Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot replicate prior results3 #554

Open
karafecho opened this issue Apr 13, 2021 · 8 comments
Open

Cannot replicate prior results3 #554

karafecho opened this issue Apr 13, 2021 · 8 comments

Comments

@karafecho
Copy link

This issue is essentially the same as #552 and #553 except for isopropranolol - gene - allergic disease.

image

@karafecho
Copy link
Author

Here is a link to the answers.

@karafecho
Copy link
Author

Note that the answer set is missing the gene DDX39B, which was returned with the original query.

@cbizon
Copy link
Contributor

cbizon commented Apr 19, 2021

Do you happen to have any information on the original result? maybe the link to the original question? Trying to track down DDX39B, and it would be helpful to know where the edges originally came from.

@cbizon
Copy link
Contributor

cbizon commented Apr 19, 2021

OK, so I think I see what's going on here. It has to do with a change in how we decided to handle subclasses. We used to add subclass-inferred edges into the database directly. So, DDX39B has an edge from biolink that it "causes" atopic eczema (MONDO:0004980), which is a subclass of Allergic Disease (MONDO:0005271) (see https://www.ebi.ac.uk/ols/ontologies/mondo/terms?iri=http%3A%2F%2Fpurl.obolibrary.org%2Fobo%2FMONDO_0004980).

We used to infer a DDX39B-[causes]->AllergicDisease edge and add it to the graph, but we decided that this was messy and better handled elsewhere. We do keep the subclass edges in the graph, as you can see from this query: https://robokop.renci.org/a/5986af83-34d5-4f4c-bce7-78930af62102_60a5cb79-9f78-43ac-966b-3e5e99995d1e/ so in principle, a question that added another disease node between the gene and allergic disease, with a subclass edge to allergic disease should reproduce the initial result.

Unfortunately, it looks like there is also some kind of load or versioning problem that did not create a subclass of edge between atopic eczema and allergic disease. We could rectify this perhaps by reloading the MONDO subclass of edges, in which case the query outlined above would work.

Going forward, when the new robokop is talking to a RKG being served from automat/plater, the subclass of edges will be followed automatically in accordance with the agreed upon KP behavior, but this is not yet implemented.

@karafecho
Copy link
Author

karafecho commented Apr 19, 2021

Thanks for looking into this. To clarify, is the plan to reload the MONDO subclass of edges? Or, should I simply explain the issue in the rebuttal letter?

@cbizon
Copy link
Contributor

cbizon commented Apr 19, 2021

Looking into it - it's a bit complicated because the robokop graph is old and so it requires some backporting.

@cbizon
Copy link
Contributor

cbizon commented Apr 20, 2021

This has now been updated. Here's a link to a new version of the question that includes explicit subclassing: https://robokop.renci.org/q/8d5556c4-2826-491b-99a0-941086840239/. It includes DDX39B, but also looks like it may add in another gene as well.

@karafecho
Copy link
Author

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants