Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Query will not run in ROBOKOP but is executable in ROBOKOP KG #495

Open
karafecho opened this issue Feb 18, 2020 · 11 comments
Open

Query will not run in ROBOKOP but is executable in ROBOKOP KG #495

karafecho opened this issue Feb 18, 2020 · 11 comments

Comments

@karafecho
Copy link

karafecho commented Feb 18, 2020

I attempted to use ROBOKOP to answer the MCAT question below.

MCAT Question
Steatorrhea is the presence of increased fat in feces. Which organ is least likely to be the cause of a patient’s steatorrhea?

Correct: Stomach

Incorrect: Pancreas; Small intestine; Liver

Initially, this question was problematic due to a lack of identifiers for stomach, pancreas, and liver; since fixed on 2/13/2020. However, the query still does not run, even when I do not specify "anatomical entity".

image

@cbizon
Copy link
Contributor

cbizon commented Feb 18, 2020

A little more information. Here's a robokop query that returns no answers:

https://robokop.renci.org/q/3692d7a6-83c1-4c5b-8641-66d718a55f8a/

But, if I run the query in robokopdb2 (I think the live one?) then I get answers:

match (p:phenotypic_feature {id:"HP:0002570"})<-[:has_phenotype]-(d:disease)-[:located_in]->(a:anatomical_entity) return * limit 10

@patrickkwang
Copy link
Contributor

patrickkwang commented Feb 18, 2020

Perhaps some strange synonymization is happening here. This is the query graph:

  {
    "nodes": [
      {
        "id": "n0",
        "type": "disease_or_phenotypic_feature",
        "set": false,
        "name": "Steatorrhea",
        "curie": [
          "HP:0002570"
        ]
      },
      {
        "id": "n1",
        "type": "disease",
        "set": false
      },
      {
        "id": "n2",
        "type": "anatomical_entity",
        "set": false
      }
    ],
    "edges": [
      {
        "type": [
          "has_phenotype"
        ],
        "id": "e0",
        "source_id": "n1",
        "target_id": "n0"
      },
      {
        "type": [
          "located_in"
        ],
        "id": "e1",
        "source_id": "n1",
        "target_id": "n2"
      }
    ]
  }

and this is the derived Cypher query:

MATCH (n1:disease {})-[e0]->(n0:disease_or_phenotypic_feature {})
USING INDEX n0:disease_or_phenotypic_feature(id)
WHERE (n0.id = 'MONDO:0001075') AND (type(e0) = "has_phenotype")
MATCH (n1)-[e1]->(n2:anatomical_entity {})
WHERE (type(e1) = "located_in")
WITH n0.id as n0, n1.id as n1, n2.id as n2, collect(distinct e0.id) as e0, collect(distinct e1.id) as e1
RETURN {n0:n0, n1:n1, n2:n2} as nodes, {e0:e0, e1:e1} as edges

@cbizon
Copy link
Contributor

cbizon commented Feb 18, 2020

Hmm. @YaphetKG are we by any chance using the production redis server for the new robokop mini-
crawls?

@YaphetKG
Copy link
Contributor

we are using another instance for those,

@karafecho
Copy link
Author

karafecho commented Feb 19, 2020

Thanks, all.

Note that I kind of need this issue resolved sooner rather than later so that I can complete my second MCAT 'experiment' by the end of the month and include the results on a poster that I am preparing for the AMIA 2020 Informatics Summit (you all are authors, btw). Poster will then be turned into a journal article. (Think: incentive.) :-)

However, I don't want to distract from other activities, so if the issue will require too much time to resolve, then I'll develop Plan B.

@karafecho
Copy link
Author

@patrickkwang @YaphetKG @cbizon : Any status updates on this issue?

@cbizon
Copy link
Contributor

cbizon commented Feb 24, 2020

It's going to take some time to fix. Can you try using the MONDO term MONDO:0001075 for steatorrhea instead?

@karafecho
Copy link
Author

I had already tried the MONDO ID.

I've given up on the original query, at least for the time being. I am probably going to use this answer list. Thoughts?

@karafecho
Copy link
Author

karafecho commented Feb 24, 2020

But check out what happens when I add the predicate "gene_to_expression_site_association".

@cbizon
Copy link
Contributor

cbizon commented Feb 25, 2020

I had already tried the MONDO ID.

And what happened in that case? Still no results?

@karafecho
Copy link
Author

karafecho commented Feb 25, 2020

Sorry, yes, the MONDO ID yielded no results. I tried a few query permutations with both the MONDO ID and the HP ID---nothing ran.

But the newly structured query works: MONDO 0001075 - gene - anatomical entity.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants