Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Large decrease in number of OCID annotations #90

Open
ravwojdyla opened this issue Jan 18, 2024 · 1 comment
Open

Large decrease in number of OCID annotations #90

ravwojdyla opened this issue Jan 18, 2024 · 1 comment

Comments

@ravwojdyla
Copy link

We have observed a large decrease in number of OCID annotations available in the recent Google Patents public data. We specifically consume the OCID associated with patents, so I will focus on that here. It appears that large number of patents that used to be annotated with OCIDs of specific entities (in our case genes), are no longer annotated by those OCIDs.

To give one specific example, if we take STAT1/ENSG00000115415 OCID:102100019657 and application US-201816499393-A, previous release had 32 OCID IDs associated with this application:

OCIDs

102100004941 102100002816 102100017157 102100004159 102100016295 102100020485 102100017509 102100019667 102100019388 102100008658 102100018913 102100015895 102100017329 102100019517 102100009637 102100000197 102100008614 102100005617 102100016662 102100009641 102100017996 102100019657 102100003514 102100017933 102100009664 102100019816 102100015722 102100017932 102100019099 102100012464 102100010255 102100002212

The most recent release doesn't have any, missing STAT1 annotation completely even though it's clearly in the text of the patent. Further if we count unique patents annotated with STAT1 OCID over time:

image

In the most recent public data there appears to be half as many publications with STAT1 annotations. Is there any specific reason for this?

Could be related to #88

@ravwojdyla
Copy link
Author

👋 @wetherbeei, in the past your help in #54 (comment) was invaluable, I wonder if you have any immediate thoughts or recommendation on this issue? Thank you in advance.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant