Skip to content

Commit

Permalink
Add SSSOM and RDF outputs (#29)
Browse files Browse the repository at this point in the history
This PR adds two new output formats:

| Format | File | Description |

|--------|----------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| SSSOM |
[tiwid.sssom.tsv](https://github.com/bridgedb/tiwid/blob/more-outputs/artifacts/tiwid.sssom.tsv)
| An output in the [Simple Standard for Sharing Ontological Mappings
(SSSOM)](https://mapping-commons.github.io/sssom/) format |
| RDF |
[tiwid.ttl](https://github.com/bridgedb/tiwid/blob/more-outputs/artifacts/tiwid.ttl)
| An output in RDF, serialized with Turtle. This file can be readily
converted to an OWL ontology with [ROBOT](https://robot.obolibrary.org/)
with `robot convert -i tiwid.ttl -o tiwid.owl`. |

It also makes minor updates to the README, such as noting that the data
is standardized on Bioregistry and not Identifiers.org (though,
effectively this is the same thing)
  • Loading branch information
cthoyt authored Jan 8, 2025
1 parent 243f7f9 commit 8fbee9a
Show file tree
Hide file tree
Showing 6 changed files with 83,278 additions and 7 deletions.
14 changes: 11 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,7 +5,7 @@

This project collects identifiers that are, for some reason, no longer pointing to active records
in the corresponding database. As such, these identifiers should no longer be used. By collecting
them in a machine readable way, we can make it easier to automate the detection of dead identifiers
them in a machine-readable way, we can make it easier to automate the detection of dead identifiers
in data curation workflows.

The following is a short summary of the contents of this resource:
Expand All @@ -15,7 +15,7 @@ The following is a short summary of the contents of this resource:
## Data

The data in the `data` folder is structured like as follows. Filenames follow the namespace of
the resources as defined by identifiers.org and used in compact identifiers. The content of
the resources as defined by the Bioregistry and used in compact URIs. The content of
the file is a comma-separated values file with one or more columns:

* `#did`: the dead identifier
Expand All @@ -24,7 +24,15 @@ the file is a comma-separated values file with one or more columns:

This model is incomplete and volatile, but applies to Release 1.

A collation of all curations can be found in [`artifacts/collated.tsv`](artifacts/collated.tsv).
## Outputs

Automatically generated artifacts can be found in the [`artifacts`](artifacts) folder.

| Format | File | Description |
|--------|----------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| TSV | [collated.tsv](artifacts/collated.tsv) | A collated version of all TSV in the [data/](data) folder |
| SSSOM | [tiwid.sssom.tsv](artifacts/tiwid.sssom.tsv) | An output in the [Simple Standard for Sharing Ontological Mappings (SSSOM)](https://mapping-commons.github.io/sssom/) format |
| RDF | [tiwid.ttl](artifacts/tiwid.ttl) | An output in RDF, serialized with Turtle. This file can be readily converted to an OWL ontology with [ROBOT](https://robot.obolibrary.org/) with `robot convert -i tiwid.ttl -o tiwid.owl`. |

## Tests

Expand Down
Loading

0 comments on commit 8fbee9a

Please sign in to comment.