Replies: 2 comments
-
Related to #966 |
Beta Was this translation helpful? Give feedback.
-
I think there are use cases for both structural and semantic specification of the metadata. Structural validationA structural schema defines the shape of the data. JSON Schema is the most common technology for specifying the data model and validating that input metadata conforms to it. Use cases:
Semantic labelingSemantic labels assign meaning to data fields. Annotations are supported by JSON-LD or RDF formats, and generally consist of annotating data with a Compact uniform resource identifiers (CURIE). Semantic validation typically does not enforce a particular data shape, but rather relies on normalization technices such as graph expansion/compression to validate semantic properties. Use cases:
|
Beta Was this translation helpful? Give feedback.
-
(Documenting SciCatCon discussion)
Background
There are different requirements for schemas for the top-level dataset fields and the scientificMetadata.
Top level fields...
scientificMetadata...
Top-Level validation
One option is to validate the full dataset DTO. This would involve
Example
(JSON-LD syntax)
where
dataset_schema.json
inherits from the main scicat dataset schema and includes both top-level and scientific fields.Advantages:
Disadvantages:
Only validate scientificMetadata
Here the schema applies only to the
scientificMetadata
. Top-level fields are in principle validated in the code, rather than following the custom schema.Example
(JSON-LD syntax)
where 'metadata_schema.json' only validates within the scientificMetadata subtree.
JSON Schema would probably make it a top-level field:
Advantages:
Disadvantages:
/dataset/validate
endopoint, which will check everything.Beta Was this translation helpful? Give feedback.
All reactions