Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GBIF validation of export from read_wi() and write_dwc() #168

Closed
2 of 3 tasks
Rafnuss opened this issue Sep 13, 2022 · 10 comments
Closed
2 of 3 tasks

GBIF validation of export from read_wi() and write_dwc() #168

Rafnuss opened this issue Sep 13, 2022 · 10 comments
Labels
camtrapdp/camtraptor To be decided if this is related to camtrapdp or camtraptor documentation Improvements or additions to documentation
Milestone

Comments

@Rafnuss
Copy link
Collaborator

Rafnuss commented Sep 13, 2022

Hey, just wanted to check with you @peterdesmet that these validation issues from the GBIF files are not something we can do at the read_wi() or write_dwc() level.

https://www.gbif.org/tools/data-validator/5547190a-10d5-4667-af3d-dc00be586117
https://www.gbif.org/tools/data-validator/47d69d10-0142-45c5-b997-ffed38be6ef5 (doesn't seem to work for audubon)

@peterdesmet
Copy link
Member

Coordinate rounded means that GBIF has rounded the coordinates for us, but best practice is to provide the coordinates as they are in the source database, so I would not add a default round.

Taxon match fuzzy: you can verify where it goes wrong by providing a CSV of all your scientific names to https://www.gbif.org/tools/species-lookup

@peterdesmet
Copy link
Member

Where is the Basis of record invalid coming from in the second link?

@Rafnuss
Copy link
Collaborator Author

Rafnuss commented Sep 13, 2022

Where is the Basis of record invalid coming from in the second link?

The second link is the validation of the audubon file. Is this not due to the fact that the validation tool is unaware of the audubon class? (i.e., there is not validation possible for this class).

@peterdesmet
Copy link
Member

Yes, that is definitely possible. Did you upload both files as a single DwC-Archive or separately?

@Rafnuss
Copy link
Collaborator Author

Rafnuss commented Sep 13, 2022

Yes, it was uploaded seperatly. Here is the one uploaded as a zip file: https://www.gbif.org/tools/data-validator/bdbd3cb4-224b-4cfc-b479-49588b64334a

here is the emf file:
https://gist.github.com/Rafnuss/5dc72f6ac42430135375d52085f9f396

@peterdesmet
Copy link
Member

Ok, never mind, is indeed missing an EML and meta.xml

@peterdesmet
Copy link
Member

Note: here's a published (sequence level) dataset that made use of write_eml() and write_dwc(): https://www.gbif.org/dataset/8a5cbaec-2839-4471-9e1d-98df301095dd

@Rafnuss
Copy link
Collaborator Author

Rafnuss commented Sep 13, 2022

Do you mean this code? https://github.com/inbo/mica-occurrences/blob/master/datasets/mica-agouti-occurrences/src/dwc_mapping.Rmd
It would be nice to have that as an article (vignette) somewhere no? Useful for me at least.

@peterdesmet
Copy link
Member

Yeah, good idea to add this as a vignette. It would have to be for the example small dataset.

@damianooldoni damianooldoni added this to the version 1 milestone Sep 14, 2022
@PietrH PietrH added the documentation Improvements or additions to documentation label Sep 5, 2023
@peterdesmet peterdesmet added the camtrapdp/camtraptor To be decided if this is related to camtrapdp or camtraptor label Mar 6, 2024
@peterdesmet
Copy link
Member

Closing this issue, @Rafnuss let me know if there are remaining items to be tackled.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
camtrapdp/camtraptor To be decided if this is related to camtrapdp or camtraptor documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

4 participants