This repository has been archived by the owner on Aug 4, 2023. It is now read-only.
v1.2.0 - Data refresh scheduling, deduplication, and provider fixes
·
296 commits
to refs/heads/main
since this release
Improvements
- Add data refresh to Airflow (#397) @stacimc
- Change PhyloPic date range & schedule interval (#423) @AetherUnbound
- Round duration for provider ingestion completion message (#422) @AetherUnbound
- Enable XCom pickling in Airflow (#421) @stacimc
- Handle duplicate keys in load_data task (#395) @stacimc
- Make 'sound' category more specific (#402) @AetherUnbound
- Group test runs by module or class (#409) @stacimc
- Report the environment in TSV loader Slack notifications (#382) @stacimc
Internal Improvements
- Add LRU cache to
is_valid_license_info
(#424) @AetherUnbound - Use published Docker image in primary docker-compose.yml (#417) @AetherUnbound
- 🔄 Synced file(s) with WordPress/openverse (#404) @dhruvkb
- 🔄 Synced file(s) with WordPress/openverse (#403) @dhruvkb
Bug Fixes
- Fix invalid license urls from Finnish Museum API (#418) @stacimc
- Reduce noise in NYPL ingestion (#415) @AetherUnbound
- Add ConnectionError to acceptable flaky exceptions for Freesound (#413) @AetherUnbound
- Fix schedule intervals on Cleveland Museum & Wikimedia Commons (#416) @AetherUnbound
- Update API requests for Museum Victoria DAG (#414) @stacimc
- Add OFEO-SG subprovider (#412) @stacimc
- 🔄 Synced file(s) with WordPress/openverse (#404) @dhruvkb
- 🔄 Synced file(s) with WordPress/openverse (#403) @dhruvkb
Credits
Thanks to @AetherUnbound, @dhruvkb and @stacimc for their contributions!