Releases: gambolputty/newscorpus
Releases · gambolputty/newscorpus
2.0.3
See CHANGELOG.md.
2.0.1
Changed
- Provide example sources.example.json file
Added
- Add database method
iter_articles
to iterate over all articles in the database
Removed
- Remove default sources.json file
v2.0.0
Changed
- Remove Docker setup and use Poetry for dependencies
- Replace MongoDB with SQLite
Added
- Optional CLI arguments
v1.2.0
Added
- Shell script that executes the crawl command
Changed
- Directory structure
- Simplified crawl command:
docker-compose run --rm crawler ./crawl.sh
v1.1.0
v1.0
The initial release.