All notable changes to this project will be documented in this file.
The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.
- Change Poetry run script and remove init.py
- Pass iter_articles arguments down to sqlite-utils rows_where()
- Set default value for order_by argument in iter_articles
- Provide example sources.example.json file
- Add database method
iter_articles
to iterate over all articles in the database
- Remove default sources.json file
- Remove Docker setup and use Poetry for dependencies
- Replace MongoDB with SQLite
- Optional CLI arguments
- Set
--wiredTigerCacheSizeGB
flag to limit memory consumption of MongoDB
- Shell script that executes the crawl command
- Directory structure
- Simplified crawl command
- Three new config variables (
MONGO_OUTSIDE_PORT
,MAX_WORKERS
,KEEP_DAYS
). See readme for details. - Network name for this project (
network name
) - Changelog file
- Config variables are now uppercase
- Readme
- Initial project release