Improved tables; links and references
Table improvements
- Tables now handle colspans and rowspans properly
- Improved table model with better accuracy
- Tables merge across pages if you pass
--use_llm
- New table benchmarks
Links and references
- Links and references are now pulled out of the pdf, and are clickable
- Anchors are placed on elements as targets
Better configuration
- Any configuration option can now be passed on the CLI
Misc
- With
--use_llm
, handwriting is now recognized (if the layout detects it) - Better llm mode overall
What's Changed
- Dynamic option parsing and updated documentation by @iammosespaulr in #457
- Add context manager support for PdfProvider by @tarun-menta in #481
- Table benchmarks by @tarun-menta in #480
- PDF Link and References Support by @iammosespaulr in #474
- Improved References by @iammosespaulr in #491
- New table model by @VikParuchuri in #472
- Vik dev by @VikParuchuri in #497
- OCR Error Detection Hotfix by @tarun-menta in #498
- Vik dev by @VikParuchuri in #500
- Dev by @VikParuchuri in #499
- Table improvements by @VikParuchuri in #503
Full Changelog: v1.2.7...v1.3.0