You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When attempting to open an Excel document with complex tables, Docling fails to extract the tables correctly.
Steps to reproduce
fromdocling.document_converterimportDocumentConvertersource="./excel-tests.xlsx"# document per local path or URLconverter=DocumentConverter()
result=converter.convert(source)
print(result.document.export_to_markdown()) # output: "## Docling Technical Report[...]"
I understand that the table is complex, so I would like to know what would be the requirements for an Excel document to work with Docling. Digging into the code, I noticed this:
Bug
When attempting to open an Excel document with complex tables, Docling fails to extract the tables correctly.
Steps to reproduce
excel-tests.xlsx
Output
Docling version
Docling version: 2.17.0
Docling Core version: 2.16.0
Docling IBM Models version: 3.3.0
Docling Parse version: 3.1.2
Python: cpython-310 (3.10.7)
Platform: Windows-10-10.0.19045-SP0
Python version
Python 3.10.7
Final Considerations
I understand that the table is complex, so I would like to know what would be the requirements for an Excel document to work with Docling. Digging into the code, I noticed this:
Hope it helps,
Let me know if you need more information.
Have a nice day!
The text was updated successfully, but these errors were encountered: