-
Notifications
You must be signed in to change notification settings - Fork 16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How can I read the table that have started on page 1 and extends on multiple pages. #192
Comments
Are there 2 separate issues here?
This could be a bug.
Have you tried the output with the Network parser? |
With this code `import pypdf_table_extraction file_path = r"C:\Projects\temp123\attachments\test\er\er3.pdf" flavors = ["hybrid", "lattice", "network", "stream"] for flavor in flavors:
print("\nTable extraction process completed.")
NOTE: I have uninstalled the Camelot and pypdf_table_extraction and Installed again only pypdf_table_extraction library so there should be no conflicts or any other issues. Can you please help/advise? |
Based on the following error message:
It looks like somhow you are running an old code base.
Maybe uninstall both again. |
Maybe my response is too long so I have cut out most of the data to make it more concise. Here is the listing of pypdf libraries that I have installed: pypdf 4.3.1 pypi_0 pypi I have uninstalled the camelot as per @stefan6419846 suggestion in a thread here Any feedback is greatly appreciated. |
pypdf_table_extraction/camelot does not recognize the table on pages after page 1 with the lattice flavor.
With the stream method, I get a messed-up output like this one
This is the output from the lattice from page one which looks great
The document is a PDF bank statement.
NOTE: I have randomized the numbers in the output for privacy and security purposes.
The text was updated successfully, but these errors were encountered: