Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

footnote 提取错误 #1620

Open
jasonzou opened this issue Jan 23, 2025 · 1 comment
Open

footnote 提取错误 #1620

jasonzou opened this issue Jan 23, 2025 · 1 comment
Labels
bug Something isn't working

Comments

@jasonzou
Copy link

Description of the bug | 错误描述

正常处理文件,但是页脚部分有的页面有问题。
如图:

Image
Image
Image

How to reproduce the bug | 如何复现

pdf文件可以从此处下载: pdf

Operating system | 操作系统

Linux

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

1.0.x

Device mode | 设备模式

cuda

@jasonzou jasonzou added the bug Something isn't working label Jan 23, 2025
@jasonzou
Copy link
Author

使用的是版本 1.1.0.
还尝试过此期刊的其它文件,页脚被错判断的情况不少。1-2次错判是比较好的情况。不过这也比其它pdf内容提取工具强了不少!多谢开源!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant