Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

建议把表格中,每个单元格的坐标,返回到pipline中 #649

Open
Vawter-001 opened this issue Sep 23, 2024 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@Vawter-001
Copy link

Is your feature request related to a problem? Please describe.
您的特性请求是否与某个问题相关?请描述。
1、直接使用paddleOCR的话,大表的解析比较困难,容易出错。
2、现在MinerU返回一段html文本回来,单元格中的长、宽信息,在html中是损失了的,但是有业务场景需要这些信息;

Describe the solution you'd like
描述您期望的解决方案
1、把表格中,每个单元格的坐标,返回到pipline中,OCR能力不行时,可使用fitz根据坐标直接提取;
2、把单元格坐标给到pipline,便于其他开发者,在此基础上进行二次开发;

Describe alternatives you've considered
描述您已考虑的替代方案
1、也可考虑使用,GOT-OCR2.0项目;或可以让用户自己决定,是使用paddleOCR或是使用GOT-OCR2.0;虽然OCR2.0比较慢,但准确率高,应对一些非实时响应的场景,完全足够。

Additional context
提供更多细节
Null

@Vawter-001 Vawter-001 added the enhancement New feature or request label Sep 23, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant