Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with coordinates of bounding boxes for some PDF #330

Closed
kermitt2 opened this issue Jul 16, 2018 · 2 comments
Closed

Issue with coordinates of bounding boxes for some PDF #330

kermitt2 opened this issue Jul 16, 2018 · 2 comments
Labels
bug From Hemiptera and especially its suborder Heteroptera

Comments

@kermitt2
Copy link
Owner

Coordinates for bounding boxes are incorrect for some PDF, see this PMC example.

screenshot from 2018-07-16 20-48-47

https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5181807/pdf/medi-95-e5368.pdf

@kermitt2 kermitt2 added the bug From Hemiptera and especially its suborder Heteroptera label Jul 16, 2018
@kermitt2
Copy link
Owner Author

kermitt2 commented Mar 4, 2019

This will be fixed by the latest pdfalto, see kermitt2/pdfalto#43

Aazhar pushed a commit that referenced this issue Mar 5, 2019
* Use numerical mapping when ocr is not activated.
@kermitt2
Copy link
Owner Author

kermitt2 commented Mar 5, 2019

Thanks @Aazhar it is working as expected now!

@kermitt2 kermitt2 closed this as completed Mar 5, 2019
tantikristanti pushed a commit that referenced this issue Nov 15, 2019
* Use numerical mapping when ocr is not activated.


Former-commit-id: 82395ac
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug From Hemiptera and especially its suborder Heteroptera
Projects
None yet
Development

No branches or pull requests

1 participant