Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OCR picks up all the text, but alignment is off #1009

Closed
nchammas opened this issue Aug 29, 2022 · 2 comments
Closed

OCR picks up all the text, but alignment is off #1009

nchammas opened this issue Aug 29, 2022 · 2 comments

Comments

@nchammas
Copy link

This is an amazing tool. Thank you for sharing it with the world.

Here is an example PDF:

The OCR is working well, however the alignment seems to be off. Highlighting a bit of text or a word seems to miss the last character or so, even though if you try copying and pasting it's clear that you got everything.

Some examples:
Screen Shot 2022-08-29 at 5 12 50 PM
Screen Shot 2022-08-29 at 5 13 02 PM

In both these cases, the expected text is actually highlighted, even though it doesn't look like it. In other words, the full date string will actually be copied, as well as the full word "Multiple". But the highlight suggests that somehow the final letter or so wasn't picked up by OCR.

I'm running OCRmyPDF 13.7.0 and Preview 11.0 on macOS.

@amitdo
Copy link

amitdo commented Oct 2, 2022

@jbarlow83
Copy link
Collaborator

Will be fixed by changes in #1194

jbarlow83 added a commit that referenced this issue Dec 3, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants