-
-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Images unnecessarily compressed? #163
Comments
No, it can't do anything fancy with unpaper.
I think this could a recent Ghostscript behavior change. If you don't care about PDF/A you can use I'll make sure Ghostscript gets explicit directions about recompression in the next release. |
Ghostscript's default behavior (that is,
Because of this I will add a new argument to choose the output image type: auto, jpeg, or lossless. Auto will let Ghostscript decide, or when |
I'm starting a workflow with my own (sometimes badly) scanned PDFs from books, which I convert to pgm, then feed to unpaper to get each of the two visible pages into one document each (which I don't think ocrmypdf can use unpaper to do, right?). Then I clean these up a little and run
img2pdf output.pfg | ocrmypdf —image-dpi 150 - result.pdf
.What I see with this is that ocrmypdf is converting the input images to jpeg, which I thought it would do only if it has to force-ocr them. Do I misunderstand? Is it possible to leave the original images (which is #125 )?
(I didn't quite understand if a solution was found here.)
The text was updated successfully, but these errors were encountered: