-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ocrd-segment-repair: handle case where points is empty #60
Comments
The problem is that you have a text region with empty
as error message. How was the empty PAGE generated? If it's by an OCR-D processor, we need to fix it. |
This error I have made by my own ;-) - I know that I need to correct something in my code - but still as it only occurs once, I cannot go on with all the other regions ... Just, would be nice, if extract-lines would be a bit more robust ... |
We've discussed whether OCR-D processors should be robust to invalid or unconventional PAGE in the past. IIRC the general consensus was that it would overstretch both the coding effort (much more boilerplate and things one can do wrong or forget to do) and the performance. So the idea is to selectively use ocrd-segment-repair if you know you have problems in your input (or after some processor's output). Not sure if your particular case (missing |
Understood, of course. In this special I already have fixed the root cause. Therefore, no need to do something like |
Too bad – I was quite curious how it would handle that case, you know :-) |
You guessed it: it wouldn't work! I created OCR-D/core#877 for the core side, but we also have to handle that case differently in the repair code here. So let's keep open, and I'll rename the issue. |
Version 0.1.20, ocrd/core 2.33.0
I have a PAGE file, which does not have any real content - like this:
If I call
ocrd-segment-extract-lines
, I get an expection like this:My expection would be, that this PAGE file simply would be ignored.
--> please, clarify ...
The text was updated successfully, but these errors were encountered: