Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add hocr__page transformation #113

Closed
zuphilip opened this issue Dec 31, 2019 · 1 comment
Closed

Add hocr__page transformation #113

zuphilip opened this issue Dec 31, 2019 · 1 comment
Labels
transformation Suggestions for new transformations to include

Comments

@zuphilip
Copy link
Member

zuphilip commented Dec 31, 2019

The transformation alto__page does actually also support hocr input and therefore we can simply use it for the transformation hocr__page as well. One should only need some symlinking and updateing the documentations. But we may want after PR #106 has been integrated to do this.

BTW one can already try this out by using the alto__page transformation on a hocr file, e.g. https://digi.bib.uni-mannheim.de/~stweil/ocr-praxis/0001-tesseract.hocr .

@zuphilip zuphilip added the transformation Suggestions for new transformations to include label Dec 31, 2019
@stweil
Copy link
Member

stweil commented Dec 31, 2019

Commit cedace7 (now added to PR #106) should do that.

@zuphilip zuphilip closed this as completed Jan 6, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
transformation Suggestions for new transformations to include
Projects
None yet
Development

No branches or pull requests

2 participants