Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add OCR Service #389

Closed
dannylamb opened this issue Oct 4, 2016 · 7 comments
Closed

Add OCR Service #389

dannylamb opened this issue Oct 4, 2016 · 7 comments
Assignees

Comments

@dannylamb
Copy link
Contributor

Depends on #387

Add a simple OCR web service that executes Tesseract on any TIFF that gets POSTed to it. It should respect a header such as X-Islandora-Ocr-Args so you can pass in command line arguments.

Service will consume image/tiff.
Service will produce text/plain, text/html/ and application/pdf (because that's what Tesseract produces).

@dannylamb dannylamb self-assigned this Oct 4, 2016
@DiegoPino
Copy link
Contributor

Do you have some planning/sketching on this? Want to use something from aaron?
I do have a camel example on thumbnail creation for jpeg. maybe we could expand on that. Thanks!

@dannylamb
Copy link
Contributor Author

I've got a skeleton that's wired up to the gradle build machinery that I'll clean up and provide after I clear out stuff for #387

@dannylamb
Copy link
Contributor Author

We'll talk about it's mini API first, before going anywhere with it.

@whikloj
Copy link
Member

whikloj commented Oct 4, 2016

@dannylamb
Copy link
Contributor Author

Yeah pretty much. For a web service we should stream to stdin and receive from stdout, tacking the args from a header at the end.

@ruebot
Copy link
Member

ruebot commented Oct 21, 2016

@ruebot
Copy link
Member

ruebot commented Mar 23, 2017

@ruebot ruebot closed this as completed Mar 23, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants