Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

This library is slower than linux #2

Open
bradosia opened this issue Mar 23, 2020 · 1 comment
Open

This library is slower than linux #2

bradosia opened this issue Mar 23, 2020 · 1 comment

Comments

@bradosia
Copy link
Owner

bradosia commented Mar 23, 2020

This library is allegedly slower than the one on linux.

Originally mentioned by lucaszanella:

I've just compiled the tesseract example in Linux and it recognized the same photo in 0.3 seconds, something's very wrong with your library.

Originally posted by @LucasZanella in #1 (comment)

@bradosia
Copy link
Owner Author

bradosia commented Mar 23, 2020

@LucasZanella I compiled tesseract for with MinGW-w64 and x86_64 architecture with release flags. The compiler info and options can be found on the readme in the root of this repository.

I have researched many issues:
tesseract-ocr/tesseract#263
tesseract-ocr/tesseract#2611
tesseract-ocr/tesseract#1278

I have discovered that the legacy engine is much faster than the LSTM engine in tesseract 4, but less accurate. When using the tesseract 4 API, changing the tesseract::OcrEngineMode to tesseract::OEM_TESSERACT_ONLY greatly improved the speed. This is probably the problem experienced for those using this library.

This perceived issue seems to be a user error rather than a compiler option issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant