Skip to content
This repository has been archived by the owner on Sep 21, 2019. It is now read-only.

Can not download tessdata #55

Closed
longnguyencmg opened this issue Aug 26, 2016 · 9 comments
Closed

Can not download tessdata #55

longnguyencmg opened this issue Aug 26, 2016 · 9 comments

Comments

@longnguyencmg
Copy link

Please help to check this link to download tessdata. URL not found :(

http://tesseract-ocr.googlecode.com/files/

@rmtheis
Copy link
Owner

rmtheis commented Aug 26, 2016

This project doesn't use that as a download location, it uses tesseract-ocr.googlecode.com/files/.

@rmtheis rmtheis closed this as completed Aug 26, 2016
@rmtheis
Copy link
Owner

rmtheis commented Aug 26, 2016

Oops, I misread your comment before. You're saying the trained data download links are broken in the app, right? Do you know of an alternative download location?

@rmtheis rmtheis reopened this Aug 26, 2016
@longnguyencmg
Copy link
Author

Hello, I'm not sure if this https://github.com/tesseract-ocr/tessdata is enough information?
Currently, I'm looking at your OcrInitAsyncTask.java, you need .traineddata and osd.traineddata. But I just can find .traineddata from the link above.

@rmtheis
Copy link
Owner

rmtheis commented Aug 27, 2016

Hmm, OK. I think using Firebase or S3 would be better than hotlinking to Github.

Maybe we can include the English/OSD data in the application assets and link to Firebase for the rest.

@longnguyencmg
Copy link
Author

Actually, I fixed it download .traindeddata from github directly without unzip, etc. But I'm not sure if it's a good solution. I didn't try S3 or Firebase. Waiting for your solution 🎯 . Cheers

@fdocharles
Copy link

Is any other way to do this.... please its urgent

@zyxrrr
Copy link

zyxrrr commented Nov 10, 2016

Please help to check this link to download tessdata. URL not found :(

http://tesseract-ocr.googlecode.com/files/

Or help to how to package the appropriate training data files in the app ?

@tabletguy
Copy link

By checking the base URL of http://tesseract-ocr.googlecode.com it will redirect to a page that says

_tesseract-ocr has Moved!

This project has moved to a new location on the internet. Its new home is at:_

https://github.com/tesseract-ocr

Source files in the program could (should) be changed to reflect this, IMO.

@rmtheis
Copy link
Owner

rmtheis commented Nov 10, 2016

I don't think hotlinking to Github is a good idea. I suggest packaging the data files in the app (like I've done with the English training data) or hosting the download yourself using Firebase.

@rmtheis rmtheis closed this as completed Nov 10, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants