-
Notifications
You must be signed in to change notification settings - Fork 591
Models
Konstantin Baierer edited this page May 30, 2020
·
16 revisions
This is a list of trained models for recognition with ocropy.
Download the models and copy them to one of:
-
$OCROPUS_DATA
, which you can define in the environment, e.g. in~/.bashrc
:OCROPUS_DATA=/data/ocropus-models/
- the working directory where you run
ocropus-*
- a subdirectory
./models
from the current working directory /usr/local/share/ocropus
/usr/share/ocropus
- English Default: https://github.com/zuphilip/ocropy-models/raw/master/en-default.pyrnn.gz (80 MB)
- Fraktur: https://github.com/zuphilip/ocropy-models/raw/master/fraktur.pyrnn.gz (42 MB)
- Fraktur: https://github.com/jze/ocropus-model_fraktur (including training data and a model for CLSTM)
- early 20th century antiqua including south-east European characters: https://github.com/jze/ocropus-model_oesterreich-ungarn (including training data)
- (Old) French: https://github.com/zuphilip/ocropy-french-models (26 MB)
- [Older models by @tmbdev: http://www.tmbdev.net/ocropy/OLD/]
- models (normal text, italics) trained with the Hume Dialogues text (English text, published 1779): https://github.com/urhub/ocropy/tree/master/models
- a mixed OCRopus model trained on twelve Latin books printed with Antiqua types between 1471 and 1686 with a focus (ten out of twelve) on early works produced before 1600: https://github.com/chreul/OCR_Testdata_EarlyPrintedBooks (16MB)
- Japanese by @isaomatsunami: https://github.com/isaomatsunami/clstm-Japanese (56 MB)
- Telugu model & training data: https://github.com/ChillarAnand/likitham
- Coptic models: https://github.com/KELLIA/CopticOCR