HTR_CalamariOCR

Handwritten Text Recognition using Calamari OCR framework.

About the Project

The aim of this work is to replicate the training and testing pipeline of a neural network about the problem of HTR.
This process is intended to demonstrate the efficiency of the Calamari OCR framework used for the model training and testing phases.
The training starts with the creation of the Ground Truth formed by segmented Text-lines in .png format and Transcriptions in associated text files. After the training of the neural network, the ability of the model in the HTR task is tested.

For more information, read the paper located in repo root.

Built with

Getting Started

To run this program you need to install some specific libraries version required by Calamari OCR, install the framework and clone the git repo for src code.

Prerequisites

The python 3.8.7 version of python3 is required to install Calamari OCR Framework.

Calamari requires a specific version of tensorflow to work with and some specific version of this python library, these are all the dependecies neded to work with Calamari OCR 2.0.1

tensorflow = 2.3.2
tfaip = 1.0.1
h5py = 2.10.0
numpy = 1.18.5

Installation

To install the package without a virtual environment simply run:

pip install calamari_ocr

To install the package from its source, download the source code and run

python setup.py install

Then download all dependecies. To download the source code check the Calamari OCR repo.

Usage

To create Ground Truth required by Calamari OCR you need to download from IAM Database the datasets. Inside the datasets you will find the .png for each text lines of the document and the corresponding trascriptions in a xml file.
To create the Ground Truth download src/ code and run the method in Parser.py .
After this run the the following lines from command line to compute the training, prediction and evaluate:

calamari-train --files your_images.*.png

Note, that calamari expects that each image file (.png) has a corresponding ground truth text file (.gt.txt) at the same location with the same base name. Required also by the evaluation step.

calamari-predict --checkpoint path_to_model.ckpt --files your_images.*.png

calamari-eval --gt *.gt.txt

Calamari OCR also presents the possibility of early stopping during training, providing a validation set. Many other training options can be found on the repo Calamari OCR.
In the src/ directory you can find the following files:

Parser.py : Creates the g Truth and manages the files within it
Utilities.py : Utilities methods and creation of Confusion Matrix
Lines_Detector.py : Image preprocessing and simple line segmentation

Authors

Lorenzo Gianassi
Francesco Gigli

Acknowledgments

Data and Document Mining Project © Course held by Professor Simone Marinai - Computer Engineering Master Degree @University of Florence

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
Images		Images
src		src
.gitignore		.gitignore
README.md		README.md
XML_IAM_template.txt		XML_IAM_template.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HTR_CalamariOCR

Table of Contents

About the Project

Built with

Getting Started

Prerequisites

Installation

Usage

Authors

Acknowledgments

About

Releases

Packages

Contributors 2

Languages

LorenzoGianassi/HTR_CalamariOCR

Folders and files

Latest commit

History

Repository files navigation

HTR_CalamariOCR

Table of Contents

About the Project

Built with

Getting Started

Prerequisites

Installation

Usage

Authors

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages