Dot-matrix text recognition in Python via PyTesseract (based on Tesseract)
Runs Tesseract OCR on an image file to scan strings. Finds strings matching a common pattern and uses regex to filter results. Stores results in an Excel spreadsheet.
- To install dependencies:
pip3 install -r requirements.txt
- Run
python3 script.py
, enter inputs for image file (eg:test.jpg
) to be scanned and a common pattern string.