You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project uses the Tesseract OCR library to extract text from images. The text is then parsed using regular expressions to extract the numbers. The numbers are then written to a text file in the output directory. To use this project, simply place the input images in the input_images directory and run the Python script.
A wrapper on top of python-OCR tools such as pytesseract and easyocr, to recognize and extract text embedded in images. Also, convert scanned-PDFs to text searchable PDFs.