extract-text-from-image

Here are 3 public repositories matching this topic...

sudhanshu25012002 / Extracting-phone-numbers-from-multiple-images-using-Python

This project uses the Tesseract OCR library to extract text from images. The text is then parsed using regular expressions to extract the numbers. The numbers are then written to a text file in the output directory. To use this project, simply place the input images in the input_images directory and run the Python script.

python extraction img-to-pdf easyocr extract-text-from-image

Updated Apr 25, 2024
Python

sxaxmz / handle_scanned_pdf

Star

A wrapper on top of python-OCR tools such as pytesseract and easyocr, to recognize and extract text embedded in images. Also, convert scanned-PDFs to text searchable PDFs.

tesseract-ocr pytesseract ocr-python scanned-image-pdfs searchable-pdf easyocr scanned-pdf-documents extract-text-from-image extract-text-from-pdf