Skip to content

This project uses OpenCV for Image Segmentation and Tesseract for Text Recognition.

Notifications You must be signed in to change notification settings

kbp12/Receipt-OCR

Repository files navigation

Receipt: Optical Character Recognition

This project uses OpenCV for Image Segmentation and Tesseract for Text Recognition.

Business Use Case -Invoices and Receipts are an essential aspect of any trade between two parties, be it between companies or between roadside stores and a consumer. Manually reconciling digital invoices is very time-consuming and can also cause manual human errors.

Extracting information from any document is an uphill task and involves a combination of object classification and object localisation.

OCR digitisation addresses the challenge of automatically extracting, which plays a critical role in streamlining document-intensive processes and office automation in many financial, accounting, and taxation areas.

TECH STACK:

Language: Python, Object detection: YOLO V4, Text Recognition: Tesseract OCR

Key learnings

[1] Understanding Object detection

[2] Understanding how to use pre-trained models of YOLO

[3] Training Custom Object Detector with YOLO

[4] Understanding Text extraction using Tesseract

About

This project uses OpenCV for Image Segmentation and Tesseract for Text Recognition.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published