Skip to content

mrizkimaulidan/sententia

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

sententia

OCR (Optical Character Recognition) is a technology that recognizes text within a digital image. It is commonly used to recognize text in scanned documents and images.

Thanks gosseract library.

You need to install some of dependencies to running this program.

Recommend to use Linux or WSL. Not yet tested on Windows. In Windows you need to compile all of the dependencies, it is complicated, so for the sake of simplicity this program should be only working on Linux or WSL.

You need install g++ and other dependencies:

$ sudo apt install g++
$ sudo apt install libtesseract-dev
$ sudo apt install libleptonica-dev
$ sudo apt install tesseract-ocr

Project installation:

Clone

$ git clone https://github.com/mrizkimaulidan/sententia.git
$ cd sententia

Download the required dependencies:

$ go mod download

Build:

$ go build .

Show help:

$ ./sententia --help
Usage of ./sententia:
  -location string
        -location=path/to/new-image path to a new grayscale image
  -path string
        -path=path/to/image path to original image

Usage:

$ ./sententia --path=original/original-image.jpg --location=grayscale/grayscale-image.jpg

About

Optical Character Recognition with gosseract

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages