Skip to content

automatically scan, ocr and store documents to your NAS

Notifications You must be signed in to change notification settings

hgruber/docfiling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Automatic document filing

This tiny project allows you to automatically scan documents, automatically recognize the text and save it in your document folder. Thus, you can index your non-electronic communications in a very convenient way.

I use a fujitsu fi-5110C auto feeding duplex color scanner for this purpose. Most software (tesseract, sane, unpaper, ImageMagick, ghostscript) is also found on github.com. I run a docker image on my server to which the scanner is attached to via usb cable and with direct access to the document folder on the NAS:

docker run -d --privileged -v /dev/bus/usb:/dev/bus/usb -v documents:/documents scan

The script is nicely performing for two years now.

todos

  • provide config for other languages (currently for German)
  • try to run script on Raspberry Pi to create a cheap standalone scanning system
  • rewrite script in python
  • automatic folder storage on filter words

About

automatically scan, ocr and store documents to your NAS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published