Document-Scanner

Document Scanner using 2 approaches viz. Hough Transform & Autoencoders

Photos - Original dataset for training
Annotations - XML files for annotations of document in the images
Maskedimages - Masks for the training images (Used in AutoEncoders)
Hough Images/Canny - Output of the images for the respective approach

This project was inspired by this: https://blogs.dropbox.com/tech/2016/08/fast-and-accurate-document-detection-for-scanning/

Training Dataset consists of training images as well as their augmented images (As sufficient data was not available)

Hough Transform Pipeline for this approach:
1. Opening operation on the image
2. Gaussian Blurring
3. Thresholding the images
4. Laplacian edge detection
5. Hough Transform
AutoEncoders This approach needed masks for the images which were generated with help of annotation files and Generating_masks.ipynb

These masks were feeded to a 7 layered convolutional network with only 20 training epochs. The results were better when compared with Hough Transform.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Annotations		Annotations
Canny		Canny
Hough_images		Hough_images
Maskedimages		Maskedimages
photos		photos
AutoEncoders.ipynb		AutoEncoders.ipynb
Generating_masks.ipynb		Generating_masks.ipynb
Hough_transform.ipynb		Hough_transform.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Document-Scanner

About

Releases

Packages

Languages

License

jkashish18/Document-Scanner

Folders and files

Latest commit

History

Repository files navigation

Document-Scanner

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages