Document Scanner using 2 approaches viz. Hough Transform & Autoencoders
- Photos - Original dataset for training
- Annotations - XML files for annotations of document in the images
- Maskedimages - Masks for the training images (Used in AutoEncoders)
- Hough Images/Canny - Output of the images for the respective approach
This project was inspired by this: https://blogs.dropbox.com/tech/2016/08/fast-and-accurate-document-detection-for-scanning/
Training Dataset consists of training images as well as their augmented images (As sufficient data was not available)
-
Hough Transform Pipeline for this approach:
- Opening operation on the image
- Gaussian Blurring
- Thresholding the images
- Laplacian edge detection
- Hough Transform
-
AutoEncoders This approach needed masks for the images which were generated with help of annotation files and Generating_masks.ipynb
These masks were feeded to a 7 layered convolutional network with only 20 training epochs. The results were better when compared with Hough Transform.