This repository contains the pipeline for breaking a text-based CAPTCHA using python and deep learning. The pipeline has the following 3 modules:
- CAPTCHA-Generator is used to generate a limited dataset of text-based CAPTCHAs and there are different variations to choose from.
- Pix2Pix is a GAN (Generative Adversarial Network) which is used to generate more of the selected variation of text-based CAPTCHAs.
- CAPTCHA-Classifier is a CNN (Convolutional Neural Network) which is used to recognize the characters contained in the text-based CAPTCHA.
- Tensorflow 1.4.1
- Python 3.x
- OpenCV
- Download the datasets and extract them inside the
CAPTCHA-Generator\Images
directory to be able to generate different variety of text-based CAPTCHAs having different background and textured alphabets. - Run the
CAPTCHA-Breaker.bat
file and you are good to go.