captcha-cracking

Simple captcha cracking methods

Using following steps you can easily convert and automatically recognize alpha-numeric captcha with high rate of success. It may be useful for fast creation of proof-of-concept.

Preparement: Install required packages

# apt install tesseract-ocr libtesseract4 imagemagick curl

Step 1: Download image

# curl -s -o captcha.png https://example.com/captcha.png

Step 2: The most import part in automation is to make image black and white, monochrome using imagemagic tools.

# convert captcha.png -fill black -fuzz 30% +opaque "#BAB5BB" -negate -monochrome result.png

Step 3: Run tesseract to recognize image and read out.txt with cat. Option '-c' is used for specifying config with alfabet

# tesseract result.png out --oem 0 -c tessedit_char_whitelist=abcdefghijklmnopqrstuvwxyz0123456789; cat out.txt
Tesseract Open Source OCR Engine v4.0.0 with Leptonica
Warning: Invalid resolution 0 dpi. Using 70 instead.
Estimating resolution as 321
29c70d

Additional info: I had to install eng.traineddata from github, because the tainedata file for tesseract from kali repo was broken

# rm /usr/share/tesseract-ocr/4.00/tessdata/eng.traineddata; wget https://github.com/tesseract-ocr/tessdata/raw/master/eng.traineddata -O /usr/share/tesseract-ocr/4.00/tessdata/eng.traineddata

Tesseract version used:

tesseract-ocr/kali-rolling,now 4.0.0-1+b1 amd64 [installed]

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
captcha.png		captcha.png
result.png		result.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

captcha-cracking

About

Releases

Packages

License

awakenine/captcha-cracking

Folders and files

Latest commit

History

Repository files navigation

captcha-cracking

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages