Apprentice Bot - Transcribers Of Reddit

This is the source code for a helper bot, making attempts at transcribing content as it is posted to the subreddit /r/TranscribersOfReddit, a community dedicated to transcribing images, audio, and video. It acts under the username "/u/transcribot".

This bot is still in training and might not be able to recognize everything it attempts. Some transcriptions might be complete trash, but the hope is that it will be a start to a more legitimate, volunteer-written transcription.

Resources

Redis (tracking completed posts and queue system) Tesseract (OCR solution)

NOTE:

This code is not complete. The praw.ini file is required to run the bots and contains information such as the useragents and certain secrets. It is built for Python 3.6.

Installation

$ git clone https://github.com/GrafeasGroup/tor_ocr.git tor_ocr
$ cd tor_ocr/
$ pip install --process-dependency-links .

OR

$ pip install --process-dependency-links 'git+https://github.com/GrafeasGroup/tor_ocr.git@master#egg=tor_ocr-0'

High-level functionality

Monitoring daemon (via Redis queue):

Pull job (by post id) off of queue:
- Download image
- OCR the image
- If OCR successful:
  - Post OCR-ed content to post on /r/TranscribersOfReddit in 9000 character chunks, replying to previous comment when over 9000 characters
- Delete local copy of image

Running Apprentice Bot

$ tor-apprentice
# => [daemon mode + logging]

Contributing

See CONTRIBUTING.md for more.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
test		test
tor_ocr		tor_ocr
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apprentice Bot - Transcribers Of Reddit

Resources

Installation

High-level functionality

Running Apprentice Bot

Contributing

About

Releases

Packages

Languages

License

yukilikespie/tor_ocr

Folders and files

Latest commit

History

Repository files navigation

Apprentice Bot - Transcribers Of Reddit

Resources

Installation

High-level functionality

Running Apprentice Bot

Contributing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages