GitHub - chrislarson/subtitle-ocr: Subtitle OCR using a CRNN implemented with Tensorflow

Subtitle OCR using a CRNN

The supplied model is a Convolution Recurrent Neural Work trained to convert image-based subtitles into text-based subtitles. This model was written and trained on a MacBook Pro, so the included instructions are based on a unix-like environment. The tensorflow-metal was plugin was used for GPU acceleration. If the requirement cannot be satisfied on your machine, use the `requirements_alt.txt` file during installation.

Requirements:

Python 3.10.12
Unix-like environment

Setup:

Create a virtual environment in the root of the repository. sh```python3 -m venv .venv


2. Activate the virtual environment.
   sh```source .venv/bin/activate```

3. Install dependencies (while in the activated virtual environment):
   sh```(.venv) pip install -r requirements.txt

Running:

Word-level inference model: sh(.venv) python3 words_inference.py

Line-level inference model: sh`(.venv) python3 lines_inference.py`

The inference model runs will open an image preview through OpenCV. Advance through the images by pressing any key.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data_utils		data_utils
models		models
.gitignore		.gitignore
.tool-versions		.tool-versions
README.md		README.md
lines_inference.py		lines_inference.py
lines_train.py		lines_train.py
model.py		model.py
model_config.py		model_config.py
report.pdf		report.pdf
requirements.txt		requirements.txt
requirements_alt.txt		requirements_alt.txt
words_inference.py		words_inference.py
words_train.py		words_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Subtitle OCR using a CRNN

About

Releases

Packages

Languages

chrislarson/subtitle-ocr

Folders and files

Latest commit

History

Repository files navigation

Subtitle OCR using a CRNN

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages