Skip to content

Latest commit

 

History

History
14 lines (11 loc) · 602 Bytes

README.md

File metadata and controls

14 lines (11 loc) · 602 Bytes

Extracting Transcripts from Video/Audio Files

This repository holds examplary code on how to extract text from video or audio files.

Getting Started

Start by cloning the Git repsitory:

git clone https://github.com/lena-will/video-to-text.git

Data

The input to the model is a wav file. To transcripe text from video, first extract the audio into a wav file.

Model

For the transcription, OpenAI's Whisper model is used. A version of the model is publicly available on HuggingFace: https://huggingface.co/openai/whisper-large-v3.