Audio Transcriber

A basic project to make use of the OpenAI Whisper general-purpose speech recognition model to transcribe audio files.

Reference: https://github.com/openai/whisper

The transcriber function expects to be passed a model name to use, a blob container of input audio files, and (optionally) an output blob container for transcripts.

Installation

The recommended way to set up this project for development is using Poetry to install and manage a virtual Python environment. With Poetry installed, change into the project directory and run:

poetry install

Activate the virtualenv like so:

poetry shell

To run Python commands in the activated virtualenv, thereafter run them as normal:

python manage.py

Manage new or updating project dependencies with Poetry also, like so:

poetry add newpackage==1.0

Environment variables

This project uses confy to set environment variables (in a .env file). The following variables are required for the project to run:

AZURE_CONNECTION_STRING=AzureStorageAccountConnectionString

Running

Run locally like so:

python transcriber.py --help

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.github		.github
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
transcriber.py		transcriber.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio Transcriber

Installation

Environment variables

Running

About

Releases

Packages

Languages

License

dbca-wa/audio-transcriber

Folders and files

Latest commit

History

Repository files navigation

Audio Transcriber

Installation

Environment variables

Running

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages