Whisper ASR Webservice

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitask model that can perform multilingual speech recognition as well as speech translation and language identification. For more details: github.com/openai/whisper

Features

Current release (v1.6.0) supports following whisper models:

Quick Usage

CPU

docker run -d -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest

GPU

docker run -d --gpus all -p 9000:9000 -e ASR_MODEL=base -e ASR_ENGINE=openai_whisper onerahmet/openai-whisper-asr-webservice:latest-gpu

for more information:

Documentation

Explore the documentation by clicking here.

Credits

This software uses libraries from the FFmpeg project under the LGPLv2.1

Name		Name	Last commit message	Last commit date
Latest commit History 248 Commits
.github		.github
app		app
docs		docs
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Dockerfile		Dockerfile
Dockerfile.gpu		Dockerfile.gpu
LICENCE		LICENCE
README.md		README.md
docker-compose.gpu.yml		docker-compose.gpu.yml
docker-compose.yml		docker-compose.yml
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Whisper ASR Webservice

Features

Quick Usage

CPU

GPU

Documentation

Credits

About

Releases 19

Contributors 12

Languages

License

ahmetoner/whisper-asr-webservice

Folders and files

Latest commit

History

Repository files navigation

Whisper ASR Webservice

Features

Quick Usage

CPU

GPU

Documentation

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 19

Contributors 12

Languages