🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
-
Updated
Nov 21, 2024 - Python
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Port of OpenAI's Whisper model in C/C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
🧠 Leon is your open-source personal assistant.
kaldi-asr/kaldi is the official location of the Kaldi project.
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
Faster Whisper transcription with CTranslate2
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
A PyTorch-based Speech Toolkit
End-to-End Speech Processing Toolkit
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
💬 Speech recognition for your site
Facebook AI Research's Automatic Speech Recognition Toolkit
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."