mau_local_stt

A Maubot to transcribe audio messages in matrix rooms using local open-source libraries

Installation

FFmpeg must be in $PATH
Activate the maubot virtual environment (source ./bin/activate), and run
- pip install whispercpp numpy - if you want to use whisper as the backend.
- pip install vosk - if you want to use vosk as the backend.
Download maulocalstt from the releases (or download the repository and build with mbc build), and upload it to maubot.
Download a model for your backend:
- For wisper, download a model from https://huggingface.co/ggerganov/whisper.cpp and place it under models/whispercpp
- For vosk, download a zipped model from https://alphacephei.com/vosk/models and unpack it into models/vosk
Create an instance of the bot, and update the configuration:
- For whisper, specify
  - model_name - the name of the model you downloaded (the name of the file without the ggml- and .bin)
  - language - the language the audio will be in (you can set it to auto for whisper to auto-detect the language)
  - translate - if you want wisper to translate the transcription to english (true or false)
- For vosk, specify
  - model_path - the path to the top directory of the model you downloaded (the one with the folders am conf graph etc.), either absolute or related to maubot's working directory.

Simply invite the bot to a room, and it will reply to all audio messages with their transcription

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
maulocalstt		maulocalstt
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
base-config.yaml		base-config.yaml
maubot.yaml		maubot.yaml