Xircuits Audio Transcription Components

A Xircuits component library for transcribing audio into text with speaker diarization. This library provides components for:

Loading and processing audio files
Performing speaker diarization (identifying who spoke when)
Transcribing speech to text
Combining diarization and transcription results
Saving formatted transcripts

Prerequisites

This component library requires:

Access to the Hugging Face Hub models:
- You need to accept the terms of use for the pyannote models at:
  - https://huggingface.co/pyannote/speaker-diarization
  - https://huggingface.co/pyannote/segmentation
- A Hugging Face access token for the diarization models
Sufficient disk space for the downloaded models (approximately 1-2GB)

Installation

To use this component library, ensure you have Xircuits installed, then simply run:

xircuits install https://github.com/xpressai/xai-transcribe

Alternatively you may manually copy the directory / clone or submodule the repository to your working Xircuits project directory then install the packages using:

pip install -r requirements.txt

Usage

The library provides components for a complete audio transcription pipeline:

TranscribeLoadAudioFile - Load an audio file or use a sample dataset
TranscribeSpeakerDiarization - Identify different speakers in the audio
TranscribeSpeechTranscription - Transcribe the audio to text with timestamps
TranscribeCombineDiarizationAndTranscription - Combine speaker information with transcription
TranscribeSaveTranscriptToFile - Save the formatted transcript to a file

Example

Create a new Xircuits workflow and add the components in sequence:

Start with TranscribeLoadAudioFile and provide a path to your audio file
Connect to TranscribeSpeakerDiarization (set use_auth_token to True if using Hugging Face models)
Add TranscribeSpeechTranscription (defaults to Whisper base model)
Connect both to TranscribeCombineDiarizationAndTranscription
Finally connect to TranscribeSaveTranscriptToFile to save the results

Tests

A github action to test your workflow runs has been provided. Simply add the path of your workflows here.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github		.github
examples		examples
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
transcribe_components.py		transcribe_components.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Xircuits Audio Transcription Components

Prerequisites

Installation

Usage

Example

Tests

About

Releases

Packages

Languages

License

XpressAI/xai-transcribe

Folders and files

Latest commit

History

Repository files navigation

Xircuits Audio Transcription Components

Prerequisites

Installation

Usage

Example

Tests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages