Skip to content

We present a comprehensive pipeline for processing Kannada audio, leveraging state-of-the-art pre-trained models from Hugging Face. The pipeline is designed to handle multiple stages of audio processing, including denoising, speech segmentation, automatic speech recognition (ASR), punctuation, transliteration, translation, and grammar checking.

Notifications You must be signed in to change notification settings

bcsamrudh/AI-Assisted-Kannada-Transcription-and-Translation

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI-Assisted-Kannada-Transcription-and-Translation

We present a comprehensive pipeline for processing Kannada audio, leveraging state-of-the-art pre-trained models from Hugging Face. The pipeline is designed to handle multiple stages of audio processing, including denoising, speech segmentation, automatic speech recognition (ASR), punctuation, transliteration, translation, and grammar checking. The novelty of our approach lies in the seamless integration of these models into a user-friendly interface, allowing users to upload or record audio and receive outputs at each stage with the option to edit and classify the results.

Pipeline

Pipeline

About

We present a comprehensive pipeline for processing Kannada audio, leveraging state-of-the-art pre-trained models from Hugging Face. The pipeline is designed to handle multiple stages of audio processing, including denoising, speech segmentation, automatic speech recognition (ASR), punctuation, transliteration, translation, and grammar checking.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 89.9%
  • HTML 9.1%
  • Shell 1.0%