Speech-to-Text Transcription Script

Project Speech to Text Conversion

a. create helper function text to speech file under utils file b. create utils file under utils folder c. Start writing code under app file and copy paste tempplates file ( index.html)

Project Speech to Text Conversion ( Added new features to store output in txt file)

a. create helper function text_to_speech_main file under utils file

Speech-to-Text Transcription Script

This is a Python script that transcribes audio files to text using Google's speech recognition API. The script can handle audio files in WAV, MP3, M4A, OGG, or FLAC format.

Dependencies

The following libraries are required to run this script:

SpeechRecognition
Pydub

You can install these libraries using pip:

pip install SpeechRecognition pydub

Usage

To use the script, simply run main.py and follow the prompts:

python main.py

-clone https://github.com/iamsamkhan/audio-to-text.git

The script will ask you to enter the path to the input audio file, the path to the output file, and the language code for the audio file.

How It Works

The script first converts the input audio file to WAV format if necessary using the Pydub library. It then transcribes the audio data to text using the SpeechRecognition library and the Google speech recognition API. Finally, it writes the transcribed text to the output file.

Why Use This Script?

This script can be useful for anyone who needs to transcribe audio files to text, such as researchers, journalists, and content creators. It provides a simple and efficient way to transcribe audio data, with support for multiple audio formats and languages.

Supported Languages

Language	Code
English (US)	en-US
English (UK)	en-GB
French	fr-FR
German	de-DE
Spanish	es-ES
Italian	it-IT
Japanese	ja-JP
Korean	ko-KR
Mandarin Chinese	zh-CN
Russian	ru-RU

Note that this is not an exhaustive list of supported languages. For a full list of supported languages and their corresponding codes, see the SpeechRecognition documentation.

License

This project is licensed under the [MIT License].

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
audios		audios
dummy		dummy
static		static
templates		templates
uploads		uploads
utils		utils
IELTS-16-test-1-section-1.txt		IELTS-16-test-1-section-1.txt
README.md		README.md
Untitled.ipynb		Untitled.ipynb
app.py		app.py
demo.ipynb		demo.ipynb
examples_english_english.wav		examples_english_english.wav
main.py		main.py
mp3.wav.py		mp3.wav.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Speech to Text Conversion

Speech-to-Text Transcription Script

Dependencies

Usage

How It Works

Why Use This Script?

Supported Languages

License

About

Releases

Packages

Languages

llmmodels/audio-to-text

Folders and files

Latest commit

History

Repository files navigation

Project Speech to Text Conversion

Speech-to-Text Transcription Script

Dependencies

Usage

How It Works

Why Use This Script?

Supported Languages

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages