README AND INSTRUCTIONS ARE CURRENTLY OUTDATED

Start with the command: pip install --upgrade -r OpenAIYouTubeTranscriber/requirements.txt or python -m pip install --upgrade -r OpenAIYouTubeTranscriber/requirements.txt to install dependencies, then python OpenAIYoutubeTranscriber.py to run this script.

OpenAI YouTube Transcriber

A powerful and intuitive automation multi-tool, primarily designed to extract audio from YouTube videos, transcribe it into text, detect the language, and save the transcription as a .txt file. This core feature is complemented by many other functionalities, making it an essential tool for streamlining your workflow with cutting-edge technology.

Project Overview
Developer

Quick setup for contributors and maintainers:
1. Install runtime dependencies:
```
make deps
```
1. Install developer tools (linters, formatter, test runner):
```
make dev
# or
pip install -r requirements-dev.txt
```
Useful make targets:
- make install — install package in editable mode
- make deps — install runtime deps
- make dev — install development deps
- make lint — run flake8
- make format — run black
- make run — run the script
Add or update dev tooling in requirements-dev.txt as needed.

Description

This script automates the transcription of YouTube videos into text format, eliminating the need for manual transcription. With an intuitive interface, users simply input a YouTube video URL, and the script processes the audio, transcribes it, detects the language, and saves the result in a .txt file. Perfect for quick, accurate transcriptions for research, content creation, or accessibility purposes.

Key Features

User-Friendly Interface: Easy to use—simply input the YouTube video URL to start the transcription process.
Efficient Audio Extraction: Uses the pytubefix library to reliably download the audio stream from YouTube videos.
High-Quality Transcription: Powered by the whisper library, offering accurate, state-of-the-art speech-to-text capabilities.
Convenient Output: Automatically saves the transcription in a .txt file for easy access and sharing.

Prerequisites

Python 3.6+
pip (Python Package Installer)

Required Libraries

pytubefix: A robust Python library for downloading YouTube videos and extracting audio. pytubefix resolves occasional issues in pytube, where certain regex expressions in cipher.py may occasionally fail.
whisper: OpenAI’s advanced speech-to-text model, known for its high accuracy and reliability in transcription.
langdetect: A powerful language detection library based on Google's language-detection algorithm.

Installation

Clone or download the repository.

Install the required libraries via pip:

pip install pytubefix
pip install git+https://github.com/openai/whisper.git
pip install langdetect

Install FFmpeg (necessary for audio processing):

Windows: If Scoop is not installed, run PowerShell as administrator and execute:

Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser
Invoke-RestMethod -Uri https://get.scoop.sh | Invoke-Expression
scoop install ffmpeg

Mac:
```
brew install ffmpeg
```

Ubuntu:

sudo apt update && sudo apt install ffmpeg

Usage

Run the script by executing WhisperYouTubeMultiTool.py:
```
python WhisperYouTubeMultiTool.py
```

Input the YouTube video URL when prompted:

Enter the YouTube video URL: https://www.youtube.com/watch?v=XXXXXXXXXXX

Example:

Enter the YouTube video URL: https://www.youtube.com/watch?v=jNQXAC9IVRw

The script will:
- Download the audio,
- Transcribe the audio to text,
- Detect the language,
- Save the transcription in a file named Transcript_{language}.txt.
Access the transcription file in the same directory as the script.

Workflow

The user provides the YouTube video URL.
pytubefix downloads the audio and saves it as an .mp3 file.
whisper transcribes the audio into text.
langdetect identifies the transcription language.
The transcription is saved as Transcript_{language}.txt, ready for review.

Known Issues

Punctuation Errors: Occasionally, punctuation may be missing or incorrect in some parts of the transcription. Manual editing or using tools like ChatGPT can help resolve these.
Transcription Accuracy: On rare occasions, the script might misinterpret words or produce spelling errors. These issues can be easily corrected using a text editor or ChatGPT.