This project enables the local installation and use of WhisperX, an advanced audio transcription system based on OpenAI's Whisper but optimized for running on local hardware with or without a GPU. This project is made possible thanks to Whisperx and Faster Whisper. This document provides a general overview of the installation and links to the website where the complete installation and usage procedure for this project can be found.
- Miniconda
- CUDA (only for NVIDIA GPU users)
environment-cuda.yml
: Configuration file for automatic installation on systems with NVIDIA GPU.environment-cpu.yml
: Configuration file for automatic installation on systems without NVIDIA GPU.app.py
: Script to run the WhisperX user interface on Gradio.setup_environment_cuda.bat
: Batch script for easy installation on Windows with NVIDIA GPU (requires CUDA 12.1 or higher).setup_environment_cpu.bat
: Batch script for easy installation on Windows without NVIDIA GPU.run_script.bat
: Batch script to run the application after installation.
If you have an NVIDIA GPU and CUDA 12.1 (or higher) installed, you can use the setup_environment_cuda.bat
script for an easy installation:
- Download the repository.
- Double-click the
setup_environment_cuda.bat
file. - The script will:
- Create a virtual environment.
- Install all necessary dependencies, including GPU-optimized versions of
torch
andtorchaudio
.
- Once the installation is complete, you can run the application by double-clicking the
run_script.bat
file.
If you do not have an NVIDIA GPU, use the setup_environment_cpu.bat
script:
- Download the repository.
- Double-click the
setup_environment_cpu.bat
file. - The script will:
- Create a virtual environment.
- Install all necessary dependencies, including CPU-only versions of
torch
andtorchaudio
.
- Once the installation is complete, you can run the application by double-clicking the
run_script.bat
file.
- Miniconda: Miniconda Installation
- CUDA: CUDA Installation (Only for NVIDIA GPU users)
- GitHub Repository: Download and setup of the repository. See details on our website.
For a detailed step-by-step installation process, please visit our website: View complete installation process
To use WhisperX after completing the installation:
-
For Windows Users:
- Simply double-click the
run_script.bat
file. This will:- Activate the virtual environment.
- Run the Gradio user interface (
app.py
).
- Simply double-click the
-
For Other Platforms:
- Activate the corresponding Conda environment.
- Run
python app.py
to start the Gradio user interface.
-
Modified default model selection:
- For CUDA-enabled devices, changed default model from "Large-v2" to "Medium"
- For CPU devices, kept default model as "Medium"
-
Improved compute type selection:
- For CPU devices, now uses "int8" compute type instead of "float32"
- For CUDA devices, kept "float16" compute type
-
Performance: These changes aim to balance performance and resource usage across different hardware configurations