Transcribe and summarize youtube video using mlx

Clone repository and cd into the repository

macOS Installation Guide

Below is the installation process for macOS.

Setting Up the Environment

conda create -n video_summarize_mlx python=3.11
conda activate video_summarize_mlx
pip install -r requirements.txt
python -m spacy download en_core_web_sm
brew install ffmpeg

The default model is Mistral7b-4bit-mlx. If you want to change it, go to summarize_model.py and change this

# Load MLX model and tokenizer
model, tokenizer = load("mlx-community/Mistral-7B-Instruct-v0.2-4bit-mlx")

But that means you also have to adjust this as appropriate

MODEL_MAX_TOKENS = 8192  # Maximum tokens for prompt and response
WINDOW_SIZE = 4096  # Maximum tokens for the input

Run the whole workflow using

python main.py --input_path "/path/to/your/video" --title "My Video Title"

Remember the longer the video, the more time it might take to summarize. If you have any suggested changes, submit a pull request and we can adjust as appropriate.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
files		files
transcribe		transcribe
whisper		whisper
LICENSE		LICENSE
README.md		README.md
The Vision.mp4		The Vision.mp4
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcribe and summarize youtube video using mlx

macOS Installation Guide

Setting Up the Environment

About

Releases

Packages

Languages

License

Peter-obi/Video_summarization_mlx

Folders and files

Latest commit

History

Repository files navigation

Transcribe and summarize youtube video using mlx

macOS Installation Guide

Setting Up the Environment

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages