Skip to content

Latest commit

 

History

History
28 lines (15 loc) · 687 Bytes

README.md

File metadata and controls

28 lines (15 loc) · 687 Bytes

Video Whisper

An experimental set of colab notebooks for video semantic segmentation.

Colab Notebook Link

The set of steps implemented:

  • text exctraction (transcription) from the youtube video with the latest ASR model: 'openai/whisper-large-v3'
  • text topic modeling with the BERTopic and HDBSCAN clustering
  • video scene segmentation with PySceneDetect

Use Google Colab Notebook with GPU.

Video timeline visualization with website text:

video timeline


Workflow diagram: