Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
downloadcategoryids.sh		downloadcategoryids.sh
downloadvideos.py		downloadvideos.py
insert.py		insert.py
insert_titles.py		insert_titles.py
main.ipynb		main.ipynb
main.py		main.py
requirements.txt		requirements.txt

README.md

Multimodal video search using CLIP and LanceDB

We used LanceDB to store frames every thirty seconds and the title of 13000+ videos, 5 random from each top category from the Youtube 8M dataset. Then, we used the CLIP model to embed frames and titles together. With LanceDB, we can perform embedding, keyword, and SQL search on these videos.

Colab walkthrough -

Get dataset

wget https://vectordb-recipes.s3.us-west-2.amazonaws.com/multimodal_video_lance.tar.gz
tar -xvf multimodal_video_lance.tar.gz
mkdir -p data/video-lancedb
mv multimodal_video.lance data/video-lancedb/

Python

Run the script

python main.py

Dataset generation

Here is how the multimodal_video dataset (the raw data) was generated:

downloadcategoryids.sh - Uses the YouTube8M dataset to retrieve 5 video ids from each category
downloadvideos.py - Uses youtube-dl to download the videos and take a screenshot every 30 seconds
insert.py Uses the CLIP embedding model to embed each screenshot and insert into LanceDB
insert_titles.py We also get titles and embed them into LanceDB
We create a full text search index using tantivy with tbl.create_fts_index("text")

This dataset is available in our s3 bucket: https://vectordb-recipes.s3.us-west-2.amazonaws.com/multimodal_video_lance.tar.gz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal_video_search

multimodal_video_search

README.md

Multimodal video search using CLIP and LanceDB

Get dataset

Python

Dataset generation

Files

multimodal_video_search

Directory actions

More options

Directory actions

More options

Latest commit

History

multimodal_video_search

Folders and files

parent directory

README.md

Multimodal video search using CLIP and LanceDB

Get dataset

Python

Dataset generation