Explore YouTube ASL

Messing around with YouTube-ASL.

Quickstart

Install requirements with conda

conda create -n explore-youtube-asl pip
conda activate explore-youtube-asl
python -m pip --version # should show the pip inside your env
pip install -r requirements.txt

Download the list of YouTube video IDs released by YouTube-ASL

python download_ids.py # should create a file called 'youtube_asl_video_ids.txt'

Random Video Viewer with a server

Start up the random video viewer. Uses flask to host a simple web server, and embed a grid of YouTube videos

python youtube-asl-viewer.py

Generate a .html you can just open in a browser

Do you want to just hardcode, say, 250 videos into a .html file, and then send that to someone who doesn't have Python or Flask, and then they can just open it in their browser? You're in luck! You can! Just run the following, and it will take youtube_asl_video_ids.txt, pull out the number of IDs you specified, and generate a .html with them hardcoded inside. Then when someone clicks on it, it'll open in their browser and just display 6 our of those 250.

# make HTML with 250 hardcoded ids, of which a few are displayed at random every time you reload the page
python create_static_html.py 250

# same thing, with 11000 videos (youtube_asl_video_ids.txt only has 11,096). Resulting html is 155 kb or so!
python create_static_html.py 11000

The resulting file is "yt_asl_static_demo.html"

Download videos. This script creates a folder called "downloads" and puts videos, subtitles, and audio tracks into it.

python download_vids.py --dataset_folder "downloads"

Features:

Implemented:

Randomly select and embed Youtube-ASL videos.
Display a grid of random videos
Download videos, and stats about them
Script to generate a static .html you can just click on, no Python needed.

Ideas:

go through the list and see which ones have audio tracks in English
download the audio tracks and run language ID, e.g. with https://huggingface.co/speechbrain/lang-id-voxlingua107-ecapa or Open Whisper: https://huggingface.co/espnet/owsm_v3
Load it into a fiftyone dataset, e.g. like in https://github.com/voxel51/fiftyone-examples/blob/master/examples/Video%20Labels.ipynb, which loads in WLASL data
Extend to YT-SL25

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
count_occurrences.sh		count_occurrences.sh
create_static_html.py		create_static_html.py
download_ids.py		download_ids.py
download_ids.sh		download_ids.sh
download_vids.py		download_vids.py
example_embed.html		example_embed.html
requirements.txt		requirements.txt
youtube-asl-viewer.py		youtube-asl-viewer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explore YouTube ASL

Quickstart

Features:

Ideas:

Related

About

Releases

Packages

Languages

License

cleong110/explore-youtube-asl

Folders and files

Latest commit

History

Repository files navigation

Explore YouTube ASL

Quickstart

Features:

Ideas:

Related

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages