AVSnap

Migrated to new location https://github.com/tavihalperin/AV-sync.

This repository contains demo code for the paper Dynamic Temporal Alignment of Speech to Lips The repository forks the demo for the audio-to-video synchronisation network (SyncNet).

The model and code can be used for research purposes under Creative Commons Attribution License. Please cite the papers below if you make use of the software.

Prerequisites

The following packages are required:

pytorch (0.4.0)
numpy (1.14.3)
scipy (1.0.1)
python_speech_features (0.6)
cuda (8.0)
ffmpeg (3.4.2)
tensorflow (1.2, 1.4)
scikit-image (0.14,1)
imageio (2.2.0)

The demo has been tested with the package versions shown above, but may also work on other versions.

Demo

First, download face detection and SyncNet models

sh download_model.sh

To run the demo, aligning audio from video2.avi to video from video1.avi

python run_pipeline.py data/video1.avi
python run_pipeline.py data/video2.avi
python align_audio.py data/video1.avi data/video2.avi --modality=all

The alignment will be computed based on both video and audio of the input videos. You can use only one of the modalities by changing the flag --modality to one of va/aa/av/vv, the first letter indicates which modality is used from the first input, and the last letter indicates which is used from the second.

Outputs:

data/out/video1_video2/modality_all.mp4

Publications

@inproceedings{halperin19dynamic,
  title     =   {Dynamic Temporal Alignment of Speech to Lips‏},
  author    =   {Halperin, Tavi and Ephrat, Ariel and Peleg, Shmuel},
  booktitle =   {2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  year      =   {2019},
}

SyncNet is taken from:

@InProceedings{Chung16a,
  author       = "Chung, J.~S. and Zisserman, A.",
  title        = "Out of time: automated lip sync in the wild",
  booktitle    = "Workshop on Multi-view Lip-reading, ACCV",
  year         = "2016",
}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
data		data
utils		utils
.gitignore		.gitignore
README.md		README.md
SyncNetInstance.py		SyncNetInstance.py
SyncNetModel.py		SyncNetModel.py
align_audio.py		align_audio.py
avsnap_utils.py		avsnap_utils.py
download_model.sh		download_model.sh
run_pipeline.py		run_pipeline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AVSnap

Prerequisites

Demo

Publications

About

Releases

Packages

Languages

tavihalperin/AV-snap

Folders and files

Latest commit

History

Repository files navigation

AVSnap

Prerequisites

Demo

Publications

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages