SingingHead: A Large-scale 4D Dataset for Singing Head Animation

arXiv | Project Page | Dataset

TODO

Release the codes for calculating the metrics of two benchmarks.
Release the scripts for visualizing the 3D facial motion.
Release the SingingHead dataset.

SingingHead Dataset

Download

The dataset can be downloaded from Hugging Face.

If you are unable to download from Hugging Face, please first fill out the required information on Hugging Face to obtain authorization, and then contact us (wusijing@sjtu.edu.cn) using the same email address to get the download link of Baidu (百度网盘).

Please note that by requesting the dataset, you confirm that you have read, understood, and agree to be bound by the terms of the agreement.

Agreement

The SingingHead dataset is available for non-commercial research purposes only.
You agree not to reproduce, modified, duplicate, copy, sell, trade, resell or exploit any portion of the images and any portion of the derived data for commercial purposes.
You agree not to further copy, publish or distribute any portion of the SingingHead dataset to any third party for any purpose. Except, for internal use at a single site within the same organization it is allowed to make copies of the dataset.
Shanghai Jiao Tong University reserves the right to terminate your access to the SingingHead dataset at any time.

Overview

The SingingHead dataset is a large-scale 4D dataset for singing head animation. It contains more than 27 hours of synchronized singing video, 3D facial motion, singing audio, and background music collected from 76 subjects. The video is captured in 30fps and cropped into a resolution of 1024×1024. The 3D facial motion is represented by 59-dimensional FLAME parameters (50 expression + 3 neck pose + 3 jaw pose). All the data sequences are cut into equal-length 8s segments, resulting in a total of 12196 sequences.

Data Structure

SingingHead
├── train.txt
├── val.txt
├── test.txt
├── video_seqs.zip
│   ├── id0_10_0_0.mp4
│   └── ...
├── flame_seqs.zip
│   ├── id0_10_0_0.pkl
│   └── ...
├── audio_seqs.zip
│   ├── id0_10_0_0.wav
│   └── ...
└── bgm_seqs.zip
    ├── id0_10_0_0_bgm.wav
    └── ...

Citation

If you use this dataset, please consider citing

@article{wu2023singinghead,
  title={SingingHead: A Large-scale 4D Dataset for Singing Head Animation},
  author={Wu, Sijing and Li, Yunhao and Zhang, Weitian and Jia, Jun and Zhu, Yucheng and Yan, Yichao and Zhai, Guangtao},
  journal={arXiv preprint arXiv:2312.04369},
  year={2023}
}

Contact

Sijing Wu (wusijing@sjtu.edu.cn)

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
assets		assets
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SingingHead: A Large-scale 4D Dataset for Singing Head Animation

arXiv | Project Page | Dataset

TODO

SingingHead Dataset

Download

Overview

Data Structure

Citation

Contact

About

Releases

Packages

wsj-sjtu/SingingHead

Folders and files

Latest commit

History

Repository files navigation

SingingHead: A Large-scale 4D Dataset for Singing Head Animation

arXiv | Project Page | Dataset

TODO

SingingHead Dataset

Download

Overview

Data Structure

Citation

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages