Stratified Avatar Generation from Sparse Observations

This is the official implementation code of our CVPR 2024 paper.

Environment

create the environment by running following instructions:

git clone https://github.com/Wenchao-M/SAGE.git
cd SAGE

# create a conda envirment named SAGE
conda create -n SAGE python=3.8
# activate the environment
conda activate SAGE
# You can modify the version of PyTorch or CUDA depending on your situation.
pip install torch==1.13.1+cu117 torchvision==0.14.1+cu117 torchaudio==0.13.1 --extra-index-url https://download.pytorch.org/whl/cu117
# install other python lib
pip install -r requirements.txt

Dataset

Before using SAGE, You must register at SMPL(https://smpl.is.tue.mpg.de/index.html), SMPL-X(https://smpl-x.is.tue.mpg.de/index.html), AMASS(https://amass.is.tue.mpg.de/index.html) and agree with the LICENSE of them.

After registration, you can download the smplh and dmpls body model from here, and unzip it in your project directory, it should look like this:

SAGENet
├─ body_models/
├─── dmpls/
├───── female/
├───── male/
├───── neutral/
├─── smplh/
├───── female/
├───── male/
└───── neutral/

Download the dataset from AMASS. In this repo, ACCAD, BMLmovi, BMLrub, CMU, EKUT, EyesJapanDataset, HDM05, HumanEva, KIT, MoSh, PosePrior, SFU, TotalCapture, Transitions are needed. ( NOTE, only SMPL + HG are needed)
Unzip them in a directory.

Run prepare_data.py

# before running, please replace raw_data_dir with the dataset dir you download before, 
# replace path_you_want_to_save with the dir you want to save processed dataset.
python prepare_data.py --support_dir body_models --root_dir /raw_data_dir --save_dir /path_you_want_to_save

If you save the processed dataset in another dir, you can use a symlink.
```
ln -s /path_you_want_to_save dataset
```

The generated dataset should look like this

SAGENet
├─dataset/
├─── ACCAD/
├───── 1.pt
├───── 2.pt
├───── ...
├─── BMLmovi/
├─── BioMotionLab_NTroje_test/
├─── BioMotionLab_NTroje_train/
├─── CMU_test/
├─── CMU_train/
├─── EKUT/
├─── Eyes_Japan_Dataset/
├─── HumanEva/
├─── KIT/
├─── MPI_HDM05_test/
├─── MPI_HDM05_train/
├─── MPI_Limits/
├─── MPI_mosh/
├─── SFU/
├─── TotalCapture/
└─── Transitions_mocap/

Evaluation Stage

Evaluate on the real capture demo

Before continuing, please remember to open the developer mode of Quest.
We provide an app to capture real motion sequences in Quest2. You can find it at realdemo/UnityProject/First3D/SAGENet.apk. You can download and install it on Quest2 device. This is what the program looks like when it runs:

This toy app starts recording data as soon as it is opened and stops recording when it is exited.
After recording, you can find the data at Android/data/SAGENet/*****.txt

We also open source the code of this toy app at real_demo/UnityProject/First3D
After getting the data ****.txt, you can use the process_quan.py to transform data into a coordinate system that the model can process.
If you want to use our model in real demo, please download the weight from here.
run python inference_realdemo.py to get the results, the results are pickle form like dataset-76610.pkl
After getting the results, you can visulize it in Blender using smpl_animation_blender.py.
- Notice that before using it, you must agree the license of SMPL and SMPLX.
- To use smpl_animation_blender.py, you need to download SMPL Blender addon from here
- If you want the SMPL to have texture, please download the texture file from here.
- You project should look like this:
- ```
├─ data/
├─── smpl-model-20200803.blend
├─── smpl_joint_regressor_male.npz
├─── smpl_joint_regressor_female.npz
├─── f_01_alb.002.png
├─── m_01_alb.002.png
└─ smpl_animation_blender.py
```

Evaluate on the dataset

You can also download the weight under Setting 1 from here, weight under Setting 2 from here and weight under Setting 3 from here. and unzip the weight into outputs dir, it should look like this

SAGENet
├─ outputs/
├─── upper_vqvae/
├───── best.pth.tar
├─── lower_vqvae/
├───── best.pth.tar
├─── decoder/
├───── best.pth.tar
├─── refiner/
└───── best.pth.tar

And run inference instruction

python test_refiner.py --cfg config_decoder/refiner.yaml

Training Stage

If you want to retrain the model, please follow these commands sequentially.

You can train only specific parts of the model. For example, you can choose to skip retraining the VQVAE stage and focus solely on training the diffusion stage and its subsequent stages.

VQVAE Stage

# NOTE:In VQVAE stage, training for the upper body and lower body can be conducted simultaneously.
CUDA_VISIBLE_DEVICES=0 python train_vqvae.py --cfg config_vqvae/upper_vqvae.yaml
CUDA_VISIBLE_DEVICES=0 python train_vqvae.py --cfg config_vqvae/lower_vqvae.yaml

Diffusion Stage

# NOTE:In Diffusion stage, lower body diffusion should be based on the upper diffusion model, so
# the following instructions must be executed sequentially.
# train the upper body diffusion model
CUDA_VISIBLE_DEVICES=0 python train_first.py --cfg config_diffusion/first.yaml
# train the lower body diffusion model
CUDA_VISIBLE_DEVICES=0 python train_second.py --cfg config_diffusion/second.yaml

Post Stage

# freeze the upper and lowewr diffusion model, train the decoder model
CUDA_VISIBLE_DEVICES=0 python train_decoder.py --cfg config_decoder/decoder.yaml
# (optional) train a RNN to smooth the result
CUDA_VISIBLE_DEVICES=0 python train_refiner.py --cfg config_decoder/refiner.yaml

For implementation details, please contact:

· wmm5390@psu.edu · whuerfff@whu.edu.cn

Acknowledgements

This project is built on source codes shared by body_visualizer, human_body_prior, AvatarPoser, AGRoL, BoDiffusion. We thank the authors for their great job!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stratified Avatar Generation from Sparse Observations

Environment

Dataset

Evaluation Stage

Evaluate on the real capture demo

Evaluate on the dataset

Training Stage

VQVAE Stage

Diffusion Stage

Post Stage

For implementation details, please contact:

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Pictures		Pictures
VQVAE		VQVAE
body_visualizer		body_visualizer
config_decoder		config_decoder
config_diffusion		config_diffusion
config_vqvae		config_vqvae
dataloader		dataloader
diffusion_stage		diffusion_stage
human_body_prior		human_body_prior
prepare_data/data_split		prepare_data/data_split
real_demo		real_demo
utils		utils
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
inference_realdemo.py		inference_realdemo.py
prepare_data.py		prepare_data.py
prepare_data_my.py		prepare_data_my.py
requirements.txt		requirements.txt
split_fulldata.py		split_fulldata.py
test_decoder.py		test_decoder.py
test_first.py		test_first.py
test_refiner.py		test_refiner.py
test_second.py		test_second.py
test_vqvae.py		test_vqvae.py
train_decoder.py		train_decoder.py
train_first.py		train_first.py
train_refiner.py		train_refiner.py
train_second.py		train_second.py
train_vqvae.py		train_vqvae.py

License

Wenchao-M/SAGE

Folders and files

Latest commit

History

Repository files navigation

Stratified Avatar Generation from Sparse Observations

Environment

Dataset

Evaluation Stage

Evaluate on the real capture demo

Evaluate on the dataset

Training Stage

VQVAE Stage

Diffusion Stage

Post Stage

For implementation details, please contact:

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages