Pose-Free Generalizable Rendering Transformer

[Paper:] | [Website: ] | [Video: ]

Pose-Free framework for Generalizable Rendering Transformer(FP-GTR) eliminates the need for pre-computed camera poses and is able to render novel views in a feed-forward pass under unseen scenes.

Updates

Novel-view synthesis on custom scene.
Training and evaluation codes on PF-GTR are released.

Data Preparation

1. Download the IBRNet's dataset from the official website.

cd data/

# IBRNet captures
gdown https://drive.google.com/uc?id=1rkzl3ecL3H0Xxf5WTyc2Swv30RIyr1R_
unzip ibrnet_collected.zip

# LLFF
gdown https://drive.google.com/uc?id=1ThgjloNt58ZdnEuiCeRf9tATJ-HI0b01
unzip real_iconic_noface.zip

## [IMPORTANT] remove scenes that appear in the test set
cd real_iconic_noface/
rm -rf data2_fernvlsb data2_hugetrike data2_trexsanta data3_orchid data5_leafscene data5_lotr data5_redflower
cd ../

# Spaces dataset
git clone https://github.com/augmentedperception/spaces_dataset

# RealEstate 10k
## make sure to install ffmpeg - sudo apt-get install ffmpeg
git clone https://github.com/qianqianwang68/RealEstate10K_Downloader
cd RealEstate10K_Downloader
python3 generate_dataset.py train
cd ../

# Google Scanned Objects
gdown https://drive.google.com/uc?id=1w1Cs0yztH6kE3JIz7mdggvPGCwIKkVi2
unzip google_scanned_objects_renderings.zip

# Blender dataset
gdown https://drive.google.com/uc?id=18JxhpWD-4ZmuFKLzKlAw-w5PpzZxXOcG
unzip nerf_synthetic.zip

# LLFF dataset (eval)
gdown https://drive.google.com/uc?id=16VnMcF1KJYxN9QId6TClMsZRahHNMW5g
unzip nerf_llff_data.zip

2. Unzip files and organize the data as follows:

${ROOT} 
├──📂data/
    ├──📂ibrnet_collected_1/
       ├── 📂...
       ├── 📜...
    ├──📂ibrnet_collected_2/
    ├──📂real_iconic_noface/
    ├──📂spaces_dataset/
    ├──📂RealEstate10K-subset/
    ├──📂google_scanned_objects/
    ├──📂nerf_synthetic/
    ├──📂nerf_llff_data/

Installation

The code is tested with python 3.9, cuda == 11.3, pytorch == 1.10.1. Additionally dependencies include:

torchvision
ConfigArgParse
imageio
matplotlib
numpy
opencv_contrib_python
Pillow
scipy
imageio-ffmpeg
lpips
scikit-image
loguru

Setup with Conda:

conda create -n pfgrt python=3.9
pip3 install torch==1.10.1+cu113 torchvision==0.11.2+cu113 torchaudio==0.10.1 -f https://download.pytorch.org/whl/torch_stable.html
pip3 install -r ./requirements.txt

Usage

1. Run `./train.py`

# python3 train.py --config <config> --optional[other kwargs]. Example:
python3 train.py --config configs/view_selector.yaml 


# python3 train.py --config <config> --optional[other kwargs]. Example:
python3 train.py --config configs/pose_free_transfomer.yaml

2. Run `./eval.py`

# python3 eval.py --run_val --N_samples 192 --config <config> --optional[other kwargs]. Example:

# single scene in specified dataset (such as llff)
python3 eval.py --config configs/pose_free_transfomer.yaml --eval_scenes orchids --expname gnt_orchids --chunk_size 10240 --run_val --N_samples 192
python3 eval.py --config configs/pose_free_transfomer.yaml --eval_scenes drums --expname gnt_drums --chunk_size 10240 --run_val --N_samples 192

# all scenes in specified dataset (such as llff)
python3 eval.py --config configs/pose_free_transfomer.yaml --expname llff --chunk_size 10240 --run_val --N_samples 192

TODO 3. Run `./demo.sh`

bash demo.sh

The code has been recently tidied up for release and could perhaps contain tiny bugs. Please feel free to open an issue.

Citation

If you find our work useful for your research, please consider citing the paper:

@inproceedings{Fan2023PoseFreeGR,
  title={Pose-Free Generalizable Rendering Transformer},
  author={Zhiwen Fan and Panwang Pan and Peihao Wang and Yifan Jiang and Hanwen Jiang and Dejia Xu and Zehao Zhu and Dilin Wang and Zhangyang Wang},
  year={2023},
  url={https://api.semanticscholar.org/CorpusID:263671855}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
configs		configs
data_loaders		data_loaders
docs		docs
exps		exps
models		models
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
config.py		config.py
demo.sh		demo.sh
environment.yml		environment.yml
eval.py		eval.py
eval.sh		eval.sh
eval_free_pose.py		eval_free_pose.py
grt_model_api.py		grt_model_api.py
render.py		render.py
requirements.txt		requirements.txt
train.py		train.py
train.sh		train.sh
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pose-Free Generalizable Rendering Transformer

[Paper:] | [Website: ] | [Video: ]

Updates

Data Preparation

1. Download the IBRNet's dataset from the official website.

2. Unzip files and organize the data as follows:

Installation

Usage

1. Run `./train.py`

2. Run `./eval.py`

TODO 3. Run `./demo.sh`

Citation

About

Releases

Packages

Languages

License

Mia-Cong/PF-GRT

Folders and files

Latest commit

History

Repository files navigation

Pose-Free Generalizable Rendering Transformer

[Paper:] | [Website: ] | [Video: ]

Updates

Data Preparation

1. Download the IBRNet's dataset from the official website.

2. Unzip files and organize the data as follows:

Installation

Usage

1. Run ./train.py

2. Run ./eval.py

TODO 3. Run ./demo.sh

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

1. Run `./train.py`

2. Run `./eval.py`

TODO 3. Run `./demo.sh`

Packages