GenN2N: Generative NeRF2NeRF Translation

Paper | Project Page

Xiangyue Liu, Han Xue, Kunming Luo, Ping Tan^†, Li Yi^†

CVPR 2024

Abstract

We present GenN2N, a unified NeRF-to-NeRF translation framework for various NeRF translation tasks such as text-driven NeRF editing, colorization, super-resolution, inpainting, etc. Unlike previous methods designed for individual translation tasks with task-specific schemes, GenN2N achieves all these NeRF editing tasks by employing a plug-and-play image-to-image translator to perform editing in the 2D domain and lifting 2D edits into the 3D NeRF space. Since the 3D consistency of 2D edits may not be assured, we propose to model the distribution of the underlying 3D edits through a generative model that can cover all possible edited NeRFs. To model the distribution of 3D edited NeRFs from 2D edited images, we carefully design a VAE-GAN that encodes images while decoding NeRFs. The latent space is trained to align with a Gaussian distribution and the NeRFs are supervised through an adversarial loss on its renderings. To ensure the latent code does not depend on 2D viewpoints but truly reflects the 3D edits, we also regularize the latent code through a contrastive learning scheme. Extensive experiments on various editing tasks show GenN2N, as a universal framework, performs as well or better than task-specific specialists while possessing flexible generative power.

Setup

Environment

Clone this repo

git clone https://github.com/Lxiangyue/GenN2N.git
cd GenN2N

Install dependencies to setup a conda environment:

conda create -n genn2n python=3.10
conda activate genn2n
pip install -r requirements.txt

We made our modifications to nerfstudio and instruct-nerf2nerf, so clone our version and install (compulsory): rm -r ... maybe not needed for the first time

cd nerfstudio && rm -r nerfstudio.egg-info && pip install -e . && cd ..
cd instruct-nerf2nerf && rm -r in2n.egg-info && pip install -e . && cd ..
cd instruct-nerf2nerf

Data

Create 'Data' folder, and download the following datas to 'Data':
- Download fangzhou-small and fangzhou-small-editing for Text-driven Editing task.
- Download redroof_palace and redroof_palace_color for Colorization task.
- Download trex_lr and trex_SR for Super-resolution task.
- Download statue_order and statue_repeat_removal for Inpainting task.
- Path organized as:
```
/Data
    /fangzhou-small
    /fangzhou-small-editing
    /redroof_palace
    /redroof_palace_colo
    /statue_order
    /statue_repeat_removal
    /trex_lr
    /trex_SR
/instruct-nerf2nerf
/nerfstudio
```

Model

For inference the follwing inference examples, download and put our trained models to 'instruct-nerf2nerf/outputs':
- Download fangzhou-small for Text-driven Editing task.
- Download redroof_palace for Colorization task.
- Download trex_lr for Super-resolution task.
- Download statue_order for Inpainting task.
- Path organized as:
```
/instruct-nerf2nerf
    /outputs
        /fangzhou-small
        /redroof_palace
        /statue_order
        /trex_lr
```

Reproducing Experiments

Text-driven Editing

For example, inference the editing prompt "Turn him into the Tolkien Elf" on fangzhou-small with our trained model:

ns-train in2n --data ../Data/fangzhou-small/ --load-dir outputs/fangzhou-small/in2n/2023-11-27_024729/nerfstudio_models --pipeline.prompt "Turn him into the Tolkien Elf" --pipeline.edited_imgs_path "../Data/fangzhou-small-editing/fangzhou_small_elf_1.25_6.5/fangzhou*" --pipeline.edited_imgs_path_type "png"  --pipeline.inference_num_per_ckpt 30  --max_num_iterations 1 --load_step 35000

For example, training the editing prompt "Turn him into the Tolkien Elf" on fangzhou-small:

ns-train in2n --data ../Data/fangzhou-small/ --load-dir outputs/fangzhou-small/nerfacto/2023-11-14_204006/nerfstudio_models --pipeline.prompt "Turn him into the Tolkien Elf" --pipeline.edited_imgs_path "../Data/fangzhou-small-editing/fangzhou_small_elf_1.25_6.5/fangzhou*" --pipeline.edited_imgs_path_type "png" --pipeline.inference_num_per_ckpt 30

Colorization

For example, inference the colorization on redroof_palace with our trained model:

ns-train in2n --data ../Data/redroof_palace/ --load-dir outputs/redroof_palace/in2n/2023-11-15_135805/nerfstudio_models --pipeline.prompt "Make it colorful" --pipeline.edited_imgs_path "../Data/redroof_palace_color/redroof_palace*" --pipeline.edited_imgs_path_type "jpg"  --pipeline.inference_num_per_ckpt 30  --max_num_iterations 1 --load_step 35750

For example, training the colorization on redroof_palace:

ns-train in2n --data ../Data/redroof_palace/ --load-dir outputs/redroof_palace/nerfacto/2023-11-15_120546/nerfstudio_models --pipeline.prompt "Make it colorful" --pipeline.edited_imgs_path "../Data/redroof_palace_color/redroof_palace*" --pipeline.edited_imgs_path_type "jpg"  --pipeline.inference_num_per_ckpt 30

Super-resolution

For example, inference the super-resolution on trex_lr with our trained model:

ns-train in2n --data ../Data/trex_lr/ --load-dir outputs/trex_lr/in2n/2023-11-17_040719/nerfstudio_models --pipeline.prompt "Make it colorful" --pipeline.edited_imgs_path "../Data/trex_SR/trex_*" --pipeline.edited_imgs_path_type "png" --max_num_iterations 1 --load_step 36250

For example, training the super-resolution on trex_lr:

ns-train in2n --data ../Data/trex_lr --load-dir outputs/trex_lr/nerfacto/2023-11-14_204456/nerfstudio_models --pipeline.prompt "Make the image higher resolution" --pipeline.edited_imgs_path "../Data/trex_SR/trex_*" --pipeline.edited_imgs_path_type "png"

Inpainting

For example, inference the inpainting on statue with our trained model:

ns-train in2n --data ../Data/statue_order/ --load-dir outputs/statue_order/in2n/2023-11-17_024339/nerfstudio_models --pipeline.prompt "remove the statue" --pipeline.edited_imgs_path "../Data/statue_repeat_removal/statue*" --pipeline.edited_imgs_path_type "png" --max_num_iterations 1

For example, training the inpainting on statue:

ns-train in2n --data ../Data/statue_order/ --load-dir outputs/statue_order/nerfacto/2024-10-08_203621/nerfstudio_models --pipeline.prompt "remove the statue" --pipeline.edited_imgs_path "../Data/statue_repeat_removal/statue*" --pipeline.edited_imgs_path_type "png"

Citation

If you find our work useful in your research, please cite:

@inproceedings{liu2024genn2n,
  title={GenN2N: Generative NeRF2NeRF Translation},
  author={Liu, Xiangyue and Xue, Han and Luo, Kunming and Tan, Ping and Yi, Li},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={5105--5114},
  year={2024}
}

Acknowledgements

The implementation of GenN2N are based on Instruct-NeRF2NeRF. Thanks to these authors for releasing the code.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
assets		assets
instruct-nerf2nerf		instruct-nerf2nerf
nerfstudio		nerfstudio
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GenN2N: Generative NeRF2NeRF Translation

Paper | Project Page

Abstract

Setup

Environment

Data

Model

Reproducing Experiments

Text-driven Editing

Colorization

Super-resolution

Inpainting

Citation

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Lxiangyue/GenN2N

Folders and files

Latest commit

History

Repository files navigation

GenN2N: Generative NeRF2NeRF Translation

Paper | Project Page

Abstract

Setup

Environment

Data

Model

Reproducing Experiments

Text-driven Editing

Colorization

Super-resolution

Inpainting

Citation

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages