Skip to content

Latest commit

 

History

History
47 lines (31 loc) · 1.59 KB

README.md

File metadata and controls

47 lines (31 loc) · 1.59 KB

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion


Weicai Ye, Chenhao Ji,Zheng Chen, Junyao Gao, Xiaoshui Huang, Song-Hai Zhang, Wanli Ouyang, Tong He, Cairong Zhao, Guofeng Zhang

NeurIPS 2024

Teaser

demo_vid DiffPano allows scalable and consistent panorama generation (i.e. room switching) with given unseen text descriptions and camera poses. Each column represents the generated multi-view panoramas, switching from one room to another.

Panoramic Video-Text Dataset Pipeline

demo_vid

Framework

demo_vid

Text to Single-View Panorama Generation

demo_vid

Text to Multi-View Panorama Generation

demo_vid

Text to Panoramic Video Generation

demo_vid

Brewing🍺, code coming soon.

Citation

If you find this code useful for your research, please use the following BibTeX entry.

@article{Ye2024DiffPano,
          title={DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion},
          author={Weicai Ye and Chenhao Ji and Zheng Chen and Junyao Gao and Xiaoshui Huang and Song-Hai Zhang and Wanli Ouyang and Tong He and Cairong Zhao and Guofeng Zhang},
          booktitle={arxiv preprint},
          year={2024},
      }