[ECCV 2022 Oral] RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos (Paper)

Yunhui Han¹, Kunming Luo², Ao Luo², Jiangyu Liu², Haoqiang Fan², Guiming Luo¹, Shuaicheng Liu^3,2*

1. Tsinghua University, 2. Megvii Research

3. University of Electronic Science and Technology of China

Abstract

Obtaining the ground truth labels from a video is challenging since the manual annotation of pixel-wise flow labels is prohibitively expensive and laborious. Besides, existing approaches try to adapt the trained model on synthetic datasets to authentic videos, which inevitably suffers from domain discrepancy and hinders the performance for realworld applications. To solve these problems, we propose RealFlow, an Expectation-Maximization based framework that can create large-scale optical flow datasets directly from any unlabeled realistic videos. Specifically, we first estimate optical flow between a pair of video frames, and then synthesize a new image from this pair based on the predicted flow. Thus the new image pairs and their corresponding flows can be regarded as a new training set. Besides, we design a Realistic Image Pair Rendering (RIPR) module that adopts softmax splatting and bi-directional hole filling techniques to alleviate the artifacts of the image synthesis. In the E-step, RIPR renders new images to create a large quantity of training data. In the M-step, we utilize the generated training data to train an optical flow network, which can be used to estimate optical flows in the next E-step. During the iterative learning steps, the capability of the flow network is gradually improved, so is the accuracy of the flow, as well as the quality of the synthesized dataset. Experimental results show that RealFlow outperforms previous dataset generation methods by a considerably large margin. Moreover, based on the generated dataset, our approach achieves state-of-the-art performance on two standard benchmarks compared with both supervised and unsupervised optical flow methods

Motivation

Top: previous methods use synthetic motion to produce training pairs. Bottom: we propose to construct training pairs with realistic motion labels from the real-world video sequence. We estimate optical flow between two frames as the training label and synthesize a ‘New Image 2’. Both the new view and flow labels are refined iteratively in the EM-based framework for mutual improvements.

Requirements

torch>=1.8.1
torchvision>=0.9.1
opencv-python>=4.5.2
timm>=0.4.5
cupy>=5.0.0
numpy>=1.15.0

Rendered Datasets

Download

You can download all the generated datasets and pretrained models in our paper:

Download the generated datasets using shell scripts dataset_download.sh

sh dataset_download.sh

the dataset will be downloaded in ./RF_dataset

Download the pretrained models using this link: pretrained_models.

Render New Data

Download the pretrained DPT model from here and pretrained RAFT C+T model (raft-things.pth) from here

Download KITTI multi-view Datasets. You can run the following command to render RF-Ktrain:

python RealFlow.py

You can also download ALOV and BDD100k from their official website to render RF-AB. Using utils/video2img.py to capture pictures.

You can simply render a new pair using:

python demo.py

Citation

If you find this work useful for your research, please cite:

@inproceedings{han2022realflow,
  title={RealFlow: EM-Based Realistic Optical Flow Dataset Generation from Videos},
  author={Han, Yunhui and Luo, Kunming and Luo, Ao and Liu, Jiangyu and Fan, Haoqiang and Luo, Guiming and Liu, Shuaicheng},
  booktitle={European Conference on Computer Vision},
  pages={288--305},
  year={2022}
}

Acknowledgements

Part of the code is adapted from previous works:

Our datasets are generated from KITTI, Sintel, BDD100k, DAVIS, and ALOV.

We thank all the authors for their contributions.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
DPT/dpt		DPT/dpt
RAFT		RAFT
RFdata		RFdata
sample		sample
softmax_splatting		softmax_splatting
utils		utils
.gitignore		.gitignore
ACKNOWLEDGMENTS		ACKNOWLEDGMENTS
LICENSE		LICENSE
README.md		README.md
RealFLow.py		RealFLow.py
dataset_download.sh		dataset_download.sh
demo.py		demo.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

[ECCV 2022 Oral] RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos (Paper)

Yunhui Han¹, Kunming Luo², Ao Luo², Jiangyu Liu², Haoqiang Fan², Guiming Luo¹, Shuaicheng Liu^3,2*

1. Tsinghua University, 2. Megvii Research

3. University of Electronic Science and Technology of China

Abstract

Motivation

Requirements

Rendered Datasets

Download

Render New Data

Citation

Acknowledgements

About

Releases

Packages

Contributors 4

Languages

License

megvii-research/RealFlow

Folders and files

Latest commit

History

Repository files navigation

[ECCV 2022 Oral] RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos (Paper)

Yunhui Han1, Kunming Luo2, Ao Luo2, Jiangyu Liu2, Haoqiang Fan2, Guiming Luo1, Shuaicheng Liu3,2*

1. Tsinghua University, 2. Megvii Research

3. University of Electronic Science and Technology of China

Abstract

Motivation

Requirements

Rendered Datasets

Download

Render New Data

Citation

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Yunhui Han¹, Kunming Luo², Ao Luo², Jiangyu Liu², Haoqiang Fan², Guiming Luo¹, Shuaicheng Liu^3,2*

Packages