Add Kitti and Sintel datasets for optical flow #4845

NicolasHug · 2021-11-03T11:29:55Z

This PR Adds the Kitti and Sintel datasets for optical flow.

Here are a few design decisions, following our initial discussion (in this PR):

Unlike Sintel, Kitti has a built-in valid mask indicating which flow values are returned. As a resut Sintel returns img1, img2, flow while Kitti returns img1, img2, flow, valid.
For both these datasets, the targets aren't known if split="test" so we return img1, img2, None and img1, img2, None, None in these cases.
For both, flow is a numpy array of shape (2, H, W) and valid is a boolean numpy array of shape (H, W). The images are PIL images.
The transforms expect to receive img1, img2, flow, valid, even for Sintel, and even in test mode.
add tests
write docs

cc @pmeier

facebook-github-bot · 2021-11-03T11:30:01Z

💊 CI failures summary and remediations

As of commit ac4eaab (more details on the Dr. CI page):

2/2 failures introduced in this PR

2 failures not recognized by patterns:

Job	Step	Action
^{binary_linux_conda_py3.6_cu111}	^{packaging/build_conda.sh}	🔁 rerun
^{binary_win_conda_py3.9_cu113}	^{Build conda packages}	🔁 rerun

1 job timed out:

binary_linux_conda_py3.6_cu111

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

torchvision/datasets/_optical_flow.py

torchvision/datasets/__init__.py

datumbox

Just a few thoughts:

torchvision/datasets/_optical_flow.py

NicolasHug · 2021-11-04T10:07:44Z

@datumbox @pmeier @fmassa thanks a lot for your very useful feedback.

I updated the PR accordingly (see original PR message above) and added tests and docs. This is now ready for a proper review.

NicolasHug · 2021-11-04T10:21:32Z

Rendered docs: https://963455-73328905-gh.circle-artifacts.com/0/docs/datasets.html

fmassa

Thanks!

vadimkantorov · 2021-11-04T11:42:20Z

There were problems with older datasets that returning tuple doesn't allow for easy adding extra fields in the future, e.g. #3608. I propose to return img, target_dict tuple (even better may be always returning a single dict for the cases of skipping actual image loading). And there is more unity when iterating between different related datasets.

pmeier · 2021-11-04T11:57:07Z

@vadimkantorov We are aware of that and the new prototype dataset API will always return a single dictionary per sample. There is no documentation yet, but if you want to you can have a look into torchvision/prototype/datasets to see how things will look like in the future. Plus, if you have feedback, that is very welcome!

vadimkantorov · 2021-11-04T11:58:37Z

If you're committed to using only tuples, please also return some internal example id, image/video file name or something of this form. Then at least people can hack around and fetch additional information

NicolasHug · 2021-11-04T12:26:28Z

Thanks for your input @vadimkantorov

I don't think any of the datasets return that kind of info so far (sadly), so returning a dict with extra info would make KittiFlow and Sintel outliers w.r.t. what we currently have. Returning the ids etc. is definitely useful, but such new design is better suited for a complete re-work of the datasets, which we are currently doing as @pmeier pointed out above.

datumbox

LGTM, thanks!

Definitely interesting to see how the ideas mentioned above can be incorporated on the new API but I think it's worth merging this PR to unblock your work on the flow models.

pmeier

LGTM, thanks @NicolasHug!

Reviewed By: kazhang Differential Revision: D32216685 fbshipit-source-id: ec74c2a573eace36bd4a0cf9913ea1dc77fcf260

Add Kitti and Sintel

f5df5fc

pytorch-probot bot added the ciflow/default label Nov 3, 2021

facebook-github-bot added the cla signed label Nov 3, 2021

NicolasHug marked this pull request as draft November 3, 2021 11:30

NicolasHug commented Nov 3, 2021

View reviewed changes

datumbox reviewed Nov 3, 2021

View reviewed changes

torchvision/datasets/_optical_flow.py Show resolved Hide resolved

torchvision/datasets/_optical_flow.py Outdated Show resolved Hide resolved

NicolasHug added 5 commits November 3, 2021 18:40

Add tests

c3dd41b

Add some docs

8c76602

More docs

e6ecc4e

more docs

721b94b

more docs

b59a5c7

NicolasHug marked this pull request as ready for review November 4, 2021 10:06

Merge branch 'main' of github.com:pytorch/vision into add_flow_datasets

e03f37f

fmassa approved these changes Nov 4, 2021

View reviewed changes

NicolasHug added 2 commits November 4, 2021 10:38

test -> testing for Kitti

6f95da0

less vert space

3b8ba30

datumbox mentioned this pull request Nov 4, 2021

Add support of the test split on ImageNet #4855

Closed

datumbox approved these changes Nov 4, 2021

View reviewed changes

This was referenced Nov 4, 2021

Add FlyingThings3D dataset for optical flow #4858

Merged

Add FlyingChairs dataset for optical flow #4860

Merged

pmeier approved these changes Nov 4, 2021

View reviewed changes

Merge branch 'main' into add_flow_datasets

ac4eaab

NicolasHug merged commit 50a3571 into pytorch:main Nov 4, 2021

NicolasHug added module: datasets new feature labels Nov 4, 2021

NicolasHug mentioned this pull request Nov 4, 2021

RAFT model and training reference #4644

Closed

12 tasks

facebook-github-bot pushed a commit that referenced this pull request Nov 8, 2021

[fbsync] Add Kitti and Sintel datasets for optical flow (#4845)

315f1a2

Reviewed By: kazhang Differential Revision: D32216685 fbshipit-source-id: ec74c2a573eace36bd4a0cf9913ea1dc77fcf260

cyyever pushed a commit to cyyever/vision that referenced this pull request Nov 16, 2021

Add Kitti and Sintel datasets for optical flow (pytorch#4845)

346ee36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Kitti and Sintel datasets for optical flow #4845

Add Kitti and Sintel datasets for optical flow #4845

NicolasHug commented Nov 3, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Nov 3, 2021 •

edited

Loading

datumbox left a comment

NicolasHug commented Nov 4, 2021

NicolasHug commented Nov 4, 2021

fmassa left a comment

vadimkantorov commented Nov 4, 2021 •

edited

Loading

pmeier commented Nov 4, 2021

vadimkantorov commented Nov 4, 2021 •

edited

Loading

NicolasHug commented Nov 4, 2021

datumbox left a comment

pmeier left a comment

Add Kitti and Sintel datasets for optical flow #4845

Add Kitti and Sintel datasets for optical flow #4845

Conversation

NicolasHug commented Nov 3, 2021 • edited by pytorch-probot bot Loading

facebook-github-bot commented Nov 3, 2021 • edited Loading

💊 CI failures summary and remediations

2 failures not recognized by patterns:

datumbox left a comment

Choose a reason for hiding this comment

NicolasHug commented Nov 4, 2021

NicolasHug commented Nov 4, 2021

fmassa left a comment

Choose a reason for hiding this comment

vadimkantorov commented Nov 4, 2021 • edited Loading

pmeier commented Nov 4, 2021

vadimkantorov commented Nov 4, 2021 • edited Loading

NicolasHug commented Nov 4, 2021

datumbox left a comment

Choose a reason for hiding this comment

pmeier left a comment

Choose a reason for hiding this comment

NicolasHug commented Nov 3, 2021 •

edited by pytorch-probot bot

Loading

facebook-github-bot commented Nov 3, 2021 •

edited

Loading

vadimkantorov commented Nov 4, 2021 •

edited

Loading

vadimkantorov commented Nov 4, 2021 •

edited

Loading