Compatibility layer between stable datasets and prototype transforms #6663

pmeier · 2022-09-28T15:22:19Z

This is the proof of concept implementation for #6662. Let's keep the discussion there unless there is need for discussion on a technical issue of the proposal.

cc @vfdev-5 @bjuncek

Conflicts: torchvision/prototype/features/__init__.py torchvision/prototype/features/_feature.py

torchvision/prototype/features/__init__.py

torchvision/prototype/features/_dataset_wrapper.py

torchvision/prototype/features/_feature.py

torchvision/prototype/features/_dataset_wrapper.py

pmeier · 2023-01-31T08:16:06Z

dbfac05 is a refactor of the internals. Here are the main points:

Change the main entry point from the classmethod VisionDatasetDatapointWrapper.from_torchvision_dataset to a standalone function wrap_dataset_for_transforms_v2 (naming TBD). With this we can make the class private and thus users should have no touching points with the internals. On top of that it simplified the implementation quite a bit.
Drop support for additional parameters of the wrapper to specify the dtypes of the wrapped objects, the bounding box format, and whether we want to keep images as PIL. See
Simplify the internal machinery. compatibility layer between stable datasets and prototype transforms? #6662 (comment) established that there is no easy way to provide standalone wrappers for the individual datapoints of a sample. Thus, there is no need to keep the bits that try to do so. This addresses Compatibility layer between stable datasets and prototype transforms #6663 (comment) although not as proposed with a standalone class for a task like classification. @NicolasHug PTAL and LMK if my new implementation is simple enough.

NicolasHug

Thanks Philip, I haven't looked at the individual wrappers in depth, but the overall design looks great (I appreciate the simplifications!). I left some comments / Qs below

torchvision/prototype/datapoints/_dataset_wrapper.py

pmeier

The latest commits add the following changes:

Following introduce heuristic for simple tensor handling of transforms v2 #7170, we no longer need special handling for any non-image parts of the samples, since all our datasets fit the heuristic.
Following remove categories metadata from (OneHot)Label datapoint #7171 (comment) we no longer use the datapoints.Label class. Together with the pass-through for PIL images, this means that this wrapper is a no-op for classification datasets. This also resolves all discussions regarding the categories, since they are no longer present.
Instead of having a wrapper that takes the dataset as well as the sample as inputs, i.e.
```
def wrapper(dataset, sample):
    ...
    return wrapped_sample
```
the architecture was changed to a factory pattern:
```
def wrapper_factory(dataset):
    def wrapper(sample):
        ...
        return wrapped_sample

    return wrapper
```
In addition to making the wrapper more "natural", this also enables us to raise on unsupported behavior on wrapping rather than when the first sample is drawn. That should improve UX.
I've added automated smoke tests to our datasets v1 tests to make sure wrapping and drawing samples doesn't raise anything.

test/test_datasets.py

…nto dataset-wrappers

NicolasHug

Thanks a lot Philip, LGTM. I just have minor comments (that we discussed offline and are probably already addressed) + one minor Q about wrap_target_by_type.

I'll admit I haven't taken a super deep look to the individual wrappers. We may have to do a bit of manual testing for those later.

test/datasets_utils.py

torchvision/prototype/datapoints/_dataset_wrapper.py

github-actions · 2023-02-10T14:32:29Z

Hey @pmeier!

You merged this PR, but no labels were added. The list of valid labels is available at https://github.com/pytorch/vision/blob/main/.github/process_commit.py

…ype transforms (#6663) Reviewed By: vmoens Differential Revision: D44416279 fbshipit-source-id: a3c1ba2048917c5af3005beef6cec77896ab20f8

pmeier added 5 commits September 21, 2022 21:56

PoC

d6786ac

Merge branch 'main'

3a916c8

Conflicts: torchvision/prototype/features/__init__.py torchvision/prototype/features/_feature.py

Merge branch 'main' into dataset-wrappers

d77ef0b

cleanup

63e1148

Merge branch 'main' into dataset-wrappers

13a820c

facebook-github-bot added the cla signed label Sep 28, 2022

pmeier mentioned this pull request Sep 28, 2022

compatibility layer between stable datasets and prototype transforms? #6662

Closed

pmeier commented Sep 28, 2022

View reviewed changes

Merge branch 'main' into dataset-wrappers

fb600a7

pmeier mentioned this pull request Nov 4, 2022

[FEEDBACK] Transforms V2 API #6753

Closed

pmeier mentioned this pull request Jan 16, 2023

[NOMRG] TransformsV2 TODOs #7082

Closed

NicolasHug reviewed Jan 25, 2023

View reviewed changes

torchvision/prototype/features/_dataset_wrapper.py Outdated Show resolved Hide resolved

NicolasHug reviewed Jan 25, 2023

View reviewed changes

torchvision/prototype/features/_dataset_wrapper.py Outdated Show resolved Hide resolved

pmeier added 2 commits January 30, 2023 17:25

Merge branch 'main' into dataset-wrappers

cae3e71

refactor

dbfac05

pmeier added 2 commits January 31, 2023 09:18

handle None label for test set use case

2dba1c7

minor cleanup

bcd7620

pmeier requested a review from NicolasHug January 31, 2023 08:27

pmeier mentioned this pull request Jan 31, 2023

remove datapoints compatibility for prototype datasets #7154

Merged

NicolasHug reviewed Feb 1, 2023

View reviewed changes

pmeier added 9 commits February 1, 2023 16:50

Merge branch 'main' into dataset-wrappers

f72ed86

minor refactorings

fe6be60

minor cache refactoring for COCO

cff9092

remove GenericDatapoint for now

9965492

Merge branch 'main' into dataset-wrappers

a588686

add all detection and segmentation datasets

d64e1a9

add Image/DatasetFolder

49cc8e7

add video datasets

8e12bad

nuke annotations

7a9f083

pmeier added 3 commits February 9, 2023 13:03

remove categories and refactor wrapping architecture

22288ce

add tests

a88aec3

cleanup

ce740c1

pmeier marked this pull request as ready for review February 9, 2023 14:14

Merge branch 'main' into dataset-wrappers

edad790

pmeier commented Feb 9, 2023

View reviewed changes

test/test_datasets.py Show resolved Hide resolved

pmeier added 6 commits February 9, 2023 15:28

remove GenericDatapoint

3398822

Merge branch 'dataset-wrappers' of https://github.com/pmeier/vision i…

b565426

…nto dataset-wrappers

Merge branch 'main' into dataset-wrappers

a236f9c

move wrapper instantiation into the class

331a66d

use decorator registering everywhere

48405b8

hard depend on wrapper in stable tests

0286238

NicolasHug approved these changes Feb 9, 2023

View reviewed changes

pmeier added 5 commits February 9, 2023 17:13

remove target type wrapping default

be42cc9

make test more strict

e3c4d50

fix cityscapes instance return

351becb

add comment for two stage design

8ed41ba

Merge branch 'main' into dataset-wrappers

f0e1af7

pmeier mentioned this pull request Feb 10, 2023

remove videos from test for DatasetFolder #7216

Merged

NicolasHug mentioned this pull request Feb 10, 2023

TODOs before 0.15 release #7217

Closed

49 tasks

Merge branch 'main' into dataset-wrappers

dbebe40

pmeier linked an issue Feb 10, 2023 that may be closed by this pull request

compatibility layer between stable datasets and prototype transforms? #6662

Closed

pmeier merged commit a9d2572 into pytorch:main Feb 10, 2023

pmeier deleted the dataset-wrappers branch February 10, 2023 14:32

NicolasHug changed the title ~~[PoC] compatibility layer between stable datasets and prototype transforms~~ Compatibility layer between stable datasets and prototype transforms Feb 10, 2023

NicolasHug added prototype module: datasets module: transforms labels Feb 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compatibility layer between stable datasets and prototype transforms #6663

Compatibility layer between stable datasets and prototype transforms #6663

pmeier commented Sep 28, 2022 •

edited by pytorch-bot bot

Loading

pmeier commented Jan 31, 2023 •

edited

Loading

NicolasHug left a comment

pmeier left a comment

NicolasHug left a comment

github-actions bot commented Feb 10, 2023

Compatibility layer between stable datasets and prototype transforms #6663

Compatibility layer between stable datasets and prototype transforms #6663

Conversation

pmeier commented Sep 28, 2022 • edited by pytorch-bot bot Loading

pmeier commented Jan 31, 2023 • edited Loading

NicolasHug left a comment

Choose a reason for hiding this comment

pmeier left a comment

Choose a reason for hiding this comment

NicolasHug left a comment

Choose a reason for hiding this comment

github-actions bot commented Feb 10, 2023

pmeier commented Sep 28, 2022 •

edited by pytorch-bot bot

Loading

pmeier commented Jan 31, 2023 •

edited

Loading