Skip to content

Detection Datasets in Torchvision #3047

Closed
@oke-aditya

Description

@oke-aditya

🚀 Feature

Can we have Pen Fudan Dataset in torchvision.datasets ?

Motivation

We use this dataset so often and commonly in tutorials !
It is much easier to prototype if we have

torchvision.datasets.PenFudan(root: str, transform: Optional[Callable] = None, target_transform: Optional[Callable] = None, download: bool = False)

then we could easily load this dataset in VOC Format

Pitch

Very commonly used and first one to think when it comes to detection and segmentation tasks.
Pen Fudan is a simple dataset to prototype with instead of COCO.
It would be really simple to load the data in VOC Format which is directly compatible with torchvision models.
This would keep quickstart and prototyping very fast, like CIFAR10 does !

I'm not sure of some aspects.

  1. Should the targets start from 0 or 1. In torchvision we assume 0 to be background, but that might not be always true.
  2. Should we load the boxes in VOC Format, or have a param to control that ? We can use box_convert and return in format people need.

Alternatives

Currently in tutorial for Object Detection we do show how to load it and use it

It should be nice addtion, as we don't have a handy detection dataset to prototype (apart from COCO)

Additional context

I coulnd't find the paper and citation count, so I'm not sure if that is needed to add to torchvision.

cc @pmeier

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions