Description
🚀 Feature
Can we have Pen Fudan Dataset in torchvision.datasets
?
Motivation
We use this dataset so often and commonly in tutorials !
It is much easier to prototype if we have
torchvision.datasets.PenFudan(root: str, transform: Optional[Callable] = None, target_transform: Optional[Callable] = None, download: bool = False)
then we could easily load this dataset in VOC Format
Pitch
Very commonly used and first one to think when it comes to detection and segmentation tasks.
Pen Fudan is a simple dataset to prototype with instead of COCO.
It would be really simple to load the data in VOC Format which is directly compatible with torchvision models.
This would keep quickstart and prototyping very fast, like CIFAR10 does !
I'm not sure of some aspects.
- Should the targets start from 0 or 1. In torchvision we assume 0 to be background, but that might not be always true.
- Should we load the boxes in VOC Format, or have a param to control that ? We can use
box_convert
and return in format people need.
Alternatives
Currently in tutorial for Object Detection we do show how to load it and use it
It should be nice addtion, as we don't have a handy detection dataset to prototype (apart from COCO)
Additional context
I coulnd't find the paper and citation count, so I'm not sure if that is needed to add to torchvision.
cc @pmeier