DALI support #608

moskomule · 2018-09-20T02:32:17Z

Hi, any plan to integrate DALI (https://docs.nvidia.com/deeplearning/sdk/dali-developer-guide/docs/index.html) to torchvision for faster preprocessing? I found chainer tries to integrate it (chainer/chainer#5067).

The text was updated successfully, but these errors were encountered:

fmassa · 2018-09-21T11:42:03Z

Hi,
Thanks for opening the issue. I'll have a look at this

moskomule · 2018-09-22T09:05:16Z

Thank you. These days I found image preprocessing parts are the bottlenecks. I'll try DALI by myself and report how it will make the processing fast.

sotte · 2018-10-02T16:01:56Z

albumentations is also a contender for faster image augmentation.

In my experience IO is actually worse than a "slow pre-processing" library. SSDs and NVMes(!) help a lot.

msaroufim · 2022-04-22T16:47:42Z

Hi @datumbox it's been a while since this PR had any discussions, I'm curious if there are any plans to make this happen?

datumbox · 2022-04-25T09:10:42Z

@msaroufim we are currently working to improve the Data loading process using PyTorch Data. We do not have immediate plans for integrating DALI directly at the moment but we can review this on the future. As we have very little resources, I think it's more realistic that such an investigation can happen after the release of the new Datasets API.

ccing @NicolasHug and @pmeier who lead the work on datasets.

msaroufim · 2022-04-25T18:04:39Z

Oh interesting so the way you'd integrate new backends in the future is to integrate them within torch.data? Also where can I learn more about the new Datasets API?

cc @VitalyFedyunin @ejguan @wenleix

pmeier · 2022-04-26T06:06:36Z

Oh interesting so the way you'd integrate new backends in the future is to integrate them within torch.data?

Not sure what you mean by "backends" here. In general you are right though. torchdata is the way to go for the new datasets.

Also where can I learn more about the new Datasets API?

There is no public document yet. However, we already have quite a large collection of datasets ported to the new structure. You can access them with torchvision.prototype.datasets.load(name), where name is the name of the dataset you want to load. For example

from torchvision.prototype import datasets

dataset = datasets.load("voc")

The dataset object is a regular IterDataPipe defined by torchdata. To transform it you can use the .map method. It takes a callable that will be executed for each sample in the dataset. This sample will be a dictionary with str keys. For example, a simple data pipeline could look like this:

from torchvision.prototype import transforms

transform = transforms.Compose(
    transforms.DecodeImage(),
    transforms.Resize(256),
    transforms.CenterCrop(256),
)

for sample in dataset.map(transform):
    ...

For everything else, please also have a look at the torchdata documentation.

abhi-glitchhg · 2022-04-26T07:02:30Z

Adding to @pmeier's comment, this tutorial might help you.

msaroufim · 2022-04-26T20:37:44Z

@pmeier to clarify by backend I mean one of these https://github.com/pytorch/vision#image-backend - i.e: pillow, accimage, pillow simd etc..

Overall the new interface for adding datasets looks good but I'm more curious about adding new backends like DALI. In particular DALI has some accelerated image processing kernels, accelerated image decoding which I think would be very useful to integrate in vision directly, feels too domain specific to be in torch.data IMHO and is similar enough to other backends like accimage to be in vision. What's the process like for adding a new backend? If it's similar to the one for accimage https://github.com/pytorch/vision/blob/main/torchvision/transforms/functional.py#L13 I can make a PR for this

The other option is to integrate the DALI data loader as a data pipe in torch.data

Here's a good primer on DALI and its value proposition https://cceyda.github.io/blog/dali/cv/image_processing/2020/11/10/nvidia_dali.html

@VitalyFedyunin @wenleix please chime in on where you think the most natural place for a DALI integration is

ejguan · 2022-04-26T20:40:20Z

The other option is to integrate the DALI data loader as a data pipe in torch.data

Thanks @msaroufim, I had the same feeling about making it as a separate DataPipe because it requires different behavior compared with datapipe.map like making sure this DataPipe only run on single process to prevent cuda context being copied around. It definitely needs more deeper look on DALI itself.

msaroufim · 2022-04-26T20:53:24Z

Seems like there's a good workaround too NVIDIA/DALI#3081 (comment) - I'll take a more thorough look

pmeier · 2022-04-27T06:37:51Z

@msaroufim

to clarify by backend I mean one of these https://github.com/pytorch/vision#image-backend - i.e: pillow, accimage, pillow simd etc..

The new datasets will return a features.EncodedImage, which is a 1D uint8 tensor just storing the raw bytes. You can decode it however you want. Right now, transforms.DecodeImage() uses PIL as backend

vision/torchvision/prototype/transforms/_type_conversion.py

Lines 11 to 17 in a8f563d

    
           class DecodeImage(Transform): 
        
               def _transform(self, input: Any, params: Dict[str, Any]) -> Any: 
        
                   if isinstance(input, features.EncodedImage): 
        
                       output = F.decode_image_with_pil(input) 
        
                       return features.Image(output) 
        
                   else: 
        
                       return input

vision/torchvision/prototype/transforms/functional/_type_conversion.py

Lines 13 to 17 in a8f563d

    
           def decode_image_with_pil(encoded_image: torch.Tensor) -> torch.Tensor: 
        
               image = torch.as_tensor(np.array(PIL.Image.open(ReadOnlyTensorBuffer(encoded_image)), copy=True)) 
        
               if image.ndim == 2: 
        
                   image = image.unsqueeze(2) 
        
               return image.permute(2, 0, 1)

but you can use arbitrary backends there.

abhi-glitchhg · 2022-09-06T10:42:49Z

Similar issue on torchdata repo - pytorch/data#761
Might be good to keep eye on this :)

fmassa added the needs discussion label Sep 21, 2018

bhack mentioned this issue Oct 29, 2018

[C++/pytorch] data loading and working with complex data structures with the C++ frontend pytorch/pytorch#12506

Closed

msaroufim mentioned this issue Apr 26, 2022

DALI 2022 roadmap NVIDIA/DALI#3774

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DALI support #608

DALI support #608

moskomule commented Sep 20, 2018

fmassa commented Sep 21, 2018

moskomule commented Sep 22, 2018

sotte commented Oct 2, 2018

msaroufim commented Apr 22, 2022

datumbox commented Apr 25, 2022

msaroufim commented Apr 25, 2022 •

edited

Loading

pmeier commented Apr 26, 2022

abhi-glitchhg commented Apr 26, 2022

msaroufim commented Apr 26, 2022 •

edited

Loading

ejguan commented Apr 26, 2022

msaroufim commented Apr 26, 2022

pmeier commented Apr 27, 2022

abhi-glitchhg commented Sep 6, 2022

DALI support #608

DALI support #608

Comments

moskomule commented Sep 20, 2018

fmassa commented Sep 21, 2018

moskomule commented Sep 22, 2018

sotte commented Oct 2, 2018

msaroufim commented Apr 22, 2022

datumbox commented Apr 25, 2022

msaroufim commented Apr 25, 2022 • edited Loading

pmeier commented Apr 26, 2022

abhi-glitchhg commented Apr 26, 2022

msaroufim commented Apr 26, 2022 • edited Loading

ejguan commented Apr 26, 2022

msaroufim commented Apr 26, 2022

pmeier commented Apr 27, 2022

abhi-glitchhg commented Sep 6, 2022

msaroufim commented Apr 25, 2022 •

edited

Loading

msaroufim commented Apr 26, 2022 •

edited

Loading