PIL version check for enum change appears to break SIMD versions #6153

rwightman · 2022-06-10T21:46:01Z

🐛 Describe the bug

This change appears to break current Pillow-SIMD version #5898

    if tuple(int(part) for part in PIL.__version__.split(".")) >= (9, 1):
  File "/home/.../lib/python3.10/site-packages/torchvision/transforms/_pil_constants.py", line 7, in <genexpr>
    if tuple(int(part) for part in PIL.__version__.split(".")) >= (9, 1):
ValueError: invalid literal for int() with base 10: 'post1'

Amusingly enough, I warned against this approach in a users PR in timm huggingface/pytorch-image-models#1256

Would be nice to have it fixed before 1.12 is finalized, I just hit this trying out the RC

Versions

PT 1.12 RC, TV 0.13.0

The text was updated successfully, but these errors were encountered:

rwightman · 2022-06-10T21:50:48Z

The version string for Pillow-SIMD

pillow-simd               9.0.0.post1              pypi_0    pypi

vfdev-5 · 2022-06-10T21:55:22Z

We should definitely use from packaging.version import Version instead of manual parsing:

from packaging.version import Version

v = Version("9.0.0.post1")
v > Version("9.1.0")
# False

However, the main drawback with packaging is that it is not a built-in package :(

rwightman · 2022-06-10T22:02:05Z

@vfdev-5 I suggested for the PR in timm that the user checks existence of the type
for 9.1.1

>>> from PIL import Image
>>> 'Resampling' in dir(Image)
True

and False for 9.0.0

vfdev-5 · 2022-06-10T22:04:04Z

Thanks for the suggestion. I hope Pillow guys wont move it elsewhere in 10.X :)

adamjstewart · 2022-06-10T22:12:10Z

PyTorch Lightning also uses packaging.version.Version: https://github.com/PyTorchLightning/pytorch-lightning/blob/master/pytorch_lightning/utilities/imports.py

@rwightman checking for the existence of the value (in try-except or with dir) will work, but we'll still see the warning message. I think it's better to add packaging to install_requires, it's a pretty light dependency to add.

rwightman · 2022-06-10T22:15:55Z

@adamjstewart it's an extra dep though as it's not builtin, why would the warning message still exist? if the type exists you'd use the new ones and never the old... how could it warn?

adamjstewart · 2022-06-10T22:16:43Z

Oh true, if you use the newer one first that should be fine. Disregard what I said, both are valid approaches.

vfdev-5 · 2022-06-10T22:27:48Z

@adamjstewart If pytorch was bringing packaging as dep then we could use it, otherwise adding it into torchvision's requirements should be carefully validated. Solution with dir works without warnings.

vfdev-5 · 2022-06-10T22:30:53Z

@rwightman by the way any particular reasons for using Pillow-SIMD instead of tensor images and tensor data aug pipelines ?

rwightman · 2022-06-10T22:40:48Z

@vfdev-5 Pillow-SIMD was still significantly faster last time I checked (last fall I think). I'm often CPU bound in dataloader pre-processing so it has a big impact. Doing it on GPU eats precious GPU memory.

rwightman · 2022-06-10T22:43:27Z

it's often underrated how fast Pillow-SIMD actually is, when you add the proper filtering to OpenCV (to prevent aliasing on downsample), Pillow-SIMD is usually faster than cv2 as well for typical pre-proc / aug pipelines. It's just a shame it's not portable or easy to integrate into most environments...

vfdev-5 · 2022-06-10T22:44:28Z

@rwightman what kind of ops you typically do in the data aug pipeline ? Resizing, padding, cropping, maybe some radiometric transforms (e.g. color jitter) ?

rwightman · 2022-06-10T22:47:58Z

@vfdev-5 yes, those basic ones for inference, but a wider array for train aug, solarize, posterize, other luts, rotate, skew, other geometric ... but even just the resize is a big one, Pillow-SIMD is heavily optimized to do many / most? of the ops in uint8, last I looked I think many of the tensor ops necessitated using float which is a big hit.

I also try to move the GPU whenever I can before converting to float to keep bandwidth down.

vfdev-5 · 2022-06-10T22:49:50Z

Yeah, doing a resize on uint8 would be great.

Right now, here are few numbers PIL (without SIMD) vs torch interpolate CPU (multithreaded) vs torch interpolate CUDA:
pytorch/pytorch#70930

rwightman · 2022-06-10T22:53:02Z

cool thanks for the numbers, I measured Pillow-SIMD to be 8x faster than Pillow (I think that was bicubic) so would maybe put it ~ 3x faster than the CPU tensor numbers

vfdev-5 · 2022-06-10T22:54:26Z

Well, there's a room for improvements for pytorch :)

rwightman · 2022-06-10T22:56:57Z

yup! you've been making great progress, I just need ALL the CPU munch munch :)

rwightman · 2022-06-10T23:09:23Z

@vfdev-5 one last thing to keep in mind re the gap and future performance characterization, the Pillow-SIMD numbers are single threaded, I don't believe it uses any thread parallelization, so if 8 threads are pinned for those tensor numbers, that is significant... esp when you're often pinning 16-48 cores with pre-processing....

vfdev-5 · 2022-06-10T23:12:50Z

Yes, you are right, PIL is single threaded.

Numbers for single thread are bit different. Torch interpolate with AA on CPU (3 channels float32) vs PIL RGB (uint8) are almost similar, torch is a bit slower: pytorch/pytorch#68819. So, x8 slower than PIL-SIMD according to your estimations.

In the previous numbers I posted, I had to present a difference between CUDA vs CPU multithreaded.

datumbox · 2022-06-10T23:44:38Z

@rwightman Thanks for reporting. Is there any specific reason you propose checking in dir() instead of doing hasattr(Image, "Resampling")?

@vfdev-5 Thanks for the ultra fast fix. I left this comment on the PR. I've tested with SIMD, PIL 9.0 and 9.1 and it works. Let me know what you think, thanks!

rwightman · 2022-06-10T23:59:33Z

@datumbox no strong reason, hasattr works, I think in this case there wouldn't be any dir gotchas as it's a enum in a module, hasattr does have stronger guarantees re attrib resolution in all cases though

t-vi · 2022-06-11T04:06:38Z

hasattr also seems to conceptually be a better fit for checking for a single thing, as dir creates an inventory of all symbols just to check for one...

rwightman · 2022-06-11T04:49:36Z

@t-vi yeah, true, forgot that dir is list and was vars thats a dict too, so the lookup is meh. hasattr should be faster.

adamjstewart mentioned this issue Jun 10, 2022

Resolve DeprecationWarnings microsoft/torchgeo#567

Merged

2 tasks

vfdev-5 mentioned this issue Jun 10, 2022

Update _pil_constants.py #6154

Merged

vfdev-5 closed this as completed in #6154 Jun 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PIL version check for enum change appears to break SIMD versions #6153

PIL version check for enum change appears to break SIMD versions #6153

rwightman commented Jun 10, 2022

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022 •

edited

Loading

vfdev-5 commented Jun 10, 2022

adamjstewart commented Jun 10, 2022

rwightman commented Jun 10, 2022

adamjstewart commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

rwightman commented Jun 10, 2022 •

edited

Loading

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022 •

edited

Loading

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022 •

edited

Loading

datumbox commented Jun 10, 2022

rwightman commented Jun 10, 2022 •

edited

Loading

t-vi commented Jun 11, 2022

rwightman commented Jun 11, 2022

PIL version check for enum change appears to break SIMD versions #6153

PIL version check for enum change appears to break SIMD versions #6153

Comments

rwightman commented Jun 10, 2022

🐛 Describe the bug

Versions

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022 • edited Loading

vfdev-5 commented Jun 10, 2022

adamjstewart commented Jun 10, 2022

rwightman commented Jun 10, 2022

adamjstewart commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

rwightman commented Jun 10, 2022 • edited Loading

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022 • edited Loading

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022

rwightman commented Jun 10, 2022

rwightman commented Jun 10, 2022

vfdev-5 commented Jun 10, 2022 • edited Loading

datumbox commented Jun 10, 2022

rwightman commented Jun 10, 2022 • edited Loading

t-vi commented Jun 11, 2022

rwightman commented Jun 11, 2022

rwightman commented Jun 10, 2022 •

edited

Loading

rwightman commented Jun 10, 2022 •

edited

Loading

rwightman commented Jun 10, 2022 •

edited

Loading

vfdev-5 commented Jun 10, 2022 •

edited

Loading

rwightman commented Jun 10, 2022 •

edited

Loading