Speed up starting compression #9169

bdraco · 2024-09-17T20:01:45Z

What do these changes do?

Enumerating a enum and accessing all the .value is not performant. Switching to a pre-built dict is significantly faster

Are there changes in behavior for the user?

no

Is it a substantial burden for the maintainers to support this?

no

related issue #2779

before

after

Enumerating a enum and accessing all the .value is not performant. Switching to a dict is significantly faster

codecov · 2024-09-17T20:11:29Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.24%. Comparing base (98b363e) to head (03afd40).

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #9169      +/-   ##
==========================================
- Coverage   98.31%   98.24%   -0.07%     
==========================================
  Files         107      107              
  Lines       34485    34486       +1     
  Branches     4095     4096       +1     
==========================================
- Hits        33903    33882      -21     
- Misses        411      429      +18     
- Partials      171      175       +4

Flag	Coverage Δ
CI-GHA	`98.15% <100.00%> (-0.06%)`	⬇️
OS-Linux	`97.81% <100.00%> (-0.06%)`	⬇️
OS-Windows	`96.28% <100.00%> (+<0.01%)`	⬆️
OS-macOS	`97.54% <100.00%> (+<0.01%)`	⬆️
Py-3.10.11	`97.64% <100.00%> (+<0.01%)`	⬆️
Py-3.10.14	`97.49% <100.00%> (-0.09%)`	⬇️
Py-3.10.15	`97.39% <100.00%> (?)`
Py-3.11.9	`97.81% <100.00%> (+<0.01%)`	⬆️
Py-3.12.5	`97.58% <100.00%> (+<0.01%)`	⬆️
Py-3.12.6	`97.64% <100.00%> (+<0.01%)`	⬆️
Py-3.9.13	`97.53% <100.00%> (+<0.01%)`	⬆️
Py-3.9.19	`97.47% <100.00%> (+<0.01%)`	⬆️
Py-pypy7.3.16	`?`
VM-macos	`97.54% <100.00%> (+<0.01%)`	⬆️
VM-ubuntu	`97.81% <100.00%> (-0.06%)`	⬇️
VM-windows	`96.28% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Dreamsorcerer · 2024-09-18T11:41:39Z

aiohttp/web_response.py

- for coding in ContentCoding:
- if coding.value in accept_encoding:
+ for value, coding in CONTENT_CODINGS.items():
+ if value in accept_encoding:


I'm struggling with this one. It looks to me like all we've saved is one attribute access...

With the dict, I was expecting a reversal of logic or something here, like for enc in accept_encoding: CONTENT_CODINGS.get(enc).

Iterating enums in python is surprisingly expensive and good chunk of its the __get__. _start_compression was the most expensive call in _prepare_headers before:

We save:

creating the iterator for ContentCoding enum (although it might be an even exchange to iterate .items(), but thats all going to be in native code for sure)

creating a enum singleton for each one in the loop

accessing the value which does some magic under the hood (its not a simple attribute)

... maybe we can save more here but I was thinking the in 3x was cheaper than a dict lookup

Zoomed in on _prepare_headers before the change:

Zoomed in on _start_compression before the change:

a dict get might make sense if we parsed the string but parsing is probably more expensive Accept-Encoding: deflate, gzip;q=1.0, *;q=0.5

Speed up starting compression

c85f1e4

Enumerating a enum and accessing all the .value is not performant. Switching to a dict is significantly faster

bdraco added backport-3.10 Trigger automatic backporting to the 3.10 release branch by Patchback robot backport-3.11 Trigger automatic backporting to the 3.11 release branch by Patchback robot labels Sep 17, 2024

changelog

46c29db

psf-chronographer bot added the bot:chronographer:provided There is a change note present in this PR label Sep 18, 2024

bdraco marked this pull request as ready for review September 18, 2024 05:51

bdraco requested review from webknjaz and asvetlov as code owners September 18, 2024 05:51

Dreamsorcerer reviewed Sep 18, 2024

View reviewed changes

bdraco mentioned this pull request Sep 18, 2024

Add a cache to must_be_empty_body #9174

Merged

bdraco added 2 commits September 18, 2024 16:39

Merge branch 'master' into start_compression_enum

3d7dd4b

symlink

03afd40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up starting compression #9169

Speed up starting compression #9169

bdraco commented Sep 17, 2024 •

edited

Loading

codecov bot commented Sep 17, 2024 •

edited

Loading

Dreamsorcerer Sep 18, 2024

bdraco Sep 18, 2024 •

edited

Loading

bdraco Sep 18, 2024

Speed up starting compression #9169

Are you sure you want to change the base?

Speed up starting compression #9169

Conversation

bdraco commented Sep 17, 2024 • edited Loading

What do these changes do?

Are there changes in behavior for the user?

Is it a substantial burden for the maintainers to support this?

codecov bot commented Sep 17, 2024 • edited Loading

Codecov Report

Dreamsorcerer Sep 18, 2024

Choose a reason for hiding this comment

bdraco Sep 18, 2024 • edited Loading

Choose a reason for hiding this comment

bdraco Sep 18, 2024

Choose a reason for hiding this comment

bdraco commented Sep 17, 2024 •

edited

Loading

codecov bot commented Sep 17, 2024 •

edited

Loading

bdraco Sep 18, 2024 •

edited

Loading