Explicitly name the allocgroups on GPU schedules "allocgroup__..." #7883

mcourteaux · 2023-10-06T19:11:39Z

I was flabbergasted when inspecting the schedule using the conceptual_stmt of a GPU schedule. The loads and stores didn't make any sense to me. After digging through the lowering passes, I found out about the AllocGroups and the clustering in the FuseGPUThreadLoops pass.

I prepended "allocgroup_" to the name of the Allocate node now, to make that clearer in the Stmt.

I still have another concern with this, and don't know what to do about it. After working on a schedule for a while, the names of these AllocGroups get really long, like:

allocate allocgroup__blurring_filters_noisy_im_in_denoise_conv_noisy$0.0_exp_logit.4__clamped_noisy_in_denoise_conv_noisy$0.1_filter_response_noisy_global_wrapper$0.3_sum_exp_logit_global_wrapper$0.6__denoise_conv_noisy_global_wrapper$0.2[float32 * (t1386 + 768)] in GPUShared

Which is anything but readable, especially in code like this:

I was thinking that it might be useful to just rename the allocations to allocgroup_0, allocgroup_1, etc... and somehow keep the list of shared allocations in this allocation group as meta information of the Allocate IR node, which can then be used to produce Stmt code with comments on what is contained within this grouped alloc.

abadams · 2023-10-09T19:07:03Z

Is this ready to merge? (It's still marked as a draft). IMO even if a better solution comes along for very long merged allocation names we should merge it because it's a small PR that makes a positive change.

mcourteaux · 2023-10-09T19:08:18Z

Okay, go ahead! Marked as ready.

mcourteaux · 2023-10-09T19:10:42Z

Wait, actually... I discovered today that it still prepends "allocgroup_" even if the group is of size 1. I'll see if I can squeeze that out.

…ains more than 1 allocation prepend the prefix.

mcourteaux · 2023-10-09T19:40:50Z

Okay, I did it. I was a little surprised by the double layered hierarchy of grouping: SharedAllocations go into a AllocGroup, which are then again clusterd into the clustered_allocs std::map. However, I think I did it right.

abadams · 2023-10-09T19:52:13Z

Here's the reason for the double-layered hierarchy (IIRC): Sets of allocations with non-overlapping lifetime go into an AllocGroup, the size of which is the max across all the allocations in the group. Then those groups are concatenated into a single big allocation, the size of which is the sum across all the groups.

mcourteaux · 2023-10-12T18:29:14Z

Are we merging this? It's ready, if you ask me. 😄

…alide#7883) * 50cents readibility improvement to allocgroups on GPU schedules. * Improve allocation group prefix: only if the alloc group cluster contains more than 1 allocation prepend the prefix.

abadams requested a review from halidebuildbots October 6, 2023 19:25

abadams approved these changes Oct 9, 2023

View reviewed changes

mcourteaux marked this pull request as ready for review October 9, 2023 19:07

mcourteaux marked this pull request as draft October 9, 2023 19:10

mcourteaux added 2 commits October 9, 2023 21:20

50cents readibility improvement to allocgroups on GPU schedules.

aaba1c3

Improve allocation group prefix: only if the alloc group cluster cont…

d9559db

…ains more than 1 allocation prepend the prefix.

mcourteaux force-pushed the clearer-allocgroups branch from 92339f2 to d9559db Compare October 9, 2023 19:36

mcourteaux marked this pull request as ready for review October 9, 2023 19:38

mcourteaux requested a review from abadams October 9, 2023 19:48

abadams approved these changes Oct 12, 2023

View reviewed changes

abadams merged commit a3911bb into halide:main Oct 12, 2023
17 of 19 checks passed

BrewTestBot mentioned this pull request Feb 2, 2024

halide 17.0.0 Homebrew/homebrew-core#161602

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Explicitly name the allocgroups on GPU schedules "allocgroup__..." #7883

Explicitly name the allocgroups on GPU schedules "allocgroup__..." #7883

mcourteaux commented Oct 6, 2023

abadams commented Oct 9, 2023

mcourteaux commented Oct 9, 2023

mcourteaux commented Oct 9, 2023

mcourteaux commented Oct 9, 2023 •

edited

Loading

abadams commented Oct 9, 2023

mcourteaux commented Oct 12, 2023

Explicitly name the allocgroups on GPU schedules "allocgroup__..." #7883

Explicitly name the allocgroups on GPU schedules "allocgroup__..." #7883

Conversation

mcourteaux commented Oct 6, 2023

abadams commented Oct 9, 2023

mcourteaux commented Oct 9, 2023

mcourteaux commented Oct 9, 2023

mcourteaux commented Oct 9, 2023 • edited Loading

abadams commented Oct 9, 2023

mcourteaux commented Oct 12, 2023

mcourteaux commented Oct 9, 2023 •

edited

Loading