Repromote from shared to private memory #554

ftynse · 2018-07-05T12:47:28Z

If a tensor reference group is promoted to shared memory at some scope, it may be interesting to promote it to registers at some deeper scope. There are two possibilities:

promote to registers instead of promoting to shared (freeing the shared memory for other uses or for increased occupancy);
promote to registers from shared, hiding global access latency and/or having more coalescing when copying from global to shared.

#161 and #217 attempted this behavior; first by demoting from shared memory, then by promoting from shared to private. Demotion from shared was mostly harmful, principally because promotion to registers was too deep and rarely beneficial by itself. The effect may be different with tunable promotion depth, so we can start by having this behavior controlled by a flag.

ftynse added polyhedral enhancement New feature or request labels Jul 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repromote from shared to private memory #554

Repromote from shared to private memory #554

ftynse commented Jul 5, 2018

Repromote from shared to private memory #554

Repromote from shared to private memory #554

Comments

ftynse commented Jul 5, 2018