[Relax] Allow softmax to work on a large tensor when dimension is not the last one #17720

hugolatendresse · 2025-03-08T20:18:41Z

For large tensors with non-last dimension softmax, we transpose to move the softmax dimension to the end, apply softmax, and then transpose back to the original shape.

Since the issue here that using softmax on non-last dimension could cause python/tvm/dlight/gpu/general_reduction.py to create arrays that are too big for the GPU shared memory, I tried to address this TODO by making changes to general_reduction.py, without success. However, as I was experimenting, I added a suggested handling for the case where num_leading_s = 0 in general_reduction.py. I thought I might as well leave that in the PR.

cc: @MasterJH5574

Edit: we may not merge this at all because it's better to fix the reduction directly, and the fix in this PR may simply be extra overhead

…tting and transposing

tqchen · 2025-03-12T13:21:07Z

let us instead to work and allow dlight to work correctly for non-last dimension cases

hugolatendresse · 2025-03-12T13:43:21Z

let us instead to work and allow dlight to work correctly for non-last dimension cases

Sounds good, closing the PR

MasterJH5574 · 2025-03-16T21:32:37Z

Fixed in #17754

hugolatendresse added 4 commits March 8, 2025 15:09

Handle case where num_leading_s = 0 in general reduction

4a786cb

handle large softmax on non-last dimension for large tensors by permu…

1cb02d0

…tting and transposing

unit test: test_softmax_non_last_dim_large_tensor

550472b

test from exported to cuda

0778b3b

hugolatendresse marked this pull request as ready for review March 8, 2025 21:21

hugolatendresse changed the title ~~Allow softmax to work on a large tensor when dimension is not the last one~~ [Relax] Allow softmax to work on a large tensor when dimension is not the last one Mar 8, 2025

hugolatendresse added 3 commits March 9, 2025 17:44

Black formatter

7c08976

updated cuda test syntax and ran Black Python formatter with version 22

1567fc5

remove void try..except from fx graph translator

fe071c4

hugolatendresse marked this pull request as draft March 10, 2025 19:04

hugolatendresse closed this Mar 12, 2025

hugolatendresse deleted the fix_softmax_not_last_dim branch May 4, 2025 19:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Relax] Allow softmax to work on a large tensor when dimension is not the last one #17720

[Relax] Allow softmax to work on a large tensor when dimension is not the last one #17720

Uh oh!

hugolatendresse commented Mar 8, 2025 •

edited

Loading

Uh oh!

tqchen commented Mar 12, 2025 •

edited

Loading

Uh oh!

hugolatendresse commented Mar 12, 2025

Uh oh!

MasterJH5574 commented Mar 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Relax] Allow softmax to work on a large tensor when dimension is not the last one #17720

[Relax] Allow softmax to work on a large tensor when dimension is not the last one #17720

Uh oh!

Conversation

hugolatendresse commented Mar 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tqchen commented Mar 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hugolatendresse commented Mar 12, 2025

Uh oh!

MasterJH5574 commented Mar 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hugolatendresse commented Mar 8, 2025 •

edited

Loading

tqchen commented Mar 12, 2025 •

edited

Loading