Add gather and scatter_add strategies #81

fmassa · 2025-08-06T15:36:16Z

They were taken from #29

Would be good to have those in PyTorch, but I've seen the gather was useful for CrossEntropyLoss as well, so probably better to unblock first

They were taken from #29

zpcore · 2025-08-06T22:19:47Z

autoparallel/propagation_rules.py

+
+    single_mesh_dim_strategies = []
+
+    # placement list stores placements of [output, input, index]


nit: [output, input, index, src]

zpcore · 2025-08-06T22:41:37Z

autoparallel/propagation_rules.py

+    """
+    # index sharding, input replicated, index sharded, output follows index
+    # this only works when the sharding dimension is the gather dimension
+    index_sharding: PlacementList = [Shard(dim), Replicate(), Shard(dim), Shard(dim)]


I feel this may not be correct. Taking the example here https://docs.pytorch.org/docs/stable/generated/torch.Tensor.scatter_add_.html#torch.Tensor.scatter_add_:

>> src = torch.ones((2, 5)) >> index = torch.tensor([[0, 1, 2, 0, 0], [0, 1, 2, 2, 2]]) >> torch.zeros(3, 5, dtype=src.dtype).scatter_add_(0, index, src) tensor([[2., 0., 0., 1., 1.], [0., 2., 0., 0., 0.], [0., 0., 2., 1., 1.]])

the output can become Partial as: [Partial(), Replicate(), Shard(dim), Shard(dim)]

Yes, thanks for the review! I had roughly copy-pasted the gather rule and didn't fix this part. Will adapt it shortly.

Also, do you think we could have this implemented natively in PyTorch?

Yes! Current upstream scatter_add strategy is just a quick workaround. We can follow up.

zpcore

I think scatter_add may produce incorrect output.

zpcore · 2025-08-07T18:58:26Z

autoparallel/propagation_rules.py

+    if len(input_shape) == len(index_shape):
+        for d in range(len(input_shape)):
+            if d != dim:
+                sharding = [Shard(d), Shard(d), Shard(d), Shard(d)]


I tried more tests and noticed that with [Shard(d), Shard(d), Shard(d), Shard(d)], we can't simply shard the output and input. E.g., if dim = 1, we want to shard on dim=0, but input can have much more rows than index, then we will most like only modify the first shard of input, because input row and index row is one to one mapping.

Correct me if I'm wrong, but I thought the shapes needed to match except of the dim ?

RIght, the shape need to match. I change it to

if d != dim and input_shape[d] == index_shape[d]:

and the op coverage pass now. I created the PR with the update here pytorch/pytorch#160140.

Oh that's true, I definitely missed that case!

fmassa · 2025-08-08T08:54:19Z

Subsumed by pytorch/pytorch#160140

As title. This PR made a small fix on top of meta-pytorch/autoparallel#81. Pull Request resolved: #160140 Approved by: https://github.com/fmassa

Add gather and scatter_add strategies

315f44b

They were taken from #29

fmassa requested review from wconstab and zpcore August 6, 2025 15:36

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 6, 2025

ezyang approved these changes Aug 6, 2025

View reviewed changes

zpcore reviewed Aug 6, 2025

View reviewed changes

zpcore reviewed Aug 7, 2025

View reviewed changes

zpcore mentioned this pull request Aug 7, 2025

improve gather and scatter_add strategy pytorch/pytorch#160140

Closed

fmassa closed this Aug 8, 2025

fmassa deleted the fmassa/gather_scatter_add branch August 8, 2025 08:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add gather and scatter_add strategies #81

Add gather and scatter_add strategies #81

Uh oh!

fmassa commented Aug 6, 2025

Uh oh!

zpcore Aug 6, 2025

Uh oh!

zpcore Aug 6, 2025 •

edited

Loading

Uh oh!

fmassa Aug 7, 2025

Uh oh!

zpcore Aug 7, 2025

Uh oh!

zpcore left a comment

Uh oh!

zpcore Aug 7, 2025 •

edited

Loading

Uh oh!

fmassa Aug 7, 2025

Uh oh!

zpcore Aug 7, 2025

Uh oh!

fmassa Aug 8, 2025

Uh oh!

fmassa commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants


		single_mesh_dim_strategies = []

		# placement list stores placements of [output, input, index]

Add gather and scatter_add strategies #81

Add gather and scatter_add strategies #81

Uh oh!

Conversation

fmassa commented Aug 6, 2025

Uh oh!

zpcore Aug 6, 2025

Choose a reason for hiding this comment

Uh oh!

zpcore Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

zpcore Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

zpcore left a comment

Choose a reason for hiding this comment

Uh oh!

zpcore Aug 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

zpcore Aug 7, 2025

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 8, 2025

Choose a reason for hiding this comment

Uh oh!

fmassa commented Aug 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zpcore Aug 6, 2025 •

edited

Loading

zpcore Aug 7, 2025 •

edited

Loading