Support tuple of tensors in estimate_strategy_runtime_cost #102

fmassa · 2025-08-21T09:51:38Z

Previously, if we had tuple of tensors as an argument to a function, we wouldn't apply any sharding on it. This is split from #26 , where I originally found this issue

wconstab · 2025-08-21T18:12:16Z

autoparallel/compute_estimation.py

    args = tree_map_only(torch.fx.Node, lambda x: x.meta["val"], node.args)
    kwargs = tree_map_only(torch.fx.Node, lambda x: x.meta["val"], node.kwargs)

-    fake_mode = torch._guards.detect_fake_mode(args)


is this bc we are already under a fake mode now, but we weren't in the initial autop?

Yes, that's right, now the whole AutoParallel is running under fake mode, so we can remove it

xmfan · 2025-08-21T18:12:56Z

autoparallel/compute_estimation.py

    args = tree_map_only(torch.fx.Node, lambda x: x.meta["val"], node.args)
    kwargs = tree_map_only(torch.fx.Node, lambda x: x.meta["val"], node.kwargs)

-    fake_mode = torch._guards.detect_fake_mode(args)


this is removed because we're already running this in a fake mode?

wconstab · 2025-08-21T18:13:06Z

autoparallel/compute_estimation.py

        _get_sharded_shape_stride(spec) for spec in strategy.input_specs
    )

+    flat_args, treespec = tree_flatten(args)


shouldn't we just call tree_map_only(Tensor, torch.empty) here instead of doing the for loop?

We need to get the size from the args_sizes_strides (which comes from the spec), so I think we might need this indirection.

But if there is a cleaner way of doing this I'm happy to change the code!

Support tuple of tensors in estimate_strategy_runtime_cost

6ca8611

Previously, if we had tuple of tensors as an argument to a function, we wouldn't apply any sharding on it. This is split from #26 , where I originally found this issue

fmassa requested review from bdhirsh and wconstab August 21, 2025 09:51

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 21, 2025

Fix bad copy-paste

0a4ba18

xmfan approved these changes Aug 21, 2025

View reviewed changes

wconstab reviewed Aug 21, 2025

View reviewed changes

xmfan reviewed Aug 21, 2025

View reviewed changes

wconstab reviewed Aug 21, 2025

View reviewed changes

wconstab approved these changes Aug 21, 2025

View reviewed changes

fmassa merged commit 20e53b0 into main Aug 21, 2025
6 checks passed

fmassa deleted the fmassa/improve_flop_estimation branch August 21, 2025 18:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support tuple of tensors in estimate_strategy_runtime_cost #102

Support tuple of tensors in estimate_strategy_runtime_cost #102

Uh oh!

fmassa commented Aug 21, 2025

Uh oh!

wconstab Aug 21, 2025

Uh oh!

fmassa Aug 21, 2025

Uh oh!

xmfan Aug 21, 2025

Uh oh!

wconstab Aug 21, 2025

Uh oh!

fmassa Aug 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Support tuple of tensors in estimate_strategy_runtime_cost #102

Support tuple of tensors in estimate_strategy_runtime_cost #102

Uh oh!

Conversation

fmassa commented Aug 21, 2025

Uh oh!

wconstab Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

xmfan Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

wconstab Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

fmassa Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants