FSDP sharding_strategy parameter fails with KeyError when passed as string #809

junuMoon · 2024-12-05T03:03:44Z

System Info

PyTorch version: 2.1.0
Python version: 3.10
OS: Ubuntu 22.04

Information

The official example scripts
My own modified scripts

🐛 Describe the bug

When passing sharding_strategy as a command line argument to FSDP config, it fails with a KeyError because the string value is not properly converted to ShardingStrategy enum.

Running with --fsdp_config.sharding_strategy "FULL_SHARD" results in:

KeyError: 'FULL_SHARD'

Error logs

Traceback (most recent call last):
  File "finetuning.py", line 272, in main
    model = FSDP(
  ...
  File "torch/distributed/fsdp/_init_utils.py", line 652, in _init_param_handle_from_params
    SHARDING_STRATEGY_MAP[state.sharding_strategy],
KeyError: 'FULL_SHARD'

Expected behavior

The string value "FULL_SHARD" should be converted to ShardingStrategy.FULL_SHARD enum when updating the config.

The text was updated successfully, but these errors were encountered:

vz-2244 · 2024-12-21T20:20:05Z

Same issue here. Any suggestion about how to change the FSDP config from command line?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSDP sharding_strategy parameter fails with KeyError when passed as string #809

FSDP sharding_strategy parameter fails with KeyError when passed as string #809

junuMoon commented Dec 5, 2024 •

edited

Loading

vz-2244 commented Dec 21, 2024

FSDP sharding_strategy parameter fails with KeyError when passed as string #809

FSDP sharding_strategy parameter fails with KeyError when passed as string #809

Comments

junuMoon commented Dec 5, 2024 • edited Loading

System Info

Information

🐛 Describe the bug

Error logs

Expected behavior

vz-2244 commented Dec 21, 2024

junuMoon commented Dec 5, 2024 •

edited

Loading