You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi I am getting this error when running the recommendation on multiple cards. I have also set the gpu_id in the config as well.
File "/home/ubuntu/recommender/.venv/lib/python3.12/site-packages/torch/utils/data/distributed.py", line 77, in __init__
num_replicas = dist.get_world_size()
^^^^^^^^^^^^^^^^^^^^^
File "/home/ubuntu/recommender/.venv/lib/python3.12/site-packages/torch/distributed/distributed_c10d.py", line 2020, in get_world_size
return _get_group_size(group)
^^^^^^^^^^^^^^^^^^^^^^
File "/home/ubuntu/recommender/.venv/lib/python3.12/site-packages/torch/distributed/distributed_c10d.py", line 986, in _get_group_size
default_pg = _get_default_group()
^^^^^^^^^^^^^^^^^^^^
File "/home/ubuntu/recommender/.venv/lib/python3.12/site-packages/torch/distributed/distributed_c10d.py", line 1150, in _get_default_group
raise ValueError(
ValueError: Default process group has not been initialized, please make sure to call init_process_group.
The text was updated successfully, but these errors were encountered:
Hi I am getting this error when running the recommendation on multiple cards. I have also set the gpu_id in the config as well.
The text was updated successfully, but these errors were encountered: