You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Imported from GitHub PR #21683
Requires #20395 which adds the NVSHMEM library dependency.
This PR adds the following:
1. Nvshmem flag to enable nvshmem
2. Set nvshmem initialization issue when GPU PJRT client is created. The first time NVSHMEM is used, it will be initialized.
3. Uses the user buffer memory pool for nvshmem. If nvshmem is enabled, it will be allocated using `nvshmem_malloc`. This same memory can be used by user buffers if nccl user buffers is also enabled.
4. Update the `CollectiveColorer` so that mosaic_gpu custom calls use the nvshmem memory space.
Copybara import of the project:
--
aee3379 by Trevor Morris <tmorris@nvidia.com>:
Add nvshmem flag, memory allocation, and memory space assignment
Set Nvshmem env info during client creation
Rename flag and use absl::string_view
--
f8fca39 by Trevor Morris <tmorris@nvidia.com>:
Use explicit types in test
--
e41faa3 by Trevor Morris <tmorris@nvidia.com>:
Add user buffer allgather and allreduce tests with and without nvshmem alloc
Set nvshmem in XLA_FLAGS
test fixes
formatting
--
cf0c368 by Trevor Morris <tmorris@nvidia.com>:
Fixes
--
3b4d111 by Trevor Morris <tmorris@nvidia.com>:
Remove early dso check
--
359f2b2 by Trevor Morris <tmorris@nvidia.com>:
Add flag comment
--
fd15a7c by Trevor Morris <tmorris@nvidia.com>:
Also assign memory space for mosaic_gpu_v2
Merging this change closes#21683
COPYBARA_INTEGRATE_REVIEW=#21683 from trevor-m:nvshmem-upstream-2 fd15a7c
PiperOrigin-RevId: 747816712
0 commit comments