Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add accelerator-awareness to most allreduce implementations #12611

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Commits on Jun 10, 2024

  1. Add accelerator-awareness to most allreduce implementations

    Adding accelerator-awareness requires allocation of temporary memory
    on the a device selected based on the input buffer
    (possibly cached by the allocator) and reintroduces
    the use of 3buff reductions to combine copy and operator application.
    This change also improves performance on CPU for larger operations.
    Where possible
    
    Signed-off-by: Joseph Schuchart <joseph.schuchart@stonybrook.edu>
    devreal committed Jun 10, 2024
    Configuration menu
    Copy the full SHA
    e823e51 View commit details
    Browse the repository at this point in the history