Skip to content

Develop an UCCL plugin for NIXL #38

@praveingk

Description

@praveingk

During Inference, we see the following scenarios:

  1. NIXL/UCX used for P2P comms for PD disaggregation (KV Cache transfers).
  2. NVSHMEM used for AlltoAll comms for MOE (https://github.com/ppl-ai/pplx-kernels). However, underlying transport is via UCX.

UCCL's transport optimization could apply to the above scenarios too.

As part of enhancement, would like to explore how UCCL can be integrated into the inference transport stack.

One solution could be to develop UCX plugin (UCT) with UCCL transport, which can solve both P2P and AllToAll for inference.

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions