Skip to content
@ParCIS

ParCIS Lab, BUPT

Parallel Computing and Intelligent Systems Laboratory (ParCIS Lab), Beijing University of Posts and Telecommunications

Popular repositories Loading

  1. Magicube Magicube Public

    Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

    C++ 88 17

  2. Chimera Chimera Public

    Chimera: bidirectional pipeline parallelism for efficiently training large-scale models.

    Python 63 8

  3. Ok-Topk Ok-Topk Public

    Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with th…

    Python 26 9

  4. FlashSparse FlashSparse Public

    FlashSparse significantly reduces the computation redundancy for unstructured sparsity (for SpMM and SDDMM) on Tensor Cores through a Swap-and-Transpose mapping strategy. FlashSparse is accepted by…

    Cuda 19 3

  5. DNN-cpp-proxies DNN-cpp-proxies Public

    C++/MPI proxies for distributed training of deep neural networks.

    C++ 1

Repositories

Showing 5 of 5 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…