Skip to content
Change the repository type filter

All

    Repositories list

    • The Ascend platform plugin for vLLM.
      Python
      Apache License 2.0
      4200Updated Jan 14, 2025Jan 14, 2025
    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.2k000Updated Jan 14, 2025Jan 14, 2025
    • Dockerfile
      Apache License 2.0
      1412Updated Jan 13, 2025Jan 13, 2025
    • Integration testing of different accelerators with PyTorch
      Python
      BSD 3-Clause "New" or "Revised" License
      0071Updated Dec 28, 2024Dec 28, 2024
    • Tensors and Dynamic neural networks in Python with strong GPU acceleration
      Python
      Other
      23k001Updated Dec 9, 2024Dec 9, 2024
    • DeepSpeed

      Public
      DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
      Python
      Apache License 2.0
      4.2k000Updated Nov 4, 2024Nov 4, 2024
    • C++
      Other
      0380Updated Oct 21, 2024Oct 21, 2024
    • A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
      Python
      BSD 3-Clause "New" or "Revised" License
      9.6k011Updated Sep 20, 2024Sep 20, 2024
    • op-plugin

      Public
      C++
      Other
      0010Updated Jul 27, 2024Jul 27, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      Apache License 2.0
      4.6k000Updated Apr 23, 2024Apr 23, 2024
    • HTML
      21144Updated Apr 2, 2024Apr 2, 2024
    • .github

      Public
      0000Updated Jul 20, 2023Jul 20, 2023
    • Daily Ceph build and test on openEuler
      Apache License 2.0
      1000Updated Jul 20, 2023Jul 20, 2023
    • ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
      4100Updated Jul 20, 2023Jul 20, 2023