-
Fuser Public
Forked from NVIDIA/FuserA Fusion Code Generator for NVIDIA GPUs (commonly known as "nvFuser")
C++ Other UpdatedDec 16, 2024 -
chakra Public
Forked from mlcommons/chakraRepository for MLCommons Chakra schema and tools
Python Apache License 2.0 UpdatedSep 21, 2024 -
lightning-thunder Public
Forked from Lightning-AI/lightning-thunderMake PyTorch models up to 40% faster! Thunder is a source to source compiler for PyTorch. It enables using different hardware executors at once; across one or thousands of GPUs.
Python Apache License 2.0 UpdatedAug 19, 2024 -
sccl Public
Forked from microsoft/msccl-toolsSynthesizer for optimal collective communication algorithms
Python MIT License UpdatedJul 10, 2024 -
-
ucc Public
Forked from openucx/uccUnified Collective Communication Library
C BSD 3-Clause "New" or "Revised" License UpdatedNov 4, 2023 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedSep 29, 2023 -
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJun 23, 2023 -
ml-testing-accelerators Public
Forked from GoogleCloudPlatform/ml-testing-acceleratorsTesting framework for Deep Learning models (Tensorflow and PyTorch) on Google Cloud hardware accelerators (TPU and GPU)
Jsonnet Apache License 2.0 UpdatedJun 23, 2023 -
cgo-artifact-2020 Public
Artifact repository for paper Automatic Generation of High-Performance Quantized Machine Learning Kernels
-
tvm Public
Forked from apache/tvmbring deep learning workloads to bare metal
-
tvmai.github.io Public
Forked from tvmai/tvmai.github.ioTVM homepage repo
Ruby UpdatedDec 18, 2018 -
Bring deep learning to bare metal
C++ Apache License 2.0 UpdatedMay 13, 2018 -
gemmlowp Public
Forked from google/gemmlowpLow-precision matrix multiplication
C++ Apache License 2.0 UpdatedApr 5, 2018 -
-