unavailableun

Follow

unavailableun

Follow

2 followers · 1 following

Microsoft Corporation

Popular repositories Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
flash-attention flash-attention Public

Forked from ROCm/flash-attention

Fast and memory-efficient exact attention

Python
transformers transformers Public

Forked from hackyon/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python