tianmu-li

Tianmu Li tianmu-li

Achievements

bitsandbytes bitsandbytes Public

Forked from bitsandbytes-foundation/bitsandbytes

8-bit CUDA functions for PyTorch

Python
vllm-fork vllm-fork Public

Forked from HabanaAI/vllm-fork

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
vllm-hpu-extension vllm-hpu-extension Public

Forked from HabanaAI/vllm-hpu-extension

Python
neural-compressor neural-compressor Public

Forked from intel/neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

Python