🎯
Focusing
Study in University of Science and Technology of China at present.
Pinned Loading
-
vllm-project/vllm
vllm-project/vllm PublicA high-throughput and memory-efficient inference and serving engine for LLMs
-
casper-hansen/AutoAWQ
casper-hansen/AutoAWQ Public archiveAutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
-
LightBinPack
LightBinPack PublicA lightweight library for solving packing problems in LLM training
C++ 2
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.