Change the repository type filter
All
Repositories list
18 repositories
UMbreLLa
PublicLLM Inference on consumer devicesgsm_infinite
PublicAPE-Page
PublicAPE
PublicRULER
PublicSequoia
Publicscalable and robust tree-based speculative decoding algorithmlm-evaluation-harness
PublicS2FT
PublicS2FT-Page
Public- [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation
MagicDec
PublicMagicPIG-Page
PublicFactor
PublicMagicDec-part1
PublicSirius
PublicMagicDec-part2
PublicTriForce
Public[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative DecodingSequoia-Page
Public