Skip to content

Pull requests: mit-han-lab/qserve

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Support bf16 for quant, layernorm and gemm OPs
#47 opened Dec 10, 2024 by guoyuhong Loading…
[Minor] Fix KV cache block size
#39 opened Oct 15, 2024 by dasistwo Loading…
Update README.md
#6 opened May 14, 2024 by eltociear Loading…
ProTip! Filter pull requests by the default branch with base:main.