Skip to content

merge paged attention feature and moe feature into llama_fp8_12062024#370

Draft
yuzho-amd wants to merge 11 commits intollama_fp8_12062024from yuzho/moe_final_0121

Commits

Commits on Dec 20, 2024

Commits on Dec 27, 2024

Commits on Dec 29, 2024

Commits on Dec 30, 2024

Commits on Jan 9, 2025