merge paged attention feature and moe feature into llama_fp8_12062024#370
Draft
yuzho-amd wants to merge 11 commits intollama_fp8_12062024from yuzho/moe_final_0121
+2,008-463
Commits
Commits on Dec 20, 2024
Commits on Dec 27, 2024
Commits on Dec 29, 2024
- committedvllmellm
Commits on Dec 30, 2024
- committedvllmellm
Commits on Jan 9, 2025
- authored