Skip to content

Move kv cache scales from k/v_proj.output_scale to self_attn.k/v_scale#133

Merged
mgoin merged 2 commits intomainfrom
move-kv_cache_scheme-to-kv_scales
Aug 15, 2024
Merged

Move kv cache scales from k/v_proj.output_scale to self_attn.k/v_scale#133
mgoin merged 2 commits intomainfrom
move-kv_cache_scheme-to-kv_scales

Commits

Commits on Aug 14, 2024

Commits on Aug 15, 2024