You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Fix fusion ordering for partial rotary embedding (#2402)
The partial-rotary-embedding fusion depends on the cos-sin-cache fusion.
Fix the fusion ordering. This is necessary for GQA fusion in models like
Phi4 (with partial-rotary-embedding).
TODO: Add test-case. The one I have is huge. Need to create a smaller
test-case.
Signed-off-by: Ganesan Ramalingam <grama@microsoft.com>
0 commit comments