Skip to content

Commit

Permalink
benchdnn: graph: inputs: add a sdpa implicit causal mask case
Browse files Browse the repository at this point in the history
  • Loading branch information
ElaineBao authored and gyhintel committed Jan 3, 2025
1 parent 70686e0 commit e38a675
Show file tree
Hide file tree
Showing 3 changed files with 532 additions and 0 deletions.
1 change: 1 addition & 0 deletions tests/benchdnn/inputs/graph/complex_fusion/harness_mha_all
Original file line number Diff line number Diff line change
Expand Up @@ -14,6 +14,7 @@
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/GQA-fp16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-wo-mask-f16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-scale-by-mul-f16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-implicit-causal-mask-fp32-bs1.json

# int8 graphs
--reset --case=complex_fusion/mha/MHA-GPT-inf-int8-bs1.json
Expand Down
1 change: 1 addition & 0 deletions tests/benchdnn/inputs/graph/complex_fusion/harness_mha_ci
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,7 @@
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/GQA-fp16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-wo-mask-f16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-scale-by-mul-f16.json
--reset --dt=f32,bf16,f16 --case=complex_fusion/mha/sdpa-plain-implicit-causal-mask-fp32-bs1.json

# int8 graphs
--reset --case=complex_fusion/mha/MHA-GPT-inf-int8-bs1.json
Expand Down
Loading

0 comments on commit e38a675

Please sign in to comment.