[WebNN EP] Support local attention feature for GQA #26565

peishenyan · 2025-11-13T07:38:05Z

Description

Support the local_window_size attribute in GroupQueryAttention Operator, which is designed for sliding window attention and may influence the attention mask pattern.

For local window size not equal to -1, new attention mask pattern will be created as follows for applying sliding window.

     condition_1 (old attn_mask) ---> CumSum (axis=3, exclusive=true, reversed=true)
          |                             |
          |                           Lesser <--- local_window_size
          |                             |
      LogicalAnd <----------------- condition_2
          |
    new attn_mask

Motivation and Context

add log info temp Support local_window_size for WebNN GQA

peishenyan · 2025-11-17T01:44:58Z

PTAL, thanks. @Honry

Honry

Thanks, LGTM, pls. also mention the needs to use 'expand' for mask_shape_ones_shape_constant in your commit message.

Honry · 2025-11-17T02:22:17Z

@fdwr, please take another look, thanks!

fdwr

fdwr · 2025-11-20T00:29:40Z

/azp run ONNX Runtime Web CI Pipeline,Windows GPU CI Pipeline,Linux Android Emulator QNN CI Pipeline,Windows GPU WebGPU CI Pipeline,Windows OpenVINO CI Pipeline

fdwr · 2025-11-20T00:29:44Z

/azp run Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline,Linux OpenVINO CI Pipeline,MacOS CI Pipeline,Windows ARM64 QNN CI Pipeline,Windows CPU CI Pipeline

fdwr · 2025-11-20T00:29:47Z

/azp run Windows GPU CUDA CI Pipeline,Windows GPU DML CI Pipeline,Windows GPU Doc Gen CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI

fdwr · 2025-11-20T00:29:49Z

/azp run Windows GPU TensorRT CI Pipeline,onnxruntime-binary-size-checks-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,Windows x64 QNN CI Pipeline,Big Models

azure-pipelines · 2025-11-20T00:29:50Z

Azure Pipelines successfully started running 1 pipeline(s).

fdwr · 2025-11-20T00:29:52Z

/azp run Test Linux CUDA x64 Release,Test Linux TensorRT x64 Release,web_Debug / build_onnxruntime_web,web_Release / build_onnxruntime_web

azure-pipelines · 2025-11-20T00:29:53Z

Azure Pipelines successfully started running 1 pipeline(s).

fdwr · 2025-11-20T00:29:53Z

/azp run Linux QNN CI Pipeline

azure-pipelines · 2025-11-20T00:29:56Z

No pipelines are associated with this pull request.

azure-pipelines · 2025-11-20T00:29:58Z

No pipelines are associated with this pull request.

azure-pipelines · 2025-11-20T00:30:00Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2025-11-20T00:30:03Z

Azure Pipelines successfully started running 1 pipeline(s).

peishenyan added 2 commits November 3, 2025 14:10

temp local window size

6af6346

add log info temp Support local_window_size for WebNN GQA

decompose large constant

e9ef8ae

Honry approved these changes Nov 17, 2025

View reviewed changes

fdwr approved these changes Nov 20, 2025

View reviewed changes

[WebNN EP] Support local attention feature for GQA #26565

Are you sure you want to change the base?

[WebNN EP] Support local attention feature for GQA #26565

Uh oh!

Conversation

peishenyan commented Nov 13, 2025

Description

Motivation and Context

Uh oh!

peishenyan commented Nov 17, 2025

Uh oh!

Honry left a comment

Choose a reason for hiding this comment

Uh oh!

Honry commented Nov 17, 2025

Uh oh!

fdwr left a comment

Choose a reason for hiding this comment

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

fdwr commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

azure-pipelines bot commented Nov 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants