[webgpu] Enable graph capture #24900

qjia7 · 2025-05-29T10:01:24Z

This PR enables graph capture capabilities in the WebGPU provider, which is similar with jsep one #18989.

All limitations are similar with JS/CUDA EP:

Models with control-flow ops (i.e. If, Loop and Scan ops) are not supported.
Usage of graph capture is limited to models where-in all ops in the model can be partitioned to the WebGPU EP or CPU EP and no memory copy between them.
Shapes of inputs/outputs cannot change across inference calls.
IOBinding is required. And all inputs/outputs are pre-allocated gpu buffers.

qjia7 · 2025-05-29T10:10:36Z

onnxruntime/core/providers/webgpu/buffer_manager.cc

      } else {
-        wgpuBufferRelease(buffer);
+        // TODO: Reuse the captured buffers for storage buffer to reduce peak memory.


To reuse the captured buffers, we need to change BufferManager::Create/TryAcquireCachedBuffer to add one additional parameter session_id so that we can check whether this session has been captured. If yes, we can get buffer from captured_buffers_ to reuse the buffer in TryAcquireCachedBuffer. How do you think? @fs-eire

qjia7 added 2 commits May 29, 2025 17:10

[webgpu] Add graph capture support

3458625

clean code

08bfa2b

qjia7 commented May 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[webgpu] Enable graph capture #24900

[webgpu] Enable graph capture #24900

qjia7 commented May 29, 2025

Uh oh!

qjia7 May 29, 2025

Uh oh!

Uh oh!

[webgpu] Enable graph capture #24900

Are you sure you want to change the base?

[webgpu] Enable graph capture #24900

Conversation

qjia7 commented May 29, 2025

Uh oh!

qjia7 May 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!