[GPU] Use onednn impl for dynamic gemm #27212

Lyamin-Roman · 2024-10-23T18:42:30Z

Details:

Performance improvement for LoRA

sshlyapn · 2024-10-24T07:27:12Z

src/plugins/intel_gpu/src/graph/impls/registry/gemm_impls.cpp

-        OV_GPU_GET_INSTANCE_OCL(gemm, shape_types::dynamic_shape)
+        OV_GPU_GET_INSTANCE_OCL(gemm, shape_types::dynamic_shape,
+            [](const program_node& node) {
+                return !node.can_use(impl_types::onednn);


Does oneDNN support efficient kernels caching for gemm? If not, this could cause runtime kernel recompilation and drop performance in some cases. This change probably requires wider performance check

Checked on LNL with qwen2 and llama3, no performance drops were detected (with and w/o sdpa)

Lyamin-Roman added the category: GPU OpenVINO GPU plugin label Oct 23, 2024

Lyamin-Roman requested review from a team as code owners October 23, 2024 18:42

sshlyapn reviewed Oct 24, 2024

View reviewed changes

Lyamin-Roman force-pushed the use_gemm_onednn_impl branch 3 times, most recently from cb044a5 to 6025088 Compare October 31, 2024 00:32

[GPU] Use onednn impl for dynamic gemm

f57f45d

Lyamin-Roman force-pushed the use_gemm_onednn_impl branch from 6025088 to f57f45d Compare October 31, 2024 00:35

vladimir-paramuzov enabled auto-merge October 31, 2024 06:34

vladimir-paramuzov approved these changes Oct 31, 2024

View reviewed changes

vladimir-paramuzov added this pull request to the merge queue Oct 31, 2024

Merged via the queue into openvinotoolkit:master with commit a6a113c Oct 31, 2024
150 checks passed

Lyamin-Roman deleted the use_gemm_onednn_impl branch October 31, 2024 11:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Use onednn impl for dynamic gemm #27212

[GPU] Use onednn impl for dynamic gemm #27212

Lyamin-Roman commented Oct 23, 2024

sshlyapn Oct 24, 2024

Lyamin-Roman Oct 24, 2024

[GPU] Use onednn impl for dynamic gemm #27212

[GPU] Use onednn impl for dynamic gemm #27212

Conversation

Lyamin-Roman commented Oct 23, 2024

Details:

sshlyapn Oct 24, 2024

Choose a reason for hiding this comment

Lyamin-Roman Oct 24, 2024

Choose a reason for hiding this comment