[CIR][CodeGen] Special treatment of 3-element extended vector load and store #674

seven-mile · 2024-06-09T16:34:45Z

Continue the work of #613 .

Original CodeGen treat vec3 as vec4 to get aligned memory access. This PR enable these paths.

jopperm

Could you add a test (or file an issue) for arrays of 3-component vectors, please?

jopperm · 2024-06-10T09:57:55Z

clang/test/CIR/CodeGen/vectype-ext.cpp

+  // CIR-NEXT: cir.store %[[#RESULT]], %[[#PVECC]] : !cir.vector<!s32i x 3>, !cir.ptr<!cir.vector<!s32i x 3>>
+
+  // LLVM-NEXT: %[[#VECB:]] = load <2 x i32>, ptr %[[#PVECB]], align 8
+  // LLVM-NEXT: %[[#VECC:]] = load <3 x i32>, ptr %[[#PVECC]], align 16


This matches clang's codegen, but do we understand why the vector is not loaded as <4 x i32> here?

Seems a mistake from upstream?

Not blocking the review, but I agree with @jopperm, since you're going to be looking at OpenCL it might be good to try to understand why for your own knowledge of how vectors should play out in general.

bcardosolopes

LGTM with few clarifying questions / comments for future PRs.

bcardosolopes · 2024-06-10T19:02:16Z

clang/test/CIR/CodeGen/vectype-ext.cpp

+  // CIR-NEXT: cir.store %[[#RESULT]], %[[#PVECC]] : !cir.vector<!s32i x 3>, !cir.ptr<!cir.vector<!s32i x 3>>
+
+  // LLVM-NEXT: %[[#VECB:]] = load <2 x i32>, ptr %[[#PVECB]], align 8
+  // LLVM-NEXT: %[[#VECC:]] = load <3 x i32>, ptr %[[#PVECC]], align 16


Not blocking the review, but I agree with @jopperm, since you're going to be looking at OpenCL it might be good to try to understand why for your own knowledge of how vectors should play out in general.

clang/test/CIR/CodeGen/vectype-ext.cpp

clang/lib/CIR/CodeGen/CIRGenExpr.cpp

…d store (llvm#674) Continue the work of llvm#613 . Original CodeGen treat vec3 as vec4 to get aligned memory access. This PR enable these paths.

…d store (#674) Continue the work of #613 . Original CodeGen treat vec3 as vec4 to get aligned memory access. This PR enable these paths.

[CIR][CIRGen] Special treatment of 3-element vector load and store

ad84428

jopperm reviewed Jun 10, 2024

View reviewed changes

bcardosolopes approved these changes Jun 10, 2024

View reviewed changes

bcardosolopes merged commit 5e9148f into llvm:main Jun 11, 2024
7 checks passed

This was referenced Jun 13, 2024

Add a test for arrays of 3-component extended vectors #685

Open

[GSoC] Add OpenCL support to compile GPU kernels #689

Closed

lanza pushed a commit that referenced this pull request Nov 5, 2024

[CIR][CodeGen] Special treatment of 3-element extended vector load an…

b217246

…d store (#674) Continue the work of #613 . Original CodeGen treat vec3 as vec4 to get aligned memory access. This PR enable these paths.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR][CodeGen] Special treatment of 3-element extended vector load and store #674

[CIR][CodeGen] Special treatment of 3-element extended vector load and store #674

seven-mile commented Jun 9, 2024

jopperm left a comment

jopperm Jun 10, 2024

seven-mile Jun 10, 2024

bcardosolopes Jun 10, 2024

bcardosolopes left a comment

bcardosolopes Jun 10, 2024

[CIR][CodeGen] Special treatment of 3-element extended vector load and store #674

[CIR][CodeGen] Special treatment of 3-element extended vector load and store #674

Conversation

seven-mile commented Jun 9, 2024

jopperm left a comment

Choose a reason for hiding this comment

jopperm Jun 10, 2024

Choose a reason for hiding this comment

seven-mile Jun 10, 2024

Choose a reason for hiding this comment

bcardosolopes Jun 10, 2024

Choose a reason for hiding this comment

bcardosolopes left a comment

Choose a reason for hiding this comment

bcardosolopes Jun 10, 2024

Choose a reason for hiding this comment