[CIR] shufflevector and convertvector built-ins #530

dkolsen-pgi · 2024-04-03T18:38:48Z

Implement __builtin_shufflevector and __builtin_convertvector in ClangIR. This change contributes to the implemention of issue #284.

__builtin_convertvector is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid if (type.isa<VectorType>()) statements everywhere, the utility function elementTypeIfVector was added to LowerToLLVM.cpp.

__builtin_shufflevector has two forms, only one of which appears to be documented.

The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation cir.vec.shuffle.ints. This operation is lowered to the llvm.shufflevector op.

The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation cir.vec.shuffle.vec. LLVM IR does not have an instruction for this, so it gets lowered to a long series of llvm.extractelement and llvm.insertelement operations.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue llvm#284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

bcardosolopes

Thanks for adding shuffle support David.

Feels like the fundamental difference in these shuffles is static indicies vs dynamic ones, with the caveat that the dynamic one operates on a single vector source. My concern with adding two CIR operations for shuffles is passes having to know about more than one. However, unifying both in one doesn't also feel great (e.g. it would require making second vector optional and tricks to differentiate attribute vs value for indicies).

I suggest we keep the CIR operations separate but make them share a ShuffleLikeInterface. Passes (including LLVM lowering) should use the interface when handling shuffles. The interface can be initially designed with the LLVM lowering use case so it could answer queries like "shuffle indicies dynamic?", "give me the input vectors", etc. Wdyt?

For naming, perhaps we can incorporate the semantic, I suggest:
vec.shuffle.ints -> vec.shuffle.static
vec.shuffle.vec -> ``vec.shuffle.dynamicAlternatively, the most common used one could bevec.shuffle` and the other suffixed with static or dynamic.

clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp

clang/lib/CIR/Dialect/IR/CIRDialect.cpp

dkolsen-pgi · 2024-04-03T23:48:47Z

I think that creating ShuffleLikeInterface now is a case of designing for reuse too early. The lowering for the two vec.shuffle operations are so completely different (1 LLVM dialect operation vs. 26 LLVM dialect operations in the test case in this PR) that the interface would not enable any shared code.

I do like your naming suggestions for the operations. That was the part of this PR I was least sure about. I will rename vec.shuffle.ints to vec.shuffle (since I think that will be the more common one) and rename vec.shuffle.vec to vec.shuffle.dynamic.

bcardosolopes · 2024-04-03T23:55:35Z

I think that creating ShuffleLikeInterface now is a case of designing for reuse too early. The lowering for the two vec.shuffle operations are so completely different (1 LLVM dialect operation vs. 26 LLVM dialect operations in the test case in this PR) that the interface would not enable any shared code.

I'm not too worried about the lowering, but more on the consistency when looking at shuffles, the more operations the more passes need to be thought about and it becomes easier to miss things. I'm fine with this getting added later though, it's indeed some early design, can you add a TODO/FIXME comment to one of the operations to highlight the future intent?

Rename `cir.vec.shuffle.ints` and `cir.vec.shuffle.vec` to `cir.vec.shuffle` and `cir.vec.shuffle.dynamic` respectively.

dkolsen-pgi · 2024-04-04T21:05:13Z

I pushed a new commit that renames cir.vec.shuffle.ints and cir.vec.shuffle.vec to cir.vec.shuffle and cir.vec.shuffle.dynamic respectively. The only other changes are slight improvements to the descriptions of the operations and a TODO comment to create a vector-shuffle interface.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue #284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue llvm#284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue #284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue llvm#284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

Implement `__builtin_shufflevector` and `__builtin_convertvector` in ClangIR. This change contributes to the implemention of issue #284. `__builtin_convertvector` is implemented as a cast. LLVM IR uses the same instructions for arithmetic conversions of both individual scalars and entire vectors. So ClangIR does the same. The code for handling conversions, in both CodeGen and Lowering, is cleaned up to correctly handle vector types. To simplify the lowering code and avoid `if (type.isa<VectorType>())` statements everywhere, the utility function `elementTypeIfVector` was added to `LowerToLLVM.cpp`. `__builtin_shufflevector` has two forms, only one of which appears to be documented. The documented form, which takes a variable-sized list of integer constants for the indices, is implemented with the new ClangIR operation `cir.vec.shuffle.ints`. This operation is lowered to the `llvm.shufflevector` op. The undocumented form, which gets the indices from a vector operand, is implemented with the new ClangIR operation `cir.vec.shuffle.vec`. LLVM IR does not have an instruction for this, so it gets lowered to a long series of `llvm.extractelement` and `llvm.insertelement` operations.

bcardosolopes requested changes Apr 3, 2024

View reviewed changes

clang/lib/CIR/Lowering/DirectToLLVM/LowerToLLVM.cpp Show resolved Hide resolved

clang/lib/CIR/Dialect/IR/CIRDialect.cpp Show resolved Hide resolved

[CIR] Rename vector shuffle operations

8339e4e

Rename `cir.vec.shuffle.ints` and `cir.vec.shuffle.vec` to `cir.vec.shuffle` and `cir.vec.shuffle.dynamic` respectively.

dkolsen-pgi requested a review from bcardosolopes April 4, 2024 21:09

bcardosolopes approved these changes Apr 5, 2024

View reviewed changes

bcardosolopes merged commit c3b2ac4 into llvm:main Apr 5, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CIR] shufflevector and convertvector built-ins #530

[CIR] shufflevector and convertvector built-ins #530

dkolsen-pgi commented Apr 3, 2024

bcardosolopes left a comment

dkolsen-pgi commented Apr 3, 2024

bcardosolopes commented Apr 3, 2024

dkolsen-pgi commented Apr 4, 2024

[CIR] shufflevector and convertvector built-ins #530

[CIR] shufflevector and convertvector built-ins #530

Conversation

dkolsen-pgi commented Apr 3, 2024

bcardosolopes left a comment

Choose a reason for hiding this comment

dkolsen-pgi commented Apr 3, 2024

bcardosolopes commented Apr 3, 2024

dkolsen-pgi commented Apr 4, 2024