[Clang] [Sema] Reject non-power-of-2 `_BitInt` matrix element types #117487

Sirraide · 2024-11-24T14:11:44Z

Essentially, this pr makes this ill-formed:

using mat4 = _BitInt(12) [[clang::matrix_type(3, 3)]];

This matches preexisting behaviour for vector types (e.g. ext_vector_type), and given that LLVM IR intrinsics for matrices also take vector types, it seems like a sensible thing to do.

This is currently especially problematic since we sometimes lower matrix types to LLVM array types instead, and while e.g. [4 x i32] and <4 x i32> probably have the same similar memory layout (though I don’t think it’s sound to rely on that either, see #117486), [4 x i12] and <4 x i12> definitely don’t.

llvmbot · 2024-11-24T14:12:15Z

@llvm/pr-subscribers-clang

Author: None (Sirraide)

Changes

Essentially, this pr makes this ill-formed:

using mat4 = _BitInt(12) [[clang::matrix_type(3, 3)]];

This matches preexisting behaviour for vector types (e.g. ext_vector_type), and given that LLVM IR intrinsics for matrices also take vector types, it seems like a sensible thing to do.

This is currently especially problematic since we sometimes lower matrix types to LLVM array types instead, and while e.g. [4 x i32] and <4 x i32> probably have the same similar memory layout (though I don’t think it’s sound to rely on that either, see #117486), [4 x i12] and <4 x i12> definitely don’t.

Full diff: https://github.com/llvm/llvm-project/pull/117487.diff

4 Files Affected:

(modified) clang/docs/ReleaseNotes.rst (+3)
(modified) clang/include/clang/Basic/DiagnosticSemaKinds.td (+2-2)
(modified) clang/lib/Sema/SemaType.cpp (+23-18)
(modified) clang/test/SemaCXX/matrix-type.cpp (+11-1)

diff --git a/clang/docs/ReleaseNotes.rst b/clang/docs/ReleaseNotes.rst
index 8bd06fadfdc984..1c0e4043bbe276 100644
--- a/clang/docs/ReleaseNotes.rst
+++ b/clang/docs/ReleaseNotes.rst
@@ -371,6 +371,9 @@ Non-comprehensive list of changes in this release
 - ``__builtin_reduce_and`` function can now be used in constant expressions.
 - ``__builtin_reduce_or`` and ``__builtin_reduce_xor`` functions can now be used in constant expressions.
 
+- Clang now rejects ``_BitInt`` matrix element types if the bit width is less than ``CHAR_WIDTH`` or
+  not a power of two, matching preexisting behaviour for vector types.
+
 New Compiler Flags
 ------------------
 
diff --git a/clang/include/clang/Basic/DiagnosticSemaKinds.td b/clang/include/clang/Basic/DiagnosticSemaKinds.td
index eb05a6a77978af..f049e72a6b8694 100644
--- a/clang/include/clang/Basic/DiagnosticSemaKinds.td
+++ b/clang/include/clang/Basic/DiagnosticSemaKinds.td
@@ -3233,8 +3233,8 @@ def err_attribute_too_few_arguments : Error<
   "%0 attribute takes at least %1 argument%s1">;
 def err_attribute_invalid_vector_type : Error<"invalid vector element type %0">;
 def err_attribute_invalid_bitint_vector_type : Error<
-  "'_BitInt' vector element width must be %select{a power of 2|"
-  "at least as wide as 'CHAR_BIT'}0">;
+  "'_BitInt' %select{vector|matrix}0 element width must be %select{a power of 2|"
+  "at least as wide as 'CHAR_BIT'}1">;
 def err_attribute_invalid_matrix_type : Error<"invalid matrix element type %0">;
 def err_attribute_bad_neon_vector_size : Error<
   "Neon vector size must be 64 or 128 bits">;
diff --git a/clang/lib/Sema/SemaType.cpp b/clang/lib/Sema/SemaType.cpp
index f32edc5ac06440..06b779f5ef3aa2 100644
--- a/clang/lib/Sema/SemaType.cpp
+++ b/clang/lib/Sema/SemaType.cpp
@@ -2312,6 +2312,18 @@ QualType Sema::BuildArrayType(QualType T, ArraySizeModifier ASM,
   return T;
 }
 
+bool CheckBitIntElementType(Sema &S, SourceLocation AttrLoc,
+                            const BitIntType *BIT, bool ForMatrixType = false) {
+  // Only support _BitInt elements with byte-sized power of 2 NumBits.
+  unsigned NumBits = BIT->getNumBits();
+  if (!llvm::isPowerOf2_32(NumBits) || NumBits < 8) {
+    S.Diag(AttrLoc, diag::err_attribute_invalid_bitint_vector_type)
+        << ForMatrixType << (NumBits < 8);
+    return true;
+  }
+  return false;
+}
+
 QualType Sema::BuildVectorType(QualType CurType, Expr *SizeExpr,
                                SourceLocation AttrLoc) {
   // The base type must be integer (not Boolean or enumeration) or float, and
@@ -2324,15 +2336,10 @@ QualType Sema::BuildVectorType(QualType CurType, Expr *SizeExpr,
     Diag(AttrLoc, diag::err_attribute_invalid_vector_type) << CurType;
     return QualType();
   }
-  // Only support _BitInt elements with byte-sized power of 2 NumBits.
-  if (const auto *BIT = CurType->getAs<BitIntType>()) {
-    unsigned NumBits = BIT->getNumBits();
-    if (!llvm::isPowerOf2_32(NumBits) || NumBits < 8) {
-      Diag(AttrLoc, diag::err_attribute_invalid_bitint_vector_type)
-          << (NumBits < 8);
-      return QualType();
-    }
-  }
+
+  if (const auto *BIT = CurType->getAs<BitIntType>();
+      BIT && CheckBitIntElementType(*this, AttrLoc, BIT))
+    return QualType();
 
   if (SizeExpr->isTypeDependent() || SizeExpr->isValueDependent())
     return Context.getDependentVectorType(CurType, SizeExpr, AttrLoc,
@@ -2402,15 +2409,9 @@ QualType Sema::BuildExtVectorType(QualType T, Expr *ArraySize,
     return QualType();
   }
 
-  // Only support _BitInt elements with byte-sized power of 2 NumBits.
-  if (T->isBitIntType()) {
-    unsigned NumBits = T->castAs<BitIntType>()->getNumBits();
-    if (!llvm::isPowerOf2_32(NumBits) || NumBits < 8) {
-      Diag(AttrLoc, diag::err_attribute_invalid_bitint_vector_type)
-          << (NumBits < 8);
-      return QualType();
-    }
-  }
+  if (const auto *BIT = T->getAs<BitIntType>();
+      BIT && CheckBitIntElementType(*this, AttrLoc, BIT))
+    return QualType();
 
   if (!ArraySize->isTypeDependent() && !ArraySize->isValueDependent()) {
     std::optional<llvm::APSInt> vecSize =
@@ -2455,6 +2456,10 @@ QualType Sema::BuildMatrixType(QualType ElementTy, Expr *NumRows, Expr *NumCols,
     return QualType();
   }
 
+  if (const auto *BIT = ElementTy->getAs<BitIntType>();
+      BIT && CheckBitIntElementType(*this, AttrLoc, BIT, true))
+    return QualType();
+
   if (NumRows->isTypeDependent() || NumCols->isTypeDependent() ||
       NumRows->isValueDependent() || NumCols->isValueDependent())
     return Context.getDependentSizedMatrixType(ElementTy, NumRows, NumCols,
diff --git a/clang/test/SemaCXX/matrix-type.cpp b/clang/test/SemaCXX/matrix-type.cpp
index af31e267fdcae8..bb7a8421ca9e37 100644
--- a/clang/test/SemaCXX/matrix-type.cpp
+++ b/clang/test/SemaCXX/matrix-type.cpp
@@ -1,4 +1,4 @@
-// RUN: %clang_cc1 -fsyntax-only -pedantic -fenable-matrix -std=c++11 -verify -triple x86_64-apple-darwin %s
+// RUN: %clang_cc1 -fsyntax-only -fenable-matrix -std=c++11 -verify -triple x86_64-apple-darwin %s
 
 using matrix_double_t = double __attribute__((matrix_type(6, 6)));
 using matrix_float_t = float __attribute__((matrix_type(6, 6)));
@@ -29,3 +29,13 @@ void matrix_unsupported_element_type() {
   using matrix3_t = bool __attribute__((matrix_type(1, 1)));     // expected-error{{invalid matrix element type 'bool'}}
   using matrix4_t = TestEnum __attribute__((matrix_type(1, 1))); // expected-error{{invalid matrix element type 'TestEnum'}}
 }
+
+void matrix_unsupported_bit_int() {
+  using m1 = _BitInt(2) __attribute__((matrix_type(4, 4))); // expected-error{{'_BitInt' matrix element width must be at least as wide as 'CHAR_BIT'}}
+  using m2 = _BitInt(7) __attribute__((matrix_type(4, 4))); // expected-error{{'_BitInt' matrix element width must be at least as wide as 'CHAR_BIT'}}
+  using m3 = _BitInt(9) __attribute__((matrix_type(4, 4))); // expected-error{{'_BitInt' matrix element width must be a power of 2}}
+  using m4 = _BitInt(12) __attribute__((matrix_type(4, 4))); // expected-error{{'_BitInt' matrix element width must be a power of 2}}
+  using m5 = _BitInt(8) __attribute__((matrix_type(4, 4)));
+  using m6 = _BitInt(64) __attribute__((matrix_type(4, 4)));
+  using m7 = _BitInt(256) __attribute__((matrix_type(4, 4)));
+}

clang/lib/Sema/SemaType.cpp

Fznamznon

It feels like that the non-power-of-2 _BitInt vectors were disabled because it wasn't known how to handle them. https://reviews.llvm.org/D133634#3793653
It probably is still so, however, does it make sense to document the new behavior?
Hopefully also, since this never worked properly, this is not a breaking change.

clang/lib/Sema/SemaType.cpp

fhahn

Makes sense to me, but do we have tests that make sure codegen for matrixes with power-of-2 bitints is sane?

Sirraide · 2024-11-25T11:26:54Z

Makes sense to me, but do we have tests that make sure codegen for matrixes with power-of-2 bitints is sane?

I’ll check and if not, I’ll add some

Sirraide · 2024-11-25T11:35:09Z

Interestingly, it seems we don’t even have CodeGen tests for _BitInt vectors...

Sirraide · 2024-11-25T11:45:24Z

Er, well this doesn’t look right:

using i8x3 = _BitInt(32) __attribute__((ext_vector_type(3)));
i8x3 v1(i8x3 a) { return a; }

%a.addr = alloca <3 x i32>, align 16
%extractVec = shufflevector <3 x i32> %a, <3 x i32> poison, <4 x i32> <i32 0, i32 1, i32 2, i32 poison>
store <4 x i32> %extractVec, ptr %a.addr, align 16

We’re storing a <4 x i32> into a <3 x i32> alloca; I don’t think that’s sound, is it?

efriedma-quic · 2024-11-25T20:06:39Z

The type of an alloca is only used to determine the number of bytes allocated; it doesn't have any other semantics. So it's fine as long as the alloca is 16 bytes. Which I think it is in the given example, because of the datalayout rules for <3 x i32>.

Sirraide · 2024-11-25T20:08:51Z

The type of an alloca is only used to determine the number of bytes allocated;

Yeah, that part I know; I was mainly just wondering if there’s a platform/abi/whatever where that doesn’t work out.

efriedma-quic · 2024-11-25T21:51:34Z

(For context, grep for PreserveVec3Type.)

Probably the code should be defensive and check for the allocation size of the vector type, but I don't think any targets define the alignment for non-power-of-2 vectors in a way that would cause issues.

fhahn · 2024-11-26T11:12:08Z

For reference, as per https://clang.llvm.org/docs/MatrixTypes.html#matrix-type, the only allowed element types for matrixes are

an integer type (as in C23 6.2.5p22), but excluding enumerated types and bool
the standard floating types float or double
a half-precision floating point type, if one is supported on the target

so we should either reject all BitInts or update the spec

Sirraide · 2024-11-26T14:29:44Z

an integer type (as in C23 6.2.5p22)

Doesn’t 6.2.5p22 include 6.2.5.p5 i.e. _BitInts?

CC @AaronBallman

AaronBallman · 2024-12-03T19:03:06Z

an integer type (as in C23 6.2.5p22)

Doesn’t 6.2.5p22 include 6.2.5.p5 i.e. _BitInts?

CC @AaronBallman

That's correct, _BitInt is an integer type in C's words of power.

…lowed

Sirraide · 2024-12-04T08:11:13Z

I’ve added some tests and also updated the docs to mention that non-power-of-2 _BitInts are not valid matrix element types.

AaronBallman

LGTM!

Sirraide added 2 commits November 24, 2024 14:38

[Clang] [Sema] Reject non-power-of-2 _BitInt matrix element types

a612c8f

Remove -pedantic

00fc390

Sirraide added clang:frontend Language frontend issues, e.g. anything involving "Sema" extension:clang labels Nov 24, 2024

Sirraide requested review from fhahn, cor3ntin, AaronBallman and Fznamznon November 24, 2024 14:11

llvmbot added the clang Clang issues not falling into any other category label Nov 24, 2024

Sirraide mentioned this pull request Nov 24, 2024

[Clang] [CG] Inconsistent lowering of matrix types to LLVM IR vector/array types #117486

Closed

tbaederr reviewed Nov 25, 2024

View reviewed changes

clang/lib/Sema/SemaType.cpp Outdated Show resolved Hide resolved

Fznamznon reviewed Nov 25, 2024

View reviewed changes

clang/lib/Sema/SemaType.cpp Outdated Show resolved Hide resolved

clang/lib/Sema/SemaType.cpp Outdated Show resolved Hide resolved

fhahn reviewed Nov 25, 2024

View reviewed changes

Apply feedback from code review

4e4fdb9

Sirraide added 2 commits December 4, 2024 09:09

Add tests for _BitInt matrix and vector types

7a54083

Update docs to mention that non-power-of-2 _BitInt matrices are disal…

0d8fd8f

…lowed

Sirraide requested a review from fhahn December 4, 2024 08:11

RST...

99a2574

AaronBallman approved these changes Dec 4, 2024

View reviewed changes

Sirraide merged commit 6e58e99 into llvm:main Dec 17, 2024
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Clang] [Sema] Reject non-power-of-2 `_BitInt` matrix element types #117487

[Clang] [Sema] Reject non-power-of-2 `_BitInt` matrix element types #117487

Sirraide commented Nov 24, 2024

llvmbot commented Nov 24, 2024

Fznamznon left a comment

fhahn left a comment

Sirraide commented Nov 25, 2024

Sirraide commented Nov 25, 2024 •

edited

Loading

Sirraide commented Nov 25, 2024

efriedma-quic commented Nov 25, 2024

Sirraide commented Nov 25, 2024

efriedma-quic commented Nov 25, 2024

fhahn commented Nov 26, 2024

Sirraide commented Nov 26, 2024

AaronBallman commented Dec 3, 2024

Sirraide commented Dec 4, 2024

AaronBallman left a comment

[Clang] [Sema] Reject non-power-of-2 _BitInt matrix element types #117487

[Clang] [Sema] Reject non-power-of-2 _BitInt matrix element types #117487

Conversation

Sirraide commented Nov 24, 2024

llvmbot commented Nov 24, 2024

Fznamznon left a comment

Choose a reason for hiding this comment

fhahn left a comment

Choose a reason for hiding this comment

Sirraide commented Nov 25, 2024

Sirraide commented Nov 25, 2024 • edited Loading

Sirraide commented Nov 25, 2024

efriedma-quic commented Nov 25, 2024

Sirraide commented Nov 25, 2024

efriedma-quic commented Nov 25, 2024

fhahn commented Nov 26, 2024

Sirraide commented Nov 26, 2024

AaronBallman commented Dec 3, 2024

Sirraide commented Dec 4, 2024

AaronBallman left a comment

Choose a reason for hiding this comment

[Clang] [Sema] Reject non-power-of-2 `_BitInt` matrix element types #117487

[Clang] [Sema] Reject non-power-of-2 `_BitInt` matrix element types #117487

Sirraide commented Nov 25, 2024 •

edited

Loading