[Clang] Remove preprocessor guards and global feature checks for NEON #95102

Lukacma · 2024-06-11T12:14:49Z

To enable function multi-versioning (FMV), current checks which rely on cmd line options or global macros to see if target feature is present need to be removed. This patch removes those for NEON and also implements changes to NEON header file as proposed in ACLE.

llvmbot · 2024-06-11T12:15:40Z

@llvm/pr-subscribers-clang

Author: None (Lukacma)

Changes

To enable function multi-versioning (FMV), current checks which rely on cmd line options or global macros to see if target feature is present need to be removed. This patch removes those for NEON and also implements changes to NEON header file as proposed in ACLE.

Full diff: https://github.com/llvm/llvm-project/pull/95102.diff

4 Files Affected:

(modified) clang/lib/Sema/SemaType.cpp (-23)
(modified) clang/test/Sema/arm-vector-types-support.c (-2)
(removed) clang/test/SemaCUDA/neon-attrs.cu (-22)
(modified) clang/utils/TableGen/NeonEmitter.cpp (-5)

diff --git a/clang/lib/Sema/SemaType.cpp b/clang/lib/Sema/SemaType.cpp
index 441fdcca0758f..65b87f62e294f 100644
--- a/clang/lib/Sema/SemaType.cpp
+++ b/clang/lib/Sema/SemaType.cpp
@@ -8084,29 +8084,6 @@ static void HandleNeonVectorTypeAttr(QualType &CurType, const ParsedAttr &Attr,
         AuxTI && (AuxTI->getTriple().isAArch64() || AuxTI->getTriple().isARM());
   }
 
-  // Target must have NEON (or MVE, whose vectors are similar enough
-  // not to need a separate attribute)
-  if (!(S.Context.getTargetInfo().hasFeature("neon") ||
-        S.Context.getTargetInfo().hasFeature("mve") ||
-        S.Context.getTargetInfo().hasFeature("sve") ||
-        S.Context.getTargetInfo().hasFeature("sme") ||
-        IsTargetCUDAAndHostARM) &&
-      VecKind == VectorKind::Neon) {
-    S.Diag(Attr.getLoc(), diag::err_attribute_unsupported)
-        << Attr << "'neon', 'mve', 'sve' or 'sme'";
-    Attr.setInvalid();
-    return;
-  }
-  if (!(S.Context.getTargetInfo().hasFeature("neon") ||
-        S.Context.getTargetInfo().hasFeature("mve") ||
-        IsTargetCUDAAndHostARM) &&
-      VecKind == VectorKind::NeonPoly) {
-    S.Diag(Attr.getLoc(), diag::err_attribute_unsupported)
-        << Attr << "'neon' or 'mve'";
-    Attr.setInvalid();
-    return;
-  }
-
   // Check the attribute arguments.
   if (Attr.getNumArgs() != 1) {
     S.Diag(Attr.getLoc(), diag::err_attribute_wrong_number_arguments)
diff --git a/clang/test/Sema/arm-vector-types-support.c b/clang/test/Sema/arm-vector-types-support.c
index ed5f5ba175a94..1d2e1c9336fc6 100644
--- a/clang/test/Sema/arm-vector-types-support.c
+++ b/clang/test/Sema/arm-vector-types-support.c
@@ -2,6 +2,4 @@
 // RUN: %clang_cc1 %s -triple aarch64 -fsyntax-only -verify
 // RUN: %clang_cc1 %s -triple aarch64 -target-feature -fp-armv8 -target-abi aapcs-soft -fsyntax-only -verify
 
-typedef __attribute__((neon_vector_type(2))) int int32x2_t; // expected-error{{'neon_vector_type' attribute is not supported on targets missing 'neon', 'mve', 'sve' or 'sme'; specify an appropriate -march= or -mcpu=}}
-typedef __attribute__((neon_polyvector_type(16))) short poly8x16_t; // expected-error{{'neon_polyvector_type' attribute is not supported on targets missing 'neon' or 'mve'; specify an appropriate -march= or -mcpu=}}
 typedef __attribute__((arm_sve_vector_bits(256))) void nosveflag; // expected-error{{'arm_sve_vector_bits' attribute is not supported on targets missing 'sve'; specify an appropriate -march= or -mcpu=}}
diff --git a/clang/test/SemaCUDA/neon-attrs.cu b/clang/test/SemaCUDA/neon-attrs.cu
deleted file mode 100644
index 129056741ac9a..0000000000000
--- a/clang/test/SemaCUDA/neon-attrs.cu
+++ /dev/null
@@ -1,22 +0,0 @@
-// CPU-side compilation on ARM with neon enabled (no errors expected).
-// RUN: %clang_cc1 -triple arm64-linux-gnu -target-feature +neon -aux-triple nvptx64 -x cuda -fsyntax-only -verify=quiet %s
-
-// CPU-side compilation on ARM with neon disabled.
-// RUN: %clang_cc1 -triple arm64-linux-gnu -target-feature -neon -aux-triple nvptx64 -x cuda -fsyntax-only -verify %s
-
-// GPU-side compilation on ARM (no errors expected).
-// RUN: %clang_cc1 -triple nvptx64 -aux-triple arm64-linux-gnu -fcuda-is-device -x cuda -fsyntax-only -verify=quiet %s
-
-// Regular C++ compilation on ARM with neon enabled (no errors expected).
-// RUN: %clang_cc1 -triple arm64-linux-gnu -target-feature +neon -x c++ -fsyntax-only -verify=quiet %s
-
-// Regular C++ compilation on ARM with neon disabled.
-// RUN: %clang_cc1 -triple arm64-linux-gnu -target-feature -neon -x c++ -fsyntax-only -verify %s
-
-// quiet-no-diagnostics
-typedef __attribute__((neon_vector_type(4))) float float32x4_t;
-// expected-error@-1 {{'neon_vector_type' attribute is not supported on targets missing 'neon', 'mve', 'sve' or 'sme'}}
-// expect
-typedef unsigned char poly8_t;
-typedef __attribute__((neon_polyvector_type(8))) poly8_t poly8x8_t;
-// expected-error@-1 {{'neon_polyvector_type' attribute is not supported on targets missing 'neon' or 'mve'}}
diff --git a/clang/utils/TableGen/NeonEmitter.cpp b/clang/utils/TableGen/NeonEmitter.cpp
index 56f1fdf9ef574..626031d38cf00 100644
--- a/clang/utils/TableGen/NeonEmitter.cpp
+++ b/clang/utils/TableGen/NeonEmitter.cpp
@@ -2370,10 +2370,6 @@ void NeonEmitter::run(raw_ostream &OS) {
         "Please use -mfloat-abi=softfp or -mfloat-abi=hard\"\n";
   OS << "#else\n\n";
 
-  OS << "#if !defined(__ARM_NEON)\n";
-  OS << "#error \"NEON support not enabled\"\n";
-  OS << "#else\n\n";
-
   OS << "#include <stdint.h>\n\n";
 
   OS << "#include <arm_bf16.h>\n";
@@ -2450,7 +2446,6 @@ void NeonEmitter::run(raw_ostream &OS) {
   OS << "#undef __ai\n\n";
   OS << "#endif /* if !defined(__ARM_NEON) */\n";
   OS << "#endif /* ifndef __ARM_FP */\n";
-  OS << "#endif /* __ARM_NEON_H */\n";
 }
 
 /// run - Read the records in arm_fp16.td and output arm_fp16.h.  arm_fp16.h

CarolineConcatto · 2024-06-11T12:36:51Z

clang/lib/Sema/SemaType.cpp

-    Attr.setInvalid();
-    return;
-  }
-  if (!(S.Context.getTargetInfo().hasFeature("neon") ||


I believe we may need to leave still the test for MVE. We need to leave a comment in the ACLE stating that the MVE header is still needed.

Do we ? From my understanding this checks are Neon vector types and are unrelated to MVE. The only reason MVE is used is because MVE vectors are similar enough so we can use them as neon vectors

You can preserve the behaviour for MVE if you alter the diagnostics condition to be
"NEON type seen" && "no MVE" && "compiling for M-class".

Introduced by 2cf1439 (llvm#94279). See also 6c369cf. The build system cannot track transitive dependencies on generated headers for some reason.

…m#94907) After enhancing MSVC's STL to statically initialize our condition_variable, Clang began noticing that some mutexes in the test suite were unused.

nvcc warns about the following code: `void f(); __device__ void f() {}` but clang does not since clang allows device function to overload host function. Users want clang to emit similar warning to help code to be compatible with nvcc. Since this may cause regression with existing code, the warning is off by default and can be enabled by -Wnvcc-compat. It won't cause warning in system headers, even with -Wnvcc-compat.

…ly (llvm#94212) The patch didn't consistently clean up `#ifdef LLVM_ENABLE_ABI_BREAKING_CHECKS` and '#if defined(LLVM_ENABLE_ABI_BREAKING_CHECKS)' paths, causing a lot of build failures

Otherwise these tests would fail when using gnuwin32.

…#94982) Reverts llvm#94212 Some codes assume that `NDEBUG` implies `LLVM_ENABLE_ABI_BREAKING_CHECKS`, thus llvm#94212 breaks some build bots.

…updates (llvm#91725) Related review: llvm#91724 This patch updates the RemoveDIs migration document to include details on the textual IR changes, including steps to update any downstream lit tests accordingly. These steps are the same as those used to update the lit tests in the LLVM/Clang lit tests, as detailed in the review linked above.

This was probably intended to test the immediate offset addressing mode. Add some tests to check immediate offsets, and clean up run lines.

Since inreg now works for the default calling convention, we can check the output with fewer argument shuffling instructions.

See Buildbot failures: - https://lab.llvm.org/buildbot/#/builders/257/builds/13450 - https://lab.llvm.org/buildbot/#/builders/256/builds/14531

We can't use `dladdr()` in the tests, because when we're statically linking with musl that function is a no-op. Additionally, because musl disables emission of unwind information in its build, and because its signal trampolines don't include unwind information, tests that involve unwinding through a signal handler won't work and need to be disabled for musl. rdar://123436891

Just use the original type and let it hit a standard legalization error.

… 1.0)` (llvm#94887) close: llvm#92538

…vm#94931) The test clang/test/OpenMP/error_unsupport_feature.c don't check the output written to the current directory. The current directory may be write protected e.g. in a sandboxed environment. This patch replace the -emit-llvm option with -emit-llvm-only as it don't care about the outputed llvm IR.

… of Linux targets (llvm#94672) The different build configuration and target Linux system can load a different number of .so libraries. Add and check own libraries.

…4712) When the integer range analysis was first develop, a pass that did integer range-based constant folding was developed and used as a test pass. There was an intent to add such a folding to SCCP, but that hasn't happened. Meanwhile, -int-range-optimizations was added to the arith dialect's transformations. The cmpi simplification in that pass is a strict subset of the constant folding that lived in -test-int-range-inference. This commit moves the former test pass into -int-range-optimizaitons, subsuming its previous contents. It also adds an optimization from rocMLIR where `rem{s,u}i` operations that are noops are replaced by their left operands.

…94773) Part of llvm#93566.

Static verifier reports unchecked use of pointer after explicitly checking earlier in the function. It appears the pointer won't be a nullptr, so remove the unneeded check for consistency.

…ession (llvm#94356) This commit reimplements the functionality of the Clang Static Analyzer checker `alpha.core.SizeofPointer` within clang-tidy by adding a new (off-by-default) option to bugprone-sizeof-expression which activates reporting all the `sizeof(ptr)` expressions (where ptr is an expression that produces a pointer). The main motivation for this change is that `alpha.core.SizeofPointer` was an AST-based checker, which did not rely on the path sensitive capabilities of the Static Analyzer, so there was no reason to keep it in the Static Analyzer instead of the more lightweight clang-tidy. After this commit I'm planning to create a separate commit that deletes `alpha.core.SizeofPointer` from Clang Static Analyzer. It was natural to place this moved logic in bugprone-sizeof-expression, because that check already provided several heuristics that reported various especially suspicious classes of `sizeof(ptr)` expressions. The new mode `WarnOnSizeOfPointer` is off-by-default, so it won't surprise the existing users; but it can provide a more through coverage for the vulnerability CWE-467 ("Use of sizeof() on a Pointer Type") than the existing partial heuristics. Previously this checker had an exception that the RHS of a `sizeof(array) / sizeof(array[0])` expression is not reported; I generalized this to an exception that the check doesn't report `sizeof(expr[0])` and `sizeof(*expr)`. This idea is taken from the Static Analyzer checker `alpha.core.SizeofPointer` (which had an exception for `*expr`), but analysis of open source projects confirmed that this indeed eliminates lots of unwanted results. Note that the suppression of `sizeof(expr[0])` and `sizeof(*expr)` reports also affects the "old" mode `WarnOnSizeOfPointerToAggregate` which is enabled by default. This commit also replaces the old message "suspicious usage of 'sizeof(A*)'; pointer to aggregate" with two more concrete messages; but I feel that this tidy check would deserve a through cleanup of all the diagnostic messages that it can produce. (I added a FIXME to mark one outright misleading message.)

Those BitVectors get expensive on targets like AMDGPU with thousands of registers, and RegAliasIterator is also expensive. We can move all liveness calculations to use RegUnits instead to speed it up for targets where RegAliasIterator is expensive, like AMDGPU. On targets where RegAliasIterator is cheap, this alternative can be a little more expensive, but I believe the tradeoff is worth it.

This PR is required to fix `std/algorithms/alg.nonmodifying/mismatch/mismatch.pass.cpp` test for big endian platrofrms such as z/OS.

…anslation (llvm#95098)" Reverted due to failure on buildbot due to missing use of the WriteNewDbgInfoFormat flag in MLIR. This reverts commit ca920bb.

Add a section about fence & address spaces that covers amdgpu-as.

[Clang] Remove preprocessor guards and global feature checks for NEON

cdffede

Lukacma requested review from momchil-velikov, hassnaaHamdi and CarolineConcatto June 11, 2024 12:14

llvmbot added clang Clang issues not falling into any other category clang:frontend Language frontend issues, e.g. anything involving "Sema" labels Jun 11, 2024

CarolineConcatto reviewed Jun 11, 2024

View reviewed changes

e-kwsm and others added 22 commits June 12, 2024 11:17

[libc++] Fix invalid escape sequences in Python comments (llvm#94032)

89e1180

[AArch64] fix Windows buildbot failure

3e1ccd7

Introduced by 2cf1439 (llvm#94279). See also 6c369cf. The build system cannot track transitive dependencies on generated headers for some reason.

[libc++][test] Avoid -Wunused-variable warnings in mutex tests (llv…

3d6b2f3

…m#94907) After enhancing MSVC's STL to statically initialize our condition_variable, Clang began noticing that some mutexes in the test suite were unused.

[libc] Add WordTypeSelector<16> specialization (llvm#94979)

982dc4d

Revert 291b415 : [Misc] Use LLVM_ENABLE_ABI_BREAKING_CHECKS correct…

32016ef

…ly (llvm#94212) The patch didn't consistently clean up `#ifdef LLVM_ENABLE_ABI_BREAKING_CHECKS` and '#if defined(LLVM_ENABLE_ABI_BREAKING_CHECKS)' paths, causing a lot of build failures

Restore 'REQUIRES: shell' for some tests after 878deae

226d7ac

Otherwise these tests would fail when using gnuwin32.

Revert "[Misc] Use LLVM_ENABLE_ABI_BREAKING_CHECKS correctly" (llvm…

06a008f

…#94982) Reverts llvm#94212 Some codes assume that `NDEBUG` implies `LLVM_ENABLE_ABI_BREAKING_CHECKS`, thus llvm#94212 breaks some build bots.

AMDGPU: Fix using useless cachepolicy argument in buffer atomic test

8828fb4

This was probably intended to test the immediate offset addressing mode. Add some tests to check immediate offsets, and clean up run lines.

AMDGPU: Simplify some tests by not using amdgpu_ps

3c5cde1

Since inreg now works for the default calling convention, we can check the output with fewer argument shuffling instructions.

[libc][math][c23] Temporarily disable float16 on RISC-V (llvm#94984)

8c85a91

See Buildbot failures: - https://lab.llvm.org/buildbot/#/builders/257/builds/13450 - https://lab.llvm.org/buildbot/#/builders/256/builds/14531

[bazel][libc]Fix test bazel for 6b21e17

828528a

AMDGPU: Do not assert on v6x16 buffer load intrinsics (llvm#94966)

cedf266

Just use the original type and let it hit a standard legalization error.

[InstCombine] fold ldexp(x, zext(i1 y)) to `fmul x, (select y, 2.0,…

60c8eaf

… 1.0)` (llvm#94887) close: llvm#92538

[AMDGPU] Restore non-buffer atomic tests lost in llvm#93801 (llvm#94978)

9a23d26

[lldb] Fix TestModuleLoadedNotifys API test to work correctly on most…

a02f6ca

… of Linux targets (llvm#94672) The different build configuration and target Linux system can load a different number of .so libraries. Add and check own libraries.

[libc][math][c23] Add {remainder,remquo}f16 C23 math functions (llvm#…

cf39856

…94773) Part of llvm#93566.

[clang][OpenMP][NFC] Remove unnecessary nullptr check (llvm#94680)

7830792

Static verifier reports unchecked use of pointer after explicitly checking earlier in the function. It appears the pointer won't be a nullptr, so remove the unneeded check for consistency.

NagyDonat and others added 6 commits June 12, 2024 11:18

[libc++] Fix endianness for algorithm mismatch (llvm#93082)

6584372

This PR is required to fix `std/algorithms/alg.nonmodifying/mismatch/mismatch.pass.cpp` test for big endian platrofrms such as z/OS.

Revert "[MLIR][Flang][DebugInfo] Set debug info format in MLIR->IR tr…

20b7309

…anslation (llvm#95098)" Reverted due to failure on buildbot due to missing use of the WriteNewDbgInfoFormat flag in MLIR. This reverts commit ca920bb.

[AMDGPU] Document amdgpu-as in AMDGPUUsage (llvm#94335)

a708fb2

Add a section about fence & address spaces that covers amdgpu-as.

fix for mve

74fcd2d

Lukacma requested review from rupprecht, keith, aaupov, maksfb, rafaelauler, ayermolo, dcci, aartbik, PeimingLiu, yinying-lisa-li, matthias-springer, antiagainst, kuhar, JDevlieghere, tbaederr, nikic and a team as code owners June 12, 2024 11:21

Lukacma closed this Jun 12, 2024

ldionne removed request for a team, ayermolo, tbaederr and yinying-lisa-li June 12, 2024 15:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Clang] Remove preprocessor guards and global feature checks for NEON #95102

[Clang] Remove preprocessor guards and global feature checks for NEON #95102

Lukacma commented Jun 11, 2024

llvmbot commented Jun 11, 2024

CarolineConcatto Jun 11, 2024

Lukacma Jun 11, 2024

momchil-velikov Jun 11, 2024

[Clang] Remove preprocessor guards and global feature checks for NEON #95102

[Clang] Remove preprocessor guards and global feature checks for NEON #95102

Conversation

Lukacma commented Jun 11, 2024

llvmbot commented Jun 11, 2024

CarolineConcatto Jun 11, 2024

Choose a reason for hiding this comment

Lukacma Jun 11, 2024

Choose a reason for hiding this comment

momchil-velikov Jun 11, 2024

Choose a reason for hiding this comment