clang-p2996

Author	SHA1	Message	Date
Tomas Matheson	7bd17212ef	Re-land "[AArch64] Codegen support for FEAT_PAuthLR" (#75947 ) This reverts commit `9f0f558742`. Fix expensive checks failure by properly marking register def for ADR.	2023-12-21 18:32:55 +00:00
Tomas Matheson	9f0f558742	Revert "[AArch64] Codegen support for FEAT_PAuthLR" This reverts commit `5992ce90b8`. Builtbot failures with expensive checks enabled.	2023-12-21 16:25:55 +00:00
Tomas Matheson	5992ce90b8	[AArch64] Codegen support for FEAT_PAuthLR - Adds a new +pc option to -mbranch-protection that will enable the use of PC as a diversifier in PAC branch protection code. - When +pauth-lr is enabled (-march=armv9.5a+pauth-lr) in combination with -mbranch-protection=pac-ret+pc, the new 9.5-a instructions (pacibsppc, retaasppc, etc) are used. Documentation for the relevant instructions can be found here: https://developer.arm.com/documentation/ddi0602/2023-09/Base-Instructions/ Co-authored-by: Lucas Prates <lucas.prates@arm.com>	2023-12-21 14:18:33 +00:00
Dimitry Andric	2c27013fa9	[clang] Add getClangVendor() and use it in CodeGenModule.cpp (#75935 ) In `9a38a72f1d` `ProductId` was assigned from the stringified value of `CLANG_VENDOR`, if that macro was defined. However, `CLANG_VENDOR` is supposed to be a string, as it is defined (optionally) as such in the top-level clang `CMakeLists.txt`. Furthermore, `CLANG_VENDOR` is only passed as a build-time define when compiling `Version.cpp`, so add a `getClangVendor()` function to `Version.h`, and use it in `CodegGenModule.cpp`, instead of relying on the macro. Fixes: `9a38a72f1d`	2023-12-20 20:09:39 +01:00
Dimitry Andric	5c1a41f8ad	Revert "[clang] Add getClangVendor() and use it in CodeGenModule.cpp (#75935 )" This reverts commit `9055519103`, due to an incorrectly chosen commit message.	2023-12-20 20:07:22 +01:00
Dimitry Andric	9055519103	[clang] Add getClangVendor() and use it in CodeGenModule.cpp (#75935 ) In `9a38a72f1d` `ProductId` was assigned from the stringified value of `CLANG_VENDOR`, if that macro was defined. However, `CLANG_VENDOR` is supposed to be a string, as it is defined (optionally) as such in the top-level clang `CMakeLists.txt`. Move the addition of `-DCLANG_VENDOR` to the compiler flags from `clang/lib/Basic/CMakeLists.txt` to the top-level `CMakeLists.txt`, so it is consistent across the whole clang codebase. Then remove the stringification from `CodeGenModule.cpp`, to make it work correctly. Fixes: `9a38a72f1d`	2023-12-20 20:03:19 +01:00
Eric Biggers	09058654f6	[RISCV] Remove experimental from Vector Crypto extensions (#74213 ) The RISC-V vector crypto extensions have been ratified. This patch updates the Clang and LLVM support for these extensions to be non-experimental, while leaving the C intrinsics as experimental since the C intrinsics are not yet standardized. Co-authored-by: Brandon Wu <brandon.wu@sifive.com>	2023-12-18 22:04:22 -08:00
Jessica Del	32f9983c06	[AMDGPU] - Add address space for strided buffers (#74471 ) This is an experimental address space for strided buffers. These buffers can have structs as elements and a stride > 1. These pointers allow the indexed access in units of stride, i.e., they point at `buffer[index * stride]`. Thus, we can use the `idxen` modifier for buffer loads. We assign address space 9 to 192-bit buffer pointers which contain a 128-bit descriptor, a 32-bit offset and a 32-bit index. Essentially, they are fat buffer pointers with an additional 32-bit index.	2023-12-15 15:49:25 +01:00
Kazu Hirata	f3dcc2351c	[clang] Use StringRef::{starts,ends}_with (NFC) (#75149 ) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-13 08:54:13 -08:00
Fangrui Song	1c830b787c	[Preprocessor] Define __GCC_HAVE_SYNC_COMPARE_AND_SWAP_16 for AArch64 (#74954 ) GCC sets `#define HAVE_atomic_compare_and_swapti 1` and therefore defines `__GCC_HAVE_SYNC_COMPARE_AND_SWAP_16`. Clang compiles the 16-byte legacy `__sync_bool_compare_and_swap` and new `__atomic_compare_exchange` compile to LDXP/STXP or (with LSE) CASP{,A,L,AL}. Link: https://github.com/llvm/llvm-project/issues/71883	2023-12-11 23:09:14 -08:00
Artem Belevich	631c6e834c	[CUDA] Add support for CUDA-12.3 and sm_90a (#74895 )	2023-12-11 12:18:28 -08:00
Philip Reames	99c0a3ea98	[RISCV] Enable target attribute when invoked through clang driver (#74889 ) `d80e46d` added support for the target function attribute. However, it turns out that commit has a nasty bug/oversight. As the tests in that revision show, everything works if clang -cc1 is directly invoked. I was suprised to learn this morning that compiling with clang (i.e. the typical user workflow) did not work. The bug is that if a set of explicit negative extensions is passed to cc1 at the command line (as the clang driver always does), we were copying these negative extensions to the end of the rewritten extension list. When this was later parsed, this had the effect of turning back off any extension that the target attribute had enabled. This patch updates the logic to only propagate the features from the input which don't appear in the rewritten form in either positive or negative form. Note that this code structure is still highly suspect. In particular I'm fairly sure that mixing extension versions with this code will result in odd results. However, I figure its better to have something which mostly works than something which doesn't work at all.	2023-12-11 08:55:21 -08:00
Dominik Adamski	276a024b49	[NFC][AMDGPU] Unify AMDGPU address space enum (#73944 ) Types of AMDGPU address space were defined not only in Clang-specific class but also in LLVM header. If we unify the AMD GPU address space enumeration, then we can reuse it in Clang, Flang and LLVM.	2023-12-11 10:45:21 +01:00
Craig Topper	bf9125294d	[RISCV] Remove unnecessary call to isSupportedExtensionFeature. hasExtension already checks if the extension is supported.	2023-12-09 14:38:03 -08:00
Kazu Hirata	cc4ecfd68b	[ADT] Rename SmallString::{starts,ends}with to {starts,ends}_with (#74916 ) This patch renames {starts,ends}with to {starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. Since there are only a handful of occurrences, this patch skips the deprecation phase and simply renames them.	2023-12-09 14:28:45 -08:00
Craig Topper	5c8755f9f4	[RISCV] Use Triple::isRISCV64(). NFC	2023-12-09 14:02:58 -08:00
Richard Dzenis	b3e6ff3319	[clang-cl] Add support for [[msvc::constexpr]] C++11 attribute (#71300 ) This commit introduces support for the MSVC-specific C++11-style attribute `[[msvc::constexpr]]`, which was introduced in MSVC 14.33. The semantics of this attribute are enabled only under MSVC compatibility (`-fms-compatibility-version`) 14.33 and higher. Additionally, the default value of `_MSC_VER` has been raised to 1433. The current implementation lacks support for: - `[[msvc::constexpr]]` constructors (see #72149); at the time of this implementation, such support would have required an unreasonable number of changes in Clang. - `[[msvc::constexpr]] return ::new` (constexpr placement new) from non-std namespaces (see #74924). Relevant to: #57696	2023-12-09 14:35:38 +04:00
Juergen Ributzka	5ad3a32c79	[clang][modules] Reset codegen options (take 2). (#74388 ) CodeGen options do not affect the AST, so they usually can be ignored. The only exception to the rule is when a PCM is created with `-gmodules`. In that case the Clang module format is switched to object file container and contains also serialized debug information that can be affected by debug options. There the following approach was choosen: 1.) Split out all the debug options into a separate `DebugOptions.def` file. The file is included by `CodeGenOptions.def`, so the change is transparent to all existing users of `CodeGenOptions.def`. 2.) Reset all CodeGen options, but excluding affecting debug options. 3.) Conditionally reset debug options that can affect the PCM. This fixes rdar://113135909.	2023-12-05 08:31:21 -08:00
Jonas Paulsson	c568927f3e	[SystemZ] Properly support 16 byte atomic int/fp types and ops. (#73134 ) - Clang FE now has MaxAtomicPromoteWidth / MaxAtomicInlineWidth set to 128, and now produces IR instead of calls to __atomic instrinsics for 16 bytes as well. - Atomic __int128 (and long double) variables are now aligned to 16 bytes by default (like gcc 14). - AtomicExpand pass now expands 16 byte operations as well. - tests for __atomic builtins for all integer widths, and __atomic_is_lock_free with friends. - TODO: AtomicExpand pass handles with this patch expansion of i128 atomicrmw:s. As a next step smaller integer types should also be possible to handle this way instead of by the backend.	2023-12-05 17:17:21 +01:00
Juergen Ributzka	1157bee5ce	Revert "[clang][modules] Reset codegen options. (#74006 )" This reverts commit `fef1854318`.	2023-12-04 14:28:22 -08:00
Juergen Ributzka	fef1854318	[clang][modules] Reset codegen options. (#74006 ) CodeGen options do not affect the AST, so they usually can be ignored. The only exception to the rule is when a PCM is created with `-gmodules`. In that case the Clang module format is switched to object file container and contains also serialized debug information that can be affected by debug options. There the following approach was choosen: 1.) Split out all the debug options into a separate `DebugOptions.def` file. The file is included by `CodeGenOptions.def`, so the change is transparent to all existing users of `CodeGenOptions.def`. 2.) Reset all CodeGen options, but excluding affecting debug options. 3.) Conditionally reset debug options that can affect the PCM. This fixes rdar://113135909.	2023-12-04 13:54:57 -08:00
Shengchen Kan	6d6baef5c9	[X86] Support CFE flags for APX features (#74199 ) Positive options: -mapx-features=<comma-separated-features> Negative options: -mno-apx-features=<comma-separated-features> -m[no-]apx-features is designed to be able to control separate APX features. Besides, we also support the flag -m[no-]apxf, which can be used like an alias of -m[no-]apx-features=< all APX features covered by CPUID APX_F> Behaviour when positive and negative options are used together: For boolean flags, the last one wins -mapxf -mno-apxf -> -mno-apxf -mno-apxf -mapxf -> -mapxf For flags that take a set as arguments, it sets the mask by order of the flags -mapx-features=egpr,ndd -mno-apx-features=egpr -> -egpr,+ndd -mapx-features=egpr -mno-apx-features=egpr,ndd -> -egpr,-ndd -mno-apx-features=egpr -mapx-features=egpr,ndd -> +egpr,+ndd -mno-apx-features=egpr,ndd -mapx-features=egpr -> -ndd,+egpr The design is aligned with gcc https://gcc.gnu.org/pipermail/gcc-patches/2023-August/628905.html	2023-12-04 19:22:56 +08:00
Philip Reames	e817966718	[RISCV] Collapse fast unaligned access into a single feature [nfc-ish] (#73971 ) When we'd originally added unaligned-scalar-mem and unaligned-vector-mem, they were separated into two parts under the theory that some processor might implement one, but not the other. At the moment, we don't have evidence of such a processor. The C/C++ level interface, and the clang driver command lines have settled on a single unaligned flag which indicates both scalar and vector support unaligned. Given that, let's remove the test matrix complexity for a set of configurations which don't appear useful. Given these are internal feature names, I don't think we need to provide any forward compatibility. Anyone disagree? Note: The immediate trigger for this patch was finding another case where the unaligned-vector-mem wasn't being properly serialized to IR from clang which resulted in problems reproducing assembly from clang's -emit-llvm feature. Instead of fixing this, I decided getting rid of the complexity was the better approach.	2023-12-01 11:00:59 -08:00
Craig Topper	0123608822	[RISCV] Minor improvements/cleanup to target attribute handling. NFC (#73851 ) Use ArrayRef to avoid a vector copy. Replace a push_back loop with a call to std::vector::insert.	2023-11-29 12:57:48 -08:00
Sunil Kuravinakop	d033f51a0a	[OpenMP] atomic compare fail : Parser & AST support Diff Revision: https://reviews.llvm.org/D123235	2023-11-26 13:34:34 -06:00
Kadir Cetinkaya	076ec9f5f5	Fix build failure on certain bots	2023-11-24 11:39:16 +01:00
Piyou Chen	d80e46da7d	[RISCV] Support target attribute for function The proposal of target attribute is https://github.com/riscv-non-isa/riscv-c-api-doc/pull/35 This patch implements it by emitting .option arch during codegen. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D151730	2023-11-23 23:05:21 -08:00
Jay Foad	cf1e0c0b07	[AMDGPU] Define new targets gfx1200 and gfx1201 (#73133 ) Define target names and ELF numbers for new GFX12 targets gfx1200 and gfx1201. For now they behave identically to GFX11.	2023-11-23 16:44:05 +00:00
Krzysztof Parzyszek	ddfed815c9	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit `edd675ac28`. This breaks clang build where every component is a shared library. The file clang/lib/Basic/OpenMPKinds.cpp, which is a part of libclangBasic.so, uses `getOpenMPClauseName` which isn't: /usr/bin/ld: CMakeFiles/obj.clangBasic.dir/OpenMPKinds.cpp.o: in functio n `clang ::getOpenMPSimpleClauseTypeName(llvm::omp::Clause, unsigned int )': OpenMPKinds.cpp:(.text._ZN5clang29getOpenMPSimpleClauseTypeNameEN4llvm3o mp6ClauseEj+0x9b): undefined reference to `llvm::omp::getOpenMPClauseNam e(llvm::omp::Clause)'	2023-11-20 10:48:06 -06:00
Sunil Kuravinakop	edd675ac28	[OpenMP] atomic compare fail : Parser & AST support Diff Revision: https://reviews.llvm.org/D123235	2023-11-20 03:05:31 -06:00
Matthew Devereau	cdf6693f07	[AArch64][SME] Add support for sme-fa64 (#70809 )	2023-11-20 08:37:52 +00:00
Qiu Chaofan	d572c4cdef	[PowerPC] Disable float128 on AIX in Clang (#67298 ) PowerPC AIX backend does not support float128 at all. Diagnose even when specifying -mfloat128 to avoid backend crash. --------- Co-authored-by: Kai Luo <gluokai@gmail.com>	2023-11-20 15:20:57 +08:00
Brad Smith	23c47eba87	[Driver] Enable __float128 support on X86 on FreeBSD / NetBSD (#72788 )	2023-11-19 03:00:05 -05:00
Egor Zhdan	f049395fc8	[APINotes] Upstream APINotesManager This upstreams more of the Clang API Notes functionality that is currently implemented in the Apple fork: https://github.com/apple/llvm-project/tree/next/clang/lib/APINotes	2023-11-17 13:28:51 +00:00
Lucas Duarte Prates	59b2301508	[AArch64] Introduce the Armv9.5-A architecture version (#72392 ) This introduces the Armv9.5-A architecture version, including the relevant command-line option for -march. Mode details about the Armv9.5-A architecture version can be found at: * https://community.arm.com/arm-community-blogs/b/architectures-and-processors-blog/posts/arm-a-profile-architecture-developments-2023 * https://developer.arm.com/documentation/ddi0602/2023-09/ Patch by Oliver Stannard.	2023-11-16 15:38:32 +00:00
Phoebe Wang	e96eddec5e	Reland "[X86][AVX10] Fix a bug when using -march with no-evex512 attribute (#72126 )" Fixes #72106	2023-11-14 15:39:30 +08:00
Phoebe Wang	17dd0c70c8	Revert "[X86][AVX10] Fix a bug when using -march with no-evex512 attribute (#72126 )" This reverts commit `451c594bcb`. Revert due to buildbot fails.	2023-11-14 15:34:38 +08:00
Phoebe Wang	451c594bcb	[X86][AVX10] Fix a bug when using -march with no-evex512 attribute (#72126 ) #71318 failed to clear EVEX512 feature for intended intrinsics. Fixes #72106	2023-11-14 15:15:34 +08:00
PiJoules	2c65860667	[clang] Remove fixed point arithmetic error (#71884 ) Prior to this, clang would always report ``` compile with '-ffixed-point' to enable fixed point types ``` whenever it sees `_Accum`, `_Fract`, or `_Sat` when fixed point arithmetic is not enabled. This can break existing code that uses these as variable names and doesn't use fixed point arithmetic like in some microsoft headers (https://github.com/llvm/llvm-project/pull/67750#issuecomment-1775264907). Fixed point should not raise this error for these cases, so this removes the error altogether and defaults to the usual error clang gives where it can see these keywords as either unknown types or regular variables.	2023-11-13 12:31:49 -08:00
yonghong-song	4e67234357	[Clang][BPF] Add __BPF_CPU_VERSION__ macro (#71856 ) Sometimes bpf developer might want to develop different codes based on particular cpu versioins. For example, cpu v1/v2/v3 branch target is 16bit while cpu v4 branch target is 32bit, thus cpu v4 allows more aggressive loop unrolling than cpu v1/v2/v3 (see [1] for a kernel selftest failure due to this). We would like to maintain aggressive loop unrolling for cpu v4 while limit loop unrolling for earlier cpu versions. Another example, signed divide also only available with cpu v4. Actually, adding cpu specific macros are fairly common in llvm. For example, x86 has maco like 'i486', '__pentium_mmx__', etc. AArch64 has '__ARM_NEON', '__ARM_FEATURE_SVE', etc. This patch added __BPF_CPU_VERSION__ macro. Current possible values are 0/1/2/3/4. The following are the -mcpu=... to __BPF_CPU_VERSION__ mapping: ``` cpu __BPF_CPU_VERSION__ no -mcpu=<...> 1 -mcpu=v1 1 -mcpu=v2 2 -mcpu=v3 3 -mcpu=v4 4 -mcpu=generic 1 -mcpu=probe 0 ``` This patch also added some macros for developers to identify some cpu insn features: ``` feature macro enabled in which cpu __BPF_FEATURE_JMP_EXT >= v2 __BPF_FEATURE_JMP32 >= v3 __BPF_FEATURE_ALU32 >= v3 __BPF_FEATURE_LDSX >= v4 __BPF_FEATURE_MOVSX >= v4 __BPF_FEATURE_BSWAP >= v4 __BPF_FEATURE_SDIV_SMOD >= v4 __BPF_FEATURE_GOTOL >= v4 __BPF_FEATURE_ST >= v4 ``` [1] https://lore.kernel.org/bpf/3e3a8a30-dde0-43a1-981e-2274962780ef@linux.dev/	2023-11-10 10:18:54 -08:00
Phoebe Wang	f229ba4e8d	[X86][AVX10] Permit AVX512 options/features used together with AVX10 (#71318 ) This patch relaxes the driver logic to permit combinations between AVX512 and AVX10 options and makes sure we have a unified behavior between options and features combination. Here are rules we are following when handle these combinations: 1. evex512 can only be used for avx512xxx options/features. It will be ignored if used without them; 2. avx512xxx and avx10.xxx are options in two worlds. Avoid to use them together in any case. It will enable a common super set when they are used together. E.g., "-mavx512f -mavx10.1-256" euqals "-mavx10.1-512". Compiler emits warnings when user using combinations like "-mavx512f -mavx10.1-256" in case they won't get unexpected result silently. Function target feature attribute follows the same rule now. We have to add "no-evex512" feature for intrinsics shared between AVX512 and AVX10. We also add "no-evex512" for early ISAs like AVX etc., because some of them are called by AVX512 intrinsics.	2023-11-10 15:21:05 +08:00
Mitch Phillips	a141a9fa97	Revert "[OpenMP] atomic compare fail : Parser & AST support" This reverts commit `086b65340c`. Reason: Broke under -Werror. More details in https://reviews.llvm.org/D123235	2023-11-08 11:20:17 +01:00
Sunil Kuravinakop	086b65340c	[OpenMP] atomic compare fail : Parser & AST support This is a support for " #pragma omp atomic compare fail ". It has Parser & AST support for now. Reviewed By: tianshilei1992, ABataev Differential Revision: https://reviews.llvm.org/D123235	2023-11-07 16:57:50 -06:00
Phoebe Wang	c78aeabaec	[X86] Add a EVEX256 macro to match with GCC and MSVC (#71317 )	2023-11-07 14:39:24 +08:00
Jan Svoboda	4f31d328aa	[clang] Improve `SourceManager::PrintStats()` This fixes a typo ("SLocEntry's" -> "SLocEntries"), fixes capitalization ("Sloc" -> "SLoc") and adds extra information (capacity in bytes of `LoadedSLocEntryTable`).	2023-11-06 14:45:04 -08:00
Paul Walker	de88371d9d	[LLVM][AArch64] Add ASM constraints for reduced GPR register ranges. (#70970 ) [LLVM][AArch64] Add ASM constraints for reduced GPR register ranges. The patch adds the follow ASM constraints: Uci => w8-w11/x8-x11 Ucj => w12-w15/x12-x15 These constraints are required for SME load/store instructions where a reduced set of GPRs are used to specify ZA array vectors. NOTE: GCC has agreed to use the same constraint syntax.	2023-11-03 15:34:45 +00:00
Jan Svoboda	a3efd892fa	[clang][modules] Don't prevent translation of FW_Private includes when explicitly building FW (#70714 ) We prevent translating `#include <FW/PrivateHeader.h>` into an import of FW_Private when compiling the implementation of FW or FW_Private. This is specified via `-fmodule-name=` on the TU command line (used to be `-fmodule-implementation-of`). This logic is supposed to only kick in when imported directly from a TU, but it currently also kicks in when compiling the public FW module explicitly (since it also has `-fmodule-name=` on the command line). This patch makes sure this logic only kicks in for the case that used to be `-fmodule-implementation-of` (for the TU), and not for all `-fmodule-name=` cases (especially for the explicit compile of a module). rdar://101051277; related: rdar://37500098&38434694	2023-11-01 12:00:54 -07:00
licongtian	8d4e35600f	[Clang][LoongArch] Support compiler options -mlsx/-mlasx for clang This patch adds compiler options -mlsx/-mlasx which enables the instruction sets of LSX and LASX, and sets related predefined macros according to the options.	2023-10-31 15:52:05 +08:00
Yusra Syeda	703895b131	[clang] Language to String function (#69487 ) This PR adds a function which converts the language to string. This is intended to be used by the z/OS target, see the patch here: https://github.com/llvm/llvm-project/pull/68926 --------- Co-authored-by: Yusra Syeda <yusra.syeda@ibm.com>	2023-10-27 17:22:49 -04:00
Brad Smith	15254eb740	[Driver] Clean up unused architecture related bits for *BSD's (#69809 ) - FreeBSD removed big-endian arm with 12.0. - OpenBSD never had big-endian arm support. I added it just in case, but it has never been used. - Remove sparcel bits. It was sprinkled in a few places but it will never be a thing. - Remove 32-bit sparc bits for FreeBSD. FreeBSD has never had 32-bit sparc support. - Remove sparc64 IAS test as support was enabled across the board awhile ago.	2023-10-26 14:44:12 -04:00

1 2 3 4 5 ...

4790 Commits