clang-p2996

Author	SHA1	Message	Date
Alex Richardson	9114ac67a9	Overload all llvm.annotation intrinsics for globals argument The global constant arguments could be in a different address space than the first argument, so we have to add another overloaded argument. This patch was originally made for CHERI LLVM (where globals can be in address space 200), but it also appears to be useful for in-tree targets as can be seen from the test diffs. Differential Revision: https://reviews.llvm.org/D138722	2022-12-07 18:29:18 +00:00
David Sherwood	bfb6f47e9e	[SVE] Change some bfloat lane intrinsics to use i32 immediates Almost all of the other SVE LLVM IR intrinsics take i32 values for lane indices or other immediates. We should bring the bfloat intrinsics in line with that. It will also make it easier to add support for the SVE2.1 float intrinsics in future, since they reuse the same underlying instruction classes. I've maintained backwards compatibility with the old i64 variants and used the autoupgrade mechanism. Differential Revision: https://reviews.llvm.org/D138788	2022-12-07 09:19:54 +00:00
Qiu Chaofan	62f20f51ce	[PowerPC] Support test data class intrinsic of 128-bit float We've exploited test data class instructions introduced in ISA 3.0. This change unifies the scalar intrinsics into ppc_test_data_class and add support for 128-bit precision float values using xststdcqp. Vector versions of the intrinsic can't be unified because they return vector int instead of int. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D138105	2022-12-07 16:44:12 +08:00
David Blaikie	6ab6085c77	Revert "DebugInfo: Add/support new DW_LANG codes for recent C and C++ versions" Some buildbots are failing in Clang and LLDB tests. (I guess the LLDB failure is due to the explicit C language tests in DwarfUnit.cpp that need to be updated - not sure what the Clang failures are about, they seem to be still emitting C99 when we're expecting C11 and I checked those tests pass... maybe systems with a different C language version default?) This reverts commit `3c312e48f3`.	2022-12-06 22:52:47 +00:00
Paul Robinson	fe21126112	[Windows] Convert tests to check 'target=...' Part of the project to eliminate special handling for triples in lit expressions.	2022-12-06 13:15:48 -08:00
David Blaikie	3c312e48f3	DebugInfo: Add/support new DW_LANG codes for recent C and C++ versions This may be a breaking change for consumers if they're trying to detect if code is C or C++, since it'll start using new codes that they may not be ready to recognize, in which case they may fall back to non-C handling. Differential Revision: https://reviews.llvm.org/D138597	2022-12-06 21:11:08 +00:00
Jonas Paulsson	f926826c2e	[SystemZ] Add "REQUIRES: systemz-registered-target" on test. The clang test that emits assembly needs this line as well.	2022-12-06 13:38:48 -06:00
Jonas Paulsson	481bb44baa	[SystemZ] Emit a .gnu_attribute for an externally visible vector abi. On SystemZ, the vector ABI changes depending on the presence of hardware vector support. Therefore, each binary compiled with a visible vector ABI (e.g. one that calls an external function with a vector argument) should be marked with a .gnu_attribute describing this. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D105067	2022-12-06 12:53:40 -06:00
Paul Robinson	26e50c4c4d	[ARM/Darwin] Convert tests to check 'target=' Part of the project to eliminate special handling for triples in lit expressions.	2022-12-06 06:58:39 -08:00
Archibald Elliott	83b3304dd2	[AArch64] Implement __arm_rsr128/__arm_wsr128 This only contains the SelectionDAG implementation. GlobalISel to follow. The broad approach is: - Introduce new builtins for 128-bit wide instructions. - Lower these to @llvm.read_register.i128/@llvm.write_register.i128 - Introduce target-specific ISD nodes which have legal operands (two i64s rather than an i128). These are named AArch64::{MRRS, MSRR} to match the instructions they are for. These are a little complex as they need to match the "shape" of what they're replacing or the legaliser complains. - Select these using the existing tryReadRegister/tryWriteRegister to share the MDString parsing code, and introduce additional code to ensure these are selected into the right MRRS/MSRR instructions. What makes this hard is ensuring that the two i64s end up in an XSeqPair register pair, because SelectionDAG doesn't care that much about register classes if it can avoid doing so. The main change to existing code is the reorganisation of tryReadRegister and tryWriteRegister to try to keep the string parsing code separate from the instruction creating code. This also includes the changes to clang to define and use the ACLE feature macro named `__ARM_FEATURE_SYSREG128`. Contributors: Sam Elliott Lucas Prates Differential Revision: https://reviews.llvm.org/D139086	2022-12-06 11:39:05 +00:00
Vitaly Buka	166c8cccde	[msan][CodeGen] Set noundef for C return value Msan needs noundef consistency between interface and implementation. If we call C++ from C we can have noundef on C++ side, and no noundef on caller C side, noundef implementation will not set TLS for return value, no noundef caller will expect it. Then we have false reports in msan. The workaround could be set TLS to zero even for noundef return values. However if we do that always it will increase binary size by about 10%. If we do that selectively we need to handle "address is taken" functions, any non local functions, and probably all function which have musttail callers. Which is still a lot. The existing implementation of HasStrictReturn refers to C standard as the reason not enforcing noundef. I believe it applies only to the case when return statement is omitted. Testing on Google codebase I never see such cases, however I've see tens of cases where C code returns actual uninitialized variables, but we ignore that it because of "omitted return" case. So this patch will: 1. fix false-positives with TLS missmatch. 2. detect bugs returning uninitialized variables for C as well. 3. report "omitted return" cases stricter than C, which is already a warning and very likely a bug in a code anyway. Reviewed By: kda Differential Revision: https://reviews.llvm.org/D139296	2022-12-05 22:58:29 -08:00
Freddy Ye	def720726b	[X86][clang] Lift _BitInt() supported max width. Reviewed By: mgehre-amd Differential Revision: https://reviews.llvm.org/D139170	2022-12-06 11:02:27 +08:00
Matt Arsenault	0b01e3d0ae	clang: Convert builtins test to opaque pointers	2022-12-05 09:01:52 -05:00
John McIver	553bdf4fde	[NFC][clang] Strengthen checks in matrix-type-operators.c * Add tbaa attribute checks * Add end-of-line check to load instructions	2022-12-05 10:13:35 +00:00
Vitaly Buka	e92fe7af3f	[test][msan] Update for noundef on retval	2022-12-04 22:47:56 -08:00
Weining Lu	47edc70866	[LoongArch] Specify registers used for exception handling See definition in backend D134709 and the doc [1] for more detail. With the benefit of this change, most libcxx and libcxxabi tests pass. [1]: https://llvm.org/docs/ExceptionHandling.html Reviewed By: xen0n, wangleiat Differential Revision: https://reviews.llvm.org/D139177	2022-12-05 11:42:41 +08:00
Vitaly Buka	9e8787821f	[test][CodeGen] Check noundef for omited return	2022-12-04 19:10:17 -08:00
Vitaly Buka	262d6d495c	[test][CodeGen] Check noundef for return value	2022-12-04 19:10:17 -08:00
Fangrui Song	eecb22d8e1	[SanitizerBinaryMetadata] Use weak __start_/__stop_ instead of dummy empty section D130887 uses a dummy empty section `sanmd_covered` (with the SHF_GNU_RETAIN flag on ELF) to prevent `undefined symbol: __start_sanmd_covered` if all `sanmd_covered` are discarded by `ld --gc-sections` (in `-z start-stop-gc` mode). The dummy `sanmd_covered` does not have the SHF_LINK_ORDER flag, so mixing it with SHF_LINK_ORDER `sanmd_covered` causes an issue to GNU ld<2.36 (https://sourceware.org/bugzilla/show_bug.cgi?id=26256). Similar to D98903 for SanitizerCoverage, let's make encapsulation symbols undefined weak[1]. This additionally avoids size cost due to the dummy section and symbol. [1]: https://maskray.me/blog/2021-01-31-metadata-sections-comdat-and-shf-link-order Reviewed By: melver Differential Revision: https://reviews.llvm.org/D139276	2022-12-04 15:06:34 -08:00
John McIver	ee13633c46	[NFC][clang] Strengthen checks in avx512fp16-builtins.c * Add end-of-line check to load instructions	2022-12-04 14:57:43 +00:00
John McIver	2389488437	[NFC][clang] Strengthen checks in avx512f-builtins.c * Add check to unnamed portion of nontemporal attribute * Add end-of-line check to load instructions	2022-12-04 14:55:41 +00:00
Paul Robinson	64e4d03c68	[lit][AIX] Convert clang tests to use 'target={{.}}-aix{{.}}' Part of the project to eliminate special handling for triples in lit expressions. Differential Revision: https://reviews.llvm.org/D137437	2022-12-02 09:44:15 -08:00
Xiang1 Zhang	94c5df8a76	[AMX] Support AMX-FP16 new intrinsic interface We support AMX-FP16 isa in https://reviews.llvm.org/D135941 now. The old intrinsic interface need to manually write tile registers. So we support its new intrinsic interface to let it be able to do register allocation. Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D138987	2022-12-01 09:47:53 +08:00
gonglingqin	624401612c	[LoongArch] Add remaining intrinsics for CRC check instructions After D137316 implements the intrinsics of the first crc check instruction and related diagnosis, this patch implements the intrinsics of all remaining crc check instructions. Differential Revision: https://reviews.llvm.org/D138418	2022-12-01 09:40:50 +08:00
Paul Robinson	2fbcf8b9b3	[Hexagon] Convert tests to check 'target=hexagon-.*' Part of the project to eliminate special handling for triples in lit expressions.	2022-11-30 13:36:10 -08:00
Henrik G. Olsson	8fa2e93538	[clang] Do not merge traps in functions annotated optnone This aligns the behaviour with that of disabling optimisations for the translation unit entirely. Not merging the traps allows us to keep separate debug information for each, improving the debugging experience when finding the cause for a ubsan trap. Differential Revision: https://reviews.llvm.org/D137714	2022-11-30 15:06:32 +01:00
Bjorn Pettersson	076cda0aaa	[clang][CodeGen] Switch tests to use opt -passes	2022-11-28 12:12:49 +01:00
Ayke van Laethem	131cddcba2	[AVR] Fix broken bitcast for aliases in non-zero address space This was triggered by some code in picolibc. The minimal version looks like this: double infinity(void) { return 5; } extern long double infinityl() __attribute__((__alias__("infinity"))); These two declarations have a different type (not because of the 'long double', which is also 'double' in IR, but because infinityl has variadic parameters). This led to a crash in the bitcast which assumed address space 0. Differential Revision: https://reviews.llvm.org/D138681	2022-11-27 15:27:42 +01:00
Alex Richardson	54ad4d2dd1	Drop redundant pipe to opt -instnamer in clang tests This used to be required, but the difference between asserts/!asserts builds no longer exists for %clang_cc1 (only for %clang), so they pass just fine without this flag.	2022-11-25 11:34:55 +00:00
Sami Tolvanen	5a3d6ce956	[Clang][Driver] Add KCFI to SupportsCoverage Allow `-fsanitize=kcfi` to be enabled with `-fsanitize-coverage=` modes such as `trace-{pc,cmp}`. Link: https://github.com/ClangBuiltLinux/linux/issues/1743 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D138458	2022-11-22 18:20:04 +00:00
KAWASHIMA Takahiro	3a95d7d098	[clang] Fix -fp-model={strict\|precise} to disable -fapprox-func `-fapprox-func` should be disabled by `-fp-model={strict\|precise}`, as well as other fast-math flags. See the last changes in `clang/test/Driver/fp-model.c`. Probably this route (`case options::OPT_ffp_model_EQ`) was forgot to update in D106191 and D114564. There is no appropriate reason not to disable the flag. This commit also updates other regression tests, which are not directly related to this bug, for consistency with other fast-math flags. Differential Revision: https://reviews.llvm.org/D138109	2022-11-22 13:04:26 +09:00
Thomas Lively	ae96b5bd2d	[WebAssembly] Update relaxed-simd instruction names Including builtin and intrinsic names. These should be the final names for the proposal. https://github.com/WebAssembly/relaxed-simd/blob/main/proposals/relaxed-simd/Overview.md Reviewed By: aheejin, maratyszcza Differential Revision: https://reviews.llvm.org/D138249	2022-11-21 12:40:15 -08:00
Nathan Sidwell	eff9d72b9b	[clang] NFC: Robustify sret test regex Replace old-style, brittle, grep with new-fangled FileCheck technology. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D137941	2022-11-21 14:20:47 -05:00
John Brawn	9e3264ab20	[FPEnv] Enable strict fp for AArch64 in clang The AArch64 target now has the necessary support for strict fp, so enable it in clang. Differential Revision: https://reviews.llvm.org/D138143	2022-11-21 16:02:54 +00:00
gonglingqin	c2ec455f18	[LoongArch] Add intrinsics for ibar, break and syscall Diagnostics for intrinsic input parameters have also been added. Differential Revision: https://reviews.llvm.org/D138094	2022-11-21 09:31:26 +08:00
yronglin	80f444646c	[CodeGen][ARM] Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Fix ARMABIInfo::EmitVAAarg crash with empty record type variadic arg Open issue: https://github.com/llvm/llvm-project/issues/58794 Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D138137	2022-11-19 15:14:10 +08:00
Xing Xue	fa7477eb87	[Clang][CodeGen][AIX] Map __builtin_frexpl, __builtin_ldexpl, and __builtin_modfl to 'double' version lib calls in 64-bit 'long double' mode Summary: AIX library functions frexpl(), ldexpl(), and modfl() are for 128-bit IBM long double, i.e. __ibm128. Other *l() functions, e.g., acosl(), are for 64-bit long double. The AIX Clang compiler currently maps builtin functions __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to frexpl(), ldexpl(), and modfl() in 64-bit long double mode which results in seg-faults or incorrect return values. This patch changes to map __builtin_frexpl(), __builtin_ldexpl(), and __builtin_modfl() to double version lib functions frexp(), ldexp() and modf() in 64-bit long double mode. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D137986	2022-11-18 11:36:56 -05:00
Alexander Shaposhnikov	f102fe7304	Revert "Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm"" This reverts commit `7f608a2497` and removes the dependency of Object on IRPrinter.	2022-11-18 08:58:31 +00:00
Mikhail Goncharov	7f608a2497	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `34ab474348`. as it has introduced circular dependency lib - analysis	2022-11-18 09:25:45 +01:00
Alexander Shaposhnikov	34ab474348	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-18 05:04:07 +00:00
Qiu Chaofan	cab9c02bd9	[Clang] Fix behavior of -ffp-model option when overriden -ffp-model=strict -ffp-model=fast will still enable strict exception handling behavior, therefore clang still emits constrained FP operations in IR. -ffp-model=fast -ffp-model=strict emits two warnings: one for strict overriding fast, the other for strict overriding strict, which is confusing. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D137618	2022-11-18 10:34:41 +08:00
Craig Topper	c9320bc871	[X86] Use correctly sized floating point literals in *zero_ps/pd. This avoids depending on int->float or double->float conversion. Improving codegen with #pragma STDC FENV_ACCESS ON. Really we should improve constant folding somewhere, but this was a cheap and easy improvement. Fixes PR59052.	2022-11-17 14:28:52 -08:00
Roman Lebedev	8adfa29706	[Pipelines] Introduce SROA after (final, run-time) loop unrolling Now that we are done with loop unrolling, be it either by LoopVectorizer, or LoopUnroll passes, some variable-offset GEP's into alloca's could have become constant-offset, thus enabling SROA and alloca promotion, yet we don't capitalize on that, which is surprizing. While it would be good to not introduce one more SROA invocation, but instead move the one from `PassBuilder::buildFunctionSimplificationPipeline()`, the existing test coverage says that is a bad idea, though it would be fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=b150d34c47efbd8fa09604bce805c0920360f8d7&to=5a9a5c855158b482552be8c7af3e73d67fa44805&stat=instructions So instead, i add yet another SROA run. I have checked, and it needs to be at least after said final loop unrolling. This is still fine compile-time wise: https://llvm-compile-time-tracker.com/compare.php?from=70324cd88328c0924e605fa81b696572560aa5c9&to=fb489bbef687ad821c3173a931709f9cad9aee8a&stat=instructions I've encountered this in a real code, `SROA-after-final-loop-unrolling.ll` has been reduced from https://godbolt.org/z/fsdMhETh3 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D136806	2022-11-17 21:31:30 +03:00
Alex Brachet	0dff945bbc	Fix debug-info test	2022-11-17 16:02:54 +00:00
Ben Shi	84ef723573	[clang] Fix wrong ABI of AVRTiny. A scalar which exceeds 4 bytes should be returned via a stack slot, on an AVRTiny device. Reviewed By: aykevl Differential Revision: https://reviews.llvm.org/D138125	2022-11-17 08:38:44 +08:00
gonglingqin	ddbb21bdb5	[LoongArch] Add immediate operand validity check for __builtin_loongarch_dbar Differential Revision: https://reviews.llvm.org/D137809	2022-11-16 14:47:45 +08:00
Michele Scandale	b7d7c448df	Fix `unsafe-fp-math` attribute emission. The conditions for which Clang emits the `unsafe-fp-math` function attribute has been modified as part of `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7`. In the backend code generators `"unsafe-fp-math"="true"` enable floating point contraction for the whole function. The intent of the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` was to prevent backend code generators performing contractions when that is not expected. However the change is inaccurate and incomplete because it allows `unsafe-fp-math` to be set also when only in-statement contraction is allowed. Consider the following example ``` float foo(float a, float b, float c) { float tmp = a * b; return tmp + c; } ``` and compile it with the command line ``` clang -fno-math-errno -funsafe-math-optimizations -ffp-contract=on \ -O2 -mavx512f -S -o - ``` The resulting assembly has a `vfmadd213ss` instruction which corresponds to a fused multiply-add. From the user perspective there shouldn't be any contraction because the multiplication and the addition are not in the same statement. The optimized IR is: ``` define float @test(float noundef %a, float noundef %b, float noundef %c) #0 { %mul = fmul reassoc nsz arcp afn float %b, %a %add = fadd reassoc nsz arcp afn float %mul, %c ret float %add } attributes #0 = { [...] "no-signed-zeros-fp-math"="true" "no-trapping-math"="true" [...] "unsafe-fp-math"="true" } ``` The `"unsafe-fp-math"="true"` function attribute allows the backend code generator to perform `(fadd (fmul a, b), c) -> (fmadd a, b, c)`. In the current IR representation there is no way to determine the statement boundaries from the original source code. Because of this for in-statement only contraction the generated IR doesn't have instructions with the `contract` fast-math flag and `llvm.fmuladd` is being used to represent contractions opportunities that occur within a single statement. Therefore `"unsafe-fp-math"="true"` can only be emitted when contraction across statements is allowed. Moreover the change in `84a9ec2ff1ee97fd7e8ed988f5e7b197aab84a7` doesn't take into account that the floating point math function attributes can be refined during IR code generation of a function to handle the cases where the floating point math options are modified within a compound statement via pragmas (see `CGFPOptionsRAII`). For consistency `unsafe-fp-math` needs to be disabled if the contraction mode for any scope/operation is not `fast`. Similarly for consistency reason the initialization of `UnsafeFPMath` of in `TargetOptions` for the backend code generation should take into account the contraction mode as well. Reviewed By: zahiraam Differential Revision: https://reviews.llvm.org/D136786	2022-11-14 20:40:57 -08:00
Roman Lebedev	b2fbafc911	[NFC][Clang] Autogenerate checklines in a test being affected by a patch	2022-11-15 03:51:24 +03:00
Fangrui Song	77bf0df376	Revert "[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm" This reverts commit `bf8381a8bc`. There is a layering violation: LLVMAnalysis depends on LLVMCore, so LLVMCore should not include LLVMAnalysis header llvm/Analysis/ModuleSummaryAnalysis.h	2022-11-14 15:51:03 -08:00
Alexander Shaposhnikov	bf8381a8bc	[opt][clang] Enable using -module-summary/-flto=thin with -S/-emit-llvm Enable using -module-summary with -S (similarly to what currently can be achieved with opt <input> -o - \| llvm-dis). This is a recommit of `ef9e62469`. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D137768	2022-11-14 23:24:08 +00:00

1 2 3 4 5 ...

7890 Commits