clang-p2996

Author	SHA1	Message	Date
David Sherwood	ceb6c23b70	[NFC][LoopVectorize] Explicitly disable tail-folding on some SVE tests This patch is in preparation for enabling vectorisation with tail-folding by default for SVE targets. Once we do that many existing tests will break that depend upon having normal unpredicated vector loops. For all such tests I have added the flag: -prefer-predicate-over-epilogue=scalar-epilogue Differential Revision: https://reviews.llvm.org/D129137	2022-07-21 15:23:00 +01:00
Qiu Chaofan	708084ec37	[PowerPC] Support x86 compatible intrinsics on AIX These headers used to be guarded only on PowerPC64 Linux or FreeBSD, but they can also be enabled for AIX OS target since it's big-endian ready. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D129461	2022-07-21 16:33:41 +08:00
Chen Zheng	ecdeabef38	enable P10 vector builtins test on AIX 64 bit; NFC Verify that P10 vector builtins with type `vector signed __int128` and `vector unsigned __int128` work well on AIX 64 bit.	2022-07-21 03:51:30 -04:00
Arthur Eubanks	7e77d31af7	[test] Remove unnecessary -verify-machineinstrs=0 Issue #38784 seems to be fixed and removing these doesn't cause any issues.	2022-07-20 10:55:54 -07:00
Nicolai Hähnle	1ddc51d89d	Inliner: don't mark call sites as 'nounwind' if that would be redundant When F calls G calls H, G is nounwind, and G is inlined into F, then the inlined call-site to H should be effectively nounwind so as not to lose information during inlining. If H itself is nounwind (which often happens when H is an intrinsic), we no longer mark the callsite explicitly as nounwind. Previously, there were cases where the inlined call-site of H differs from a pre-existing call-site of H in F only in the explicitly added nounwind attribute, thus preventing common subexpression elimination. v2: - just check CI->doesNotThrow v3 (resubmit after revert at `3443788087`): - update Clang tests Differential Revision: https://reviews.llvm.org/D129860	2022-07-20 14:17:23 +02:00
Nicolai Hähnle	7af2818a99	Update some more tests with update_cc_test_checks.py	2022-07-20 13:27:18 +02:00
Fangrui Song	23ba688f02	[X86] Use Min behavior for cf-protection-{return,branch}/ibt-seal module flags These features require that all object files are compiled with the support. When the feature is disabled for an object file, the merge behavior should treat the file having a value of 0 (see D129911). Reviewed By: xiangzhangllvm Differential Revision: https://reviews.llvm.org/D130065	2022-07-19 21:20:02 -07:00
Nicolai Hähnle	3d2256cc75	Revert "Update some more tests with update_cc_test_checks.py" This reverts commit `9fb33d52b0`. Buildbots are showing a number of regressions that don't reproduce locally. Needs more investigating.	2022-07-19 16:39:57 +02:00
Nicolai Hähnle	9fb33d52b0	Update some more tests with update_cc_test_checks.py	2022-07-19 16:18:35 +02:00
Xiang1 Zhang	4bb19de4b6	[X86] Add 64 bit implement for __SSC_MARK Reviewed By: craig.topper, pengfei.wang, jinsong Differential Revision: https://reviews.llvm.org/D129826	2022-07-19 16:13:41 +08:00
Nicolai Hähnle	5fc6213551	Revert change to clang/test/CodeGen/arm_acle.c For some reason, update_cc_test_checks.py produced a failing test. Partial revert of `301011fa60`	2022-07-19 09:11:23 +02:00
Nicolai Hähnle	301011fa60	Rerun ./utils/update_cc_test.py on a bunch of tests Due to update script changes; this reduces the size of a later "real" diff.	2022-07-19 08:53:05 +02:00
serge-sans-paille	f764dc99b3	[clang] Introduce -fstrict-flex-arrays=<n> for stricter handling of flexible arrays Some code [0] consider that trailing arrays are flexible, whatever their size. Support for these legacy code has been introduced in `f8f6324983` but it prevents evaluation of __builtin_object_size and __builtin_dynamic_object_size in some legit cases. Introduce -fstrict-flex-arrays=<n> to have stricter conformance when it is desirable. n = 0: current behavior, any trailing array member is a flexible array. The default. n = 1: any trailing array member of undefined, 0 or 1 size is a flexible array member n = 2: any trailing array member of undefined or 0 size is a flexible array member This takes into account two specificities of clang: array bounds as macro id disqualify FAM, as well as non standard layout. Similar patch for gcc discuss here: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101836 [0] https://docs.freebsd.org/en/books/developers-handbook/sockets/#sockets-essential-functions	2022-07-18 12:45:52 +02:00
Fangrui Song	0d5a62faca	[sanitizer] Add "mainfile" prefix to sanitizer special case list When an issue exists in the main file (caller) instead of an included file (callee), using a `src` pattern applying to the included file may be inappropriate if it's the caller's responsibility. Add `mainfile` prefix to check the main filename. For the example below, the issue may reside in a.c (foo should not be called with a misaligned pointer or foo should switch to an unaligned load), but with `src` we can only apply to the innocent callee a.h. With this patch we can use the more appropriate `mainfile:a.c`. ``` //--- a.h // internal linkage static inline int load(int x) { return x; } //--- a.c, -fsanitize=alignment #include "a.h" int foo(void *x) { return load(x); } ``` See the updated clang/docs/SanitizerSpecialCaseList.rst for a caveat due to C++ vague linkage functions. Reviewed By: #sanitizers, kstoimenov, vitalybuka Differential Revision: https://reviews.llvm.org/D129832	2022-07-15 10:39:26 -07:00
Nikita Popov	2a721374ae	[IR] Don't use blockaddresses as callbr arguments Following some recent discussions, this changes the representation of callbrs in IR. The current blockaddress arguments are replaced with `!` label constraints that refer directly to callbr indirect destinations: ; Before: %res = callbr i8* asm "", "=r,r,i"(i8* %x, i8* blockaddress(@test8, %foo)) to label %asm.fallthrough [label %foo] ; After: %res = callbr i8* asm "", "=r,r,!i"(i8* %x) to label %asm.fallthrough [label %foo] The benefit of this is that we can easily update the successors of a callbr, without having to worry about also updating blockaddress references. This should allow us to remove some limitations: * Allow unrolling/peeling/rotation of callbr, or any other clone-based optimizations (https://github.com/llvm/llvm-project/issues/41834) * Allow duplicate successors (https://github.com/llvm/llvm-project/issues/45248) This is just the IR representation change though, I will follow up with patches to remove limtations in various transformation passes that are no longer needed. Differential Revision: https://reviews.llvm.org/D129288	2022-07-15 10:18:17 +02:00
Jonas Devlieghere	888673b6e3	Revert "[clang] Implement ElaboratedType sugaring for types written bare" This reverts commit `7c51f02eff` because it stills breaks the LLDB tests. This was re-landed without addressing the issue or even agreement on how to address the issue. More details and discussion in https://reviews.llvm.org/D112374.	2022-07-14 21:17:48 -07:00
Matheus Izvekov	7c51f02eff	[clang] Implement ElaboratedType sugaring for types written bare Without this patch, clang will not wrap in an ElaboratedType node types written without a keyword and nested name qualifier, which goes against the intent that we should produce an AST which retains enough details to recover how things are written. The lack of this sugar is incompatible with the intent of the type printer default policy, which is to print types as written, but to fall back and print them fully qualified when they are desugared. An ElaboratedTypeLoc without keyword / NNS uses no storage by itself, but still requires pointer alignment due to pre-existing bug in the TypeLoc buffer handling. --- Troubleshooting list to deal with any breakage seen with this patch: 1) The most likely effect one would see by this patch is a change in how a type is printed. The type printer will, by design and default, print types as written. There are customization options there, but not that many, and they mainly apply to how to print a type that we somehow failed to track how it was written. This patch fixes a problem where we failed to distinguish between a type that was written without any elaborated-type qualifiers, such as a 'struct'/'class' tags and name spacifiers such as 'std::', and one that has been stripped of any 'metadata' that identifies such, the so called canonical types. Example: ``` namespace foo { struct A {}; A a; }; ``` If one were to print the type of `foo::a`, prior to this patch, this would result in `foo::A`. This is how the type printer would have, by default, printed the canonical type of A as well. As soon as you add any name qualifiers to A, the type printer would suddenly start accurately printing the type as written. This patch will make it print it accurately even when written without qualifiers, so we will just print `A` for the initial example, as the user did not really write that `foo::` namespace qualifier. 2) This patch could expose a bug in some AST matcher. Matching types is harder to get right when there is sugar involved. For example, if you want to match a type against being a pointer to some type A, then you have to account for getting a type that is sugar for a pointer to A, or being a pointer to sugar to A, or both! Usually you would get the second part wrong, and this would work for a very simple test where you don't use any name qualifiers, but you would discover is broken when you do. The usual fix is to either use the matcher which strips sugar, which is annoying to use as for example if you match an N level pointer, you have to put N+1 such matchers in there, beginning to end and between all those levels. But in a lot of cases, if the property you want to match is present in the canonical type, it's easier and faster to just match on that... This goes with what is said in 1), if you want to match against the name of a type, and you want the name string to be something stable, perhaps matching on the name of the canonical type is the better choice. 3) This patch could exposed a bug in how you get the source range of some TypeLoc. For some reason, a lot of code is using getLocalSourceRange(), which only looks at the given TypeLoc node. This patch introduces a new, and more common TypeLoc node which contains no source locations on itself. This is not an inovation here, and some other, more rare TypeLoc nodes could also have this property, but if you use getLocalSourceRange on them, it's not going to return any valid locations, because it doesn't have any. The right fix here is to always use getSourceRange() or getBeginLoc/getEndLoc which will dive into the inner TypeLoc to get the source range if it doesn't find it on the top level one. You can use getLocalSourceRange if you are really into micro-optimizations and you have some outside knowledge that the TypeLocs you are dealing with will always include some source location. 4) Exposed a bug somewhere in the use of the normal clang type class API, where you have some type, you want to see if that type is some particular kind, you try a `dyn_cast` such as `dyn_cast<TypedefType>` and that fails because now you have an ElaboratedType which has a TypeDefType inside of it, which is what you wanted to match. Again, like 2), this would usually have been tested poorly with some simple tests with no qualifications, and would have been broken had there been any other kind of type sugar, be it an ElaboratedType or a TemplateSpecializationType or a SubstTemplateParmType. The usual fix here is to use `getAs` instead of `dyn_cast`, which will look deeper into the type. Or use `getAsAdjusted` when dealing with TypeLocs. For some reason the API is inconsistent there and on TypeLocs getAs behaves like a dyn_cast. 5) It could be a bug in this patch perhaps. Let me know if you need any help! Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Differential Revision: https://reviews.llvm.org/D112374	2022-07-15 04:16:55 +02:00
Vitaly Buka	a629ec2f47	[test] Fix D129789 for 32bit platforms	2022-07-14 12:44:15 -07:00
Ellis Hoag	af58684f27	[InstrProf] Add options to profile function groups Add two options, `-fprofile-function-groups=N` and `-fprofile-selected-function-group=i` used to partition functions into `N` groups and only instrument the functions in group `i`. Similar options were added to xray in https://reviews.llvm.org/D87953 and the goal is the same; to reduce instrumented size overhead by spreading the overhead across multiple builds. Raw profiles from different groups can be added like normal using the `llvm-profdata merge` command. Reviewed By: ianlevesque Differential Revision: https://reviews.llvm.org/D129594	2022-07-14 11:41:30 -07:00
Nick Desaulniers	140bfdca60	[clang][CodeGen] add fn_ret_thunk_extern to synthetic fns Follow up fix to commit `2240d72f15` ("[X86] initial -mfunction-return=thunk-extern support") https://reviews.llvm.org/D129572 @nathanchance reported that -mfunction-return=thunk-extern was failing to annotate the asan and tsan contructors. https://lore.kernel.org/llvm/Ys7pLq+tQk5xEa%2FB@dev-arch.thelio-3990X/ I then noticed the same occurring for gcov synthetic functions. Similar to commit `2786e67` ("[IR][sanitizer] Add module flag "frame-pointer" and set it for cc1 -mframe-pointer={non-leaf,all}") define a new module level MetaData, "fn_ret_thunk_extern", then when set adds the fn_ret_thunk_extern IR Fn Attr to synthetically created Functions. Fixes https://github.com/llvm/llvm-project/issues/56514 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D129709	2022-07-14 11:25:24 -07:00
Vitaly Buka	a2e01bdcc2	[test][CodeGen] Don't miss lifetime markers in lifetime tests Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D129789	2022-07-14 11:14:07 -07:00
Nick Desaulniers	d2792e7d37	[clang][test] fix typo in fn attr While testing backports of https://reviews.llvm.org/D129572#inline-1245936 commit `2240d72f15` ("[X86] initial -mfunction-return=thunk-extern support") I noticed that one of my unit tests mistyped a function attribute. The unit test was intended to test fn attr merging behavior, but with the typo it was not. Small fixup. Reviewed By: aaron.ballman, erichkeane Differential Revision: https://reviews.llvm.org/D129691	2022-07-14 09:49:11 -07:00
Jonas Devlieghere	3968936b92	Revert "[clang] Implement ElaboratedType sugaring for types written bare" This reverts commit `bdc6974f92` because it breaks all the LLDB tests that import the std module. import-std-module/array.TestArrayFromStdModule.py import-std-module/deque-basic.TestDequeFromStdModule.py import-std-module/deque-dbg-info-content.TestDbgInfoContentDequeFromStdModule.py import-std-module/forward_list.TestForwardListFromStdModule.py import-std-module/forward_list-dbg-info-content.TestDbgInfoContentForwardListFromStdModule.py import-std-module/list.TestListFromStdModule.py import-std-module/list-dbg-info-content.TestDbgInfoContentListFromStdModule.py import-std-module/queue.TestQueueFromStdModule.py import-std-module/stack.TestStackFromStdModule.py import-std-module/vector.TestVectorFromStdModule.py import-std-module/vector-bool.TestVectorBoolFromStdModule.py import-std-module/vector-dbg-info-content.TestDbgInfoContentVectorFromStdModule.py import-std-module/vector-of-vectors.TestVectorOfVectorsFromStdModule.py https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake/45301/	2022-07-13 09:20:30 -07:00
Mitch Phillips	7045519359	Add missing sanitizer metadata plumbing from CFE. clang misses attaching sanitizer metadata for external globals. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D129492	2022-07-13 08:54:41 -07:00
Mitch Phillips	90e5a8ac47	Remove 'no_sanitize_memtag'. Add 'sanitize_memtag'. For MTE globals, we should have clang emit the attribute for all GV's that it creates, and then use that in the upcoming AArch64 global tagging IR pass. We need a positive attribute for this sanitizer (rather than implicit sanitization of all globals) because it needs to interact with other parts of LLVM, including: 1. Suppressing certain global optimisations (like merging), 2. Emitting extra directives by the ASM writer, and 3. Putting extra information in the symbol table entries. While this does technically make the LLVM IR / bitcode format non-backwards-compatible, nobody should have used this attribute yet, because it's a no-op. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D128950	2022-07-13 08:54:41 -07:00
Cullen Rhodes	7c3cda551a	[AArch64][SVE] Prefer SIMD&FP variant of clast[ab] The scalar variant with GPR source/dest has considerably higher latency than the SIMD&FP scalar variant across a variety of micro-architectures: Core Scalar SIMD&FP -------------------------------- Neoverse V1 9 cyc 3 cyc Neoverse N2 8 cyc 3 cyc Cortex A510 8 cyc 4 cyc A64FX 29 cyc 6 cyc	2022-07-13 08:53:36 +00:00
Matheus Izvekov	bdc6974f92	[clang] Implement ElaboratedType sugaring for types written bare Without this patch, clang will not wrap in an ElaboratedType node types written without a keyword and nested name qualifier, which goes against the intent that we should produce an AST which retains enough details to recover how things are written. The lack of this sugar is incompatible with the intent of the type printer default policy, which is to print types as written, but to fall back and print them fully qualified when they are desugared. An ElaboratedTypeLoc without keyword / NNS uses no storage by itself, but still requires pointer alignment due to pre-existing bug in the TypeLoc buffer handling. Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> Differential Revision: https://reviews.llvm.org/D112374	2022-07-13 02:10:09 +02:00
Nick Desaulniers	2240d72f15	[X86] initial -mfunction-return=thunk-extern support Adds support for: * `-mfunction-return=<value>` command line flag, and * `__attribute__((function_return("<value>")))` function attribute Where the supported <value>s are: * keep (disable) * thunk-extern (enable) thunk-extern enables clang to change ret instructions into jmps to an external symbol named __x86_return_thunk, implemented as a new MachineFunctionPass named "x86-return-thunks", keyed off the new IR attribute fn_ret_thunk_extern. The symbol __x86_return_thunk is expected to be provided by the runtime the compiled code is linked against and is not defined by the compiler. Enabling this option alone doesn't provide mitigations without corresponding definitions of __x86_return_thunk! This new MachineFunctionPass is very similar to "x86-lvi-ret". The <value>s "thunk" and "thunk-inline" are currently unsupported. It's not clear yet that they are necessary: whether the thunk pattern they would emit is beneficial or used anywhere. Should the <value>s "thunk" and "thunk-inline" become necessary, x86-return-thunks could probably be merged into x86-retpoline-thunks which has pre-existing machinery for emitting thunks (which could be used to implement the <value> "thunk"). Has been found to build+boot with corresponding Linux kernel patches. This helps the Linux kernel mitigate RETBLEED. * CVE-2022-23816 * CVE-2022-28693 * CVE-2022-29901 See also: * "RETBLEED: Arbitrary Speculative Code Execution with Return Instructions." * AMD SECURITY NOTICE AMD-SN-1037: AMD CPU Branch Type Confusion * TECHNICAL GUIDANCE FOR MITIGATING BRANCH TYPE CONFUSION REVISION 1.0 2022-07-12 * Return Stack Buffer Underflow / Return Stack Buffer Underflow / CVE-2022-29901, CVE-2022-28693 / INTEL-SA-00702 SystemZ may eventually want to support "thunk-extern" and "thunk"; both options are used by the Linux kernel's CONFIG_EXPOLINE. This functionality has been available in GCC since the 8.1 release, and was backported to the 7.3 release. Many thanks for folks that provided discrete review off list due to the embargoed nature of this hardware vulnerability. Many Bothans died to bring us this information. Link: https://www.youtube.com/watch?v=IF6HbCKQHK8 Link: https://github.com/llvm/llvm-project/issues/54404 Link: https://gcc.gnu.org/legacy-ml/gcc-patches/2018-01/msg01197.html Link: https://www.intel.com/content/www/us/en/developer/articles/technical/software-security-guidance/advisory-guidance/return-stack-buffer-underflow.html Link: https://arstechnica.com/information-technology/2022/07/intel-and-amd-cpus-vulnerable-to-a-new-speculative-execution-attack/?comments=1 Link: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=ce114c866860aa9eae3f50974efc68241186ba60 Link: https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00702.html Link: https://www.intel.com/content/www/us/en/security-center/advisory/intel-sa-00707.html Reviewed By: aaron.ballman, craig.topper Differential Revision: https://reviews.llvm.org/D129572	2022-07-12 09:17:54 -07:00
Iain Sandoe	b19d3ee712	Revert "[C++20][Modules] Build module static initializers per P1874R1." This reverts commit `ac507102d2`. reverting while we figuere out why one of the green dragon lldb test fails.	2022-07-11 19:50:31 +01:00
Iain Sandoe	e7c8ded6df	Revert "[C++20][Modules] Fix two tests for CTORs that return pointers [NFC]." This reverts commit `4328b96017`. reverting while we figure out why one of the Greendragon lldb tests fails.	2022-07-11 19:49:48 +01:00
Iain Sandoe	4328b96017	[C++20][Modules] Fix two tests for CTORs that return pointers [NFC]. The test are to check that we call the correctly mangled CTORs, so that the return values from them are irrelevant. I forgot that some targets return a pointer, apologies for the breakage.	2022-07-09 11:23:20 +01:00
Iain Sandoe	ac507102d2	[C++20][Modules] Build module static initializers per P1874R1. Currently we only implement this for the Itanium ABI since the correct mangling for the initializers in other ABIs is not yet known. Intended result: For a module interface [which includes partition interface and implementation units] (instead of the generic CXX initializer) we emit a module init that: - wraps the contained initializations in a control variable to ensure that the inits only happen once, even if a module is imported many times by imports of the main unit. - calls module initializers for imported modules first. Note that the order of module import is not significant, and therefore neither is the order of imported module initializers. - We then call initializers for the Global Module Fragment (if present) - We then call initializers for the current module. - We then call initializers for the Private Module Fragment (if present) For a module implementation unit, or a non-module TU that imports at least one module we emit a regular CXX init that: - Calls the initializers for any imported modules first. - Then proceeds as normal with remaining inits. For all module unit kinds we include a global constructor entry, this allows for the (in most cases unusual) possibility that a module object could be included in a final binary without a specific call to its initializer. Implementation: - We provide the module pointer in the AST Context so that CodeGen can act on it and its sub-modules. - We need to account for module build lines like this: ` clang -cc1 -std=c++20 Foo.pcm -emit-obj -o Foo.o` or ` clang -cc1 -std=c++20 -xc++-module Foo.cpp -emit-obj -o Foo.o` - in order to do this, we add to ParseAST to set the module pointer in the ASTContext, once we establish that this is a module build and we know the module pointer. To be able to do this, we make the query for current module public in Sema. - In CodeGen, we determine if the current build requires a CXX20-style module init and, if so, we defer any module initializers during the "Eagerly Emitted" phase. - We then walk the module initializers at the end of the TU but before emitting deferred inits (which adds any hidden and static ones, fixing https://github.com/llvm/llvm-project/issues/51873 ). - We then proceed to emit the deferred inits and continue to emit the CXX init function. Differential Revision: https://reviews.llvm.org/D126189	2022-07-09 09:09:09 +01:00
Masoud Ataei	fe06b9f02c	Bringing back the test with the required target related to commit `96515df816`	2022-07-06 13:04:17 -07:00
Masoud Ataei	d34315e71a	Removing this test temporarily beacuse of a failure in x86_64	2022-07-06 12:41:13 -07:00
Masoud Ataei	96515df816	[PowerPC] Fix the check for scalar MASS conversion Proposing to move the check for scalar MASS conversion from constructor of PPCTargetLowering to the lowerLibCallBase function which decides about the lowering. The Target machine option Options.PPCGenScalarMASSEntries is set in PPCTargetMachine.cpp. But an object of the class PPCTargetLowering is created in one of the included header files. So, the constructor will run before setting PPCGenScalarMASSEntries to correct value. So, we cannot check this option in the constructor. Differential: https://reviews.llvm.org/D128653 Reviewer: @bmahjour	2022-07-06 11:44:00 -07:00
Paul Robinson	08e4fe6c61	[X86] Add RDPRU instruction Add support for the RDPRU instruction on Zen2 processors. User-facing features: - Clang option -m[no-]rdpru to enable/disable the feature - Support is implicit for znver2/znver3 processors - Preprocessor symbol __RDPRU__ to indicate support - Header rdpruintrin.h to define intrinsics - "rdpru" mnemonic supported for assembler code Internal features: - Clang builtin __builtin_ia32_rdpru - IR intrinsic @llvm.x86.rdpru Differential Revision: https://reviews.llvm.org/D128934	2022-07-06 07:17:47 -07:00
Bruno De Fraine	5b3247bf9f	[tbaa] Handle base classes in struct tbaa This is a fix for the miscompilation reported in https://github.com/llvm/llvm-project/issues/55384 Not adding a new test case since existing test cases already cover base classes (including new-struct-path tbaa). Reviewed By: jeroen.dobbelaere Differential Revision: https://reviews.llvm.org/D126956	2022-07-06 14:37:59 +02:00
Archibald Elliott	1666f09933	[ARM] Add Support for Cortex-M85 This patch adds support for Arm's Cortex-M85 CPU. The Cortex-M85 CPU is an Arm v8.1m Mainline CPU, with optional support for MVE and PACBTI, both of which are enabled by default. Parts have been coauthored by by Mark Murray, Alexandros Lamprineas and David Green. Differential Revision: https://reviews.llvm.org/D128415	2022-07-05 10:43:31 +01:00
Stephan Bergmann	4996e3f683	[test] Check for more -fsanitize=array-bounds behavior ...that had temporarily regressed with (since reverted) <`886715af96`> "[clang] Introduce -fstrict-flex-arrays=<n> for stricter handling of flexible arrays", and had then been seen to cause issues in the wild: For one, the HarfBuzz project has various "fake" flexible array members of the form > Type arrayZ[HB_VAR_ARRAY]; in <https://github.com/harfbuzz/harfbuzz/blob/main/src/hb-open-type.hh>, where HB_VAR_ARRAY is a macro defined as > #ifndef HB_VAR_ARRAY > #define HB_VAR_ARRAY 1 > #endif in <https://github.com/harfbuzz/harfbuzz/blob/main/src/hb-machinery.hh>. For another, the Firebird project in <https://github.com/FirebirdSQL/firebird/blob/master/src/lock/lock_proto.h> uses a trailing member > srq lhb_hash[1]; // Hash table as a "fake" flexible array, but declared in a > struct lhb : public Firebird::MemoryHeader that is not a standard-layout class (because the Firebird::MemoryHeader base class also declares non-static data members). (The second case is specific to C++. Extend the test setup so that all the other tests are now run for both C and C++, just in case the behavior could ever start to diverge for those two languages.) A third case where -fsanitize=array-bounds differs from -Warray-bounds (and which is also specific to C++, but which doesn't appear to have been encountered in the wild) is when the "fake" flexible array member's size results from template argument substitution. Differential Revision: https://reviews.llvm.org/D128783	2022-07-05 08:12:53 +02:00
Ben Dunbobbin	325e7e8b87	[LLVM][LTO][LLD] Enable Profile Guided Layout (--call-graph-profile-sort) for FullLTO The CGProfilePass needs to be run during FullLTO compilation at link time to emit the .llvm.call-graph-profile section to the compiled LTO object file. Currently, it is being run only during the initial LTO-prelink compilation stage (to produce the bitcode files to be consumed by the linker) and so the section is not produced. ThinLTO is not affected because: - For ThinLTO-prelink compilation the CGProfilePass pass is not run because ThinLTO-prelink passes are added via buildThinLTOPreLinkDefaultPipeline. Normal and FullLTO-prelink passes are both added via buildPerModuleDefaultPipeline which uses the LTOPreLink parameter to customize its behavior for the FullLTO-prelink pass differences. - ThinLTO backend compilation phase adds the CGProfilePass (see: buildModuleOptimizationPipeline). Adjust when the pass is run so that the .llvm.call-graph-profile section is produced correctly for FullLTO. Fixes #56185 (https://github.com/llvm/llvm-project/issues/56185)	2022-07-01 13:57:36 +01:00
Phoebe Wang	abeeae570e	[X86] Support `_Float16` on SSE2 and up This is split from D113107 to address #56204 and https://discourse.llvm.org/t/how-to-build-compiler-rt-for-new-x86-half-float-abi/63366 Reviewed By: zahiraam, rjmccall, bkramer, MaskRay Differential Revision: https://reviews.llvm.org/D128571	2022-06-30 17:21:37 +08:00
Lei Huang	caf7243a6b	[PowerPC] Fix signatures for vec_replace_unaligned builtin ``vec_replace_unaligned`` is meant to return vuc to emphasize that elements are being inserted on unnatural boundaries. Reviewed By: amyk, quinnp Differential Revision: https://reviews.llvm.org/D128288	2022-06-29 09:35:52 -05:00
Ben Langmuir	eab2a06f0f	Revert "Reland "[X86] Support `_Float16` on SSE2 and up"" Broke compiler-rt on Darwin: https://green.lab.llvm.org/green/job/clang-stage1-RA/29920/ This reverts commit `527ef8ca98`.	2022-06-28 10:59:03 -07:00
Mitch Phillips	a2095d1aff	Allow mangled names in sanitizer clang IR gen tests. Looks like with https://reviews.llvm.org/D127911, Windows emits more globals with mangled names into the IR. Relax the tests in order to allow these mangled names.	2022-06-28 09:27:43 -07:00
Phoebe Wang	527ef8ca98	Reland "[X86] Support `_Float16` on SSE2 and up" Enable `COMPILER_RT_HAS_FLOAT16` to solve the lit fail. This is split from D113107 to address #56204 and https://discourse.llvm.org/t/how-to-build-compiler-rt-for-new-x86-half-float-abi/63366 Reviewed By: zahiraam, rjmccall, bkramer Differential Revision: https://reviews.llvm.org/D128571	2022-06-28 14:38:56 +08:00
Craig Topper	17a36c7c40	[RISCV] Zero extend immediate for vget/vset builtins to match vector.insert/extract intrinsics. The vector.insert/extract intrinsics require an i64 immediate argument. This fixes a crash on RV32. Differential Revision: https://reviews.llvm.org/D128624	2022-06-27 20:26:54 -07:00
Fangrui Song	efd90ffbfc	[test] Add -fsanitize=array-bounds test for pseudo flexible array member This behavior (from commit `539e4a77bb` in 2013) was untested. The test can help detect regression introduced by `886715af96`	2022-06-27 16:40:52 -07:00
Mitch Phillips	dacfa24f75	Delete 'llvm.asan.globals' for global metadata. Now that we have the sanitizer metadata that is actually on the global variable, and now that we use debuginfo in order to do symbolization of globals, we can delete the 'llvm.asan.globals' IR synthesis. This patch deletes the 'location' part of the __asan_global that's embedded in the binary as well, because it's unnecessary. This saves about ~1.7% of the optimised non-debug with-asserts clang binary. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D127911	2022-06-27 14:40:40 -07:00
Vitaly Buka	cdfa15da94	Revert "[clang] Introduce -fstrict-flex-arrays=<n> for stricter handling of flexible arrays" This reverts D126864 and related fixes. This reverts commit `572b08790a`. This reverts commit `886715af96`.	2022-06-27 14:03:09 -07:00
Vitaly Buka	8f7cca90af	Revert "[X86] Support `_Float16` on SSE2 and up" Breaks buildbot https://lab.llvm.org/buildbot/#/builders/37/builds/14334 This reverts commit `f5d781d627`.	2022-06-27 12:43:29 -07:00

1 2 3 4 5 ...

7647 Commits