clang-p2996

Author	SHA1	Message	Date
Hans	3bb39690d7	[coro] Lower `llvm.coro.await.suspend.handle` to resume with tail call (#89751 ) The C++ standard requires that symmetric transfer from one coroutine to another is performed via a tail call. Failure to do so is a miscompile and often breaks programs by quickly overflowing the stack. Until now, the coro split pass tried to ensure this in the `addMustTailToCoroResumes()` function by searching for `llvm.coro.resume` calls to lower as tail calls if the conditions were right: the right function arguments, attributes, calling convention etc., and if a `ret void` was sure to be reached after traversal with some ad-hoc constant folding following the call. This was brittle, as the kind of implicit variants required for a tail call to happen could easily be broken by other passes (e.g. if some instruction got in between the `resume` and `ret`), see for example `9d1cb18d19` and `284da049f5`. Also the logic seemed backwards: instead of searching for possible tail call candidates and doing them if the circumstances are right, it seems better to start with the intention of making the tail calls we need, and forcing the circumstances to be right. Now that we have the `llvm.coro.await.suspend.handle` intrinsic (since `f786881340`) which corresponds exactly to symmetric transfer, change the lowering of that to also include the `resume` part, always lowered as a tail call.	2024-05-15 15:29:08 +02:00
Jay Foad	1650f1b3d7	Fix typo "indicies" (#92232 )	2024-05-15 13:10:16 +01:00
Florian Mayer	80f8ae3f84	[NFC] add explanation to register flags doc (#91803 )	2024-05-14 11:34:27 -07:00
Krzysztof Drewniak	ac0d415552	Update documentation for buffer fat pointers (#92034 ) Now that we've got (minus some issues around datatypes and invariant loads) working lowerings for address space 7, update the table in the AMDGPU usage guide to properly indicate the nature of these address spaces.	2024-05-14 10:03:48 -05:00
Graham Hunter	2b15c4a62b	[AArch64] Postcommit fixes for histogram intrinsic (#92095 ) A buildbot with expensive checks enabled flagged some problems with my patch. There was also a post-commit nit on the langref changes.	2024-05-14 15:16:42 +01:00
Vyacheslav Levytskyy	be9b4dab40	[SPIR-V] Introduce support for 'spirv.Decorations' metadata node in SPIR-V Backend (#91736 ) This PR is to introduce support for 'spirv.Decorations' metadata node in SPIR-V Backend. See also https://github.com/KhronosGroup/SPIRV-LLVM-Translator/blob/main/docs/SPIRVRepresentationInLLVM.rst that describes `spirv.Decorations` as an important part of SPIRV-friendly LLVM IR.	2024-05-14 11:35:11 +02:00
appujee	f3b8d91ca8	LLVM vectorizer working group (#92068 ) Recurring meeting at 3rd Thursday of every month.	2024-05-13 22:39:51 -07:00
Fangrui Song	23f8fac745	Revert "Repply#2 "[RemoveDIs] Load into new debug info format by default in LLVM (#89799 )"" This reverts commit `91446e2aa6` and a unittest followup `1530f31931` (#90476). In a stage-2 -flto=thin -gsplit-dwarf -g -fdebug-info-for-profiling -fprofile-sample-use= build of clang, a ThinLTO backend compile has assertion failures: Global is external, but doesn't have external or weak linkage! ptr @_ZN5clang12ast_matchers8internal18makeAllOfCompositeINS_8QualTypeEEENS1_15BindableMatcherIT_EEN4llvm8ArrayRefIPKNS1_7MatcherIS5_EEEE function declaration may only have a unique !dbg attachment ptr @_ZN5clang12ast_matchers8internal18makeAllOfCompositeINS_8QualTypeEEENS1_15BindableMatcherIT_EEN4llvm8ArrayRefIPKNS1_7MatcherIS5_EEEE The failures somehow go away if -fprofile-sample-use= is removed.	2024-05-13 16:37:39 -07:00
Graham Hunter	fbb37e9606	[AArch64] Add an all-in-one histogram intrinsic Based on discussion from https://discourse.llvm.org/t/rfc-vectorization-support-for-histogram-count-operations/74788 Current interface is: llvm.experimental.histogram(<vecty> ptrs, <intty> inc_amount, <vecty> mask) The integer type used by 'inc_amount' needs to match the type of the buckets in memory. The intrinsic covers the following operations: * Gather load * histogram on the elements of 'ptrs' * multiply the histogram results by 'inc_amount' * add the result of the multiply to the values loaded by the gather * scatter store the results of the add Supports lowering to histcnt instructions for AArch64 targets, and scalarization for all others at present.	2024-05-13 11:35:28 +01:00
Min-Yih Hsu	f8063ffe73	[VP][RISCV] Add vp.reduce.fmaximum/fminimum and its RISC-V codegen (#91782 ) `vp.reduce.fmaximum/fminimum` are the VP version of `vector.reduce.fmaximum/fminimum`.	2024-05-10 16:01:47 -07:00
Jack Styles	6aac30fa43	Update FEAT_PAuth_LR behaviour for AArch64 (#90614 ) Currently, LLVM enables `-mbranch-protection=standard` as `bti+pac-ret`. To align LLVM with the behaviour in GNU, this has been updated to `bti+pac-ret+pc` when FEAT_PAuth_LR is enabled as an optional feature via the `-mcpu=` options. If this is not enabled, then this will revert to the existing behaviour.	2024-05-10 08:09:02 +01:00
Jonas Devlieghere	1e97d114b5	[dsymutil] Add -q/--quiet flag to suppress warnings (#91658 ) Add a -q/--quiet flag to suppress dsymutil output. For now the flag is limited to dsymutil, though there might be other places in the DWARF linker that could be conditionalized by this flag. The motivation is having a way to silence the "no debug symbols in executable" warning. This is useful when we want to generate a dSYM for a binary not containing debug symbols, but still want a dSYM that can be indexed by spotlight. rdar://127843467	2024-05-09 15:55:36 -07:00
Fangrui Song	aacea0d0f6	[utils] Add script to generate elaborated IR and assembly tests (#89026 ) Generally, IR and assembly test files benefit from being cleaned to remove unnecessary details. However, for tests requiring elaborate IR or assembly files where cleanup is less practical (e.g., large amount of debug information output from Clang), the current practice is to include the C/C++ source file and the generation instructions as comments. This is inconvenient when regeneration is needed. This patch adds `llvm/utils/update_test_body.py` to allow easier regeneration. `ld.lld --debug-names` tests (#86508) utilize this script for Clang-generated assembly tests. Note: `-o pipefail` is standard (since https://www.austingroupbugs.net/view.php?id=789) but not supported by dash. Link: https://discourse.llvm.org/t/utility-to-generate-elaborated-assembly-ir-tests/78408	2024-05-08 23:58:55 -07:00
Mircea Trofin	96568f3539	[llvm][ctx_profile] Add instrumentation lowering (#90821 ) This adds the instrumentation lowering pass. (Tracking Issue: #89287, RFC referenced there)	2024-05-08 16:49:08 -07:00
XChy	08011cf845	[Docs][NFC] Use opaque ptr in the example (#91502 )	2024-05-09 01:15:49 +08:00
Farzon Lotfi	3e82442ff7	[SPIRV] Add tan intrinsic part 3 (#90278 ) This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 If you want an overarching view of how this will all connect see: https://github.com/llvm/llvm-project/pull/90088 Changes: - `llvm/docs/GlobalISel/GenericOpcode.rst` - Document the `G_FTAN` opcode - `llvm/include/llvm/IR/Intrinsics.td` - Create the tan intrinsic - `llvm/include/llvm/Support/TargetOpcodes.def` - Create a `G_FTAN` Opcode handler - `llvm/include/llvm/Target/GenericOpcodes.td` - Define the `G_FTAN` Opcode - `llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp` Map the tan intrinsic to `G_FTAN` Opcode - `llvm/lib/Target/SPIRV/SPIRVInstructionSelector.cpp` - Map the `G_FTAN` opcode to the GLSL 4.5 and openCL tan instructions. - `llvm/lib/Target/SPIRV/SPIRVLegalizerInfo.cpp` - Define `G_FTAN` as a legal spirv target opcode.	2024-05-08 00:57:39 -04:00
Farzon Lotfi	31b45a9d0d	[clang][hlsl] Add tan intrinsic part 1 (#90276 ) This change is an implementation of #87367's investigation on supporting IEEE math operations as intrinsics. Which was discussed in this RFC: https://discourse.llvm.org/t/rfc-all-the-math-intrinsics/78294 If you want an overarching view of how this will all connect see: https://github.com/llvm/llvm-project/pull/90088 Changes: - `clang/docs/LanguageExtensions.rst` - Document the new elementwise tan builtin. - `clang/include/clang/Basic/Builtins.td` - Implement the tan builtin. - `clang/lib/CodeGen/CGBuiltin.cpp` - invoke the tan intrinsic on uses of the builtin - `clang/lib/Headers/hlsl/hlsl_intrinsics.h` - Associate the tan builtin with the equivalent hlsl apis - `clang/lib/Sema/SemaChecking.cpp` - Add generic sema checks as well as HLSL specifc sema checks to the tan builtin - `llvm/include/llvm/IR/Intrinsics.td` - Create the tan intrinsic - `llvm/docs/LangRef.rst` - Document the tan intrinsic	2024-05-07 22:54:15 -04:00
Benji Smith	584253c4e2	[C API] Add getters and build function for CallBr (#91154 ) This adds LLVMBuildCallBr to create CallBr instructions, and getters for the CallBr-specific data. The remainder of its data, e.g. arguments/function, can be accessed using existing getters.	2024-05-08 10:59:53 +09:00
Chris Copeland	651bdb96b1	[ARM] Armv8-R does not require fp64 or neon. (#88287 ) This was [addressed for AArch64 here](https://github.com/llvm/llvm-project/pull/79004), but the same applies to ARM. Move the enablement of neon+fp64 to `-mcpu=cortex-r52`, which optionally supports these features.	2024-05-07 11:48:30 +01:00
Peter Waller	1de0535e84	[llvm-mca] Abort on parse error without -skip-unsupported-instructions (#90474 ) [llvm-mca] Abort on parse error without -skip-unsupported-instructions Prior to this patch, llvm-mca would continue executing after parse errors. These errors can lead to some confusion since some analysis results are printed on the standard output, and they're printed after the errors, which could otherwise be easy to miss. However it is still useful to be able to continue analysis after errors; so extend the recently added -skip-unsupported-instructions to support this. Two tests which have parse errors for some of the 'RUN' branches are updated to use -skip-unsupported-instructions so they can remain as-is. Add a description of -skip-unsupported-instructions to the llvm-mca command guide, and add it to the llvm-mca --help output: ``` --skip-unsupported-instructions=<value> - Force analysis to continue in the presence of unsupported instructions =none - Exit with an error when an instruction is unsupported for any reason (default) =lack-sched - Skip instructions on input which lack scheduling information =parse-failure - Skip lines on the input which fail to parse for any reason =any - Skip instructions or lines on input which are unsupported for any reason ``` Tests within this patch are intended to cover each of the cases. Reason \| Flag \| Comment --------------\|------\|------- none \| none \| Usual case, existing test suite lack-sched \| none \| Advises user to use -skip-unsupported-instructions=lack-sched, tested in llvm/test/tools/llvm-mca/X86/BtVer2/unsupported-instruction.s parse-failure \| none \| Advises user to use -skip-unsupported-instructions=parse-failure, tested in llvm/test/tools/llvm-mca/bad-input.s any \| none \| (N/A, covered above) lack-sched \| any \| Continues, prints warnings, tested in llvm/test/tools/llvm-mca/X86/BtVer2/unsupported-instruction.s parse-failure \| any \| Continues, prints errors, tested in llvm/test/tools/llvm-mca/bad-input.s lack-sched \| parse-failure \| Advises user to use -skip-unsupported-instructions=lack-sched, tested in llvm/test/tools/llvm-mca/X86/BtVer2/unsupported-instruction.s parse-failure \| lack-sched \| Advises user to use -skip-unsupported-instructions=parse-failure, tested in llvm/test/tools/llvm-mca/bad-input.s none \| * \| This would be any test case with skip-unsupported-instructions, coverage added in llvm/test/tools/llvm-mca/X86/BtVer2/simple-test.s any \| * \| (Logically covered by the other cases)	2024-05-07 09:13:44 +01:00
Nikita Popov	de8cf69abf	[LangRef] callbr result can be used in all successors (#91167 ) Originally, the callbr result could only be used on the fallthrough destination. This limitation has been lifted, and the result is now also available on the indirect destinations. However, LangRef was not updated to reflect this.	2024-05-07 09:46:16 +09:00
Chris B	afeedd9c3d	[DirectX][docs] Document DXContainer format (#90908 ) This adds a document to describe the DXContainer format and the structures of data inside the file. Resolves #88775	2024-05-06 16:20:31 -05:00
Matt Arsenault	d654278bde	Reapply "AMDGPU: Implement llvm.set.rounding (#88587 )" series (#91113 ) Revert "Revert 4 last AMDGPU commits to unbreak Windows bots" This reverts commit `0d493ed2c6`. MSVC does not like constexpr on the definition after an extern declaration of a global.	2024-05-06 09:09:19 +02:00
Mehdi Amini	0d493ed2c6	Revert 4 last AMDGPU commits to unbreak Windows bots Revert "AMDGPU: Try to fix build error with old gcc" This reverts commit `c7ad12d0d7`. Revert "AMDGPU: Use umin in set.rounding expansion" This reverts commit `a56f0b51dd`. Revert "AMDGPU: Optimize set_rounding if input is known to fit in 2 bits (#88588)" This reverts commit `b4e751e2ab`. Revert "AMDGPU: Implement llvm.set.rounding (#88587)" This reverts commit `9731b77e80`.	2024-05-04 19:57:33 +02:00
Andreas Jonson	1343e68862	[C API] Add function to create ConstantRange attributes to C API (#90505 )	2024-05-04 16:01:59 +09:00
Nikita Popov	f16e234f11	[InstCombine] Do not request non-splat vector support in code reviews (NFC) (#90709 ) The InstCombine contributor guide already says: > Handle non-splat vector constants if doing so is free, but do > not add handling for them if it adds any additional complexity > to the code. This change strengthens this guideline to explicitly discourage asking (new) contributors to implement non-splat support during code reviews. Doing so will almost certainly increase the number of necessary review iterations, or result in outright contradictory review feedback, as different people are willing to accept a different degree of complexity for non-splat vector support.	2024-05-04 16:01:36 +09:00
Maksim Levental	b958ef1948	Update GettingInvolved.rst (#91008 )	2024-05-03 19:02:28 -05:00
Fangrui Song	121bef76df	[docs,utils] Convert text files from CRLF to LF Skip .bat, .natvis, utils/lit/tests/Inputs/shtest-shell/diff-in.dos	2024-05-03 10:16:54 -07:00
Stephen Tozer	91446e2aa6	Repply#2 "[RemoveDIs] Load into new debug info format by default in LLVM (#89799 )" Reapplies the original commit: `2f01fd99eb` The previous application of this patch failed due to some missing DbgVariableRecord support in clang, which has been added now by commit `8805465e`. This will probably break some downstream tools that don't already handle debug records. If your downstream code breaks as a result of this change, the simplest fix is to convert the module in question to the old debug format before you process it, using `Module::convertFromNewDbgValues()`. For more information about how to handle debug records or about what has changed, see the migration document: https://llvm.org/docs/RemoveDIsDebugInfo.html This reverts commit `4fd319ae27`.	2024-05-03 12:55:31 +01:00
Matt Arsenault	9731b77e80	AMDGPU: Implement llvm.set.rounding (#88587 ) Use a shift of a magic constant and some offseting to convert from flt_rounds values. I don't know why the enum defines Dynamic = 7. The standard suggests -1 is the cannot determine value. If we could start the extended values at 4 we wouldn't need the extra compare sub and select. https://reviews.llvm.org/D153257	2024-05-03 09:41:27 +02:00
Stephen Tozer	4fd319ae27	Revert#2 "[RemoveDIs] Load into new debug info format by default in LLVM (#89799 )" Reverted following probably-causing failures on some clang buildbots: https://lab.llvm.org/buildbot/#/builders/245/builds/24037 This reverts commit `a12622543d`.	2024-05-02 17:52:02 +01:00
Craig Topper	44645996b0	[RISCV] Add smstateen extension (#90818 )	2024-05-02 09:12:44 -07:00
Stephen Tozer	a12622543d	Reapply "[RemoveDIs] Load into new debug info format by default in LLVM (#89799 )" Fixes the broken tests in the original commit: `2f01fd99eb` This will probably break some downstream tools that don't already handle debug records. If your downstream code breaks as a result of this change, the simplest fix is to convert the module in question to the old debug format before you process it, using `Module::convertFromNewDbgValues()`. For more information about how to handle debug records or about what has changed, see the migration document: https://llvm.org/docs/RemoveDIsDebugInfo.html This reverts commit `00821fed09`.	2024-05-02 16:32:12 +01:00
Stephen Tozer	00821fed09	Revert "[RemoveDIs] Load into new debug info format by default in LLVM (#89799 )" A unit test was broken by the above commit: https://lab.llvm.org/buildbot/#/builders/139/builds/64627 This reverts commit `2f01fd99eb`.	2024-05-01 16:56:34 +01:00
Stephen Tozer	2f01fd99eb	[RemoveDIs] Load into new debug info format by default in LLVM (#89799 ) This patch enables parsing and creating modules directly into the new debug info format. Prior to this patch, all modules were constructed with the old debug info format by default, and would be converted into the new format just before running LLVM passes. This is an important milestone, in that this means that every tool will now be exposed to debug records, rather than those that run LLVM passes. As far as I've tested, all LLVM tools/projects now either handle debug records, or convert them to the old intrinsic format. There are a few unit tests that need updating for this patch; these are either cases of tests that previously needed to set the debug info format to function, or tests that depend on the old debug info format in some way. There should be no visible change in the output of any LLVM tool as a result of this patch, although the likelihood of this patch breaking downstream code means an NFC tag might be a little misleading, if not technically incorrect: This will probably break some downstream tools that don't already handle debug records. If your downstream code breaks as a result of this change, the simplest fix is to convert the module in question to the old debug format before you process it, using `Module::convertFromNewDbgValues()`. For more information about how to handle debug records or about what has changed, see the migration document: https://llvm.org/docs/RemoveDIsDebugInfo.html	2024-05-01 16:50:12 +01:00
Eli Friedman	a754ce0489	[LangRef] Fix build warning.	2024-04-30 10:33:37 -07:00
Eli Friedman	600cae7d42	[LangRef] Try to clarify mustprogress wording. (#90510 ) Ensure it's clear that: - Infinite loops in non-mustprogress functions are well-defined, even if they're called by mustprogress functions. - Infinite recursion in mustprogress functions is not well-defined. Looking at D86233, it's clear this was the intent, but the "transitive" wording is ambiguous. Instead, just explicitly state that infinite loops written in non-mustprogress functions count as progress.	2024-04-30 10:16:12 -07:00
Min-Yih Hsu	539f626ecd	[VP][RISCV] Add vp.cttz.elts intrinsic and its RISC-V codegen (#90502 ) This intrinsic is the VP version of `experimental.cttz.elts`.	2024-04-30 09:27:10 -07:00
Jonathan Thackray	e50a857fb1	[AArch64] Add support for Cortex-R82AE and improve Cortex-R82 (#90440 )	2024-04-30 14:15:01 +01:00
Kristof Beyls	853344d3ae	[docs] Document which online sync-ups are no longer happening (#89361 ) Some of the online sync-ups on our Getting Involved page seem to no longer be happening. Document them as no longer happening, so that people don't get confused when dialing in to one of these.	2024-04-30 09:49:57 +02:00
Maciej Gabka	bfc0317153	Move several vector intrinsics out of experimental namespace (#88748 ) This patch is moving out following intrinsics: * vector.interleave2/deinterleave2 * vector.reverse * vector.splice from the experimental namespace. All these intrinsics exist in LLVM for more than a year now, and are widely used, so should not be considered as experimental.	2024-04-29 10:16:45 +01:00
Sameer Sahasrabuddhe	256d76f480	[Docs] Improve the description of convergence (#89038 ) - Clarify convergence of threads v/s convergence of operations. - Explicitly address operations that are not in any cycle. This was inspired by a discussion on Discourse: https://discourse.llvm.org/t/llvm-convergence-semantics/77642	2024-04-28 19:56:14 +05:30
Craig Topper	b27f86b40b	[RISCV] Add an instruction PrettyPrinter to llvm-objdump (#90093 ) This prints the opcode bytes in the same order as GNU objdump without a space between them.	2024-04-26 11:27:28 -07:00
Jonathan Thackray	a670cdadca	[AArch64] Add support for Neoverse-N3, Neoverse-V3 and Neoverse-V3AE (#90143 ) Neoverse-N3, Neoverse-V3 and Neoverse-V3AE are Armv9.2 AArch64 CPUs. Technical Reference Manual for Neoverse-N3: https://developer.arm.com/documentation/107997/latest/ Technical Reference Manual for Neoverse-V3: https://developer.arm.com/documentation/107734/latest/ Technical Reference Manual for Neoverse-V3AE: https://developer.arm.com/documentation/101595/latest/	2024-04-26 13:04:35 +01:00
Alex Bradbury	357530f113	Revert "[llvm][RISCV] Enable trailing fences for seq-cst stores by default (#87376 )" This reverts commit `733b271db7`. Reverting in order to revert the companion patch adding the atomics ABI ELF attributes due to the reported incompatibility with GNU ld. https://github.com/llvm/llvm-project/pull/84597#issuecomment-2079128332	2024-04-26 12:16:53 +01:00
bd1976bris	88a733f8e6	[llvm-objcopy][docs] Use "Mark" rather than "Make" in the objcopy docs for consistency (#90080 ) llvm-objcopy --help uses the term "Mark" rather than "Make". e.g. "Mark all symbols local" Change llvm/docs to align.	2024-04-26 09:13:17 +01:00
Thorsten Schütt	65fb80beae	[GlobalIsel] Add Gallery to MIR Patterns (#89974 ) examples for fold of zext(trunc:nuw)	2024-04-26 07:06:49 +02:00
Paul Kirth	733b271db7	[llvm][RISCV] Enable trailing fences for seq-cst stores by default (#87376 ) With the tag merging in place, we can safely change the default for +seq-cst-trailing-fence to the default, according to the recommendation in https://github.com/riscv-non-isa/riscv-elf-psabi-doc/blob/master/riscv-atomic.adoc This tag changes the default for the feature flag, and moves to more consistent naming with respect to existing features.	2024-04-25 16:33:10 -07:00
Mircea Trofin	ddb67e6847	[llvm][ctx_profile] Add the `llvm.instrprof.callsite` intrinsic (#89939 ) Add the callsite intrinsic. Structurally, it is very similar to the counter intrinsics, hence the inheritance relationship. We can probably rename `InstrProfCntrInstBase` to `InstrProfIndexedBase` later - because the "counting" aspect is really left to derived types of `InstrProfCntrInstBase`, and it only concerns itself with the index aspect (which is what we care about for `callsite`, too) (Tracking Issue: #89287, RFC referenced there)	2024-04-25 15:00:09 -07:00
Andy Kaylor	2575cd8a90	Add ics link for Floating Point WG (#82545 ) This adds a link to an ics file for the LLVM Floating Point WG line in the Getting Involved page.	2024-04-24 11:55:17 -07:00

1 2 3 4 5 ...

10794 Commits