clang-p2996

Author	SHA1	Message	Date
David Spickett	8b3cc4d1ed	[llvm][docs] Remove LLDB DWARF64 release note As a precaution until the issues reported (by me) in https://github.com/llvm/llvm-project/issues/135208 are investigated.	2025-07-02 14:02:57 +00:00
Steven Perron	4e213159af	[SPIRV] Add FloatControl2 capability (#144371 ) Add handling for FPFastMathMode in SPIR-V shaders. This is a first pass that simply does a direct translation when the proper extension is available. This will unblock work for HLSL. However, it is not a full solution. The default math mode for spir-v is determined by the API. When targeting Vulkan many of the fast math options are assumed. We should do something particular when targeting Vulkan. We will also need to handle the hlsl "precise" keyword correctly when FPFastMathMode is not available. Unblockes https://github.com/llvm/llvm-project/issues/140739, but we are keeing it open to track the remaining issues mentioned above.	2025-07-02 08:48:57 -04:00
Antonio Frighetto	f1cc0b607b	[IR] Introduce `dead_on_return` attribute Add `dead_on_return` attribute, which is meant to be taken advantage by the frontend, and states that the memory pointed to by the argument is dead upon function return. As with `byval`, it is supposed to be used for passing aggregates by value. The difference lies in the ABI: `byval` implies that the pointer is explicitly passed as argument to the callee (during codegen the copy is emitted as per byval contract), whereas a `dead_on_return`-marked argument implies that the copy already exists in the IR, is located at a specific stack offset within the caller, and this memory will not be read further by the caller upon callee return – or otherwise poison, if read before being written. RFC: https://discourse.llvm.org/t/rfc-add-dead-on-return-attribute/86871.	2025-07-02 09:29:36 +02:00
Hemang Gadhavi	da0828b1e9	[lldb] Enable support for DWARF64 format handling (#145645 ) This PR introduces support for the DWARF64 format, enabling handling of 64-bit DWARF sections as defined by the DWARF specification. The update includes adjustments to header parsing and modification of form values to accommodate 64-bit offsets and values. Also Added the testcase to verify the DWARF64 format.	2025-07-01 18:35:40 +05:30
David Spickett	cd60247a0f	[llvm][docs] Document how to ask for things other than commit access (#146340 ) This is the implicit process but useful to spell it out I think. To give folks more confidence in asking. Inspired by https://discourse.llvm.org/t/request-github-issue-triage-permission-for-labeling-clang-issues/87126/1	2025-07-01 10:59:59 +01:00
S. VenkataKeerthy	a2dc64cd03	[NFC][MLGO] Convert notes to proper RST note directives in MLGO.rst (#146450 )	2025-06-30 18:25:12 -07:00
S. VenkataKeerthy	9438048816	[IR2Vec] Simplifying creation of Embedder (#143999 ) This change simplifies the API by removing the error handling complexity. - Changed `Embedder::create()` to return `std::unique_ptr<Embedder>` directly instead of `Expected<std::unique_ptr<Embedder>>` - Updated documentation and tests to reflect the new API - Added death test for invalid IR2Vec kind in debug mode - In release mode, simply returns nullptr for invalid kinds instead of creating an error (Tracking issue - #141817)	2025-06-30 18:24:08 -07:00
S. VenkataKeerthy	6620db50fe	[NFC][MLGO] Fix indentation in MLGO.rst (#146444 )	2025-06-30 18:00:09 -07:00
S. VenkataKeerthy	0745eb501d	[IR2Vec] Scale embeddings once in vocab analysis instead of repetitive scaling (#143986 ) Changes to scale opcodes, types and args once in `IR2VecVocabAnalysis` so that we can avoid scaling each time while computing embeddings. This PR refactors the vocabulary to explicitly define 3 sections---Opcodes, Types, and Arguments---used for computing Embeddings. (Tracking issue - #141817 ; partly fixes - #141832)	2025-06-30 23:09:19 +02:00
ur4t	38df9ebdc6	[docs] Fix a typo in llvm/docs/CMake.rst (#138639 )	2025-06-29 23:57:46 +01:00
Wendi	2b48ce7034	[docs] Add documentation for LLVM Qualification Group (#145331 ) This patch adds a new document describing the LLVM Qualification Group, modeled after the Security Group documentation. The goal is to create an open working group focused on enabling LLVM use in safety-critical applications, such as those requiring ISO 26262 qualification. The group is intended to be non-enforcing and collaborative, and to act as a public coordination point for contributors working on safety-relevant concerns in LLVM. See: https://discourse.llvm.org/t/rfc-proposal-to-establish-a-safety-group-in-llvm/86916 In this review, I’d really appreciate your feedback on both the overall structure and wording, especially if anything could be made clearer, more balanced, or more aligned with LLVM’s values and documentation tone. What feels right? What could be improved to better reflect LLVM community expectations? --------- Co-authored-by: Wendi Urribarri (Woven by Toyota <wendi.urribarri@woven-planet.global>	2025-06-27 14:23:53 +01:00
Nicolai Hähnle	61739d76f0	AMDGPU: Trivial doc fixes (#146021 )	2025-06-26 20:55:16 -07:00
Paschalis Mpeis	2e196a0ed1	[DOCS][BUILDBOT] Bump recommended worker version (#144853 ) Now recommend 3.11.7 to match the `#testing-a-builder-config-locally` section.	2025-06-26 10:30:55 +01:00
Nikita Popov	5fb0ae1a5b	[LegacyPM] Remove unused getAdjustedAnalysisPointer() method (NFC) (#145738 ) This never actually gets overridden and always returns this, so drop it. Noticed this looking into why the pass vtables are so huge.	2025-06-26 09:04:07 +02:00
Rahul Joshi	00f6d6ab66	[NFC][CodingStandard] Require``[[maybe_unused]]`` for unused variables in asserts (#142850 ) Require using attribute `[[maybe_unused]` for assert-only variables that may be unused in non-assert enabled builds to suppress unused variable warnings. --------- Co-authored-by: James Henderson <James.Henderson@sony.com> Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-25 15:47:07 -07:00
Craig Topper	c3b160bcaa	[RISCV] Remove -mattr=+no-rvc-hints (#145138 ) As far as I know binutils does not have a similar option and I don't know of a reason we shouldn't accept the RVC hint instructions. The wording in the spec in the past suggested that maybe these weren't valid instruction names, but that's been modified recently.	2025-06-25 08:24:24 -07:00
Alexis Engelke	36819eaed1	[llvm-objdump] Support --symbolize-operand on AArch64 Similar to the existing implementations for X86 and PPC, support symbolizing branch targets for AArch64. Do not omit the address for ADRP as the target is typically not at an intended location. Pull Request: https://github.com/llvm/llvm-project/pull/145009	2025-06-25 17:09:25 +02:00
Scott Linder	c9910d5ef8	[LangRef] Clarify definition of fragments (#145582 ) Try to give a more complete description of what we call a "fragment", and how debug records interact when fragments are involved.	2025-06-25 10:54:51 -04:00
Tobias Hieta	31545ca5f4	[Docs] Incorporate the recent discussed changes to release process (#145285 ) * Move up the date 2 weeks * Make RC3 required * Add more releases to the table to align with the release every two weeks RFC. This has been discussed in the following RFCs: https://discourse.llvm.org/t/rfc-llvm-releases-every-two-weeks/84245 https://discourse.llvm.org/t/rfc-updating-and-aligning-the-llvm-release-process-before-llvm-21/86493	2025-06-25 09:36:42 +02:00
Aaron Ballman	f8ed45611b	Remove PSTL from the top-level monorepo (#143717 ) The RFC for this removal can be found at: https://discourse.llvm.org/t/rfc-removing-pstl/86807 Note, libc++ still supports PSTL. That support is integrated directly into the libc++ source tree. There is no release note for this removal because it's not really clear that this was user-facing facilities or where such a release note should live.	2025-06-24 13:26:19 -04:00
Andrew Rogers	e175ecff93	[llvm] revisions to `LLVM_ABI` export macro definitions (#144598 ) ## Purpose Simplify the logic used to define `LLVM_ABI` and related macros, eliminate the `LLVM_ABI_FRIEND` macro, and update the `LLVM_ABI` macro to always resolve to `__attribute__((visibility("default")))` when building LLVM as a shared library for ELF or Mach-O targets. ## Background Previously, `LLVM_ABI` was defined to the C++ style attribute `[[gnu::visibility("default")]]` when compiling with gcc, which has more restrictions on its placement. Of note, the C++ style attributes cannot decorate `friend` functions and must not appear after `extern` on variable declarations. Documentation for `LLVM_ABI` and related annotations is found in the LLVM repo [here](https://github.com/llvm/llvm-project/blob/main/llvm/docs/InterfaceExportAnnotations.rst). ## Overview - Define a new CMake config value, `LLVM_ENABLE_LLVM_EXPORT_ANNOTATIONS`, which is implicitly set whenever `LLVM_BUILD_LLVM_DYLIB`, `LLVM_BUILD_SHARED_LIBS`, or `LLVM_ENABLE_PLUGINS` is set. Add it as a `#cmakedefine` to llvm-config.h so its definition is available to projects building against LLVM as required so clients see `__declspec(dllimport)` on Windows. - Gate the `LLVM_ABI` macro definitions in Compiler.h behind the new `LLVM_ENABLE_LLVM_EXPORT_ANNOTATIONS` definition. This is simpler/cleaner, but should be equivalent to the previous logic. - Maintain `LLVM_BUILD_STATIC` as an override to be used by specific targets that don't want to build against the DLL/shared library, such as tablegen. - For ELF and Mach-O targets, directly define `LLVM_ABI` as `__attribute__((visibility("default")))` instead of `LLVM_ATTRIBUTE_VISIBILITY_DEFAULT`, which resolves to C++ style `[[gnu::visibility("default")]]` when compiling with gcc. - Remove the `LLVM_ABI_FRIEND` macro and replace all usages of it with `LLVM_ABI`. - Update the documentation for exporting friend functions to no longer reference `LLVM_ABI_FRIEND`. ## Validation - Built as static lib with clang and gcc on Linux. - Built as static with clang-cl and MSVC on Windows. - Built as shared lib with clang and gcc on Linux (+ additional local changes not yet merged). - Built as DLL with clang-cl and MSVC on Windows (+ additional local changes not yet merged). --------- Co-authored-by: SquallATF <squallatf@gmail.com>	2025-06-24 09:18:34 -07:00
Diana Picus	a201f8872a	[AMDGPU] Replace dynamic VGPR feature with attribute (#133444 ) Use a function attribute (amdgpu-dynamic-vgpr) instead of a subtarget feature, as requested in #130030.	2025-06-24 11:09:36 +02:00
Matt Arsenault	f0d898f36b	DAG: Move get_dynamic_area_offset type check to IR verifier (#145268 ) Also fix the LangRef to match the implementation. This was checking against the alloca address space size rather than the default address space. The check was also more permissive than the LangRef. The error check permitted any size less than the pointer size; follow the stricter wording of the LangRef.	2025-06-24 11:11:52 +09:00
Jim Lin	2f9c97c030	[RISCV] Add Andes AX45MPV processor definition (#145267 ) Andes AX45MPV is 64-bit in-order dual-issue 8-stage pipeline linux-capable CPU implementing the RV64IMAFDCV ISA extension. That is developed by Andes Technology https://www.andestech.com, a RISC-V IP provider. The overviews for AX45MPV: https://www.andestech.com/en/products-solutions/andescore-processors/riscv-ax45mpv/ Scheduling model for RVV extension will be implemented a follow-up PR.	2025-06-24 08:57:55 +08:00
Wenju He	9d570d568b	[ValueTracking] Return true for AddrSpaceCast in canCreateUndefOrPoison (#144686 ) In our downstream GPU target, following IR is valid before instcombine although the second addrspacecast causes UB. define i1 @test(ptr addrspace(1) noundef %v) { %0 = addrspacecast ptr addrspace(1) %v to ptr addrspace(4) %1 = call i32 @llvm.xxxx.isaddr.shared(ptr addrspace(4) %0) %2 = icmp eq i32 %1, 0 %3 = addrspacecast ptr addrspace(4) %0 to ptr addrspace(3) %4 = select i1 %2, ptr addrspace(3) null, ptr addrspace(3) %3 %5 = icmp eq ptr addrspace(3) %4, null ret i1 %5 } We have a custom optimization that replaces invalid addrspacecast with poison, and IR is still valid since `select` stops poison propagation. However, instcombine pass optimizes `select` to `or`: %0 = addrspacecast ptr addrspace(1) %v to ptr addrspace(4) %1 = call i32 @llvm.xxxx.isaddr.shared(ptr addrspace(4) %0) %2 = icmp eq i32 %1, 0 %3 = addrspacecast ptr addrspace(1) %v to ptr addrspace(3) %4 = icmp eq ptr addrspace(3) %3, null %5 = or i1 %2, %4 ret i1 %5 The transform is invalid for our target. --------- Co-authored-by: Nikita Popov <github@npopov.com>	2025-06-24 08:43:47 +08:00
Benji Smith	8f01edfa11	[C API] Add getter/setter for samesign flag on icmp (#145247 ) This was added to the C++ API in https://github.com/llvm/llvm-project/pull/111419 so this change adds accessors in the C API, along with a couple tests.	2025-06-22 18:05:17 -04:00
Chenguang Wang	72de0e4584	[TableGen][Docs] Fix empty list syntax in TableGen doc. (#145041 ) `[]<list<int>>` actually produces `list<list<int>>`.	2025-06-20 09:07:35 -07:00
Stanislav Mekhanoshin	69974658f0	[AMDGPU] Initial support for gfx1250 target. (#144965 ) This is just a stub for now.	2025-06-19 22:52:51 -07:00
Nicolai Hähnle	3bee9ba015	AMDGPU/GFX12: Fix s_barrier_signal_isfirst for single-wave workgroups (#143634 ) Barrier instructions are no-ops in single-wave workgroups. This includes s_barrier_signal_isfirst, which will leave SCC unmodified. Model this correctly (via an implicit use of SCC) and ensure SCC==1 before the barrier instruction (if the wave is the only one of the workgroup, then it is the first). --------- Co-authored-by: Matt Arsenault <arsenm2@gmail.com>	2025-06-19 11:22:49 -07:00
Rajat Bajpai	590066bee7	[NVPTX] Add family-specific architectures support (#141899 ) This change adds family-specific architecture variants support added in [PTX ISA 8.8](https://docs.nvidia.com/cuda/parallel-thread-execution/#ptx-isa-version-8-8). These architecture variants have "f" suffix. For example, sm_100f. This change doesn't promote existing features to family-specific architecture.	2025-06-19 12:18:17 +05:30
Tobias Stadler	1f34d68c4f	[Remarks] Remove yaml-strtab format (#144527 ) Background: The yaml-strtab format looks just like the yaml format, except that the values in the key/value pairs of the remarks are deduplicated and replaced by indices into a string table (see removed test cases for examples). The motivation behind this format was to reduce size of the remarks files. However, it was quickly superseded by the bitstream format. Therefore, remove the yaml-strtab format, as it doesn't have a good usecase anymore: - It isn't particularly efficient - It isn't human-readable - It isn't straightforward to parse in external tools that can't use the remarks library. We don't even support it in opt-viewer. llvm-remarkutil is also missing options to parse/convert yaml-strtab, so the chance that anyone is actually using this format is low.	2025-06-18 14:25:41 +01:00
Jim Lin	8ddada41df	[RISCV] Add Andes XAndesVBFHCvt (Andes Vector BFLOAT16 Conversion) extension (#144320 ) The spec can be found at: https://github.com/andestech/andes-v5-isa/releases/tag/ast-v5_4_0-release. This patch only supports assembler. The instructions are similar to `Zvfbfmin` and the only difference with `Zvfbfmin` is that `XAndesVBFHCvt` doesn't have mask variant.	2025-06-18 09:17:46 +08:00
Sam Elliott	98c6c371d6	[RISCV] Xqccmp v0.3 (#137854 ) All the changes for v0.2 and v0.3 are either already implemented, or irrelevant to the compiler implementation.	2025-06-16 22:13:45 -07:00
Sam Elliott	c0ac95181e	[RISCV] Update Xqci to v0.13.0 (#144398 )	2025-06-16 22:12:12 -07:00
Scott Linder	e8362234f6	[Object][AMDGPU] Support REL relocations (#143966 ) Shaders compiled with DXC/LLPC generate these relocations, and even if that changes in the future we want to handle existing binaries. The friction to support this and the maintenance cost long term both seem incredibly low, considering other targets like ARM support both REL/RELA static relocations behind the same interface.	2025-06-16 15:03:02 -04:00
Yingwei Zheng	299a55a88f	[InstCombine][Docs] Update InstCombine contributor guide (#144228 ) Update the guideline to reduce the chance of miscompilation/performance regression. --------- Co-authored-by: Nikita Popov <github@npopov.com> Co-authored-by: Antonio Frighetto <me@antoniofrighetto.com>	2025-06-16 18:07:27 +08:00
Javier Lopez-Gomez	383b326879	[llvm-debuginfo-analyzer] Fix ODR violation in llvm::logicalview::LVObject (#140265 ) Some data members are only part of a class definition in a Debug build, e.g. `LVObject::ID`. If `debuginfologicalview` is used as a library, `NDEBUG` cannot be used for this purpose, as this PP macro may have a different definition in a downstream project, which in turn triggers an ODR violation. Fix it by - Making `LVObject::ID` an unconditional data member. - Making `LVObject::dump()` non-virtual. Rationale: `virtual` is not needed (and it calls `print()`, which is virtual anyway). Fixes #139098.	2025-06-16 10:47:00 +02:00
Sudharsan Veeravalli	7d9a451d87	[RISCV] Change input register type for QC_SWM and QC_SWMI (#144294 ) Version 0.13 of the `Xqci` spec changes the register type of input operand `rs3` from `GPR` to `GPRNoX0` for these two instructions. The spec can be found at https://github.com/quic/riscv-unified-db/releases/tag/Xqci-0.13.0	2025-06-16 12:28:12 +05:30
Fabian Meumertzheim	83f215b035	Reland "[llvm-cov] Add support for baseline coverage" (#144130 ) When no profile is provided, but the new --empty-profile option is specified, the export/report/show commands now emit coverage data equivalent to that obtained from a profile with all zero counters ("baseline coverage"). This is useful for build systems (e.g. Bazel) that can track coverage information for each build target, even those that are never linked into tests and thus don't have runtime coverage data recorded. By merging in baseline coverage, lines in files that aren't linked into tests are correctly reported as uncovered. Reland with fixes to `CoverageMappingTest.cpp`. Reverts llvm/llvm-project#144121	2025-06-13 12:09:58 -07:00
Keith Smiley	65d88d31ea	Revert "[llvm-cov] Add support for baseline coverage" (#144121 ) Reverts llvm/llvm-project#117910 ``` /home/buildbots/llvm-external-buildbots/workers/ppc64le-lld-multistage-test/ppc64le-lld-multistage-test/llvm-project/llvm/unittests/ProfileData/CoverageMappingTest.cpp /home/buildbots/llvm-external-buildbots/workers/ppc64le-lld-multistage-test/ppc64le-lld-multistage-test/llvm-project/llvm/unittests/ProfileData/CoverageMappingTest.cpp:281:28: error: 'std::reference_wrapper' may not intend to support class template argument deduction [-Werror,-Wctad-maybe-unsupported] 281 \| std::make_optional(std::reference_wrapper(*ProfileReader)); \| ^ /usr/lib/gcc/ppc64le-redhat-linux/8/../../../../include/c++/8/bits/refwrap.h:289:11: note: add a deduction guide to suppress this warning 289 \| class reference_wrapper \| ^ ```	2025-06-13 10:04:45 -07:00
Daniel Hernandez-Juarez	68b6f392ed	[MLIR][AMDGPU] Fix bug in GatherToLDSOpLowering, get the correct MemRefType for destination (#142915 ) This PR fixes a bug in GatherToLDSOpLowering, we were getting the MemRefType of source for the destination. Additionally, some related typos are corrected. CC: @krzysz00 @umangyadav @lialan	2025-06-13 11:33:51 -05:00
Fabian Meumertzheim	dc9e300f12	[llvm-cov] Add support for baseline coverage (#117910 ) When no profile is provided, but the new --empty-profile option is specifed, the export/report/show commands now emit coverage data equivalent to that obtained from a profile with all zero counters ("baseline coverage"). This is useful for build systems (e.g. Bazel) that can track coverage information for each build target, even those that are never linked into tests and thus don't have runtime coverage data recorded. By merging in baseline coverage, lines in files that aren't linked into tests are correctly reported as uncovered.	2025-06-13 08:49:30 -07:00
Stephen Tozer	cc365331af	[DLCov] Origin-Tracking: Add config options (#143590 ) This patch is part of a series that adds origin-tracking to the debugify source location coverage checks, allowing us to report symbolized stack traces of the point where missing source locations appear. This patch adds the configuration options needed to enable this feature, in the form of a new CMake option that enables a flag in `llvm-config.h`; this is not an entirely new CMake flag, but a new option, `COVERAGE_AND_ORIGIN`, for the existing flag `LLVM_ENABLE_DEBUGLOC_COVERAGE_TRACKING`. This patch contains documentation, but no actual implementation for the flag itself.	2025-06-13 12:54:30 +01:00
Diana Picus	a5cbd2ab0b	Revert "[AMDGPU] Skip register uses in AMDGPUResourceUsageAnalysis (#… (#144039 ) …133242)" This reverts commit `130080fab1` because it causes issues in testcases similar to coalescer_remat.ll [1], i.e. when we use a VGPR tuple but only write to its lower parts. The high VGPRs would then not be included in the vgpr_count, and accessing them would be an out of bounds violation. [1] https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/AMDGPU/coalescer_remat.ll	2025-06-13 12:48:24 +02:00
Saiyedul Islam	432d06ab91	[NFC][AMDGPU] Fix stale links to ROCm repositories (#143949 ) Following GitHub organizations were merged into the ROCm org: * ROCm-Developer-Tools * RadeonOpenCompute * ROCmSoftwarePlatform Ensure that all hyperlinks to the old organizations now point to the new organization at https://github.com/ROCm.	2025-06-13 11:33:52 +05:30
Durgadoss R	3e5d50f9c6	[NVPTX] Add cta_group support to TMA G2S intrinsics (#143178 ) This patch extends the TMA G2S intrinsics with the support for cta_group::1/2 available from Blackwell onwards. The existing intrinsics are auto-upgraded with a default value of '0' for the `cta_group` flag operand. * lit tests are added for all combinations of the newer variants. * Negative tests are added to validate the error-handling when the value of the cta_group flag falls out-of-range. * The generated PTX is verified with a 12.8 ptxas executable. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>	2025-06-12 15:20:39 +05:30
Paul Kirth	ad2a2b8eed	[llvm] Add a tool to check mustache compliance against the public spec (#142813 ) This is a cli tool to that tests the conformance of LLVM's mustache implementation against the public Mustache spec, hosted at https://github.com/mustache/spec. This is a revised version of the patches in #111487. Co-authored-by: Peter Chou <peter.chou@mail.utoronto.ca>	2025-06-11 13:05:21 -07:00
RonDahan101	fa9e1a1515	[AArch64] Expand llvm.histogram intrinsic to support umax, umin, and uadd.sat operations (#138447 ) This patch extends the llvm.histogram intrinsic to support additional update operations beyond the existing add. Specifically, the new supported operations are: * umax: unsigned maximum * umin: unsigned minimum * uadd.sat: unsigned saturated addition Based on the discussion from: https://discourse.llvm.org/t/rfc-expanding-the-experimental-histogram-intrinsic/84673	2025-06-11 15:15:24 +01:00
arysef	78af498035	[LLVM][IR] Support target extension types in vectors (#140630 ) This change is to support target extension types in vectors. The change allows sized target extension types to opt-in to being a valid vector element. Allowing target extension types as vector elements will allow backends to use vector operations such as `insertelement` and `extractelement` on their target types with minimal changes. RFC: https://discourse.llvm.org/t/rfc-supporting-sized-target-extension-types-in-vector/86431	2025-06-09 12:03:15 -07:00
Durgadoss R	c4012bb5de	[NVPTX] Add pm_event intrinsics (#141278 ) This patch adds the pm_event.mask intrinsic and its clang-builtin. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>	2025-06-06 19:39:33 +05:30

1 2 3 4 5 ...

11732 Commits