clang-p2996

Author	SHA1	Message	Date
sribee8	47e28d9cd1	[libc] wcscspn implementation (#146158 ) Implemented wcscspn and tests. --------- Co-authored-by: Sriya Pratipati <sriyap@google.com>	2025-07-01 15:56:09 +00:00
Luke Lau	04c614327c	[InstCombine] Pull vector reverse through intrinsics (#146384 ) This is the intrinsic version of #146349, and handles fabs as well as other intrinsics. It's largely a copy of InstCombinerImpl::foldShuffledIntrinsicOperands but a bit simpler since we don't need to find a common mask. Creating a separate function seems to be cleaner than trying to shoehorn it into the existing one.	2025-07-01 16:49:10 +01:00
Shilei Tian	4dab0b2300	[AMDGPU] Fix wrong reverse operations for `v_cmpx_le_u32` (#146398 ) Fixes: SWDEV-538616	2025-07-01 11:48:08 -04:00
Matthias Springer	569ca0f698	[mlir][async] Erase op later to preserve insertion point (#146516 ) Delay the erasure of an op, so that the insertion point of the rewriter remains valid. This commit is in preparation of the One-Shot Dialect Conversion refactoring. (The current implementation works with the current dialect conversion driver because op erasure is delayed.)	2025-07-01 17:25:21 +02:00
Charles Zablit	0c124be33f	[lldb][NFC] Inline ResolveSDKPathFromDebugInfo in one of its call site (#146062 ) This patch is part of an effort to remove the `ResolveSDKPathFromDebugInfo` method, and more specifically the variant which takes a Module as argument. See the following PR for a follow up on what to do: - https://github.com/llvm/llvm-project/pull/144913. --------- Co-authored-by: Michael Buch <michaelbuch12@gmail.com>	2025-07-01 16:23:23 +01:00
Kajetan Puchalski	69b69cbcb4	[flang][tco] Add -emit-final-mlir flag (#146533 ) Add a flag to tco for emitting the final MLIR, prior to lowering to LLVM IR. This is intended to produce output that can be passed directly to mlir-translate. --------- Signed-off-by: Kajetan Puchalski <kajetan.puchalski@arm.com>	2025-07-01 16:22:36 +01:00
Joseph Huber	3cff3d882b	[Offload] Add skeleton for offload conformance tests (#146391 ) Summary: This adds a basic outline for adding 'conformance' tests. These are tests that are intended to check device code against a standard. In this case, we will expect this to be filled with math conformance tests to make sure their results are within the ULP requirements we demand. Right now this just assumes the GPU libc is there, meaning you'll likely need to do a manual `ninja` before doing `ninja -C runtimes/runtimes-bins offload.conformance`.	2025-07-01 10:20:40 -05:00
Simon Pilgrim	6b1c92cbcb	[X86] legalize-sub-zero.ll - regenerate test checks	2025-07-01 16:07:32 +01:00
Krzysztof Parzyszek	86077c41a7	[flang][OpenMP] Rewrite min/max with more than 2 arguments (#146423 ) Given an atomic operation `w = max(w, x1, x2, ...)` rewrite it as `w = max(w, max(x1, x2, ...))`. This will avoid unnecessary non-atomic comparisons inside of the atomic operation (min/max are expanded inline). In particular, if some of the x_i's are optional dummy parameters in the containing function, this will avoid any presence tests within the atomic operation. Fixes https://github.com/llvm/llvm-project/issues/144838	2025-07-01 09:54:58 -05:00
David Green	6e3465cd0f	[AArch64] Fix ldp rename through a bundle (#146415 ) std::prev(Paired) will get the previous instruction, that might skip over the instructions in a bundle to the BUNDLE itself. Change it to Paired->getPrevNode() to make sure we update the registers in each instruction in the bundle.	2025-07-01 15:44:59 +01:00
David Green	5332534b9c	[ARM] Add neon vector support for ceil As per #142559, this marks fceil as legal for Neon and upgrades the existing arm.neon.vrintp intrinsics.	2025-07-01 15:41:10 +01:00
Ilya Biryukov	cf9374933d	[Modularize] Make `Location::operator bool` explicit This unbreaks C++20 buildbot that was broken since `402baea0a9`. With implicit conversion in C++20 compilation mode the spaceship will unintentionally be based on `operator bool`: ```cpp auto foo(Location L, Location R) { return L <=> R; // Equivalent to the following line due to implicit conversions. // return L.operator bool() <=> R.operator bool(); } ``` The spaceship operator is rarely used explicitly, but its implicit uses in the STL may cause surprising results, as exposed by the use of `std::tie` in `402baea0a9`, which ended up changing the comparisons results unintentionally.	2025-07-01 16:38:07 +02:00
Rahul Joshi	d7b8b65e23	[LLVM][TableGen][DecoderEmitter] Add wrapper struct for `bit_value_t` (#146248 ) Add a convenience wrapper struct for the `bit_value_t` enum type to host various constructors, query, and printing support. Also refactor related code in several places. In `getBitsField`, use `llvm::append_range` and `SmallVector::append()` and eliminate manual loops. Eliminate `emitNameWithID` and instead use the `operator <<` that does the same thing as this function. Have `BitValue::getValue()` (replacement for `Value`) return std::optional<> instead of -1 for unset bits. Terminate with a fatal error when a decoding conflict is encountered.	2025-07-01 07:36:17 -07:00
Kazu Hirata	e99da2b7a9	[mlir] Remove unused includes (NFC) (#146467 )	2025-07-01 07:32:44 -07:00
Kazu Hirata	f4cdb89b47	[mlir] Remove unnecessary casts (NFC) (#146465 ) Note that encodeStringLiteralInto returns void.	2025-07-01 07:32:36 -07:00
Kazu Hirata	7622bf9d12	[IR] Remove an unnecessary cast (NFC) (#146464 ) The destructor does not return anything.	2025-07-01 07:32:29 -07:00
Kazu Hirata	bb080107e4	[CodeGen] Remove unnecessary casts (NFC) (#146463 ) Both of these functions return void.	2025-07-01 07:32:21 -07:00
cmtice	11ecd4742b	[LLDB] Update DIL to pass current 'frame var' tests. (#145055 ) As a preliminary to making DIL the default implementation for 'frame var', ran check-lldb forcing 'frame var' to always use DIL, and discovered a few failing tests. This fixes most of them. The only remaining failing test is TestDAP_evaluate.py, which now passes a test case that the test says should fail (still investigating this). Changes in this PR: - Sets correct VariableSP, as well as returning ValueObjectSP (needed for several watchpoint tests). - Updates error messages, when looking up members, to match what the rest of LLDB expects. Also update appropriate DIL tests to expect the updated error messages. - Updates DIL parser to look for and accept "(anonymous namespace)::" at the front of a variable name.	2025-07-01 07:30:47 -07:00
Ege Beysel	ace5108f37	feat(linalg): add a way to pass controlFn to `foldIntoPackUnpackPatterns` (#143685 ) This PR adds a mechanism, so that downstream consumers can pass in control functions for the application of these patterns. This change shouldn't affect any consumers of this method that do not specify a controlFn. The controlFn always gets the source operand of the consumer in each of the patterns as a parameter. In IREE, we (will) use it to control preventing folding patterns that would inhibit fusion. See IREE issue [#20896](https://github.com/iree-org/iree/issues/20896) for more details.	2025-07-01 07:22:38 -07:00
Frederik Harwath	f9413e1754	[clang][test] Remove duplication from gcc toolchain test (NFC) (#146487 ) Changes from Commit `40aab0412f` "[test] Migrate -gcc-toolchain with space separator to --gcc-toolchain=" made two previously different RUN lines equal. Remove one RUN line.	2025-07-01 16:17:26 +02:00
David Green	42e7796920	[ARM] Add a comment about fixupImmediateBr updaing ImmBranches. NFC To prevent people from modernizing the loop, add a comment that fixupImmediateBr can append to ImmBranches.	2025-07-01 15:01:25 +01:00
Shilei Tian	bab9d4c2d7	[NFC][AMDGPU] Pre-commit a test case that shows wrong reverse operation is used for V_CMPX_LE_U32 (#146527 )	2025-07-01 09:57:04 -04:00
Nikita Popov	a6592ddf4e	[AArch64] Mark neon.stN intrinsics as writeonly (#145289 ) I found this peculiar comment in EarlyCSE: `1c78d8d9d7/llvm/lib/Transforms/Scalar/EarlyCSE.cpp (L1620-L1624)` Looking back over history, this seems to be referring to the aarch64.neon.stN intrinsics, which are indeed not marked writeonly (though the ldN intrinsics are readonly). Possibly I'm missing something special about these intrinsics, but I think it is safe to mark them as writeonly.	2025-07-01 15:56:02 +02:00
Timm Baeder	1fe993c251	[clang][bytecode] Allocate operator new data as array (#146471 ) Even if we only allocate one element, we still need to allocate it as a single-element array. This matches what the current interpreter does.	2025-07-01 15:45:50 +02:00
Zhuoran Yin	8cfd9b8821	[MLIR] Make generic skip packing init operand when not used in DataLayoutPropagation (#146139 ) In both `bubbleUpPackOpThroughGenericOp()` or `pushDownUnPackOpThroughGenericOp()`, we can simplify the lowered IR by removing the pack of an empty when the init tensor isn't used in generic op. Instead of packing an empty tensor, the empty tensor can be forwarded to the generic output. This allows cleaner result after data layout propagation.	2025-07-01 09:39:30 -04:00
Nicolas Vasilache	08cf6ae537	[mlir][memref] Add a new `ReifyResultShapes` pass (#145927 ) This pass reifies the shapes of a subset of `ReifyRankedShapedTypeOpInterface` ops with `tensor` results. The pass currently only supports result shape type reification for: - tensor::PadOp - tensor::ConcatOp It addresses a representation gap where implicit op semantics are needed to infer static result types from dynamic operands. But it does so by using `ReifyRankedShapedTypeOpInterface` as the source of truth rather than the op itself. As a consequence, this cannot generalize today. TODO: in the future, we should consider coupling this information with op "transfer functions" (e.g. `IndexingMapOpInterface`) to provide a source of truth that can work across result shape inference, canonicalization and op verifiers. The pass replaces the operations with their reified versions, when more static information can be derived, and inserts casts when results shapes are updated. Example: ```mlir #map = affine_map<(d0) -> (-d0 + 256)> func.func @func(%arg0: f32, %arg1: index, %arg2: tensor<64x?x64xf32>) -> tensor<1x?x64xf32> { %0 = affine.apply #map(%arg1) %extracted_slice = tensor.extract_slice %arg2[0, 0, 0] [1, %arg1, 64] [1, 1, 1] : tensor<64x?x64xf32> to tensor<1x?x64xf32> %padded = tensor.pad %extracted_slice low[0, 0, 0] high[0, %0, 0] { ^bb0(%arg3: index, %arg4: index, %arg5: index): tensor.yield %arg0 : f32 } : tensor<1x?x64xf32> to tensor<1x?x64xf32> return %padded : tensor<1x?x64xf32> } // mlir-opt --reify-result-shapes #map = affine_map<()[s0] -> (-s0 + 256)> func.func @func(%arg0: f32, %arg1: index, %arg2: tensor<64x?x64xf32>) -> tensor<1x?x64xf32> { %0 = affine.apply #map()[%arg1] %extracted_slice = tensor.extract_slice %arg2[0, 0, 0] [1, %arg1, 64] [1, 1, 1] : tensor<64x?x64xf32> to tensor<1x?x64xf32> %padded = tensor.pad %extracted_slice low[0, 0, 0] high[0, %0, 0] { ^bb0(%arg3: index, %arg4: index, %arg5: index): tensor.yield %arg0 : f32 } : tensor<1x?x64xf32> to tensor<1x256x64xf32> %cast = tensor.cast %padded : tensor<1x256x64xf32> to tensor<1x?x64xf32> return %cast : tensor<1x?x64xf32> } ``` --------- Co-authored-by: Fabian Mora <fabian.mora-cordero@amd.com>	2025-07-01 15:39:21 +02:00
Shilei Tian	3355cca938	[NFC][AMDGPU] Auto generate check lines for some test cases (#146400 )	2025-07-01 09:25:08 -04:00
Nikita Popov	bedd7ddb7f	[InstCombine] Fix use after free Load the nowrap flags before calling EmitGEPOffset(), as this may free the instruction.	2025-07-01 15:18:49 +02:00
Callum Fare	1a253e213d	[NFC][Offload] Fix possible edge cases in offload-tblgen (#146511 ) Fix a couple of unhandled edge cases in offload-tblgen that were found by static analysis * `LineStart` may wrap around to 0 when processing multi-line strings. The value is not actually being used in that case, but still better to explicitly handle it * Possible unchecked nullptr when processing parameter flags	2025-07-01 14:09:49 +01:00
Hemang Gadhavi	da0828b1e9	[lldb] Enable support for DWARF64 format handling (#145645 ) This PR introduces support for the DWARF64 format, enabling handling of 64-bit DWARF sections as defined by the DWARF specification. The update includes adjustments to header parsing and modification of form values to accommodate 64-bit offsets and values. Also Added the testcase to verify the DWARF64 format.	2025-07-01 18:35:40 +05:30
Erich Keane	857815f3fa	[OpenACC][CIR] Implement 'rest' of update clause lowering (#146414 ) This implements the async, wait, if, and if_present (as well as device_type, but that is a detail of async/wait) lowering. All of these are implemented the same way they are for the compute constructs, so this is a pretty mild amount of changes.	2025-07-01 06:05:08 -07:00
Shivam Gupta	e44fbea0a1	[FunctionAttrs] Handle ConstantRange overflow in memset initializes inference (#145739 ) Avoid constructing invalid ConstantRange when Offset + Length in memset overflows signed 64-bit integer space. This prevents assertion failures when inferring the initializes attribute. Fixes #140345	2025-07-01 18:34:52 +05:30
Denzel-Brian Budii	3702d64801	[mlir] Reapply 141423 mlir-query combinators plus fix (#146156 ) An uninitialized variable that caused a crash (https://lab.llvm.org/buildbot/#/builders/164/builds/11004) was identified using the memory analyzer, leading to the reversion of https://github.com/llvm/llvm-project/pull/141423. This pull request reapplies the previously reverted changes and includes the fix, which has been tested locally following the steps at https://github.com/google/sanitizers/wiki/SanitizerBotReproduceBuild. Note: the fix is included as part of the second commit	2025-07-01 15:03:17 +02:00
Benjamin Kramer	771ee8e387	[bazel] Add mising dependency for `698ec8c7ba`	2025-07-01 15:02:41 +02:00
Simon Pilgrim	72f87d2d69	[DAG] canCreateUndefOrPoison - remove isGuaranteedNotToBeUndefOrPoison check for insert/extract vector element indices (#146514 ) No longer necessary now that #146490 has landed	2025-07-01 14:01:54 +01:00
Rob Buis	524f090306	[alpha.webkit.UncountedCallArgsChecker] Treat CFEqual as a safe function (#146369 ) CFEqual is a trivial function, so treat it as safe.	2025-07-01 13:57:53 +01:00
Timm Baeder	6731f151ea	[clang][bytecode] Remove unused InRange function (#146509 )	2025-07-01 14:42:00 +02:00
Mythreya	d9d9ab8698	[clang][CodeComplete] skip explicit obj param in code completion string (#146258 ) Fixes clangd/clangd#2339	2025-07-01 08:33:56 -04:00
Nikita Popov	b8b7494551	[InstCombine] Rewrite multi-use GEPs when simplifying comparison (#146100 ) We already do this when both sides are a GEP, but not if only one is. This ensures that the offset arithmetic is not duplicated.	2025-07-01 14:26:47 +02:00
David Sherwood	9b13dfdfbc	[LV] Use vscale for tuning to improve branch weight estimates (#144733 ) In addBranchWeightToMiddleTerminator we attempt to add branch weights to the middle block terminator. We pessimistically assume vscale=1, whereas we can improve the estimate by using the value of vscale used for tuning.	2025-07-01 13:23:38 +01:00
Sudharsan Veeravalli	15ab4bb5c8	[Hexagon] Implement shouldConvertConstantLoadToIntImm (#146452 ) This will convert loads of constant strings to immediate values. Put this behind a flag that is enabled by default so that we can toggle it if need be.	2025-07-01 17:52:09 +05:30
Sudharsan Veeravalli	67b79468fb	[RISCV] Factor out getKillRegState in copyPhysReg (NFC) (#146454 ) This is used multiple times in the function.	2025-07-01 17:51:46 +05:30
Simon Pilgrim	89fe429262	[DAG] canCreateUndefOrPoison - remove isGuaranteedNotToBeUndefOrPoison check for shift nodes (#146502 ) No longer necessary now that #146490 has landed - we still have the test coverage from #94145 that covers this.	2025-07-01 12:44:59 +01:00
Simon Pilgrim	56841565db	[DAG] canCreateUndefOrPoison - add handling for CTTZ/CTLZ_ZERO_UNDEF nodes (#146501 ) CTTZ/CTLZ_ZERO_UNDEF nodes can only create poison if the source value is zero - so check with isKnownNeverZero Pulled out of #146361 and reapplied now that #146490 has landed.	2025-07-01 12:44:45 +01:00
Luke Lau	4a2fa0847f	[VPlan] Support VPWidenIntOrFpInductionRecipes with EVL tail folding (#144666 ) Following on from #118638, this handles widened induction variables with EVL tail folding by setting the VF operand to be EVL, calculated in the vector body. We need to do this for correctness since with EVL tail folding the number of elements processed in the penultimate iteration may not be VF, but the runtime EVL, and we need take this into account when updating the backedge value. - Because the VF may now not be a live-in we need to move the insertion point to just after the VFs definition - We also need to avoid truncating it when it's the same size as the step type, previously this wasn't a problem for live-ins. - Also because the VF may be smaller than the IV type, since the EVL is always i32, we may need to zext it. On -march=rva23u64 -O3 we get 87.1% more loops vectorized on TSVC, and 42.8% more loops vectorized on SPEC CPU 2017	2025-07-01 12:29:24 +01:00
kd0608	a6339d0e58	[clang]Fix Handle structs exceeding 1EB size limit (#146032 ) When declaring multiple arrays of 1 ExaByte in a struct, the offset can exceed 2EB, causing incorrect struct size reporting (only 1EB). This fix ensures an error is thrown, preventing the generation of incorrect assembly. #60272	2025-07-01 16:48:41 +05:30
Yanzuo Liu	90e20d4f42	[Clang][Bytecode] Implement P1061 structured binding pack (#146474 ) Other part of this feature was implemented by #121417.	2025-07-01 19:15:12 +08:00
Simon Pilgrim	fd46e409a9	[X86] detectZextAbsDiff - use m_SpecificVectorElementVT matcher. NFC. (#146498 )	2025-07-01 11:59:37 +01:00
Henrich Lauko	37d30d9e21	[mlir][tblgen] Fix test definition names to reflect expected valid results (NFC) (#146243 )	2025-07-01 12:37:06 +02:00
Kerry McLaughlin	6d6b36439f	[Clang][AArch64] Move definitions of FP8 Neon loads & stores (#146352 ) Moves the definitions of FP8 loads & stores so that they are guarded by `ArchGuard = "defined(__aarch64__) \|\| defined(__arm64ec__)"`	2025-07-01 11:33:15 +01:00

1 2 3 4 5 ...

543140 Commits