clang-p2996

Author	SHA1	Message	Date
Fabian Mora	878d3594ed	[mlir][vector] Avoid setting padding by default to `0` in `vector.transfer_read` prefer `ub.poison` (#146088 ) Context: `vector.transfer_read` always requires a padding value. Most of its builders take no `padding` value and assume the safe value of `0`. However, this should be a conscious choice by the API user, as it makes it easy to introduce bugs. For example, I found several occasions while making this patch that the padding value was not getting propagated (`vector.transfer_read` was transformed into another `vector.transfer_read`). These bugs, were always caused because of constructors that don't require specifying padding. Additionally, using `ub.poison` as a possible default value is better, as it indicates the user "doesn't care" about the actual padding value, forcing users to specify the actual padding semantics they want. With that in mind, this patch changes the builders in `vector.transfer_read` to always having a `std::optional<Value> padding` argument. This argument is never optional, but for convenience users can pass `std::nullopt`, padding the transfer read with `ub.poison`. --------- Signed-off-by: Fabian Mora <fabian.mora-cordero@amd.com>	2025-06-30 15:20:42 -04:00
Erich Keane	a99fee6989	[OpenACC][CIR] Implement 'exit data' construct + clauses (#146167 ) Similar to 'enter data', except the data clauses have a 'getdeviceptr' operation before, so that they can properly use the 'exit' operation correctly. While this is a touch awkward, it fits perfectly into the existing infrastructure. Same as with 'enter data', we had to add some add-functions for async and wait.	2025-06-30 06:19:43 -07:00
Luke Hutton	2e7aa7ead6	[mlir][tosa] Add custom operand getters for select op (#145921 ) The select op has 3 inputs: input1, input2, input3 to according to the tosa specification. However, use of getInput1(), getInput2() and getInput3() in the codebase can be confusing and hinder readability. This commit adds custom getters to help improve readability: - input1 -> getPred() - input2 -> getOnTrue() - input3 -> getOnFalse() They should be preferred as they are more descriptive, however, the ODS generated getters (getInputX()) may still be used. Unfortunately the custom getters don't propagate to Adaptors such as `FoldAdaptor`, so the ODS generated getters must be used.	2025-06-30 10:11:09 +01:00
Markus Böck	8602204d9f	[mlir][tensor] Relax input type requirement on `tensor.splat` (#145893 ) `tensor.splat` is currently restricted to only accepting input values that are of integer, index or float type. This is much more restrictive than the tensor type itself as well as any lowerings of it. This PR therefore removes this restriction by using `AnyType` for the input value. Whether the type is actually valid or not for a tensor remains verified through the type equality of the result tensor element type and the input type.	2025-06-30 09:49:19 +02:00
Jaden Angella	d4b5905a25	Add`final` specifier to the classop (#145977 ) In some use cases of the `ClassOp`, eg MLGO, we would like to be able to declare the class as final. This specifier allows for that.	2025-06-29 19:00:59 -07:00
Kazu Hirata	b5cd49eff0	[mlir] Remove unused includes (NFC) (#146278 ) These are identified by misc-include-cleaner. I've filtered out those that break builds. Also, I'm staying away from llvm-config.h, config.h, and Compiler.h, which likely cause platform- or compiler-specific build failures.	2025-06-29 12:13:12 -07:00
Uday Bondhugula	80625c16f0	[MLIR][Affine] Fix memref replacement in affine-data-copy-generate (#139016 ) Fixes: https://github.com/llvm/llvm-project/issues/130257 Fix affine-data-copy-generate in certain cases that involved users in multiple blocks. Perform the memref replacement correctly during copy generation. Improve/clean up memref affine use replacement API. Instead of supporting dominance and post dominance filters (which aren't adequate in most cases) and computing dominance info expensively each time in RAMUW, provide a user filter callback, i.e., force users to compute dominance if needed.	2025-06-28 10:27:11 +05:30
Christopher McGirr	29b1054835	[mlir][linalg] Update pack and unpack documentation (#143903 ) * Clarified the `inner_dim_pos` attribute in the case of high dimensionality tensors. * Added a 5D examples to show-case the use-cases that triggered this updated. * Added a reminder for linalg.unpack that number of elements are not required to be the same between input/output due to padding being dropped. I encountered some odd variations of `linalg.pack` and `linalg.unpack` while working on some TFLite models and the definition in the documentation did not match what I saw pass in IR verification. The following changes reconcile those differences. --------- Signed-off-by: Christopher McGirr <mcgirr@roofline.ai>	2025-06-27 13:55:26 -07:00
Erich Keane	33d20828d1	[OpenACC][CIR] Implement enter-data + clause lowering (#146146 ) 'enter data' is a new construct type that requires one of the data clauses, so we had to wait for all clauses to be ready before we could commit this. Most of the clauses are simple, but there is a little bit of work to get 'async' and 'wait' to have similar interfaces in the ACC dialect, where helpers were added.	2025-06-27 13:47:42 -07:00
Kazu Hirata	9d6cbc3c20	[ADT] Deprecate MutableArrayRef(std::nullopt) (#146113 ) ArrayRef(std::nullopt) just got deprecated. This patch does the same to MutableArrayRef(std::nullopt). Since there are only a couple of uses, this patch does migration and deprecation at the same time.	2025-06-27 11:31:11 -07:00
Momchil Velikov	3876e887d0	[MLIR][ArmSVE] Add an ArmSVE dialect operation mapping to `bfmmla` (#145064 )	2025-06-27 15:37:13 +01:00
Erich Keane	3463aba45f	[OpenACC][CIR] Implement copyin/copyout/create lowering for compute/c… (#145976 ) …ombined This patch does the lowering of copyin (represented as a acc.copyin/acc.delete), copyout (acc.create/acc.copyin), and create (acc.create/acc.delete). Additionally, it found a few problems with #144806, so it fixes those as well.	2025-06-27 07:25:58 -07:00
Andrzej Warzyński	541f33e075	[mlir][linalg] Prevent hoisting of transfer pairs in the presence of aliases (#145235 ) This patch adds additional checks to the hoisting logic to prevent hoisting of `vector.transfer_read` / `vector.transfer_write` pairs when the underlying memref has users that introduce aliases via operations implementing `ViewLikeOpInterface`. Note: This may conservatively block some valid hoisting opportunities and could affect performance. However, as demonstrated by the included tests, the current logic is too permissive and can lead to incorrect transformations. If this change prevents hoisting in cases that are provably safe, please share a minimal repro - I'm happy to explore ways to relax the check. Special treatment is given to `memref.assume_alignment`, mainly to accommodate recent updates in: * https://github.com/llvm/llvm-project/pull/139521 Note that such special casing does not scale and should generally be avoided. The current hoisting logic lacks robust alias analysis. While better support would require more work, the broader semantics of `memref.assume_alignment` remain somewhat unclear. It's possible this op may eventually be replaced with the "alignment" attribute added in: * https://github.com/llvm/llvm-project/pull/144344	2025-06-27 13:18:15 +01:00
long.chen	aed8f1992a	[NFC][mlir][memref] refine debug message about memref::SubViewOp. (#145470 )	2025-06-27 18:34:45 +08:00
Twice	c3e08c9b89	[MLIR] Replace getVoidPtrType with getPtrType in ConvertToLLVMPattern (#145657 ) `ConversionPattern::getVoidPtrType` looks a little confusion since the opaque pointer migration is already done. Also we cannot specify address space in this method. Maybe we can mark them as deprecated and add new method `getPtrType()`, as this PR did : )	2025-06-27 12:31:53 +02:00
Christopher McGirr	96c1611163	[mlir][linalg] fix OuterUnitDims linalg.pack decomposition pattern (#141613 ) Given the following example: ``` module { func.func @main(%arg0: tensor<1x1x1x4x1xf32>, %arg1: tensor<1x1x4xf32>) -> tensor<1x1x1x4x1xf32> { %pack = linalg.pack %arg1 outer_dims_perm = [1, 2, 0] inner_dims_pos = [2, 0] inner_tiles = [4, 1] into %arg0 : tensor<1x1x4xf32> -> tensor<1x1x1x4x1xf32> return %pack : tensor<1x1x1x4x1xf32> } } ``` We would generate an invalid transpose operation because the calculated permutation would be `[0, 2, 0]` which is semantically incorrect. As the permutation must contain unique integers corresponding to the source tensor dimensions. The following change modifies how we calculate the permutation array and ensures that the dimension indices given in the permutation array is unique. The above example would then translate to a transpose having a permutation of `[1, 2, 0]`. Following the rule, that the `inner_dim_pos` is appended to the permutation array and the preceding indices are filled with the remaining dimensions.	2025-06-27 09:24:33 +02:00
Kazu Hirata	c7b34b0b44	[mlir] Use a new constructor of ArrayRef (NFC) (#146009 ) ArrayRef now has a new constructor that takes a parameter whose type has data() and size(). This patch migrates: ArrayRef<T>(X.data(), X.size() to: ArrayRef<T>(X)	2025-06-26 23:38:20 -07:00
Han-Chung Wang	0515449f6d	[mlir][tensor][memref] Enhance collapse(expand(src)) canonicalization pattern. (#145995 )	2025-06-26 19:39:50 -07:00
Jaden Angella	1dfdd1e6de	[mlir][emitC] Add support to emitter for `classop`, `fieldop` and `getfieldop` (#145605 ) Add support to the emitter for `ClassOp`, `FieldOp` and `GetFieldOp`. These ops were introduced in #141158	2025-06-26 13:54:05 -07:00
Abid Qadeer	232c2921e1	Reland [mlir][OpenMP] Use correct debug location with link clause. (#145889 ) https://github.com/llvm/llvm-project/pull/145026 was reverted because it failed a sanitizer test. That issue has been fixed in https://github.com/llvm/llvm-project/pull/145883.	2025-06-26 19:32:30 +01:00
Diego Caballero	7842e9eada	[mlir][Vector] Lower `vector.to_elements` to LLVM (#145766 ) Only elements with at least one use are lowered to `llvm.extractelement` op.	2025-06-26 10:36:08 -07:00
Momchil Velikov	e0b83ca8a4	[MLIR][ArmNeon] Add a couple of negative tests for BFMMLA with scalable dimensions (#145882 )	2025-06-26 17:09:59 +01:00
Kazu Hirata	abc2c3a538	[mlir] Use llvm::is_contained instead of llvm::all_of (NFC) (#145845 ) llvm::is_contained is shorter than llvm::all_of plus a lambda.	2025-06-26 08:41:26 -07:00
Kazu Hirata	70dce3d987	[mlir] Migrate away from std::nullopt (NFC) (#145842 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch replaces {} with std::nullopt.	2025-06-26 08:41:02 -07:00
Momchil Velikov	d13e223a89	[MLIR][AArch64] Add integration test for lowering of `vector.contract` to Neon FEAT_I8MM (#144699 )	2025-06-26 16:01:00 +01:00
Frank Schlimbach	2bece2194a	[mlir][mesh] resubmitting #144079 (#145897 ) #144079 introduced a test with an uninitialized access Buildbot failure: https://lab.llvm.org/buildbot/#/builders/164/builds/11140 and got reverted #145531 This PR is an exact copy of #144079 plus a trivial fix (96c8525c82c5474d8522a973553e85a2033bee5b).	2025-06-26 16:36:17 +02:00
Nicolas Vasilache	5bf43634c4	[mlir][affine] NFC Rename SimplifyAffineMinMax -> SimplifyAffineMinMa… (#145905 ) …xPass This is more consistent re. auto-generated names like `createSimplifyAffineMinMaxPass`.	2025-06-26 16:34:47 +02:00
Nicolas Vasilache	a1d5b9d1cd	[mlir][affine] Wrap SimplifyAffineMinMax in a pass (#145741 ) This revision adds a pass working on FunctionOpInterface to connect recently introduced AffineMin/Max simplification patterns. Additionally fixes some minor issues that have surfaced upon larger scale testing.	2025-06-26 15:32:51 +02:00
Benjamin Kramer	6a5469bb81	[bazel] Fixes for `e5a8c51c9d`	2025-06-26 15:06:18 +02:00
Nicolas Vasilache	e5a8c51c9d	[mlir][tensor] Make tensor::PadOp a ReifyRankedShapedTypeOpInterface (#145867 ) Co-authored-by: Fabian Mora <fmora.dev@gmail.com>	2025-06-26 14:40:57 +02:00
Benjamin Kramer	3287c1c176	[XeGPU] Move targetinfo constants to their own header file This breaks the dependency from Dialect to Utils, which would be cyclic.	2025-06-26 13:53:00 +02:00
Momchil Velikov	a2265423d0	[MLIR][ArmNeon] Add an ArmNeon operation which maps to `bfmmla` (#145038 )	2025-06-26 10:43:36 +01:00
Andrzej Warzyński	e9e25f02e6	[mlir][vector] Restrict vector.insert/vector.extract to disallow 0-d vectors (#121458 ) This patch enforces a restriction in the Vector dialect: the non-indexed operands of `vector.insert` and `vector.extract` must no longer be 0-D vectors. In other words, rank-0 vector types like `vector<f32>` are disallowed as the source or result. EXAMPLES -------- The following are now illegal (note the use of `vector<f32>`): ```mlir %0 = vector.insert %v, %dst[0, 0] : vector<f32> into vector<2x2xf32> %1 = vector.extract %src[0, 0] : vector<f32> from vector<2x2xf32> ``` Instead, use scalars as the source and result types: ```mlir %0 = vector.insert %v, %dst[0, 0] : f32 into vector<2x2xf32> %1 = vector.extract %src[0, 0] : f32 from vector<2x2xf32> ``` Note, this change serves three goals. These are summarised below. ## 1. REDUCED AMBIGUITY By enforcing scalar-only semantics when the result (`vector.extract`) or source (`vector.insert`) are rank-0, we eliminate ambiguity in interpretation. Prior to this patch, both `f32` and `vector<f32>` were accepted. ## 2. MATCH IMPLEMENTATION TO DOCUMENTATION The current behaviour contradicts the documented intent. For example, `vector.extract` states: > Degenerates to an element type if n-k is zero. This patch enforces that intent in code. ## 3. ENSURE SYMMETRY BETWEEN INSERT AND EXTRACT With the stricter semantics in place, it’s natural and consistent to make `vector.insert` behave symmetrically to `vector.extract`, i.e., degenerate the source type to a scalar when n = 0. NOTES FOR REVIEWERS ------------------- 1. Main change is in "VectorOps.cpp", where stricter type checks are implemented. 2. Test updates in "invalid.mlir" and "ops.mlir" are minor cleanups to remove now-illegal examples. 2. Lowering changes in "VectorToSCF.cpp" are the main trade-off: we now require an additional `vector.extract` when a preceding `vector.transfer_read` generates a rank-0 vector. RELATED RFC ----------- * https://discourse.llvm.org/t/rfc-should-we-restrict-the-usage-of-0-d-vectors-in-the-vector-dialect	2025-06-26 09:47:06 +01:00
Andrzej Warzyński	a8998fa062	[mlir][vector][nfc] Move vector.splat test (#145699 ) Moves a test for vector.splat so that all invalid tests are grouped together and easy to find.	2025-06-26 09:27:07 +01:00
Matthias Springer	9ccf613b34	[mlir][Transforms][NFC] Store per-pattern IR modifications in separate state (#145319 ) This commit adds extra state to `ConversionPatternRewriterImpl` to store all modified / newly-created operations and moved / newly-created blocks in separate lists on a per-pattern basis. This is in preparation of the One-Shot Dialect Conversion refactoring: the new driver will no longer maintain a list of all IR rewrites, so information about newly-created operations (which is needed to trigger recursive legalization) must be retained in a different data structure. This commit is also expected to improve the performance of the existing driver. The previous implementation iterated over all new IR modifications and then filtered them by type. It also required an additional pointer indirection (through `std::unique_ptr<IRRewrite>`) to retrieve the operation/block pointer.	2025-06-26 08:54:39 +02:00
Alan Li	3f3282cee8	[AMDGPU] Adding AMDGPU dialect wrapper for ROCDL transpose loads. (#145395 ) * 1-to-1 mapping wrapper op. * Direct lowering from AMDGPU wrapper to ROCDL intrinsics.	2025-06-25 22:58:14 -04:00
Chao Chen	36fbc6a8d2	[MLIR][XeGPU] Remove the transpose attribute from Gather/Scatter ops and Cleanup the documents (#145389 )	2025-06-25 19:43:53 -05:00
Charitha Saumya	c539ec0db5	[mlir][vector] Add support for vector extract/insert_strided_slice in vector distribution. (#145421 ) This PR adds initial support for `vector.extract_strided_slice` and `vector.insert_strided_slice` ops in vector distribution.	2025-06-25 16:41:28 -07:00
Mehdi Amini	ff0dcc4614	[MLIR][Linalg] Harden parsing Linalg named ops (#145337 ) This thread through proper error handling / reporting capabilities to avoid hitting llvm_unreachable while parsing linalg ops. Fixes #132755 Fixes #132740 Fixes #129185	2025-06-26 00:22:08 +02:00
Mircea Trofin	62f8281e08	[IR][PGO] Verify invalid `MD_prof` metadata on instructions (#145576 ) This PR places the validation of `MD_prof` instruction metadata in the Verifier.	2025-06-25 13:10:43 -07:00
Abid Qadeer	a75279e4a5	Revert "[mlir][OpenMP] Use correct debug location with link clause." (#145768 ) Reverts llvm/llvm-project#145026 Caused a CI failure on https://lab.llvm.org/buildbot/#/builders/169/builds/12504.	2025-06-25 20:06:36 +01:00
MaheshRavishankar	c873e5f87d	[mlir][TilingInterface] Handle multi operand consumer fusion. (#145193 ) For consumer fusion cases of this form ``` %0:2 = scf.forall .. shared_outs(%arg0 = ..., %arg0 = ...) { tensor.parallel_insert_slice ... into %arg0 tensor.parallel_insert_slice ... into %arg1 } %1 = linalg.generic ... ins(%0#0, %0#1) ``` the current consumer fusion that handles one slice at a time cannot fuse the consumer into the loop, since fusing along one slice will create and SSA violation on the other use from the `scf.forall`. The solution is to allow consumer fusion to allow considering multiple slices at once. This PR changes the `TilingInterface` methods related to consumer fusion, i.e. - `getTiledImplementationFromOperandTile` - `getIterationDomainFromOperandTile` to allow fusion while considering multiple operands. It is upto the `TilingInterface` implementation to return an error if a list of tiles of the operands cannot result in a consistent implementation of the tiled operation. The Linalg operation implementation of `TilingInterface` has been modified to account for these changes and allow cases where operand tiles that can result in a consistent tiling implementation are handled. --------- Signed-off-by: MaheshRavishankar <mahesh.ravishankar@gmail.com>	2025-06-25 11:54:38 -07:00
Kazu Hirata	28f6f87061	[mlir] Migrate away from std::nullopt (NFC) (#145523 ) ArrayRef has a constructor that accepts std::nullopt. This constructor dates back to the days when we still had llvm::Optional. Since the use of std::nullopt outside the context of std::optional is kind of abuse and not intuitive to new comers, I would like to move away from the constructor and eventually remove it. This patch migrates away from std::nullopt in favor of ArrayRef<T>() where we use perfect forwarding. Note that {} would be ambiguous for perfect forwarding to work.	2025-06-25 11:49:22 -07:00
Andrzej Warzyński	5f8e7ed5a3	[mlir][linalg][nfc] Split hoisting tests into dedicated test functions (#145234 ) Refactors the `@hoist_vector_transfer_pairs` test function in `hoisting.mlir` into smaller, focused test functions - each covering a specific `vector.transfer_read`/`vector.transfer_write` pair. This makes it easier to identify which edge cases are tested, spot duplication, and write more targeted and readable check lines, with less surrounding noise. This refactor also helped identify some issues with the original `@hoist_vector_transfer_pairs` test: * Input variables `%val` and `%cmp` were unused. * There were no check lines for reads from `memref5`. Note for reviewers (current and future): This PR is split into small, incremental, and self-contained commits. It should be easier to follow the changes by reviewing those commits individually, rather than reading the full squashed diff. However, this will be merged as a single commit to avoid adding unnecessary history noise in-tree.	2025-06-25 19:32:38 +01:00
Ivan Tadeu Ferreira Antunes Filho	626be98c35	[mlir] Remove dependency on LinalgTransformOps.h from #144657 (#145749 ) https://github.com/llvm/llvm-project/pull/144657 added #include "mlir/Dialect/Linalg/TransformOps/LinalgTransformOps.h" to mlir/test/lib/Dialect/Linalg/TestLinalgTransforms.cpp, though it is not in use	2025-06-25 14:21:26 -04:00
Spenser Bauman	532c15a718	[mlir][linalg] Fix module dependency issue due to unused import (#145727 ) This include introduces a dependency for LinalgTransforms on LinalgTransformOps, which is unspecified in the module dependencies, and would produce a cyclic dependency if it were specified. The include is unused in WinogradConv2D.cpp, so this change removes it.	2025-06-25 12:54:49 -04:00
Andrzej Warzyński	c92b580d63	[mlir][ArmSME] Remove `ConvertIllegalShapeCastOpsToTransposes` (#139706 ) As a follow-up to PR #135841 (see discussion for background), this patch removes the `ConvertIllegalShapeCastOpsToTransposes` pattern from the SME legalization pass. This change unblocks folding for ShapeCastOp involving scalable vectors. Originally, the `ConvertIllegalShapeCastOpsToTransposes` pattern was introduced to rewrite certain `vector.shape_cast` ops that could not be lowered otherwise. Based on local end-to-end testing, this workaround is no longer required, and the pattern can now be safely removed. This patch also removes a special case from `ShapeCastOp::fold`, simplifying the fold logic. As a side effect of removing `ConvertIllegalShapeCastOpsToTransposes`, we lose the mechanism that enabled lowering of certain ops like: ```mlir %res = vector.transfer_read %mem[%a, %b] (...) : memref<?x?xf32>, vector<[4]x1xf32> ``` Previously, such cases were handled by: * Rewriting a nearby `vector.shape_cast` to a `vector.transpose` (via `ConvertIllegalShapeCastOpsToTransposes`) * Then lowering the result with `LiftIllegalVectorTransposeToMemory`. This patch introduces a new dedicated pattern, `LowerColumnTransferReadToLoops`, that directly handles illegal `vector.transfer_read` ops involving leading scalable dimensions.	2025-06-25 16:40:14 +01:00
Rolf Morel	c08502defe	[MLIR][Transform] expose transform.debug extension in Python (#145550 ) Removes the Debug... prefix on the ops in tablegen, in line with pretty much all other Transform-dialect extension ops. This means that the ops in Python look like `debug.EmitParamAsRemarkOp`/`debug.emit_param_as_remark` instead of `debug.DebugEmitParamAsRemarkOp`/`debug.debug_emit_param_as_remark`.	2025-06-25 16:39:01 +01:00
Momchil Velikov	ea1e181571	[MLIR][AArch64] Lower `vector.contract` with mixed signed/unsigned arguments to Neon FEAT_I8MM (#144698 )	2025-06-25 16:10:02 +01:00
Jack Frankland	022e1e99f3	[mlir][memref]: Fix Bug in GlobalOp Verifier (#144900 ) When comparing the type of the initializer in a `memref::GlobalOp` against its result only consider the element type and the shape. Other attributes such as memory space should be ignored since comparing these between tensors and memrefs doesn't make sense and constructing a memref in a specific memory space with a tensor that has no such attribute should be valid. Signed-off-by: Jack Frankland <jack.frankland@arm.com>	2025-06-25 15:14:55 +01:00

1 2 3 4 5 ...

23314 Commits