clang-p2996

Author	SHA1	Message	Date
Aart Bik	f388a3a446	[mlir][sparse] update doc and examples of the [dis]assemble operations (#88213 ) The doc and examples of the [dis]assemble operations did not reflect all the recent changes on order of the operands. Also clarified some of the text.	2024-04-10 09:42:12 -07:00
Jeff Niu	f2ade91a9f	[mlir] Optimize getting properties on concrete ops (#88259 ) This makes retrieving properties on concrete operations faster by removing a branch when it is known that the operation must have properties.	2024-04-10 14:11:45 +02:00
Raghu Maddhipatla	eec41d2f8d	Revert "[Flang] [OpenMP] [Semantics] [MLIR] [Lowering] Add lowering support for IS_DEVICE_PTR and HAS_DEVICE_ADDR clauses on OMP TARGET directive." (#88198 ) Reverts llvm/llvm-project#74187	2024-04-09 16:18:56 -05:00
srcarroll	b79db39659	[mlir][linalg] Support `ParamType` in `vector_sizes` option of `VectorizeOp` transform (#87557 )	2024-04-09 15:52:40 -05:00
Raghu Maddhipatla	9d9560facb	[Flang] [OpenMP] [Semantics] [MLIR] [Lowering] Add lowering support for IS_DEVICE_PTR and HAS_DEVICE_ADDR clauses on OMP TARGET directive. (#74187 ) Added lowering support for IS_DEVICE_PTR and HAS_DEVICE_ADDR clauses for OMP TARGET directive and added related tests for these changes. IS_DEVICE_PTR and HAS_DEVICE_ADDR clauses apply to OMP TARGET directive OpenMP spec states `The is_device_ptr clause indicates that its list items are device pointers.` `The has_device_addr clause indicates that its list items already have device addresses and therefore they may be directly accessed from a target device.` Whereas USE_DEVICE_PTR and USE_DEVICE_ADDR clauses apply to OMP TARGET DATA directive and OpenMP spec for them states `Each list item in the use_device_ptr clause results in a new list item that is a device pointer that refers to a device address` `Each list item in a use_device_addr clause that is present in the device data environment is treated as if it is implicitly mapped by a map clause on the construct with a map-type of alloc`	2024-04-09 14:59:20 -05:00
xiaoleis-nv	8d6469b0e0	[mlir][vector] Add lower-vector-multi-reduction pass (#87333 ) This MR adds the `lower-vector-multi-reduction` pass to lower the vector.multi_reduction operation. While the Transform Dialect includes an operation, `transform.apply_patterns.vector.lower_multi_reduction`, intended for a similar purpose, its utility is limited to projects that have adopted the Transform Dialect. Recognizing that not all projects are equipped to integrate this dialect, the proposed pass serves as a vital standalone alternative. It ensures that projects solely dependent on the traditional pass infrastructure can also benefit from the optimized lowering of `multi_reduction` operation. --------- Co-authored-by: Xiaolei Shi <xiaoleis@nvidia.com>	2024-04-09 10:04:25 -07:00
Billy Zhu	6f6336858e	[MLIR][LLVM] Add DebugNameTableKind to DICompileUnit (#87974 ) Add the DebugNameTableKind field to DICompileUnit, along with its importer & exporter.	2024-04-09 06:18:07 -07:00
Sergio Afonso	6528f10366	[MLIR][OpenMP] Group clause operands into structures and use them to define simplified op builders (#86797 ) This patch introduces a set of composable structures grouping the MLIR operands associated to each OpenMP clause. This makes it easier to keep the MLIR representation for the same clause consistent throughout all operations that accept it. The relevant clause operand structures are grouped into per-operation structures using a mixin pattern and used to define new operation constructors. These constructors can be used to avoid having to get the order of a possibly large list of operands right. Missing clauses are documented as TODOs, as well as operands which are part of the relevant operation's operand structure but cannot be attached to the associated operation yet, due to missing op arguments to its MLIR definition. A follow-up patch will update Flang lowering to make use of these structures, simplifying the passing of information from clause processing to operation-generating functions and also simplifying the creation of operations through the use of the new operation constructors.	2024-04-09 13:40:18 +01:00
Uday Bondhugula	0e5a53cc01	[MLIR] Fix typo bug in AffineExprVisitor for WalkResult return case (#86138 ) Fix typo bug in AffineExprVisitor for the WalkResult return case. This didn't show up immmediately because most walks in the tree didn't use walk result.	2024-04-09 08:37:57 +05:30
Andrei Golubev	be006372f3	[mlir][OpPrintingFlags] Allow to disable ElementsAttr hex printing (#85766 ) At present, large ElementsAttr is unconditionally printed with a hex string. This means that in IR large constant values often look like: dense<"0x000000000004000000080000000004000000080000000..."> : tensor<10x10xi32> Hoisting hex printing control to the user level for tooling means that one can disable the feature and get human-readable values when necessary: dense<[16, 32, 48, 500...]> : tensor<10x10xi32> Note: AsmPrinterOptions::printElementsAttrWithHexIfLarger is not always possible to be used as it requires that one exposes MLIR's command-line options in user tooling (including an actual compiler). Co-authored-by: Harald Rotuna <harald.razvan.rotuna@intel.com>	2024-04-09 02:08:32 +02:00
Prashant Kumar	9ffecef1c6	[mlir][vector][NFC] Fix typo temp -> tmp. (#87878 )	2024-04-08 08:34:36 +02:00
Fabian Mora	a2c4b7c8e2	[mlir] Add `convertInstruction` and `getSupportedInstructions` to `LLVMImportInterface` (#86799 ) This patch adds the `convertInstruction` and `getSupportedInstructions` to `LLVMImportInterface`, allowing any non-LLVM dialect to specify how to import LLVM IR instructions and overriding the default import of LLVM instructions.	2024-04-07 08:46:21 +02:00
Matthias Springer	76435f2dca	[mlir][SCF] `ValueBoundsConstraintSet`: Support `scf.if` (branches) (#87860 ) This commit adds support for `scf.if` to `ValueBoundsConstraintSet`. Example: ``` %0 = scf.if ... -> index { scf.yield %a : index } else { scf.yield %b : index } ``` The following constraints hold for %0: * %0 >= min(%a, %b) * %0 <= max(%a, %b) Such constraints cannot be added to the constraint set; min/max is not supported by `IntegerRelation`. However, if we know which one of %a and %b is larger, we can add constraints for %0. E.g., if %a <= %b: * %0 >= %a * %0 <= %b This commit required a few minor changes to the `ValueBoundsConstraintSet` infrastructure, so that values can be compared while we are still in the process of traversing the IR/adding constraints. Note: This is a re-upload of #85895, which was reverted. The bug that caused the failure was fixed in #87859.	2024-04-06 13:04:49 +09:00
Jeff Niu	0f52f4ddd9	[mlir][ods] Emit "trivial" ODS getter/setters inline (#87741 ) Emitting trivial getters that amount to `(*this)->getOperand(1)` out-of-line or `getProperties().foo` is a pretty significant performance hit on these basic MLIR APIs for manipulating ops (3-4x). Emit them inline (without adding additional dependencies to header files).	2024-04-06 04:01:37 +02:00
Christian Ulmann	541962306d	[MLIR][LLVM] Remove bitcast pattern from type consistency pass (#87755 ) This commit removes the no longer required bitcast inserting pattern in LLVM dialect's type consistency pattern. This was previously required to enable Mem2Reg and SROA to promote accesses that had different types. Recent changes to both passes added direct support for this feature to them, so the pattern has no further use.	2024-04-05 15:47:16 +02:00
Mehdi Amini	8487e05967	Revert "[mlir][SCF] `ValueBoundsConstraintSet`: Support `scf.if` (branches) (#85895 )" This reverts commit `6b30ffef28`. gcc7 bot is broken	2024-04-05 03:00:35 -07:00
Benjamin Maxwell	0b7362c257	[mlir][arith] Add result pretty printing for constant vscale values (#83565 ) In scalable code it is very common to have constant multiples of vscale, e.g. `4 * vscale`. This updates `arith.muli` to pretty print the result name in cases like this, so `4 * vscale` would be `%c4_vscale`. This makes reading IR dumps of scalable code a little nicer.	2024-04-05 10:48:16 +01:00
Matthias Springer	6b30ffef28	[mlir][SCF] `ValueBoundsConstraintSet`: Support `scf.if` (branches) (#85895 ) This commit adds support for `scf.if` to `ValueBoundsConstraintSet`. Example: ``` %0 = scf.if ... -> index { scf.yield %a : index } else { scf.yield %b : index } ``` The following constraints hold for %0: * %0 >= min(%a, %b) * %0 <= max(%a, %b) Such constraints cannot be added to the constraint set; min/max is not supported by `IntegerRelation`. However, if we know which one of %a and %b is larger, we can add constraints for %0. E.g., if %a <= %b: * %0 >= %a * %0 <= %b This commit required a few minor changes to the `ValueBoundsConstraintSet` infrastructure, so that values can be compared while we are still in the process of traversing the IR/adding constraints.	2024-04-05 13:14:00 +09:00
MaheshRavishankar	5aeb604c7c	[mlir][SCF] Modernize `coalesceLoops` method to handle `scf.for` loops with iter_args (#87019 ) As part of this extension this change also does some general cleanup 1) Make all the methods take `RewriterBase` as arguments instead of creating their own builders that tend to crash when used within pattern rewrites 2) Split `coalesePerfectlyNestedLoops` into two separate methods, one for `scf.for` and other for `affine.for`. The templatization didnt seem to be buying much there. Also general clean up of tests.	2024-04-04 13:44:24 -07:00
Fabian Mora	220cdf940e	[mlir] Add `requiresReplacedValues` and `visitReplacedValues` to `PromotableOpInterface` (#86792 ) Add `requiresReplacedValues` and `visitReplacedValues` methods to `PromotableOpInterface`. These methods allow `PromotableOpInterface` ops to transforms definitions mutated by a `store`. This change is necessary to correctly handle the promotion of `LLVM_DbgDeclareOp`. --------- Co-authored-by: Théo Degioanni <30992420+Moxinilian@users.noreply.github.com>	2024-04-04 13:34:46 -04:00
Tom Eccles	cc34ad91f0	[MLIR][OpenMP] Add cleanup region to omp.declare_reduction (#87377 ) Currently, by-ref reductions will allocate the per-thread reduction variable in the initialization region. Adding a cleanup region allows that allocation to be undone. This will allow flang to support reduction of arrays stored on the heap. This conflation of allocation and initialization in the initialization should be fixed in the future to better match the OpenMP standard, but that is beyond the scope of this patch.	2024-04-04 11:19:42 +01:00
Matthias Springer	5e4a44380e	[mlir][Interfaces][NFC] `ValueBoundsConstraintSet`: Pass stop condition in the constructor (#86099 ) This commit changes the API of `ValueBoundsConstraintSet`: the stop condition is now passed to the constructor instead of `processWorklist`. That makes it easier to add items to the worklist multiple times and process them in a consistent manner. The current `ValueBoundsConstraintSet` is passed as a reference to the stop function, so that the stop function can be defined before the the `ValueBoundsConstraintSet` is constructed. This change is in preparation of adding support for branches.	2024-04-04 17:05:47 +09:00
Hsiangkai Wang	362aa434cc	[mlir] Enhance TimingManager Printing Flexibility (#85821 ) Revise the printing functionality of TimingManager to accommodate various output formats. At present, TimingManager is limited to outputting data solely in plain text format. To overcome this limitation, I have introduced an abstract class that serves as the foundation for printing. This approach allows users to implement additional output formats by extending this abstract class. As part of this update, I have integrated support for JSON as a new output format, enhancing the ease of parsing for subsequent processing scripts.	2024-04-03 16:58:01 +01:00
Adrian Kuegel	269d0aaec1	[mlir] Apply ClangTidy findings. modernize-use-override ClangTidy check. This warning appears on overridden virtual functions not marked with override or final keywords or marked with more than one of virtual, override, final.	2024-04-03 14:14:06 +00:00
Simon Camphausen	95f9b083d0	[mlir][EmitC] Fix examples in op descriptions (#87478 ) - Remove trailing type from value attributes as emitc.opaque attributes are untyped. - Replace invalid trailing * in opaque type by wrapping it into an !emitc.ptr.	2024-04-03 15:22:15 +02:00
Christian Ulmann	a2acf31323	[MLIR] Add endianness accessors to the data layout (#87347 ) This commit extends the data layout subsystem with accessors for the endianness. The implementation follows the structure implemented for alloca, global, and program memory spaces.	2024-04-03 14:54:29 +02:00
Simon Camphausen	1f268092c7	[mlir][EmitC] Add support for pointer and opaque types to subscript op (#86266 ) For pointer types the indices are restricted to one integer-like operand. For opaque types no further restrictions are made.	2024-04-03 13:06:14 +02:00
Chenguang Wang	b714fc7f86	Move format internal code from llvm::detail to llvm::support::detail. (#87288 ) Some support code, e.g. llvm/Support/Endian.h, uses llvm::support::detail, but the format-related code uses llvm::detail. On VS2019, when a C++ file includes both headers, a `detail::` from `namespace llvm { ... }` becomes ambiguous. `44253a9c` breaks TensorFlow and [JAX](https://github.com/google/jax/actions/runs/8507773013/job/23300219405) build because of this. Since llvm::X::detail seems like a cleaner solution and is used in other places as well (e.g. llvm::yaml::detail), we should probably migrate all llvm::detail usages to llvm::X::detail.	2024-04-02 08:35:08 -07:00
Ivan Butygin	5b66b6a32a	[mlir][pass] Add composite pass utility (#87166 ) Composite pass allows to run sequence of passes in the loop until fixed point or maximum number of iterations is reached. The usual candidates are canonicalize+CSE as canonicalize can open more opportunities for CSE and vice-versa.	2024-04-02 13:30:45 +03:00
Haojian Wu	89cfae41ec	[mlir] Add missing #include header for std::is_pointer	2024-04-02 11:41:04 +02:00
Matthias Springer	9067f54705	[mlir][IR][NFC] Make `replaceAllUsesWith` non-templatized (#84722 ) Turn `RewriterBase::replaceAllUsesWith` into a non-templatized implementation, so that it can be made virtual and be overridden in the `ConversionPatternRewriter` in a subsequent change. This change is in preparation of adding dialect conversion support for `replaceAllUsesWith`.	2024-04-02 11:03:12 +09:00
Matthias Springer	38113a0832	[mlir][IR] Trigger `notifyOperationReplaced` on `replaceAllOpUsesWith` (#84721 ) Before this change: `notifyOperationReplaced` was triggered when calling `RewriteBase::replaceOp`. After this change: `notifyOperationReplaced` is triggered when `RewriterBase::replaceAllOpUsesWith` or `RewriterBase::replaceOp` is called. Until now, every `notifyOperationReplaced` was always sent together with a `notifyOperationErased`, which made that `notifyOperationErased` callback irrelevant. More importantly, when a user called `RewriterBase::replaceAllOpUsesWith`+`RewriterBase::eraseOp` instead of `RewriterBase::replaceOp`, no `notifyOperationReplaced` callback was sent, even though the two notations are semantically equivalent. As an example, this can be a problem when applying patterns with the transform dialect because the `TrackingListener` will only see the `notifyOperationErased` callback and the payload op is dropped from the mappings. Note: It is still possible to write semantically equivalent code that does not trigger a `notifyOperationReplaced` (e.g., when op results are replaced one-by-one), but this commit already improves the situation a lot.	2024-04-02 10:53:57 +09:00
Ivan Butygin	1079fc4f54	[mlir][pass] Add `errorHandler` param to `Pass::initializeOptions` (#87289 ) There is no good way to report detailed errors from inside `Pass::initializeOptions` function as context may not be available at this point and writing directly to `llvm::errs()` is not composable. See https://github.com/llvm/llvm-project/pull/87166#discussion_r1546426763 * Add error handler callback to `Pass::initializeOptions` * Update `PassOptions::parseFromString` to support custom error stream instead of using `llvm::errs()` directly. * Update default `Pass::initializeOptions` implementation to propagate error string from `parseFromString` to new error handler. * Update `MapMemRefStorageClassPass` to report error details using new API.	2024-04-02 02:43:04 +03:00
Peiming Liu	a54930e696	[mlir][sparse] allow YieldOp to yield multiple values. (#87261 )	2024-04-01 10:30:36 -07:00
Jakub Kuderski	a8cfa7cbdf	[mlir][TD] Allow op printing flags as `transform.print` attrs (#86846 ) Introduce 3 new optional attributes to the `transform.print` ops: * `assume_verified` * `use_local_scope` * `skip_regions` The primary motivation is to allow printing on large inputs that otherwise take forever to print and verify. For the full context, see this IREE issue: https://github.com/openxla/iree/issues/16901. Also add some tests and fix the op description.	2024-04-01 12:32:23 -04:00
Victor Perez	8827ff92b9	[MLIR][Arith] Add rounding mode attribute to `truncf` (#86152 ) Add rounding mode attribute to `arith`. This attribute can be used in different FP `arith` operations to control rounding mode. Rounding modes correspond to IEEE 754-specified rounding modes. Use in `arith.truncf` folding. As this is not supported in dialects other than LLVM, conversion should fail for now in case this attribute is present. --------- Signed-off-by: Victor Perez <victor.perez@codeplay.com>	2024-04-01 11:57:14 +02:00
Prashant Kumar	10a57f3aff	[mlir][math] Expand powfI operation for constant power operand. (#87081 ) -- Convert `math.fpowi` to a series of `arith.mulf` operations. -- If the power is negative, we divide the result by 1.	2024-04-01 13:18:27 +05:30
Philip Lassen	a0c019ae9e	[mlir][linalg] Delete unused SameVariadicOperandSize trait from ops (#87124 ) Both `Transpose` and `Broadcast` specify the `SameVariadicOperandSize` trait. However neither has a variadic operand let alone more than one. This is likely a relic from copying the boilerplate of the `Reduce` definition.	2024-03-31 23:35:55 +02:00
Mehdi Amini	82c6eeed08	[MLIR] Add a second map for registered OperationName in MLIRContext (NFC) (#87170 ) This speeds up registered op creation by 10-11% by allowing lookup by TypeID instead of StringRef. This can break your build/tests at runtime with an error that you're creating an unregistered operation that you have registered. If so you are likely using a class inheriting from the "real" operation. See for example in this patch the case of: class ConstantIndexOp : public arith::ConstantOp { If one is using `builder.create<ConstantIndexOp>()` they actually create an `arith.constant` operation, but the builder will fetch the TypeID for the `ConstantIndexOp` class which does not correspond to any registered operation. To fix it the `ConstantIndexOp` class got this addition: static ::mlir::TypeID resolveTypeID() { return TypeID::get<ConstantOp>(); }	2024-03-31 21:28:05 +02:00
Aart Bik	dc4cfdbb8f	[mlir][sparse] provide an AoS "view" into sparse runtime support lib (#87116 ) Note that even though the sparse runtime support lib always uses SoA storage for COO storage (and provides correct codegen by means of views into this storage), in some rare cases we need the true physical SoA storage as a coordinate buffer. This PR provides that functionality by means of a (costly) coordinate buffer call. Since this is currently only used for testing/debugging by means of the sparse_tensor.print method, this solution is acceptable. If we ever want a performing version of this, we should truly support AoS storage of COO in addition to the SoA used right now.	2024-03-29 15:30:36 -07:00
Jakub Kuderski	d61ec513c4	[mlir][spirv] Add IsInf/IsNan expansion for WebGPU (#86903 ) These non-finite math ops are supported by SPIR-V but not by WGSL. Assume finite floating point values and expand these ops into `false`. Previously, this worked by adding fast math flags during conversion from arith to spirv, but this got removed in https://github.com/llvm/llvm-project/pull/86578. Also do some misc cleanups in the surrounding code.	2024-03-28 14:13:04 -04:00
Andrzej Warzyński	d3aa92ed14	[mlir][vector] Add support for scalable vectors to VectorLinearize (#86786 ) Adds support for scalable vectors to patterns defined in VectorLineralize.cpp. Linearization is disable in 2 notable cases: * vectors with more than 1 scalable dimension (we cannot represent vscale^2), * vectors initialised with arith.constant that's not a vector splat (such arith.constant Ops cannot be flattened).	2024-03-28 14:53:21 +00:00
Rolf Morel	eacda36c7d	[SCF][Transform] Add support for scf.for in LoopFuseSibling op (#81495 ) Adds support for fusing two scf.for loops occurring in the same block. Uses the rudimentary checks already in place for scf.forall (like the target loop's operands being dominated by the source loop). - Fixes a bug in the dominance check whereby it was checked that values in the target loop themselves dominated the source loop rather than the ops that define these operands. - Renames the LoopFuseSibling op to LoopFuseSiblingOp. - Updates LoopFuseSiblingOp's description. - Adds tests for using LoopFuseSiblingOp on scf.for loops, including one which fails without the fix for the dominance check. - Adds tests checking the different failure modes of the dominance checker. - Adds test for case whereby scf.yield is automatically generated when there are no loop-carried variables.	2024-03-28 14:13:08 +01:00
Oleksandr "Alex" Zinenko	91856b34e3	[mlir] move MatchOpInterface under Transform/Interfaces (#86899 ) This is similar to the TransformOpInterface move.	2024-03-28 14:00:22 +01:00
Justin Fargnoli	35d55f2894	[NFC][mlir] Reorder `declarePromisedInterface()` operands (#86628 ) Reorder the template operands of `declarePromisedInterface()` to match `declarePromisedInterfaces()`.	2024-03-27 10:30:17 -07:00
Alex Voicu	ab7dba233a	[CodeGen][LLVM] Make the `va_list` related intrinsics generic. (#85460 ) Currently, the builtins used for implementing `va_list` handling unconditionally take their arguments as unqualified `ptr`s i.e. pointers to AS 0. This does not work for targets where the default AS is not 0 or AS 0 is not a viable AS (for example, a target might choose 0 to represent the constant address space). This patch changes the builtins' signature to take generic `anyptr` args, which corrects this issue. It is noisy due to the number of tests affected. A test for an upstream target which does not use 0 as its default AS (SPIRV for HIP device compilations) is added as well.	2024-03-27 11:41:34 +00:00
Matthias Springer	10b07f2324	[mlir][Interfaces] `ValueBoundsConstraintSet`: Add `dump` helper function (#86634 ) This commit adds a helper function that dumps the constraint set and the mapping of columns to values/dims. For debugging only. Example output: ``` ========== Columns: (column dim value) 0 1 linalg.fill (result 0) 1 1 tensor.extract_slice (result 0) 2 n/a affine.min (result 0) 3 n/a scf.for (bbarg 0) 4 n/a func.func (bbarg 2) Constraint set: Domain: 0, Range: 1, Symbols: 4, Locals: 0 6 constraints (None None None None None const) 1 -1 0 0 0 0 = 0 0 1 -1 0 0 0 = 0 0 0 -1 -1 1 0 >= 0 0 0 -1 0 0 4 >= 0 0 0 0 1 0 0 >= 0 0 0 0 -1 1 -1 >= 0 ========== ```	2024-03-27 10:49:24 +09:00
Matthias Gehre	c6d419c15b	[TOSA] Allow all integer types in most ops (#86509 ) As discussed in one of the previous TOSA community meetings, we would like to allow for more integer types in the TOSA dialect to enable more use cases. For strict standards conformance, the TosaValidation pass can be used. Follow up PRs will extend conversions from TOSA where needed.	2024-03-26 22:27:11 +01:00
Alex Zinenko	cf6e62d8c3	[mlir][doc] NFC fix incorrect filename in td match interface	2024-03-26 18:48:46 +00:00
Ivan Butygin	f050a098b5	[mlir][spirv] Remove `enableFastMathMode` flag from SPIR-V conversion (#86578 ) Most of arith/math ops support fastmath attribute, use it instead of global flag.	2024-03-26 20:06:06 +03:00

1 2 3 4 5 ...

10090 Commits