clang-p2996

Author	SHA1	Message	Date
Matthias Springer	9f808f6e2f	[mlir][vector][NFC] Drop `get...AttrStrName` helper functions These functions are not needed. They are auto-generated from the `.td` files. Differential Revision: https://reviews.llvm.org/D155483	2023-07-17 18:16:08 +02:00
Guray Ozen	baba13e9a1	[mlir][nvvm] Delete backslash Delete the backslash. It was there to compile tablegen file. It looks like space also works fine. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D155474	2023-07-17 17:56:52 +02:00
Matthias Springer	0e8c68c301	[mlir][Interfaces] Fix DestinationStyleOpInterface for vector ops This revision fixes `hasTensorSemantics` and `hasBufferSemantics` for vector transfer ops, which may have a vector operand. `VectorType` implements `ShapedType` and such operands do not affect whether an op has tensor or buffer semantics. Also implement `DestinationStyleOpInterface` on `TransferReadOp` so that `hasTensorSemantics`/`hasBufferSemantics` can be called. (The op has no inits, but this makes it symmetric to `TransferWriteOp`.) Differential Revision: https://reviews.llvm.org/D155469	2023-07-17 17:40:18 +02:00
Guray Ozen	28555793b1	[mlir][nvvm] Add `cp.async.bulk.tensor.shared.cluster.global` This work introduce `cp.async.bulk.tensor.shared.cluster.global` in NVVM dialect that executes load using TMA. Depends on D155056 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155060	2023-07-17 17:10:39 +02:00
Adam Paszke	fbfff1caff	[MLIR][CAPI] Add C API dialect registration methods for Arith, Math, MemRef and Vector dialects Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D155450	2023-07-17 14:45:49 +00:00
Matthias Springer	b1d2687501	[mlir][IR] Remove duplicate `isLastMemrefDimUnitStride` functions This function is duplicated in various dialects. Differential Revision: https://reviews.llvm.org/D155462	2023-07-17 16:31:04 +02:00
Alex Zinenko	371366ce27	[mlir][nvgpu] add simple pipelining for shared memory copies Add a simple transform operation to the NVGPU extension that performs software pipelining of copies to shared memory. The functionality is extremely minimalistic in this version and only supports copies from global to shared memory inside an `scf.for` loop with either `vector.transfer` or `nvgpu.device_async_copy` operations when pipelining preconditions are already satisfied in the IR. This is the minimally useful version that uses the more general loop pipeliner in an NVGPU-specific way. Further extensions and orthogonalizations will be necessary. This required a change to the loop pipeliner itself to properly propagate errors should the predicate generator fail. This is loosely inspired from the vesion in IREE, but has less unsafe assumptions and more principled way of communicating decisions. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155223	2023-07-17 14:29:12 +00:00
Matthias Springer	a4f4d82c35	[mlir][NVGPU][NFC] Clean up code structure * Move passes to `Transforms` directory. * Add `Utils.h` (will be utilized in a subsequent change). Differential Revision: https://reviews.llvm.org/D155427	2023-07-17 14:15:42 +02:00
Matthias Gehre	0ebb050311	[MLIR] [TOSA]: Move reshape(reshape(x)) -> reshape(x) from canonicalization to fold reshape(reshape(x)) -> reshape(x) can be directly written as a fold instead of a canonicalization, to help other passes cleanup while they work. This initially broke ReshapeConverterExpand/Collapse, which relies on creating foldable reshapes and a carefully crafted benefit priority of patterns. I turned this into a single pattern on reshapes, which does expand and/or collapse as needed in one go. Differential Revision: https://reviews.llvm.org/D155266	2023-07-17 10:14:37 +02:00
Matthias Springer	98770ecd76	[mlir][bufferization] Add `buffer_loop_hoisting` transform op This op hoists buffer allocation from loops. Differential Revision: https://reviews.llvm.org/D155289	2023-07-14 17:09:38 +02:00
Nicolas Vasilache	5e877caf4d	[mlir] Add an IntNEQValue predicate Differential Revision: https://reviews.llvm.org/D155298	2023-07-14 16:57:04 +02:00
Nicolas Vasilache	9e54d5e778	[mlir] NFC - Basic improvements to IndexingUtils (product and sum)	2023-07-14 16:41:31 +02:00
Nicolas Vasilache	ed68282942	Revert "[mlir][memref] NFC - Move utility function declaration from IR/MemRef.h to Utils/MemRefUtils.h" This reverts commit `8b161e9772`. This creates cyclic dependencies that cannot be easily untangled for now.	2023-07-14 16:31:54 +02:00
Matthias Springer	fd5cda3393	[mlir][vector][NFC] Minor VectorTransferOpInterface cleanup * Rename functions with underscore to camel case. * Return C++ bools of "in_bounds" values instead of an `ArrayAttr`. Differential Revision: https://reviews.llvm.org/D155277	2023-07-14 15:41:21 +02:00
Markus Böck	9170fa5808	[mlir][LLVM] Convert access group metadata to using attributes instead of ops Using MLIR attributes instead of metadata has many advantages: * No indirection: Attributes can simply refer to each other seemlessly without having to use the indirection of `SymbolRefAttr`. This also gives us correctness by construction in a lot of places as well * Multithreading safe: The Attribute infrastructure gives us thread-safety for free. Creating operations and inserting them into a block is not thread-safe. This is a major use case for e.g. the inliner in MLIR which runs in parallel * Easier to create: There is no need for a builder or a metadata region This patch therefore does exactly that. It leverages the new distinct attributes to create distinct access groups in a deterministic and threadsafe manner. Differential Revision: https://reviews.llvm.org/D155285	2023-07-14 14:57:46 +02:00
Matthias Springer	1a5aa77f30	[mlir][linalg] BufferizeToAllocationOp: Add option to specify custom alloc op Supported ops are "memref.alloc" and "memref.alloca". Differential Revision: https://reviews.llvm.org/D155282	2023-07-14 13:39:05 +02:00
Matthias Springer	88f4292a16	[mlir][bufferization] OneShotBufferizeOp: Add options to use linalg.copy This new option allows users to specify a custom memcpy op. Differential Revision: https://reviews.llvm.org/D155280	2023-07-14 13:34:22 +02:00
Nicolas Vasilache	8b161e9772	[mlir][memref] NFC - Move utility function declaration from IR/MemRef.h to Utils/MemRefUtils.h	2023-07-14 11:24:22 +02:00
Nicolas Vasilache	0489cfe13d	Revert "[RandomIRBuilder] Remove use of getNonOpaquePointerElementType() (NFC)" This reverts commit `afdb83b19c`. This was landed with a bad description.	2023-07-14 11:24:22 +02:00
Markus Böck	78d00a160f	[mlir][LLVM] Convert alias metadata to using attributes instead of ops Using MLIR attributes instead of metadata has many advantages: * No indirection: Attributes can simply refer to each other seemlessly without having to use the indirection of `SymbolRefAttr`. This also gives us correctness by construction in a lot of places as well * Multithreading save: The Attribute infrastructure gives us thread-safety for free. Creating operations and inserting them into a block is not thread-safe. This is a major use case for e.g. the inliner in MLIR which runs in parallel * Easier to create: There is no need for a builder or a metadata region This patch therefore does exactly that. It leverages the new distinct attributes to create distinct alias domains and scopes in a deterministic and threadsafe manner. Differential Revision: https://reviews.llvm.org/D155159	2023-07-14 11:14:42 +02:00
Nikita Popov	afdb83b19c	[RandomIRBuilder] Remove use of getNonOpaquePointerElementType() (NFC)	2023-07-14 11:09:01 +02:00
Hideto Ueno	cf40fde4ed	[mlir] Don't emit forward declaration for user defined storage classes Currently DefGen::emitDecl always emits forward declarations of storage classes even for user define ones, which makes it difficult to use template class directly in ODS. This patch changes `DefGen` not to emit forward decl when `genStorageClass` is false. Original discussion: https://discourse.llvm.org/t/use-template-classes-as-user-defined-storage-classes/72015 Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D155225	2023-07-13 21:14:48 -07:00
Hanhan Wang	8fc433f055	[mlir][MemRef] Move narrow type emulation common methods to MemRefUtils. It also unifies the computation of StridedLayoutAttr. If the stride is static known value, we can just use it. Differential Revision: https://reviews.llvm.org/D155017	2023-07-13 14:43:21 -07:00
Guray Ozen	22a32f7d9c	[mlir][gpu] Add dump-ptx option When targeting NVIDIA GPUs, seeing the generated PTX is important. Currently, we don't have simple way to do it. This work adds dump-ptx to gpu-to-cubin pass. One can use it like `gpu-to-cubin{chip=sm_90 features=+ptx80 dump-ptx}`. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155166	2023-07-13 21:14:57 +02:00
Nicolas Vasilache	39427a4fbb	[mlir][Linalg] Fold/erase self-copy linalg.copy on buffers Differential Revision: https://reviews.llvm.org/D155203	2023-07-13 16:38:02 +02:00
Jan Sjodin	45a9604417	[Flang][OpenMP][MLIR] Add early outlining pass for omp.target operations to flang This patch implements an early outlining transform of omp.target operations in flang. The pass is needed because optimizations may cross target op region boundaries, but with the outlining the resulting functions only contain a single omp.target op plus a func.return, so there should not be any opportunity to optimize across region boundaries. The patch also adds an interface to be able to store and retrieve the parent function name of the original target operation. This is needed to be able to create correct kernel function names when lowering to LLVM-IR. Reviewed By: kiranchandramohan, domada Differential Revision: https://reviews.llvm.org/D154879	2023-07-13 09:14:42 -04:00
Guray Ozen	eda52f3cd3	[mlir][nvvm] Add populate function (nfc) This work adds populate function for the nvvm to llvm conversion pattern. Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D155189	2023-07-13 14:53:51 +02:00
Adam Paszke	c83318e3e0	[MLIR][Python] Implement pybind adapters for MlirBlock Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D155092	2023-07-12 22:27:01 -07:00
Adam Paszke	86bc2e3ae9	[MLIR] Add a number of methods to the C API Those include: - mlirFuncSetArgAttr - mlirOperationSetOperands - mlirRegionTakeBody - mlirBlockInsertArgument Reviewed By: ftynse, jpienaar Differential Revision: https://reviews.llvm.org/D155091	2023-07-12 22:10:03 -07:00
Hideto Ueno	d138c89148	[mlir] Forward arguments of `pair` in `SubElementInterface::replaceImmediateSubElementsImpl` `SubElementInterface::replaceImmediateSubElementsImpl` specializes tuples so that arguments are forwarded to type getter. However currently pairs are not supported even though an example in documents uses a pair as a key type. This patch adds support for pairs as well. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D155043	2023-07-12 22:07:27 -07:00
Jakub Kuderski	4ba61f5a30	[mlirv][spirv] Add KHR Cooperative Matrix type and extension Start plumbing through support for the `SPV_KHR_cooperative_matrix` extension: https://github.com/KhronosGroup/SPIRV-Registry/blob/master/extensions/KHR/SPV_KHR_cooperative_matrix.html. Register the extension, add new coop matrix type, and add `spirv.KHR.CooperativeMatrixLength` op to exercise it. Make sure that mixing of the KHR and NV coop matrix extensions is not allowed. Make cast verification more robust. Reviewed By: antiagainst, qedawkins Differential Revision: https://reviews.llvm.org/D154877	2023-07-12 21:11:08 -04:00
Peiming Liu	269c82d389	[mlir][sparse] introduce new 2:4 block sparsity level type. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D155128	2023-07-12 23:33:53 +00:00
Tai Ly	d713a00270	[TOSA] Add level checks and remove Tensor1DTo4D Remove Tosa_Tensor1Dto4D and Tosa_TensorUpto4D in the Tosa Dialect and added level checks to TosaValidation pass to validate per spec. Signed-off-by: Tai Ly <tai.ly@arm.com> Change-Id: Icd32137e9f8051f99994cee9f388f20c1a840f4b Reviewed By: eric-k256 Differential Revision: https://reviews.llvm.org/D154273	2023-07-12 16:56:44 +00:00
Matthias Springer	d3ddcfd448	[mlir][DialectUtils] Generalize `extractFromI64ArrayAttr` helper Generalize `extractFromI64ArrayAttr` to `extractFromIntegerArrayAttr`, so that arbitrary integer/bool types can be extracted. Differential Revision: https://reviews.llvm.org/D154974	2023-07-12 17:59:40 +02:00
Amanda Tang	47b0a9b931	[ODS] Extra Concrete Declarations and Definitions under Traits Support extra concrete class declarations and definitions under NativeTrait that get injected into the class that specifies the trait. Extra declarations and definitions can be passed in as template arguments for NativeOpTraitNativeAttrTrait and NativeTypeTrait. Usage examples of this feature include: - Creating a wrapper Trait for authoring inferReturnTypes with the OpAdaptor by specifying necessary Op specific declarations and definitions directly in the trait - Refactoring the InferTensorType trait Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D154731	2023-07-12 08:46:19 -07:00
Ingo Müller	ab86b8cef4	[mlir][linalg][transform] Fix printing of TileToForall in edge case. The `static_(num_threads\|tile_sizes)` attributes of this op are `DefaultValuedOptionalAttr`s, so they can be constructed without such an attribute. In other words, the following is a valid op (note the absense of the `static_num_threads` attribute): "builtin.module"() ({ "transform.sequence"() <{failure_propagation_mode = 1 : i32, operand_segment_sizes = array<i32: 0, 0>}> ({ ^bb0(%arg0: !pdl.operation, %arg1: !transform.op<"linalg.matmul">, %arg2: !transform.op<"linalg.elemwise_binary">): %0 = "transform.structured.match"(%arg0) <{ops = ["test.dummy"]}> : (!pdl.operation) -> !pdl.operation %1:2 = "transform.structured.tile_to_forall_op"(%arg1, %0) <{operand_segment_sizes = array<i32: 1, 0, 0, 0, 1>}> : (!transform.op<"linalg.matmul">, !pdl.operation) -> (!transform.op<"scf.forall">, !transform.op<"linalg.matmul">) "transform.yield"() : () -> () }) : () -> () }) : () -> () However, the custom printing directive converted those to an `ArrayRef`, which crashes if done on an empty `ArrayAttr`. This patch changes the signature such that no automatic conversion takes place and extends the test to test for existinnce of the attribute. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D155062	2023-07-12 13:30:15 +00:00
Guray Ozen	ffbca7e9f3	[mlir][nvvm] Change return type of std::string of getPtx of PtxBuilder getPtx used to return `const char*`. It is not flexible when one needs to build string in the function. This work changes return type. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D155056	2023-07-12 14:59:54 +02:00
Adrian Kuegel	7724c4b5a9	[mlir] Apply ClangTidy fixes The get() call is redundant.	2023-07-12 11:31:05 +02:00
Marius Brehler	a2426eb603	[mlir][emitc] Add div, mul and rem operators This adds operations for binary multiplicative arithmetic operators to EmitC. The input and output arguments for the remainder operator are restricted to index (emitted as size_t), integers and the EmitC opaque types (as the operator can be overloaded for a custom type). The multiplication and division operator further support floating point numbers. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D154846	2023-07-12 08:45:10 +02:00
yzhang93	9a7677d8ee	[mlir] Narrow bitwidth emulation for vector.load This patch is a following for the previous patch https://reviews.llvm.org/D151519. With this patch, vector.load op with narrow bitwidth (e.g., i4) can be converted to supported wider bitwidth (e.g., i8). Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D154178	2023-07-11 13:38:15 -07:00
Krzysztof Drewniak	10b56e0210	[mlir][Arith] Add pass for emulating unsupported float ops (#1079 ) To complement the bf16 expansion and truncation patterns added to ExpandOps, define a pass that replaces, for any arithmetic operation op, %y = arith.op %v0, %v1, ... : T with %e0 = arith.expf %v0 : T to U %e1 = arith.expf %v1 : T to U ... %y.exp = arith.op %e0, %e1, ... : U %y = arith.truncf %y.exp : U to T This allows for "emulating" floating-point operations not supported on a given target (such as bfloat operations or most arithmetic on 8-bit floats) by extending those types to supported ones, performing the arithmetic operation, and then truncating back to the original type (which ensures appropriate rounding behavior). The lowering of the extf and truncf ops introduced by this transformation should be handled by subsequent passes. Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D154539	2023-07-11 20:32:35 +00:00
Diego Caballero	51ef80a7c2	[mlir][Vector] Add support for 0-D vectors to vector.insert/extract This is part of the process to remove vector.insertelement/extractelement from the Vector dialect. RFC: https://discourse.llvm.org/t/rfc-psa-remove-vector-extractelement-and-vector-insertelement-ops-in-favor-of-vector-extract-and-vector-insert-ops Differential Revision: https://reviews.llvm.org/D152644	2023-07-11 19:28:16 +00:00
Alex Zinenko	8a918c54bb	[mlir] add backward dense dataflow analysis This is the counterpart to the forward dense dataflow analysis and integrates into the dataflow framework. The implementation follows the structure of existing dataflow analyses. Reviewed By: Mogball, phisiart Differential Revision: https://reviews.llvm.org/D154713	2023-07-11 16:47:53 +00:00
Guray Ozen	affcfccd3c	[mlir][nvgpu] Add initial support for `mbarrier` `mbarrier` is a barrier created in shared memory that supports different flavors of synchronizing threads other than `__syncthreads`, for more information see below. https://docs.nvidia.com/cuda/parallel-thread-execution/#parallel-synchronization-and-communication-instructions-mbarrier This work adds initial Ops wrt `mbarrier` to nvgpu dialect. First, it introduces to two types: `mbarrier.barrier` that is barrier object in shared memory `mbarrier.barrier.token` that is token It introduces following Ops: `mbarrier.create` creates `mbarrier.barrier` `mbarrier.init` initializes `mbarrier.barrier` `mbarrier.arrive` performs arrive-on `mbarrier.barrier` returns `mbarrier.barrier.token` `mbarrier.arrive.nocomplete` performs arrive-on (non-blocking) `mbarrier.barrier` returns `mbarrier.barrier.token` `mbarrier.test_wait` waits on `mbarrier.barrier` and `mbarrier.barrier.token` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D154090	2023-07-11 17:35:27 +02:00
Matthias Springer	579bca1265	[mlir][linalg] BufferizeToAllocation: Add custom memcpy op Add a new option that allows users to specify a memcpy op: "memref.tensor_store", "memref.copy" or "linalg.copy". Differential Revision: https://reviews.llvm.org/D154968	2023-07-11 16:47:42 +02:00
Matthias Springer	8e72fbd616	[mlir][bufferization] Add read_only attribute to ToMemrefOp This unit attribute indicates to the bufferization that the resulting buffer will not be written to by another op. Differential Revision: https://reviews.llvm.org/D154967	2023-07-11 16:37:17 +02:00
Matthias Springer	8ddd98f831	[mlir][linalg] Return newly created ops from bufferize_to_allocation Return all ops that were generated as part of the bufferization, so that users do not have to match them in the enclosing op. Differential Revision: https://reviews.llvm.org/D154966	2023-07-11 16:34:02 +02:00
Matthias Springer	894fdbc719	[mlir][transform] Add transform.select op This transform op can be used to select all payload ops with a given name from a handle. Differential Revision: https://reviews.llvm.org/D154956	2023-07-11 16:16:56 +02:00
gilsaia	867c7b5cc0	[MLIR][Presburger] Optimize for intersect Added a series of optimizations to the Intersect function of PresburgerRelation, referring to the ISL implementation. Tested it on a simple Benchmark implemented by myself to see that it can speed up the Intersect operation The Benchmark can be found here:https://github.com/gilsaia/llvm-project-test-fpl/blob/develop_benchmark/mlir/benchmark/presburger/Benchmark.cpp The overall results for Intersect are as follows {F28191553} The results for each case are as follows {F28191556} Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D154771	2023-07-11 19:35:48 +05:30
Victor Perez	9cb421cdea	[mlir][llvm] Define annotation intrinsics Define `llvm.intr.var.annotation`, `llvm.intr.ptr.annotation` and `llvm.intr.annotation` in the llvm dialect as `llvm.var.annotation`, `llvm.ptr.annotation` and `llvm.annotation` counterparts. Signed-off-by: Victor Perez <victor.perez@codeplay.com> Differential Revision: https://reviews.llvm.org/D154842	2023-07-11 14:37:11 +01:00

1 2 3 4 5 ...

8680 Commits