clang-p2996

Author	SHA1	Message	Date
Sean Silva	be88ba09d5	[NFC] Make assertion more informative. This assert just caught me, and this improved message would have saved me some time.	2020-05-21 13:36:21 -07:00
Thomas Raoux	0712eac766	[mlir][spirv] Enable composite instructions for cooperative matrix type. Enable inset/extract/construct composite ops as well as access chain for cooperative matrix. ConstantComposite requires more change and will be done in a separate patch. Also fix the getNumElements function for coopMatrix per feedback from Jeff Bolz. The number of element is implementation dependent so it cannot be known at compile time. Differential Revision: https://reviews.llvm.org/D80321	2020-05-21 12:19:55 -07:00
Thomas Raoux	15389cdc5b	[mlir][spirv] Add remaining cooperative matrix instructions Adds support for cooperative matrix support for arithmetic and cast instructions. It also adds cooperative matrix store, muladd and matrixlength instructions which are part of the extension. Differential Revision: https://reviews.llvm.org/D80181	2020-05-21 11:55:33 -07:00
jerryyin	9c53ac08de	[mlir][rocdl] Exposing buffer load/store intrinsic Summary: * Updated ROCDLOps tablegen * Added parsing and printing function for new intrinsic * Added unit tests Reviewers: ftynse Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80233	2020-05-21 14:14:35 +00:00
Wen-Heng (Jack) Chung	2cbbc266ec	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-21 08:53:47 -05:00
Nicolas Vasilache	941005f51a	[mlir] NFC - Add a builder to vector.transpose Summary: Also expose some more vector ops to EDSCs. Differential Revision: https://reviews.llvm.org/D80333	2020-05-21 05:18:58 -04:00
Mehdi Amini	5c3ebd7725	Revert "[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass." This reverts commit `cdb6f05e2d`. The build is broken with: You have called ADD_LIBRARY for library obj.MLIRGPUtoCUDATransforms without any source files. This typically indicates a problem with your CMakeLists.txt file	2020-05-21 03:44:35 +00:00
Wen-Heng (Jack) Chung	ad398164ba	[mlir][gpu] Refactor functions for workgroup and private buffer attributions. Summary: Consolidate interfaces adding workgroup and private buffer attributions in GPU dialect. Note all private buffer attributions must follow workgroup buffer attributions. Reviewers: herhut Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, csigg, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, llvm-commits Tags: #llvm, #mlir Differential Revision: https://reviews.llvm.org/D79508	2020-05-20 16:20:27 -05:00
Wen-Heng (Jack) Chung	cdb6f05e2d	[mlir][gpu] Refactor ConvertGpuLaunchFuncToCudaCalls pass. Due to similar APIs between CUDA and ROCm (HIP), ConvertGpuLaunchFuncToCudaCalls pass could be used on both platforms with some refactoring. In this commit: - Migrate ConvertLaunchFuncToCudaCalls from GPUToCUDA to GPUCommon, and rename. - Rename runtime wrapper APIs be platform-neutral. - Let GPU binary annotation attribute be specifiable as a PassOption. - Naming changes within the implementation and tests. Subsequent patches would introduce ROCm-specific tests and runtime wrapper APIs. Differential Revision: https://reviews.llvm.org/D80167	2020-05-20 16:11:48 -05:00
MaheshRavishankar	0e88eb5c51	[mlir][spirv] Adapt subview legalization to the updated op semantics. The subview semantics changes recently to allow for more natural representation of constant offsets and strides. The legalization of subview op for lowering to SPIR-V needs to account for this. Also change the linearization to use the strides from the affine map of a memref. Differential Revision: https://reviews.llvm.org/D80270	2020-05-20 12:00:21 -07:00
Nicolas Vasilache	7c3c5b11b1	[mlir][Vector] Add option to fully unroll for VectorTransfer to SCF lowering Summary: Previously, the only support partial lowering from vector transfers to SCF was going through loops. This requires a dedicated allocation and extra memory roundtrips because LLVM aggregates cannot be indexed dynamically (for more details see the [deep-dive](https://mlir.llvm.org/docs/Dialects/Vector/#deeperdive)). This revision allows specifying full unrolling which removes this additional roundtrip. This should be used carefully though because full unrolling will spill, negating the benefits of removing the interim alloc in the first place. Proper heuristics are left for a later time. Differential Revision: https://reviews.llvm.org/D80100	2020-05-20 11:02:13 -04:00
Alex Zinenko	3ccf4a5bd1	[mlir] ensureRegionTerminator: take OpBuilder The SingleBlockImplicitTerminator op trait provides a function `ensureRegionTerminator` that injects an appropriate terminator into the block if necessary, which is used during operation constructing and parsing. Currently, this function directly modifies the IR using low-level APIs on Operation and Block. If this function is called from a conversion pattern, these manipulations are not reflected in the ConversionPatternRewriter and thus cannot be undone or, worse, lead to tricky memory errors and malformed IR. Change `ensureRegionTerminator` to take an instance of `OpBuilder` instead of `Builder`, and use it to construct the block and the terminator when required. Maintain overloads taking an instance of `Builder` and creating a simple `OpBuilder` to use in parsers, which don't have an `OpBuilder` and cannot interact with the dialect conversion mechanism. This change was one of the reasons to make `<OpTy>::build` accept an `OpBuilder`. Differential Revision: https://reviews.llvm.org/D80138	2020-05-20 16:14:46 +02:00
Nicolas Vasilache	19e5b2bccb	[mlir][Linalg] NFC - Simplify GenericNestLoop builder Summary: This revision trims unnecessary complexity. Differential Revision: https://reviews.llvm.org/D80290	2020-05-20 09:44:15 -04:00
Nicolas Vasilache	004a3d4f56	[mlir][Linalg] Refactor linalg tiling Summary: This revision refactors the Linalg tiling pass to be written as pattern applications and retires the use of the folder in Linalg tiling. In the early days, tiling was written as a pass that would create (partially) folded and canonicalized operations on the fly for better composability. As this evolves towards composition of patterns, the pass-specific folder is counter-productive and is retired. The tiling options struct evolves to take a tile size creation function which allows materializing tile sizes on the fly (in particular constant tile sizes). This plays better with folding and DCE. With the folder going away in Tiling, the check on whether subviews are the same in linalg fusion needs to be more robust. This revision also implements such a check. In the current form, there are still some canonicalizations missing due to AffineMin/Max ops fed by scf::ForOp. These will be improved at a later time. Differential Revision: https://reviews.llvm.org/D80267	2020-05-20 09:39:56 -04:00
Tres Popp	02035580d3	[mlir] Add custom assembly formats to shape.witness ops. The assembly formats are essentially the generic forms without quotations and type information. Differential Revision: https://reviews.llvm.org/D80180	2020-05-20 13:25:33 +02:00
Tres Popp	fb6986ef69	[mlir] Custom printing/parsing for Shape::AssumingOp Summary: Additionally, this adds traits and builder methods to AssumingYieldOp and names the input witness to the AssumingOp. Differential Revision: https://reviews.llvm.org/D80187	2020-05-20 10:39:26 +02:00
Tres Popp	44226c1fea	[mlir] Mark witness related Shape dialect ops as NoSideEffect. Differential Revision: https://reviews.llvm.org/D80179	2020-05-20 10:26:35 +02:00
Chintan Kaur	78453e3705	Mark AffineMap::replaceDimsAndSymbols as const (NFC) This is consistent to the other methods of the class, as well as AffineExpr::replaceDimsAndSymbols. Differential Revision: https://reviews.llvm.org/D80266	2020-05-20 03:11:41 +00:00
Thomas Raoux	b359bbaa8b	[mlir][spirv] First step to support spirv cooperative matrix extension. Add a new type to SPIRV dialect for cooperative matrix and add new op for cooperative matrix load. This is missing most instructions to support cooperative matrix extension but this is a stop-gap patch to avoid creating big review. Differential Revision: https://reviews.llvm.org/D80043	2020-05-19 19:29:41 -07:00
Diego Caballero	a45fb1942f	[mlir][Affine] Introduce affine memory interfaces This patch introduces interfaces for read and write ops with affine restrictions. I used `read`/`write` intead of `load`/`store` for the interfaces so that they can also be implemented by dma ops. For now, they are only implemented by affine.load, affine.store, affine.vector_load and affine.vector_store. For testing purposes, this patch also migrates affine loop fusion and required analysis to use the new interfaces. No other changes are made beyond that. Co-authored-by: Alex Zinenko <zinenko@google.com> Reviewed By: bondhugula, ftynse Differential Revision: https://reviews.llvm.org/D79829	2020-05-19 17:32:50 -07:00
Sean Silva	21b0eff773	[mlir][shape] Add `shape.from_extents`. Summary: This is a basic op needed for creating shapes from SSA values representing the extents. Differential Revision: https://reviews.llvm.org/D79833	2020-05-19 14:26:08 -07:00
Ehsan Toosi	3468300511	[MLIR] Update the FunctionAndBlockSignatureConverter and NonVoidToVoidReturnOpConverter of Buffer Assignment Making these two converters more generic. FunctionAndBlockSignatureConverter now moves only memref results (after type conversion) to the function argument and keeps other legal function results unchanged. NonVoidToVoidReturnOpConverter is renamed to NoBufferOperandsReturnOpConverter. It removes only the buffer operands from the operands of the converted ReturnOp and inserts CopyOps to copy each buffer to the target function argument. Differential Revision: https://reviews.llvm.org/D79329	2020-05-19 17:04:59 +02:00
Alex Zinenko	d1560f3956	[mlir] scf::ForOp: provide builders with callbacks for loop body Thanks to a recent change that made `::build` functions take an instance of `OpBuilder`, it is now possible to build operations within a region attached to the operation about to be created. Exercise this on `scf::ForOp` by taking a callback that populates the loop body while the loop is being created. Additionally, provide helper functions to build perfect nests of `ForOp`s, with support for iteration arguments. These functions provide the same functionality as EDSC LoopNestBuilder with simpler implementation, without relying on edsc::ScopedContext, and using `OpBuilder` in an unambiguous way. Compatibility functions for EDSC are provided, but may be removed in the future. Differential Revision: https://reviews.llvm.org/D79688	2020-05-19 16:26:29 +02:00
George	e984b7f2a2	Added a TanOp to SPIR-V dialect GLSL ops Implemented tangent op from SPIR-V's GLSL extended instruction set. Added a round-trip and serialization/deserialization tests for the op. Differential Revision: https://reviews.llvm.org/D80152	2020-05-19 09:15:29 -04:00
Frederik Gossen	5afd86b0de	[MLIR] Add helper functions for common integer types Add helper functions for 32-bit and 64-bit integer types. Differential Revision: https://reviews.llvm.org/D80111	2020-05-19 11:42:41 +00:00
Kiran Kumar T P	fa8fc9ffcc	[MLIR, OpenMP] Support for flush operation, and translating the same to LLVM IR Summary: This patch adds support for flush operation in OpenMP dialect and translation of this construct to LLVM IR. The OpenMP IRBuilder is used for this translation. The patch includes code changes and testcase modifications. Reviewed By: ftynse, kiranchandramohan Differential Revision: https://reviews.llvm.org/D79937	2020-05-19 17:01:25 +05:30
Pierre Oechsel	d1866f8947	[MLIR] [Linalg] Add option to use the partial view after promotion. For now the promoted buffer is indexed using the `full view`. The full view might be slightly bigger than the partial view (which is accounting for boundaries). Unfortunately this does not compose easily with other transformations when multiple buffers with shapes related to each other are involved. Take `linalg.matmul A B C` (with A of size MxK, B of size KxN and C of size MxN) and suppose we are: - Tiling over M by 100 - Promoting A only This is producing a `linalg.matmul promoted_A B subview_C` where `promoted_A` is a promoted buffer of `A` of size (100xK) and `subview_C` is a subview of size mxK where m could be smaller than 100 due to boundaries thus leading to a possible incorrect behavior. We propose to: - Add a new parameter to the tiling promotion allowing to enable the use of the full tile buffer. - By default all promoted buffers will be indexed by the partial view. Note that this could be considered as a breaking change in comparison to the way the tiling promotion was working. Differential Revision: https://reviews.llvm.org/D79927	2020-05-18 18:28:18 +02:00
Nicolas Vasilache	1870e787af	[mlir][Vector] Add an optional "masked" boolean array attribute to vector transfer operations Summary: Vector transfer ops semantic is extended to allow specifying a per-dimension `masked` attribute. When the attribute is false on a particular dimension, lowering to LLVM emits unmasked load and store operations. Differential Revision: https://reviews.llvm.org/D80098	2020-05-18 11:52:08 -04:00
Nicolas Vasilache	36cdc17f8c	[mlir][Vector] Make minor identity permutation map optional in transfer op printing and parsing Summary: This revision makes the use of vector transfer operatons more idiomatic by allowing to omit and inferring the permutation_map. Differential Revision: https://reviews.llvm.org/D80092	2020-05-18 11:41:27 -04:00
Nicolas Vasilache	03092f2fa7	[mlir] Add BoolArrayAttr in Tablegen + Builder support Differential Revision: https://reviews.llvm.org/D80090	2020-05-18 09:42:12 -04:00
Nicolas Vasilache	1d6eb09d22	[mlir] NFC - VectorTransforms use OpBuilder where relevant Summary: This will allow using unrolling outside of only rewrite patterns. Differential Revision: https://reviews.llvm.org/D80083	2020-05-17 10:17:12 -04:00
Eli Friedman	4f04db4b54	AllocaInst should store Align instead of MaybeAlign. Along the lines of D77454 and D79968. Unlike loads and stores, the default alignment is getPrefTypeAlign, to match the existing handling in various places, including SelectionDAG and InstCombine. Differential Revision: https://reviews.llvm.org/D80044	2020-05-16 14:53:16 -07:00
Stephen Neuendorffer	ec44e08940	[MLIR] Move JitRunner to live with ExecutionEngine The JitRunner library is logically very close to the execution engine, and shares similar dependencies. find -name "*.cpp" -exec sed -i "s/Support\/JitRunner/ExecutionEngine\/JitRunner/" "{}" \; Differential Revision: https://reviews.llvm.org/D79899	2020-05-15 14:37:10 -07:00
Stephen Neuendorffer	eb623ae832	[MLIR] Continue renaming of "SideEffects" MLIRSideEffects -> MLIRSideEffectInterfaces SideEffects.h -> SideEffectInterfaces.h SideEffects.cpp -> SideEffectInterface.cpp Note that I haven't renamed TableGen/SideEffects.h or TableGen/SideEffects.cpp find -name "*.h" -exec sed -i "s/SideEffects.h/SideEffectInterfaces.h/" "{}" \; find -name "CMakeLists.txt" -exec sed -i "s/MLIRSideEffects/MLIRSideEffectInterfaces/" "{}" \; Differential Revision: https://reviews.llvm.org/D79890	2020-05-15 14:37:09 -07:00
Stephen Neuendorffer	9de4ee3815	[MLIR] Allow unreachable blocks to violate dominance property. It is possible for optimizations to create SSA code which violates the dominance property in unreachable blocks. Equivalently, dominance computed using normal mechanisms is undefined in unreachable blocks. See discussion here: https://llvm.discourse.group/t/rfc-allowing-dialects-to-relax-the-ssa-dominance-condition/833/51 This patch only checks the dominance condition inside blocks which are reachable from the the entry block of their region. Note that the dominance conditions of regions contained in an unreachable block are still checked. Differential Revision: https://reviews.llvm.org/D79922	2020-05-15 10:31:57 -07:00
Tres Popp	a26883e5aa	[MLIR] Add shape.witness type and ops Summary: These represent shape based preconditions on execution of code. Differential Revision: https://reviews.llvm.org/D79717	2020-05-15 14:33:54 +02:00
Alex Zinenko	4ead2cf76c	[mlir] Rename conversions involving ex-Loop dialect to mention SCF The following Conversions are affected: LoopToStandard -> SCFToStandard, LoopsToGPU -> SCFToGPU, VectorToLoops -> VectorToSCF. Full file paths are affected. Additionally, drop the 'Convert' prefix from filenames living under lib/Conversion where applicable. API names and CLI options for pass testing are also renamed when applicable. In particular, LoopsToGPU contains several passes that apply to different kinds of loops (`for` or `parallel`), for which the original names are preserved. Differential Revision: https://reviews.llvm.org/D79940	2020-05-15 10:45:11 +02:00
Nicolas Vasilache	f1b972041a	[mlir][Linalg] Start a LinalgToStandard pass and move conversion to library calls. This revision starts decoupling the include the kitchen sink behavior of Linalg to LLVM lowering by inserting a -convert-linalg-to-std pass. The lowering of linalg ops to function calls was previously lowering to memref descriptors by having both linalg -> std and std -> LLVM patterns in the same rewrite. When separating this step, a new issue occurred: the layout is automatically type-erased by this process. This revision therefore introduces memref casts to perform these type erasures explicitly. To connect everything end-to-end, the LLVM lowering of MemRefCastOp is relaxed because it is artificially more restricted than the op semantics. The op semantics already guarantee that source and target MemRefTypes are cast-compatible. An invalid lowering test now becomes valid and is removed. Differential Revision: https://reviews.llvm.org/D79468	2020-05-15 00:24:03 -04:00
Eugene Zhulenev	3a11ca7bed	[MLIR] Add symbol map to mlir ExecutionEngine Add additional symbol mapping to be able to provide custom symbols to jitted code at runtime. Differential Revision: https://reviews.llvm.org/D79812	2020-05-14 22:30:03 +02:00
Diego Caballero	bc5565f9ea	[mlir][Affine] Introduce affine.vector_load and affine.vector_store This patch adds `affine.vector_load` and `affine.vector_store` ops to the Affine dialect and lowers them to `vector.transfer_read` and `vector.transfer_write`, respectively, in the Vector dialect. Reviewed By: bondhugula, nicolasvasilache Differential Revision: https://reviews.llvm.org/D79658	2020-05-14 13:17:58 -07:00
Stephan Herhut	9ffaba86e5	[mlir] Fix the example for std.rank Summary: The assembly format for std.rank expects the operand type and not the result type after the colon. Differential Revision: https://reviews.llvm.org/D79857	2020-05-14 10:45:07 +02:00
Stephen Neuendorffer	ce3bbeb915	[MLIR] refactor cmake specification of tablegen'd interfaces. Introduce add_mlir_interface to avoid lots of boilerplate Differential Revision: https://reviews.llvm.org/D79841	2020-05-13 10:37:06 -07:00
Alex Zinenko	60f443bb3b	[mlir] Change dialect namespace loop->scf All ops of the SCF dialect now use the `scf.` prefix instead of `loop.`. This is a part of dialect renaming. Differential Revision: https://reviews.llvm.org/D79844	2020-05-13 19:20:21 +02:00
Pierre Oechsel	a5d80818fa	[mlir] [VectorOps] Add missing EDSC intrinsics. Differential Revision: https://reviews.llvm.org/D79858	2020-05-13 10:11:30 -04:00
Nicolas Vasilache	e0b99a5de4	[mlir] Add SubViewOp::getOrCreateRanges and fix folding pattern The existing implementation of SubViewOp::getRanges relies on all offsets/sizes/strides to be dynamic values and does not work in combination with canonicalization. This revision adds a SubViewOp::getOrCreateRanges to create the missing constants in the canonicalized case. This allows reactivating the fused pass with staged pattern applications. However another issue surfaces that the SubViewOp verifier is now too strict to allow folding. The existing folding pattern is turned into a canonicalization pattern which rewrites memref_cast + subview into subview + memref_cast. The transform-patterns-matmul-to-vector can then be reactivated. Differential Revision: https://reviews.llvm.org/D79759	2020-05-13 10:11:30 -04:00
Marcel Koester	881c3bb6a7	[mlir] Adapted standard Alloc and Alloca ops to use new side-effect resources. The current standard Alloca node is not annotated with the MemEffect<Alloc> trait. This CL updates the Alloc and Alloca memory-effect annotations using the latest Resource objects. Alloca nodes will use a newly defined AutomaticAllocationScopeResource to distinguish between Alloc and Alloca memory effects. Differential Revision: https://reviews.llvm.org/D79620	2020-05-13 13:46:39 +02:00
MaheshRavishankar	49e6c19100	[mlir][StandardToLLVM] Add SinOp to LLVM dialect and lowering of std.sin to this op. Differential Revision: https://reviews.llvm.org/D79505	2020-05-12 23:15:25 -07:00
MaheshRavishankar	5440d0a12d	[mlir][Linalg] Add folders and canonicalizers for linalg.reshape/linalg.tensor_reshape operations. Differential Revision: https://reviews.llvm.org/D79765	2020-05-12 23:03:26 -07:00
MaheshRavishankar	d2a9569850	[mlir][Linalg] Allow reshapes to collapse to a zero-rank tensor. This is only valid if the source tensors (result tensor) is static shaped with all unit-extents when the reshape is collapsing (expanding) dimensions. Differential Revision: https://reviews.llvm.org/D79764	2020-05-12 23:03:25 -07:00
Nicolas Vasilache	63c0e72b2f	[mlir] Revisit std.subview handling of static information. The main objective of this revision is to change the way static information is represented, propagated and canonicalized in the SubViewOp. In the current implementation the issue is that canonicalization may strictly lose information because static offsets are combined in irrecoverable ways into the result type, in order to fit the strided memref representation. The core semantics of the op do not change but the parser and printer do: the op always requires `rank` offsets, sizes and strides. These quantities can now be either SSA values or static integer attributes. The result type is automatically deduced from the static information and more powerful canonicalizations (as powerful as the representation with sentinel `?` values allows). Previously static information was inferred on a best-effort basis from looking at the source and destination type. Relevant tests are rewritten to use the idiomatic `offset: x, strides : [...]`-form. Bugs are corrected along the way that were not trivially visible in flattened strided memref form. Lowering to LLVM is updated, simplified and now supports all cases. A mixed static-dynamic mode test that wouldn't previously lower is added. It is an open question, and a longer discussion, whether a better result type representation would be a nicer alternative. For now, the subview op carries the required semantic. Differential Revision: https://reviews.llvm.org/D79662	2020-05-12 20:04:44 -04:00

1 2 3 4 5 ...

2708 Commits