clang-p2996

Author	SHA1	Message	Date
Matthias Springer	e7790fbed3	[mlir] Add `test-convergence` option to Canonicalizer tests This new option is set to `false` by default. It should be set only in Canonicalizer tests to detect faulty canonicalization patterns. I.e., patterns that prevent the canonicalizer from converging. The canonicalizer should always convergence on such small unit tests that we have in `canonicalize.mlir`. Two faulty canonicalization patterns were detected and fixed with this change. Differential Revision: https://reviews.llvm.org/D140873	2023-01-04 12:02:21 +01:00
Krzysztof Drewniak	be575c5dfc	Re-land D139865 "Add known_block_size and known_grid_size to gpu.func" This should fix the MSVC warning that caused the previous revert. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D140766	2023-01-02 16:39:00 +00:00
Stella Stamenova	828b4762ca	Revert "[mlir][GPU] Add known_block_size and known_grid_size to gpu.func" This reverts commit `85e38d7cd6`. This broke the windows mlir buildbot: https://lab.llvm.org/buildbot/#/builders/13/builds/30180/steps/6/logs/stdio	2022-12-23 17:29:42 -08:00
Krzysztof Drewniak	85e38d7cd6	[mlir][GPU] Add known_block_size and known_grid_size to gpu.func In many cases, the the number of workgroups (the grid size) and the number of workitems within each group (the block size) that a GPU kernel will be launched with are known. For example, if gpu.launch is called with constant block and grid sizes, we know that those are the only possible sizes that will be used to launch that kernel. In other cases, a custom code-generation pipeline that eventually produces GPU kernels may know the launch dimensions of those kernels, or at least may be able to provide an upper bound on them. Other GPU programming systems, such as OpenCL, allow capturing such information to enable compiler optimizations - see reqd_work_group_size, but MLIR currently has no mechanism for doing so. This set of attributes is the first step in enabling optimizations based on the known launch dimensions of kernels. It extends the kernel outline pass to set these bounds on kernels with constant launch dimensions and extends integer range inference for GPU index operations to account for the bounds when they are known. Subsequent revisions will use this data when lowering GPU operations to the ROCDL dialect. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D139865	2022-12-22 21:41:46 +00:00
Ramkumar Ramachandra	22426110c5	mlir/tblgen: use std::optional in generation This is part of an effort to migrate from llvm::Optional to std::optional. This patch changes the way mlir-tblgen generates .inc files, and modifies tests and documentation appropriately. It is a "no compromises" patch, and doesn't leave the user with an unpleasant mix of llvm::Optional and std::optional. A non-trivial change has been made to ControlFlowInterfaces to split one constructor into two, relating to a build failure on Windows. See also: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Signed-off-by: Ramkumar Ramachandra <r@artagnon.com> Differential Revision: https://reviews.llvm.org/D138934	2022-12-17 11:13:26 +01:00
Jeff Niu	53406427cd	[mlir] FunctionOpInterface: turn required attributes into interface methods (Reland) Reland D139447, D139471 With flang actually working - FunctionOpInterface: make get/setFunctionType interface methods This patch removes the concept of a `function_type`-named type attribute as a requirement for implementors of FunctionOpInterface. Instead, this type should be provided through two interface methods, `getFunctionType` and `setFunctionTypeAttr` (*Attr because functions may use different concrete function types), which should be automatically implemented by ODS for ops that define a `$function_type` attribute. This also allows FunctionOpInterface to materialize function types if they don't carry them in an attribute, for example. Importantly, all the function "helper" still accept an attribute name to use in parsing and printing functions, for example. - FunctionOpInterface: arg and result attrs dispatch to interface This patch removes the `arg_attrs` and `res_attrs` named attributes as a requirement for FunctionOpInterface and replaces them with interface methods for the getters, setters, and removers of the relevent attributes. This allows operations to use their own storage for the argument and result attributes. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D139736	2022-12-10 15:17:09 -08:00
David Spickett	cf98e8273c	Revert "[mlir] FunctionOpInterface: make get/setFunctionType interface methods" and "[mlir] Fix examples build" This reverts commit `fbc253fe81` and `96cf183bcc`. Which I missed in the first revert in `f3379feabe`.	2022-12-09 15:36:48 +00:00
David Spickett	f3379feabe	Revert "[mlir] FunctionOpInterface: arg and result attrs dispatch to interface" and "[flang] Fix flang after MLIR update" This reverts commit `dd74e6b6f4` and `1897b67ae8` due to ongoing test failures on flang bots e.g. https://lab.llvm.org/buildbot/#/builders/179/builds/5050	2022-12-09 15:02:39 +00:00
Jeff Niu	dd74e6b6f4	[mlir] FunctionOpInterface: arg and result attrs dispatch to interface This patch removes the `arg_attrs` and `res_attrs` named attributes as a requirement for FunctionOpInterface and replaces them with interface methods for the getters, setters, and removers of the relevent attributes. This allows operations to use their own storage for the argument and result attributes. Depends on D139471 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D139472	2022-12-08 11:32:38 -08:00
Jeff Niu	fbc253fe81	[mlir] FunctionOpInterface: make get/setFunctionType interface methods This patch removes the concept of a `function_type`-named type attribute as a requirement for implementors of FunctionOpInterface. Instead, this type should be provided through two interface methods, `getFunctionType` and `setFunctionTypeAttr` (*Attr because functions may use different concrete function types), which should be automatically implemented by ODS for ops that define a `$function_type` attribute. This also allows FunctionOpInterface to materialize function types if they don't carry them in an attribute, for example. Importantly, all the function "helper" still accept an attribute name to use in parsing and printing functions, for example. Reviewed By: rriddle, lattner Differential Revision: https://reviews.llvm.org/D139447	2022-12-08 11:32:27 -08:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Guray Ozen	beaffb041c	[mlir][transform] Decouple GPUDeviceMapping attribute from the GPU transfrom dialect code generator `DeviceMappingAttrInterface` is implemented as unifiying mechanism for thread mapping. A code generator could use any attribute that implements this interface to lower `scf.foreach_thread` to device specific code. It is allowed to choose its own mapping and interpretation. Currently, GPU transform dialect supports only `GPUThreadMapping` and `GPUBlockMapping`; however, other mappings should to be supported as well. This change addresses this issue. It decouples gpu transform dialect from the `GPUThreadMapping` and `GPUBlockMapping`. Now, they can work any other mapping. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138020	2022-11-15 18:16:32 +01:00
Mehdi Amini	3bed21bdd2	Apply clang-tidy fixes for readability-simplify-boolean-expr in GPUDialect.cpp (NFC)	2022-11-14 20:04:16 +00:00
Mehdi Amini	c310a6dda2	Apply clang-tidy fixes for readability-simplify-boolean-expr in GPUDialect.cpp (NFC)	2022-11-06 20:15:00 +00:00
Nirvedh Meshram	c441070665	[mlir][spirv] Add conversion from GPU WMMA ops to SPIRV Cooperative matrix Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D136521	2022-10-22 18:29:40 -07:00
Hanhan Wang	00767cb452	[mlir] Delete dup code and use unified methods. The foldMemRefCast method is defined in memref namespace; the foldTensorCast method is defined in tensor namespace. This revision deletes the dup code and use the unified methods. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D136379	2022-10-21 16:51:44 -07:00
Ivan Butygin	b845addae8	[mlir][gpu] Add `subgroup_reduce` operation Introduce `subgroup_reduce` operation, similar to `all_reduce`, but operating on subgroup scope instead of workgroup. It is intended as low-level building block for more high level abstractions (e.g for workgroup-wide `all_reduce` ops). Only introduce version taking reduce operation enum for simplicity sake. Differential Revision: https://reviews.llvm.org/D135323	2022-10-11 11:47:15 +02:00
River Riddle	10c04f4641	[mlir:GPU][NFC] Update GPU API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
River Riddle	986b5c56ea	[mlir] Flip Async/GPU/OpenACC/OpenMP to use Both accessors This allows for incrementally updating the old API usages without needing to update everything at once. These will be left on Both for a little bit and then flipped to prefixed when all APIs have been updated. Differential Revision: https://reviews.llvm.org/D134386	2022-09-21 17:36:13 -07:00
Christian Sigg	50c33a3a9c	[MLIR] Harden gpu.func verification GPUFuncOpLowering moves the body out of gpu.func op and erases it. An empty gpu.func may fail verification but should not crash it. Verification of an erased op is triggered e.g. with debug printing on. Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D132446	2022-08-23 14:58:46 +02:00
Jeff Niu	58a47508f0	(Reland) [mlir] Switch segment size attributes to DenseI32ArrayAttr This reland includes changes to the Python bindings. Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131801 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D131803	2022-08-12 19:44:52 -04:00
Alex Zinenko	e8e718fa4b	Revert "[mlir] Switch segment size attributes to DenseI32ArrayAttr" This reverts commit `30171e76f0`. Breaks Python tests in MLIR, missing C API and Python changes.	2022-08-12 10:22:47 +02:00
Jeff Niu	30171e76f0	[mlir] Switch segment size attributes to DenseI32ArrayAttr Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131738 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D131702	2022-08-11 20:56:45 -04:00
Benjamin Kramer	9fa59e7643	[mlir] Use C++17 structured bindings instead of std::tie where applicable. NFCI	2022-08-09 13:34:17 +02:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Kazu Hirata	037f09959a	[mlir] Don't use Optional::hasValue (NFC)	2022-06-20 11:22:37 -07:00
Mogball	d7ef488bb6	[mlir][gpu] Move GPU headers into IR/ and Transforms/ Depends on D127350 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127352	2022-06-09 22:49:03 +00:00
Arnab Dutta	16219f8c94	[MLIR][GPU] Add canonicalizer for gpu.memcpy Erase gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula, csigg Differential Revision: https://reviews.llvm.org/D124257	2022-05-14 19:01:04 +05:30
Christian Sigg	0e3d1ca54a	[MLIR][GPU] NFC: simplify kernel operand accessor implementations. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D125112	2022-05-14 14:14:42 +02:00
Thomas Raoux	15bcc36eed	[mlir][gpu] Move async copy ops to NVGPU and add caching hints Move async copy operations to NVGPU as they only exist on NV target and are designed to match ptx semantic. This allows us to also add more fine grain caching hint attribute to the op. Add hint to bypass L1 and hook it up to NVVM op. Differential Revision: https://reviews.llvm.org/D125244	2022-05-10 22:30:24 +00:00
Chris Lattner	d85eb4e2d6	[AsmParser] Introduce a new "Argument" abstraction + supporting logic MLIR has a common pattern for "arguments" that uses syntax like `%x : i32 {attrs} loc("sourceloc")` which is implemented in adhoc ways throughout the codebase. The approach this uses is verbose (because it is implemented with parallel arrays) and inconsistent (e.g. lots of things drop source location info). Solve this by introducing OpAsmParser::Argument and make addRegion (which sets up BlockArguments for the region) take it. Convert the world to propagating this down. This means that we correctly capture and propagate source location information in a lot more cases (e.g. see the affine.for testcase example), and it also simplifies much code. Differential Revision: https://reviews.llvm.org/D124649	2022-04-29 12:19:34 -07:00
Chris Lattner	5dedf911de	[AsmParser] Rework logic around "region argument parsing" The asm parser had a notional distinction between parsing an operand (like "%foo" or "%4#3") and parsing a region argument (which isn't supposed to allow a result number like #3). Unfortunately the implementation has two problems: 1) It didn't actually check for the result number and reject it. parseRegionArgument and parseOperand were identical. 2) It had a lot of machinery built up around it that paralleled operand parsing. This also was functionally identical, but also had some subtle differences (e.g. the parseOptional stuff had a different result type). I thought about just removing all of this, but decided that the missing error checking was important, so I reimplemented it with a `allowResultNumber` flag on parseOperand. This keeps the codepaths unified and adds the missing error checks. Differential Revision: https://reviews.llvm.org/D124470	2022-04-28 11:12:44 -07:00
Chris Lattner	31c8abc3f1	[AsmParser/Printer] Rework sourceloc support for function arguments. When Location tracking support for block arguments was added, we discussed various approaches to threading support for this through function-like argument parsing. At the time, we added a parallel array of locations that could hold this. It turns out that that approach was verbose and error prone, roughly no one adopted it. This patch takes a different approach, adding an optional source locator to the UnresolvedOperand class. This fits much more naturally into the standard structure we use for representing locators, and gives all the function like dialects locator support for free (e.g. see the test adding an example for the LLVM dialect). Differential Revision: https://reviews.llvm.org/D124188	2022-04-21 12:43:36 -07:00
Fangrui Song	ae46b3e01f	Revert D121279 "[MLIR][GPU] Add canonicalizer for gpu.memcpy" This reverts commit `12f55cac69`. Causes miscompile. Will follow up with a reproduce.	2022-04-21 08:55:13 -07:00
Uday Bondhugula	f47a38f517	Add async dependencies support for gpu.launch op Add async dependencies support for gpu.launch op: this allows specifying a list of async tokens ("streams") as dependencies for the launch. Update the GPU kernel outlining pass lowering to propagate async dependencies from gpu.launch to gpu.launch_func op. Previously, a new stream was being created and destroyed for a kernel launch. The async deps support allows the kernel launch to be serialized on an existing stream. Differential Revision: https://reviews.llvm.org/D123499	2022-04-21 16:25:59 +05:30
Uday Bondhugula	d7565de6cc	[MLIR] NFC. Drop trailing white space in GPU async ops print NFC. Drop trailing end of line white space in GPU async ops' printer whenever the list of async deps is empty. Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D123754	2022-04-20 17:56:53 +05:30
Arnab Dutta	12f55cac69	[MLIR][GPU] Add canonicalizer for gpu.memcpy Fold away gpu.memcpy op when only uses of dest are the memcpy op in question, its allocation and deallocation ops. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D121279	2022-04-19 17:54:00 +05:30
Mehdi Amini	21b251624b	Apply clang-tidy fixes for readability-identifier-naming in GPUDialect.cpp (NFC)	2022-04-18 18:15:30 +00:00
Arnab Dutta	392d55c1e2	[MLIR][GPU] Add canonicalization patterns for folding simple gpu.wait ops. * Fold away redundant %t = gpu.wait async + gpu.wait [%t] pairs. * Fold away %t = gpu.wait async ... ops when %t has no uses. * Fold away gpu.wait [] ops. * In case of %t1 = gpu.wait async [%t0], replace all uses of %t1 with %t0. Differential Revision: https://reviews.llvm.org/D121878	2022-04-14 12:30:55 +05:30
Thomas Raoux	d77f483640	[mlir][gpu] Relax restriction on mma load/store op Those ops can support more complex layout as long as the most inner dimension is contiguous. Differential Revision: https://reviews.llvm.org/D122452	2022-03-25 04:03:40 +00:00
Markus Böck	e13d23bc6c	[mlir] Rename `OpAsmParser::OperandType` to `OpAsmParser::UnresolvedOperand` I am not sure about the meaning of Type in the name (was it meant be interpreted as Kind?), and given the importance and meaning of Type in the context of MLIR, its probably better to rename it. Given the comment in the source code, the suggestion in the GitHub issue and the final discussions in the review, this patch renames the OperandType to UnresolvedOperand. Fixes https://github.com/llvm/llvm-project/issues/54446 Differential Revision: https://reviews.llvm.org/D122142	2022-03-21 21:42:13 +01:00
River Riddle	4a3460a791	[mlir:FunctionOpInterface] Rename the "type" attribute to "function_type" This removes any potential confusion with the `getType` accessors which correspond to SSA results of an operation, and makes it clear what the intent is (i.e. to represent the type of the function). Differential Revision: https://reviews.llvm.org/D121762	2022-03-16 17:07:04 -07:00
River Riddle	bbfec2a1b0	[mlir] Remove the deprecated ODS Op verifier/parser/printer code blocks These have been deprecated for ~1 month now and can be removed. Differential Revision: https://reviews.llvm.org/D121090	2022-03-15 01:17:30 -07:00
Chia-hung Duan	ed645f6336	[mlir] Support verification order (3/3) In this CL, update the function name of verifier according to the behavior. If a verifier needs to access the region then it'll be updated to `verifyRegions`. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D120373	2022-03-11 01:16:28 +00:00
Akshay Baviskar	f1efac7f08	Add verifier for gpu.alloc op Add verifier for gpu.alloc op to verify if the dimension operand counts and symbol operand counts are same as their memref counterparts. Differential Revision: https://reviews.llvm.org/D117427	2022-02-15 15:57:58 +05:30
Thomas Raoux	5ab04bc068	[mlir][gpu] Add device side async copy operations Add new operations to the gpu dialect to represent device side asynchronous copies. This also add the lowering of those operations to nvvm dialect. Those ops are meant to be low level and map directly to llvm dialects like nvvm or rocdl. We can further add higher level of abstraction by building on top of those operations. This has been discuss here: https://discourse.llvm.org/t/modeling-gpu-async-copy-ampere-feature/4924 Differential Revision: https://reviews.llvm.org/D119191	2022-02-10 17:25:59 -08:00
Matthias Springer	69f7647158	[mlir][GPU] Add ShuffleOp builder for constant offset/width Differential Revision: https://reviews.llvm.org/D119345	2022-02-10 02:55:44 +09:00
River Riddle	2418cd92c0	[mlir] Update uses of `parser`/`printer` ODS op field to `hasCustomAssemblyFormat` The parser/printer fields are deprecated and in the process of being removed.	2022-02-07 19:03:58 -08:00
River Riddle	094ede6d20	[mlir][NFC] Update GPU/SCF operations to use `hasVerifier` instead of `verifier` The verifier field is deprecated, and slated for removal. Differential Revision: https://reviews.llvm.org/D118827	2022-02-02 13:34:31 -08:00

1 2 3 4

167 Commits