clang-p2996

Author	SHA1	Message	Date
Kazu Hirata	1a36588ec6	[mlir] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-03 18:50:27 -08:00
River Riddle	b74192b7ae	[mlir] Remove support for non-prefixed accessors This finishes off a year long pursuit to LLVMify the generated operation accessors, prefixing them with get/set. Support for any other accessor naming is fully removed after this commit. https://discourse.llvm.org/t/psa-raw-accessors-are-being-removed/65629 Differential Revision: https://reviews.llvm.org/D136727	2022-12-02 13:32:36 -08:00
Nicolas Vasilache	a8850312c1	[mlir][Transform][NFC] Use a single rewriter instead of duplicating it everywhere Differential Revision: https://reviews.llvm.org/D139094	2022-12-01 03:54:31 -08:00
Ron Lieberman	ca856fff1c	Revert "enable code-object-version=5" very sorry wrong repo. This reverts commit `d882ba7aea`.	2022-11-29 15:21:09 -06:00
Ron Lieberman	d882ba7aea	enable code-object-version=5	2022-11-29 15:11:57 -06:00
Guray Ozen	c5798fae05	[mlir] [transform] Error for duplicated processor mapping In a nested loop nest, it is not feasible to map different loops to the same processing unit; for an example, check the code below. This modification includes a check in this circumstance. ``` scf.foreach_thread (%i, %j) in (%c32, %c32) {...} { mapping = [#gpu.thread<x>, #gpu.thread<x>] } ``` Note: It also deletes a test because it is not possible to reproduce this error. Depends on D138020 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D138032	2022-11-18 08:38:53 +01:00
Guray Ozen	5ce68f4284	[mlir] Introduce `replaceUsesOfWith` to `RewriterBase` Finding uses of a value and replacing them with a new one is a common method. I have not seen an safe and easy shortcut that does that. This revision attempts to address that by intoroducing `replaceUsesOfWith` to `RewriterBase`. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D138110	2022-11-16 17:53:11 +01:00
Guray Ozen	63ca939783	[mlir] [transform] Fix for RAUW error in transform gpu dialect The given test fails due to error below. The following error is why the test is failing. One `memref.store` and two `memref.load` are consumers of the loop index for which I do RAUW. `memref.store` is first in the list. If I RAUW on this the loop of `llvm::make early inc range(threadIdx.getUsers())` does not return two `memref.load` as users. They remain unchanged. I'm not really certain why. This change applies RAUW after collecting the users. If a better solution exists, I would be happy to implement it. ``` mlir-opt: ...llvm-project/mlir/include/mlir/IR/UseDefLists.h:175: mlir::IRObjectWithUseList<mlir::OpOperand>::~IRObjectWithUseList() [OperandType = mlir::OpOperand]: Assertion `use_empty() && "Cannot destroy a value that still has uses!"' failed. PLEASE submit a bug report to https://github.com/llvm/llvm-project/issues/ and include the crash backtrace. ``` Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D138029	2022-11-16 09:55:24 +01:00
Krzysztof Drewniak	38ec0a07d0	[mlir][SerializeToHsaco] Minimize dependencies of AMDGPU compilation The SerializeToHsaco uses functions from ExecutionEngineUtils to set up LLVM pass pipelines, but does not otherwise depend on the execution engine (except indirectly via a dependency on IPO). This commit removes the dependency on the execution engine to prevent unnecessarily compilations. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D138041	2022-11-15 20:31:06 +00:00
Mahesh Ravishankar	fc367dfa67	[mlir] Remove `Transforms/SideEffectUtils.h` and move the methods into `Interface/SideEffectInterfaces.h`. The methods in `SideEffectUtils.h` (and their implementations in `SideEffectUtils.cpp`) seem to have similar intent to methods already existing in `SideEffectInterfaces.h`. Move the decleration (and implementation) from `SideEffectUtils.h` (and `SideEffectUtils.cpp`) into `SideEffectInterfaces.h` (and `SideEffectInterface.cpp`). Also drop the `SideEffectInterface::hasNoEffect` method in favor of `mlir::isMemoryEffectFree` which actually recurses into the operation instead of just relying on the `hasRecursiveMemoryEffectTrait` exclusively. Differential Revision: https://reviews.llvm.org/D137857	2022-11-15 20:07:35 +00:00
Guray Ozen	beaffb041c	[mlir][transform] Decouple GPUDeviceMapping attribute from the GPU transfrom dialect code generator `DeviceMappingAttrInterface` is implemented as unifiying mechanism for thread mapping. A code generator could use any attribute that implements this interface to lower `scf.foreach_thread` to device specific code. It is allowed to choose its own mapping and interpretation. Currently, GPU transform dialect supports only `GPUThreadMapping` and `GPUBlockMapping`; however, other mappings should to be supported as well. This change addresses this issue. It decouples gpu transform dialect from the `GPUThreadMapping` and `GPUBlockMapping`. Now, they can work any other mapping. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D138020	2022-11-15 18:16:32 +01:00
Mehdi Amini	3bed21bdd2	Apply clang-tidy fixes for readability-simplify-boolean-expr in GPUDialect.cpp (NFC)	2022-11-14 20:04:16 +00:00
Nicolas Vasilache	f0a411da77	[mlir][Transform]Significantly cleanup scf.foreach_thread and GPU transform permutation handling Previously, the need for a dense permutation leaked into the thread_dim_mapping specification. This revision allows to use a sparse specification of the thread_dim_mapping and the proper completion / sorting is applied automatically. In the process, the sematics of scf.foreach_thread is tightened to require a matching number of thread dimensions and mappings. The relevant negative test is added. Differential Revision: https://reviews.llvm.org/D137906	2022-11-14 09:19:49 -08:00
Guray Ozen	0d845660f4	[mlir] Fix asan errors in gpu transform dialect	2022-11-11 11:57:00 +01:00
Guray Ozen	6663f34704	[mlir] Introduce device mapper attribute for `thread_dim_map` and `mapped to dims` `scf.foreach_thread` defines mapping its loops to processors via an integer array, see an example below. A lowering can use this mapping. However, expressing mapping as an integer array is very confusing, especially when there are multiple levels of parallelism. In addition, the op does not verify the integer array. This change introduces device mapping attribute to make mapping descriptive and verifiable. Then it makes GPU transform dialect use it. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [0, 1]} } { thread_dim_mapping = [0, 1]} ``` It first introduces a `DeviceMappingInterface` which is an attribute interface. `scf.foreach_thread` defines its mapping via this interface. A lowering must define its attributes and implement this interface as well. This way gives us a clear validation. The change also introduces two new attributes (`#gpu.thread<x/y/z>` and `#gpu.block<x,y,z>` ). After this change, the above code prints as below, as seen here, this way clarifies the loop mappings. The change also implements consuming of these two new attribute by the transform dialect. Transform dialect binds the outermost loops to the thread blocks and innermost loops to threads. ``` scf.foreach_thread (%i, %j) in (%c1, %c2) { scf.foreach_thread (%i2, %j2) in (%c1, %c2) {...} { thread_dim_mapping = [#gpu.thread<x>, #gpu.thread<y>]} } { thread_dim_mapping = [#gpu.block<x>, #gpu.block<y>]} ``` Reviewed By: ftynse, nicolasvasilache Differential Revision: https://reviews.llvm.org/D137413	2022-11-11 08:44:57 +01:00
Mehdi Amini	c310a6dda2	Apply clang-tidy fixes for readability-simplify-boolean-expr in GPUDialect.cpp (NFC)	2022-11-06 20:15:00 +00:00
Nirvedh Meshram	c441070665	[mlir][spirv] Add conversion from GPU WMMA ops to SPIRV Cooperative matrix Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D136521	2022-10-22 18:29:40 -07:00
Hanhan Wang	00767cb452	[mlir] Delete dup code and use unified methods. The foldMemRefCast method is defined in memref namespace; the foldTensorCast method is defined in tensor namespace. This revision deletes the dup code and use the unified methods. Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D136379	2022-10-21 16:51:44 -07:00
Alex Zinenko	b0bf7ffffc	[mlir] add utilites for DiagnosedSilenceableFailure This class adds helper functions similar to `emitError` for the DiagnosedSilenceableFailure class in both the silenceable and definite failure cases. These helpers simplify the use of said class and make tranfsorm op application code idiomatic. Reviewed By: springerm Differential Revision: https://reviews.llvm.org/D136072	2022-10-17 15:31:28 +00:00
Mehdi Amini	5fbec2dfbb	Apply clang-tidy fixes for readability-identifier-naming in GPUTransformOps.cpp (NFC)	2022-10-11 16:54:33 +00:00
Ivan Butygin	b845addae8	[mlir][gpu] Add `subgroup_reduce` operation Introduce `subgroup_reduce` operation, similar to `all_reduce`, but operating on subgroup scope instead of workgroup. It is intended as low-level building block for more high level abstractions (e.g for workgroup-wide `all_reduce` ops). Only introduce version taking reduce operation enum for simplicity sake. Differential Revision: https://reviews.llvm.org/D135323	2022-10-11 11:47:15 +02:00
Guray Ozen	e68a7bed59	[mlir][transform] Add failing test for GPU transform dialect The GPU transform dialect currently has restrictions and several situations where we can't use transform dialect. This update includes a method to test a failing cases in GPU transform dialect. Differential Revision: https://reviews.llvm.org/D135063	2022-10-05 13:10:13 +02:00
Guray Ozen	78305720f3	[mlir][transform][nfc] typo fix fix typo Reviewed By: nicolasvasilache, ftynse Differential Revision: https://reviews.llvm.org/D135242	2022-10-05 13:05:46 +02:00
Guray Ozen	89bb0cae46	[mlir][transform] Create GPU transform dialect This revision adds GPU transform dialect. It also introduce a prefix such as "transform.gpu" for all ops related to this dialect. MLIR already had two GPU transform op in linalg. This revision moves these ops into GPUTransformOps. The Ops are as follows: `transform.structured.map_nested_foreach_thread_to_gpu_blocks` -> `transform.gpu.map_foreach_to_blocks` This op selects the outermost (toplevel) foreach_thread and parallelize across GPU blocks. It can also generate `gpu_launch`. `transform.structured.map_nested_foreach_thread_to_gpu_threads` -> `transform.gpu.map_nested_foreach_to_threads` This op parallelizes nested foreach_thread that are inside `gpu_launch` across GPU threads. It doesn't add new functionality, but there are some minor refactoring of the code. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D134800	2022-10-04 13:09:08 +02:00
River Riddle	10c04f4641	[mlir:GPU][NFC] Update GPU API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
River Riddle	a5aa783685	[mlir:Async][NFC] Update Async API to use prefixed accessors This doesn't flip the switch for prefix generation yet, that'll be done in a followup.	2022-09-30 15:27:10 -07:00
Jakub Kuderski	abc362a107	[mlir][arith] Change dialect name from Arithmetic to Arith Suggested by @lattner in https://discourse.llvm.org/t/rfc-define-precise-arith-semantics/65507/22. Tested with: `ninja check-mlir check-mlir-integration check-mlir-mlir-spirv-cpu-runner check-mlir-mlir-vulkan-runner check-mlir-examples` and `bazel build --config=generic_clang @llvm-project//mlir:all`. Reviewed By: lattner, Mogball, rriddle, jpienaar, mehdi_amini Differential Revision: https://reviews.llvm.org/D134762	2022-09-29 11:23:28 -04:00
Ivan Radanov Ivanov	e01c7f092f	[MLIR] Revert default NVIDIA GPU version Due to integration tests failing revert mlir::SerializeToCubinPass defaults to old ones (changed in https://reviews.llvm.org/D134153) Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D134414	2022-09-22 10:19:38 +02:00
River Riddle	986b5c56ea	[mlir] Flip Async/GPU/OpenACC/OpenMP to use Both accessors This allows for incrementally updating the old API usages without needing to update everything at once. These will be left on Both for a little bit and then flipped to prefixed when all APIs have been updated. Differential Revision: https://reviews.llvm.org/D134386	2022-09-21 17:36:13 -07:00
Ivan Radanov Ivanov	f9211330f6	[MLIR] Set default NVIDIA GPU version	2022-09-21 18:10:59 -04:00
Ivan Radanov Ivanov	2f7a774ed7	[MLIR] Add a create function for mlir::SerializeToCubinPass Differential Revision: https://reviews.llvm.org/D134153	2022-09-21 18:02:59 -04:00
Mehdi Amini	28c17a4b06	Apply clang-tidy fixes for performance-unnecessary-value-param in InferIntRangeInterfaceImpls.cpp (NFC)	2022-09-01 14:50:14 +00:00
Michele Scuttari	67d0d7ac0a	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-31 12:28:45 +02:00
Michele Scuttari	039b969b32	Revert "[MLIR] Update pass declarations to new autogenerated files" This reverts commit `2be8af8f0e`.	2022-08-30 22:21:55 +02:00
Michele Scuttari	2be8af8f0e	[MLIR] Update pass declarations to new autogenerated files The patch introduces the required changes to update the pass declarations and definitions to use the new autogenerated files and allow dropping the old infrastructure. Reviewed By: mehdi_amini, rriddle Differential Review: https://reviews.llvm.org/D132838	2022-08-30 21:56:31 +02:00
Christian Sigg	50c33a3a9c	[MLIR] Harden gpu.func verification GPUFuncOpLowering moves the body out of gpu.func op and erases it. An empty gpu.func may fail verification but should not crash it. Verification of an erased op is triggered e.g. with debug printing on. Reviewed By: akuegel Differential Revision: https://reviews.llvm.org/D132446	2022-08-23 14:58:46 +02:00
Jeff Niu	58a47508f0	(Reland) [mlir] Switch segment size attributes to DenseI32ArrayAttr This reland includes changes to the Python bindings. Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131801 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D131803	2022-08-12 19:44:52 -04:00
Alex Zinenko	e8e718fa4b	Revert "[mlir] Switch segment size attributes to DenseI32ArrayAttr" This reverts commit `30171e76f0`. Breaks Python tests in MLIR, missing C API and Python changes.	2022-08-12 10:22:47 +02:00
Jeff Niu	30171e76f0	[mlir] Switch segment size attributes to DenseI32ArrayAttr Switch variadic operand and result segment size attributes to use the dense i32 array. Dense integer arrays were introduced primarily to represent index lists. They are a better fit for segment sizes than dense elements attrs. Depends on D131738 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D131702	2022-08-11 20:56:45 -04:00
Benjamin Kramer	9fa59e7643	[mlir] Use C++17 structured bindings instead of std::tie where applicable. NFCI	2022-08-09 13:34:17 +02:00
River Riddle	c60b897d22	[mlir] Refactor the Parser library in preparation for an MLIR binary format The current Parser library is solely focused on providing API for the textual MLIR format, but MLIR will soon also provide a binary format. This commit renames the current Parser library to AsmParser to better correspond to what the library is actually intended for. A new Parser library is added which will act as a unified parser interface between both text and binary formats. Most parser clients are unaffected, given that the unified interface is essentially the same as the current interface. Only clients that rely on utilizing the AsmParserState, or those that want to parse Attributes/Types need to be updated to point to the AsmParser library. Differential Revision: https://reviews.llvm.org/D129605	2022-07-25 16:33:01 -07:00
Jeff Niu	b7f93c2809	[mlir] (NFC) run clang-format on all files	2022-07-14 13:32:13 -07:00
Kazu Hirata	c27d815249	[mlir] Use value instead of getValue (NFC)	2022-07-14 00:19:59 -07:00
Kazu Hirata	491d27013d	[mlir] Use has_value instead of hasValue (NFC)	2022-07-13 00:57:02 -07:00
Jacques Pienaar	136d746ec7	[mlir] Flip accessors to prefixed form (NFC) Another mechanical sweep to keep diff small for flip to _Prefixed.	2022-07-10 21:19:11 -07:00
Christian Sigg	3e01af093f	[mlir] Add InferIntRangeInterface to gpu.launch Infers block/grid dimensions/indices or ranges of such dimensions/indices. Reviewed By: krzysz00 Differential Revision: https://reviews.llvm.org/D129036	2022-07-05 07:14:54 +02:00
Kazu Hirata	3b7c3a654c	Revert "Don't use Optional::hasValue (NFC)" This reverts commit `aa8feeefd3`.	2022-06-25 11:56:50 -07:00
Kazu Hirata	aa8feeefd3	Don't use Optional::hasValue (NFC)	2022-06-25 11:55:57 -07:00
Kazu Hirata	6d5fc1e3d5	[mlir] Don't use Optional::getValue (NFC)	2022-06-20 23:20:25 -07:00
Mogball	d883a02a7c	[mlir][ods] Remove StructAttr Depends on D127373 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D127375	2022-06-21 01:10:05 +00:00

1 2 3 4 5 ...

347 Commits