clang-p2996

Author	SHA1	Message	Date
Vitaly Buka	6e1ac68a0c	[mlir] Don't iterate mutable user list executeOp.operandsMutable().append(asyncTokens) in addAsyncDependencyAfter can resize and invalidate iterators. Fixes reports like https://reviews.llvm.org/P8286 Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D124577	2022-04-28 08:59:55 -07:00
Uday Bondhugula	f47a38f517	Add async dependencies support for gpu.launch op Add async dependencies support for gpu.launch op: this allows specifying a list of async tokens ("streams") as dependencies for the launch. Update the GPU kernel outlining pass lowering to propagate async dependencies from gpu.launch to gpu.launch_func op. Previously, a new stream was being created and destroyed for a kernel launch. The async deps support allows the kernel launch to be serialized on an existing stream. Differential Revision: https://reviews.llvm.org/D123499	2022-04-21 16:25:59 +05:30
River Riddle	58ceae9561	[mlir:NFC] Remove the forward declaration of FuncOp in the mlir namespace FuncOp has been moved to the `func` namespace for a little over a month, the using directive can be dropped now.	2022-04-18 12:01:55 -07:00
River Riddle	1269f96d2e	[mlir] Add MLIR_DEFINE_EXPLICIT_INTERNAL_INLINE_TYPE_ID to SerializeToCubinPass This pass is defined in an anonymous namespace and requires an explicit TypeID	2022-04-04 14:28:10 -07:00
River Riddle	5e50dd048e	[mlir] Rework the implementation of TypeID This commit restructures how TypeID is implemented to ideally avoid the current problems related to shared libraries. This is done by changing the "implicit" fallback path to use the name of the type, instead of using a static template variable (which breaks shared libraries). The major downside to this is that it adds some additional initialization costs for the implicit path. Given the use of type names for uniqueness in the fallback, we also no longer allow types defined in anonymous namespaces to have an implicit TypeID. To simplify defining an ID for these classes, a new `MLIR_DEFINE_EXPLICIT_INTERNAL_INLINE_TYPE_ID` macro was added to allow for explicitly defining a TypeID directly on an internal class. To help identify when types are using the fallback, `-debug-only=typeid` can be used to log which types are using implicit ids. This change generally only requires changes to the test passes, which are all defined in anonymous namespaces, and thus can't use the fallback any longer. Differential Revision: https://reviews.llvm.org/D122775	2022-04-04 13:52:26 -07:00
River Riddle	3655069234	[mlir] Move the Builtin FuncOp to the Func dialect This commit moves FuncOp out of the builtin dialect, and into the Func dialect. This move has been planned in some capacity from the moment we made FuncOp an operation (years ago). This commit handles the functional aspects of the move, but various aspects are left untouched to ease migration: func::FuncOp is re-exported into mlir to reduce the actual API churn, the assembly format still accepts the unqualified `func`. These temporary measures will remain for a little while to simplify migration before being removed. Differential Revision: https://reviews.llvm.org/D121266	2022-03-16 17:07:03 -07:00
River Riddle	9eaff42360	[mlir][NFC] Move Parser.h to Parser/ There is no reason for this file to be at the top-level, and its current placement predates the Parser/ folder's existence. Differential Revision: https://reviews.llvm.org/D121024	2022-03-07 01:05:38 -08:00
Krzysztof Drewniak	4e817b3fa3	[MLIR][AMDGPU] Fix typo and add comment to SerializeToHsaco Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D120943	2022-03-04 17:15:11 +00:00
River Riddle	1f971e23f0	[mlir] Trim a huge number of unnecessary dependencies on the Func dialect The Func has a large number of legacy dependencies carried over from the old Standard dialect, which was pervasive and contained a large number of varied operations. With the split of the standard dialect and its demise, a lot of lingering dead dependencies have survived to the Func dialect. This commit removes a large majority of then, greatly reducing the dependence surface area of the Func dialect.	2022-03-01 12:10:04 -08:00
River Riddle	23aa5a7446	[mlir] Rename the Standard dialect to the Func dialect The last remaining operations in the standard dialect all revolve around FuncOp/function related constructs. This patch simply handles the initial renaming (which by itself is already huge), but there are a large number of cleanups unlocked/necessary afterwards: * Removing a bunch of unnecessary dependencies on Func * Cleaning up the From/ToStandard conversion passes * Preparing for the move of FuncOp to the Func dialect See the discussion at https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D120624	2022-03-01 12:10:04 -08:00
Ivan Butygin	d271fc04d5	[mlir][gpu] Split ops sinking from gpu-kernel-outlining pass into separate pass Previously `gpu-kernel-outlining` pass was also doing index computation sinking into gpu.launch before actual outlining. Split ops sinking from `gpu-kernel-outlining` pass into separate pass, so users can use theirs own sinking pass before outlining. To achieve old behavior users will need to call both passes: `-gpu-launch-sink-index-computations -gpu-kernel-outlining`. Differential Revision: https://reviews.llvm.org/D119932	2022-02-17 10:34:20 +03:00
Shao-Ce SUN	2aed07e96c	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 13:10:09 +08:00
Shao-Ce SUN	9cc49c1951	Revert "[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter`" This reverts commit `fe25c06cc5`.	2022-02-16 11:57:49 +08:00
Shao-Ce SUN	fe25c06cc5	[NFC][MC] remove unused argument `MCRegisterInfo` in `MCCodeEmitter` For ten years, it seems that `MCRegisterInfo` is not used by any target. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D119846	2022-02-16 11:47:17 +08:00
Krzysztof Drewniak	1aa71944cf	[MLIR][GPU] Add missing include to SerilazeToHsaco Differential Revision: https://reviews.llvm.org/D119852	2022-02-15 17:11:33 +00:00
Ivan Butygin	a2e2fbba17	[mlir][gpu] sinkOperationsIntoLaunchOp: Add user hook for isSinkingBeneficiary Differential Revision: https://reviews.llvm.org/D119632	2022-02-15 16:50:49 +03:00
Sameer Sahasrabuddhe	d8f99bb6e0	[AMDGPU] replace hostcall module flag with function attribute The module flag to indicate use of hostcall is insufficient to catch all cases where hostcall might be in use by a kernel. This is now replaced by a function attribute that gets propagated to top-level kernel functions via their respective call-graph. If the attribute "amdgpu-no-hostcall-ptr" is absent on a kernel, the default behaviour is to emit kernel metadata indicating that the kernel uses the hostcall buffer pointer passed as an implicit argument. The attribute may be placed explicitly by the user, or inferred by the AMDGPU attributor by examining the call-graph. The attribute is inferred only if the function is not being sanitized, and the implictarg_ptr does not result in a load of any byte in the hostcall pointer argument. Reviewed By: jdoerfert, arsenm, kpyzhov Differential Revision: https://reviews.llvm.org/D119216	2022-02-11 22:51:56 +05:30
Krzysztof Drewniak	1ce314ce6b	[MLIR][GPU][lld] Use LLD bundled in ROCm, removing workaround Having clarified that executing the SerializeToHsaco pass can depend on a ROCm installation, switch from calling lld as a library to using the copy of lld guaranteed to be included in a ROCm install. This removes the workaround introduced in D119277 Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119463	2022-02-10 19:37:30 +00:00
Krzysztof Drewniak	c37b3e4108	[MLIR][GPU] Add now-required include to SerializeToHsaco Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D119455	2022-02-10 18:36:38 +00:00
Alexandre Ganea	1e661e583d	[MLIR] Temporary workaround for calling the LLD ELF driver as-a-lib This fixes the situation described in https://github.com/llvm/llvm-project/issues/53475 with a repro exposed by https://github.com/ROCmSoftwarePlatform/D108850-lld-bug-reproduction This is purposely just a workaround to unblock users. This could be transplanted to the release/14.x branch if need be. A proper fix will later be provided in https://reviews.llvm.org/D119049. Differential Revision: https://reviews.llvm.org/D119277	2022-02-08 19:12:15 -05:00
River Riddle	ace01605e0	[mlir] Split out a new ControlFlow dialect from Standard This dialect is intended to model lower level/branch based control-flow constructs. The initial set of operations are: AssertOp, BranchOp, CondBranchOp, SwitchOp; all split out from the current standard dialect. See https://discourse.llvm.org/t/standard-dialect-the-final-chapter/6061 Differential Revision: https://reviews.llvm.org/D118966	2022-02-06 14:51:16 -08:00
River Riddle	dec8af701f	[mlir] Move SelectOp from Standard to Arithmetic This is part of splitting up the standard dialect. See https://llvm.discourse.group/t/standard-dialect-the-final-chapter/ for discussion. Differential Revision: https://reviews.llvm.org/D118648	2022-02-02 14:45:12 -08:00
Alexandre Ganea	dc3b9365b6	[mlir] Silence warnings when building with MSVC Differential Revision: https://reviews.llvm.org/D118536	2022-01-30 17:31:35 -05:00
Krzysztof Drewniak	e7d0dae76e	[MLIR][GPU] Add missing #include to SerializeToHsaco.cpp llvm/Support/Path.h was likely previously implicitly included, and a refactoring removed that inclusion, breaking the pass. Differential Revision: https://reviews.llvm.org/D118508	2022-01-28 22:56:38 +00:00
Alexandre Ganea	1cf9876661	[mlir] Fix build after `83d59e05b2` Differential Revision: https://reviews.llvm.org/D118510	2022-01-28 17:21:30 -05:00
River Riddle	6842ec42f6	[mlir][NFC] Add a using for llvm::SMLoc/llvm::SMRange to LLVM.h These are used pervasively during parsing. Differential Revision: https://reviews.llvm.org/D118291	2022-01-26 21:37:23 -08:00
River Riddle	65e7cd13bb	[mlir] Remove a bunch of unnecessary dialect dependencies A lot of dialects have dependencies that are unnecessary, either because of copy/paste of files when creating things or some other means. This commit cleans up a bunch of the simple ones: * Copy/Paste or missed during refactoring Most of the dependencies cleaned up here look like copy/paste errors when creating new dialects/transformations, or because the dependency wasn't removed during a refactoring (e.g. when splitting the standard dialect). * Unnecessary hard coding of constant operations in matchers There are a few instances where a dialect had a dependency because it was hardcoding checks for constant operations instead of using the better m_Constant approach. Differential Revision: https://reviews.llvm.org/D118062	2022-01-24 19:25:53 -08:00
River Riddle	a70aa7bb0d	[mlir:Transforms] Move out the remaining non-dialect independent transforms and utilities This has been a major TODO for a very long time, and is necessary for establishing a proper dialect-free dependency layering for the Transforms library. Code was moved to effectively two main locations: * Affine/ There was quite a bit of affine dialect related code in Transforms/ do to historical reasons (of a time way into MLIR's past). The following headers were moved to: Transforms/LoopFusionUtils.h -> Dialect/Affine/LoopFusionUtils.h Transforms/LoopUtils.h -> Dialect/Affine/LoopUtils.h Transforms/Utils.h -> Dialect/Affine/Utils.h The following transforms were also moved: AffineLoopFusion, AffinePipelineDataTransfer, LoopCoalescing * SCF/ Only one SCF pass was in Transforms/ (likely accidentally placed here): ParallelLoopCollapsing The SCF specific utilities in LoopUtils have been moved to SCF/Utils.h * Misc: mlir::moveLoopInvariantCode was also moved to LoopLikeInterface.h given that it is a simple utility defined in terms of LoopLikeOpInterface. Differential Revision: https://reviews.llvm.org/D117848	2022-01-24 19:25:53 -08:00
Krzysztof Drewniak	40aef79db0	[MLIR][GPU] Add debug output to enable dumping GPU assembly - Set the DEBUG_TYPE of SerializeToBlob to serialize-to-blob - Add debug output to print the assembly or PTX for GPU modules before they are assembled and linked Note that, as SerializeToBlob is a superclass of SerializeToCubin and SerializeToHsaco, --debug-only=serialize-to-blom will dump the intermediate compiler result for both of these passes. In addition, if LLVM options such as --stop-after are used to control the GPU kernel compilation process, the debug output will contain the appropriate intermediate IR. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D117519	2022-01-20 20:52:12 +00:00
River Riddle	e084679f96	[mlir] Make locations required when adding/creating block arguments BlockArguments gained the ability to have locations attached a while ago, but they have always been optional. This goes against the core tenant of MLIR where location information is a requirement, so this commit updates the API to require locations. Fixes #53279 Differential Revision: https://reviews.llvm.org/D117633	2022-01-19 17:35:35 -08:00
River Riddle	4157455425	[mlir][Pass] Deprecate FunctionPass in favor of OperationPass<FuncOp> The only benefit of FunctionPass is that it filters out function declarations. This isn't enough to justify carrying it around, as we can simplify filter out declarations when necessary within the pass. We can also explore with better scheduling primitives to filter out declarations at the pipeline level in the future. The definition of FunctionPass is left intact for now to allow time for downstream users to migrate. Differential Revision: https://reviews.llvm.org/D117182	2022-01-18 19:52:44 -08:00
Mogball	aae5125550	[mlir] Replace StrEnumAttr -> EnumAttr in core dialects Removes uses of `StrEnumAttr` in core dialects Reviewed By: mehdi_amini, rriddle Differential Revision: https://reviews.llvm.org/D117514	2022-01-18 17:15:00 +00:00
Duncan P. N. Exon Smith	b77d4d54f9	mlir: Avoid SmallVector::set_size in SerializeToHsacoPass::loadLibraries Spotted this in a final grep of projects I don't usually build before pushing https://reviews.llvm.org/D115380, which makes `SmallVector::set_size()` private. Update to `truncate()`, a new-ish variant of `resize()` that asserts the new size is not bigger and that avoids pulling in the allocation and initialization code for growing. Doesn't really look like the perf impact of that would matter here, but since `dirLength` is known to be a smaller size then we might as well. Differential Revision: https://reviews.llvm.org/D117073	2022-01-13 10:17:00 -08:00
Diego Caballero	e2b658cd5d	[mlir][GPU] Fix attribute name of DL specification D115722 added a DL spec to GPU modules. It happens that the DL default interface implementation is sensitive to the name of the DL spec attribute. This patch is fixing the name of the attribute to be the expected one. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D116956	2022-01-11 08:30:52 +00:00
Mehdi Amini	e4853be2f1	Apply clang-tidy fixes for performance-for-range-copy to MLIR (NFC)	2022-01-02 22:19:56 +00:00
Mehdi Amini	1fc096af1e	Apply clang-tidy fixes for performance-unnecessary-value-param to MLIR (NFC) Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116250	2022-01-02 01:45:18 +00:00
Mehdi Amini	02b6fb218e	Fix clang-tidy issues in mlir/ (NFC) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D115956	2021-12-20 20:25:01 +00:00
Diego Caballero	32fe1a8a25	[mlir][GPU] Extend GPU kernel outlining to generate DL specification This patch extends the GPU kernel outlining pass so that it can take in an optional data layout specification that will be attached to the GPU module operation generated. If the data layout specification is not provided the default data layout is used instead. Reviewed By: herhut, mehdi_amini Differential Revision: https://reviews.llvm.org/D115722	2021-12-16 11:35:53 +00:00
Krzysztof Drewniak	e1da62910e	[MLIR][GPU] Define gpu.printf op and its lowerings - Define a gpu.printf op, which can be lowered to any GPU printf() support (which is present in CUDA, HIP, and OpenCL). This op only supports constant format strings and scalar arguments - Define the lowering of gpu.pirntf to a call to printf() (which is what is required for AMD GPUs when using OpenCL) as well as to the hostcall interface present in the AMD Open Compute device library, which is the interface present when kernels are running under HIP. - Add a "runtime" enum that allows specifying which of the possible runtimes a ROCDL kernel will be executed under or that the runtime is unknown. This enum controls how gpu.printf is lowered This change does not enable lowering for Nvidia GPUs, but such a lowering should be possible in principle. And: [MLIR][AMDGPU] Always set amdgpu-implicitarg-num-bytes=56 on kernels This is something that Clang always sets on both OpenCL and HIP kernels, and failing to include it causes mysterious crashes with printf() support. In addition, revert the max-flat-work-group-size to (1, 256) to avoid triggering bugs in the AMDGPU backend. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110448	2021-12-09 15:54:31 +00:00
Mehdi Amini	be0a7e9f27	Adjust "end namespace" comment in MLIR to match new agree'd coding style See D115115 and this mailing list discussion: https://lists.llvm.org/pipermail/llvm-dev/2021-December/154199.html Differential Revision: https://reviews.llvm.org/D115309	2021-12-08 06:05:26 +00:00
Krzysztof Drewniak	a6f53afbcb	[MLIR][GPU] Link in device libraries during HSA compilation if needed To perform some operations, such as sin() or printf(), code compiled for AMD GPUs must be linked to a series of device libraries. This commit adds support for linking in these libraries. However, since these device libraries are delivered as LLVM bitcode, raising the possibility of version incompatibilities, this commit only links in libraries when the functions from those libraries are called by the code being compiled. This code also sets the math flags to their most conservative values, as MLIR doesn't have a `-ffast-math` equivalent. Depends on D114114 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114117	2021-11-19 22:29:37 +00:00
rdzhabarov	d729f4c38f	[mlir] Bug fix. Stream must outlive the pass manager. Bug fix. Stream must outlive the pass manager. Reviewed By: Chia-hungDuan Differential Revision: https://reviews.llvm.org/D114277	2021-11-19 21:45:43 +00:00
Krzysztof Drewniak	20f79f8caa	[MLIR][GPU] Make the path to ROCm a runtime option Our current build assumes that the path to ROCm we find at build time will be the path at which ROCm is located when the built code is executed. This commit adds a --rocm-path option to SerializeToHsaco, and removes the HIP dependency that the SerializeToHsaco previously had. Depends on D114113 (though the dependency is to ensure the diffs apply cleanly and to capture the dependency on D114107) Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114114	2021-11-19 20:51:54 +00:00
Krzysztof Drewniak	bd22554af0	[MLIR][GPU] Run generic LLVM optimizations when serializing (on AMD) - Adds hooks that allow SerializeTo* passes to arbitrarily transform the produced LLVM Module before it is passed to the code generation passes. - Uses these hooks within the SerializeToHsaco pass in order to run LLVM optimizations and to set the optimization level on the TargetMachine. - Adds an optLevel parameter to SerializeToHsaco Future work may include moving much of what's been added to SerializeToHsaco to SerializeToBlob, but that would require confirmation from the NVVM backend maintainers that it would be appropriate to do so. Depends on D114107 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114113	2021-11-19 19:21:24 +00:00
Krzysztof Drewniak	f849640a0c	[MLIR] Make the ROCM integration tests runnable - Move the #define s to the GPU Transform library from GPU Ops so that SerializeToHsaco is non-trivially compiled - Add required includes to SerializeToHsaco - Move MCSubtargetInfo creation to the correct point in the compilation process - Change mlir in ROCM tests to account for renamed/moved ops Differential Revision: https://reviews.llvm.org/D114184	2021-11-19 17:09:53 +00:00
Krzysztof Drewniak	fb1a06aa13	[MLIR][GPU] Add target arguments to SerializeToHsaco Compiling code for AMD GPUs requires knowledge of which chipset is being targeted, especially if the code uses chipset-specific intrinsics (which is the case in a downstream convolution generator). This commit adds `target`, `chipset` and `features` arguments to the SerializeToHsaco constructor to enable passing in this required information. It also amends the ROCm integration tests to pass in the target chipset, which is set to the chipset of the first GPU on the system executing the tests. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114107	2021-11-18 16:28:44 +00:00
Vladislav Vinogradov	e41ebbecf9	[mlir][RFC] Refactor layout representation in MemRefType The change is based on the proposal from the following discussion: https://llvm.discourse.group/t/rfc-memreftype-affine-maps-list-vs-single-item/3968 * Introduce `MemRefLayoutAttr` interface to get `AffineMap` from an `Attribute` (`AffineMapAttr` implements this interface). * Store layout as a single generic `MemRefLayoutAttr`. This change removes the affine map composition feature and related API. Actually, while the `MemRefType` itself supported it, almost none of the upstream can work with more than 1 affine map in `MemRefType`. The introduced `MemRefLayoutAttr` allows to re-implement this feature in a more stable way - via separate attribute class. Also the interface allows to use different layout representations rather than affine maps. For example, the described "stride + offset" form, which is currently supported in ASM parser only, can now be expressed as separate attribute. Reviewed By: ftynse, bondhugula Differential Revision: https://reviews.llvm.org/D111553	2021-10-19 12:31:15 +03:00
Mogball	a54f4eae0e	[MLIR] Replace std ops with arith dialect ops Precursor: https://reviews.llvm.org/D110200 Removed redundant ops from the standard dialect that were moved to the `arith` or `math` dialects. Renamed all instances of operations in the codebase and in tests. Reviewed By: rriddle, jpienaar Differential Revision: https://reviews.llvm.org/D110797	2021-10-13 03:07:03 +00:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Uday Bondhugula	08b63db8bb	[MLIR][GPU] Add GPU launch op support for dynamic shared memory Add support for dynamic shared memory for GPU launch ops: add an optional operand to gpu.launch and gpu.launch_func ops to specify the amount of "dynamic" shared memory to use. Update lowerings to connect this operand to the GPU runtime. Differential Revision: https://reviews.llvm.org/D110800	2021-10-01 16:46:07 +05:30

1 2 3 4

157 Commits