clang-p2996

Author	SHA1	Message	Date
Billy Zhu	6f6336858e	[MLIR][LLVM] Add DebugNameTableKind to DICompileUnit (#87974 ) Add the DebugNameTableKind field to DICompileUnit, along with its importer & exporter.	2024-04-09 06:18:07 -07:00
Christian Ulmann	541962306d	[MLIR][LLVM] Remove bitcast pattern from type consistency pass (#87755 ) This commit removes the no longer required bitcast inserting pattern in LLVM dialect's type consistency pattern. This was previously required to enable Mem2Reg and SROA to promote accesses that had different types. Recent changes to both passes added direct support for this feature to them, so the pattern has no further use.	2024-04-05 15:47:16 +02:00
Christian Ulmann	ef8322f41d	[MLIR][LLVM] Improve bit- and addrspacecast folders (#87745 ) This commit extends the folders of chainable casts (bitcast and addrspacecast) to ensure that they fold a chain of the same casts into a single cast. Additionally cleans up the canonicalization test file, as this used some outdated constructs.	2024-04-05 09:14:13 +02:00
Christian Ulmann	974f1ee58d	[MLIR][LLVM][Mem2Reg] Relax type equality requirement for load and store (#87637 ) This commit relaxes Mem2Reg's type equality requirement for the LLVM dialect's load and store operations. For now, we only allow loads to be promoted if the reaching definition can be casted into a value of the target type. For stores, the same conversion casting check is applied and we ensure that their result is properly casted to the type of the memory slot. This is necessary to satisfy assumptions of the general mem2reg pass, as it creates block arguments with the types of the memory slot. This relands https://github.com/llvm/llvm-project/pull/87504	2024-04-05 08:25:36 +02:00
Fabian Mora	220cdf940e	[mlir] Add `requiresReplacedValues` and `visitReplacedValues` to `PromotableOpInterface` (#86792 ) Add `requiresReplacedValues` and `visitReplacedValues` methods to `PromotableOpInterface`. These methods allow `PromotableOpInterface` ops to transforms definitions mutated by a `store`. This change is necessary to correctly handle the promotion of `LLVM_DbgDeclareOp`. --------- Co-authored-by: Théo Degioanni <30992420+Moxinilian@users.noreply.github.com>	2024-04-04 13:34:46 -04:00
Christian Ulmann	e0e615efac	Revert "[MLIR][LLVM][Mem2Reg] Relax type equality requirement for load and store (#87504 )" (#87631 ) This reverts commit `d6e4582198` as it violates an assumption of Mem2Reg's block argument creation. Mem2Reg strongly assumes that all involved values have the same type as the alloca, which was relaxed by this PR. Therefore, branches got created that jumped to basic blocks with differently typed block arguments.	2024-04-04 15:07:18 +02:00
Christian Ulmann	d6e4582198	[MLIR][LLVM][Mem2Reg] Relax type equality requirement for load and store (#87504 ) This commit relaxes Mem2Reg's type equality requirement for the LLVM dialect's load and store operations. For now, we only allow loads to be promoted if the reaching definition can be casted into a value of the target type. For stores, all type checks are removed, as a non-volatile store that does not write out the alloca's pointer can always be deleted.	2024-04-04 09:34:37 +02:00
Christian Ulmann	63d22f7a5b	[MLIR][LLVM][SROA] Make GEP handling type agnostic (#86950 ) This commit removes SROA's type consistency constraints from LLVM dialect's GEPOp. The checks for valid indexing are now purely done by computing the GEP's offset with the aid of the data layout. To simplify handling of "nested subslots", we are tricking the SROA by handing in memory slots that hold byte array types. This ensures that subsequent accesses only need to check if their access will be in-bounds. This lifts the requirement of determining the sub-types for all but the first level of subslots.	2024-04-02 14:35:55 +02:00
Jakub Kuderski	971b852546	[mlir][NFC] Simplify type checks with isa predicates (#87183 ) For more context on isa predicates, see: https://github.com/llvm/llvm-project/pull/83753.	2024-04-01 11:40:09 -04:00
Justin Fargnoli	35d55f2894	[NFC][mlir] Reorder `declarePromisedInterface()` operands (#86628 ) Reorder the template operands of `declarePromisedInterface()` to match `declarePromisedInterfaces()`.	2024-03-27 10:30:17 -07:00
Christian Ulmann	cb300c3305	[MLIR][LLVM][SROA] Fix pointer escape through stores bug (#86291 ) This commit resolves a SROA bug caused by not properly checking if a llvm store operation writes the pointer to memory or not. Now, we do no longer consider stores that use a slot pointer as a value to store as fixable.	2024-03-22 16:44:06 +01:00
Christian Ulmann	0289ae51aa	[MLIR][LLVM][SROA] Support incorrectly typed memory accesses (#85813 ) This commit relaxes the assumption of type consistency for LLVM dialect load and store operations in SROA. Instead, there is now a check that loads and stores are in the bounds specified by the sub-slot they access. This commit additionally removes the corresponding patterns from the type consistency pass, as they are no longer necessary. Note: It will be necessary to extend Mem2Reg with the logic for differently sized accesses as well. This is non-the-less a strict upgrade for productive flows, as the type consistency pass can produce invalid IR for some odd cases.	2024-03-22 08:31:17 +01:00
Tobias Gysi	adda597388	[MLIR] Add index bitwidth to the DataLayout (#85927 ) When importing from LLVM IR the data layout of all pointer types contains an index bitwidth that should be used for index computations. This revision adds a getter to the DataLayout that provides access to the already stored bitwidth. The function returns an optional since only pointer-like types have an index bitwidth. Querying the bitwidth of a non-pointer type returns std::nullopt. The new function works for the built-in Index type and, using a type interface, for the LLVMPointerType.	2024-03-21 09:07:57 +01:00
Christian Ulmann	98c6bc531d	[MLIR][SROA][Mem2Reg] Add data layout to interface methods (#85644 ) This commit expends the Mem2Reg and SROA interface methods with passed in handles to a `DataLayout` structure. This is done to avoid superfluous retreiving of data layouts during each conversion of intrinsics. This change, additionally, enables subsequent changes to make the LLVM dialect implementation of these interfaces type agnostic.	2024-03-20 14:21:53 +01:00
Christian Ulmann	252e2551ea	[MLIR][LLVM][SROA] Avoid splitting dynamically indexed allocas (#85758 ) This commit ensures that SROA does no longer attempt to split allocas that are indexed into dynamically. Dynamic indices into arrays are allowed to be negative or out-of-bounds, when the alloca containing the array has memory backing these produced indices.	2024-03-19 17:18:28 +01:00
Guray Ozen	8819f87998	[MLIR][NVVM] Add barrier.arrive (#85412 ) PR adds `nvvm.barrier.arrive` Op. It is useful op for producer consumer modeling.	2024-03-19 16:51:32 +01:00
Billy Zhu	1e8dad3bef	[MLIR][LLVM] Support Recursive DITypes (#80251 ) Following the discussion from [this thread](https://discourse.llvm.org/t/handling-cyclic-dependencies-in-debug-info/67526/11), this PR adds support for recursive DITypes. This PR adds: 1. DIRecursiveTypeAttrInterface: An interface that DITypeAttrs can implement to indicate that it supports recursion. See full description in code. 2. Importer & exporter support (The only DITypeAttr that implements the interface is DICompositeTypeAttr, so the exporter is only implemented for composites too. There will be two methods that each llvm DI type that supports mutation needs to implement since there's nothing general). --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-03-15 09:58:25 -07:00
Matthias Springer	91d5653e3a	[mlir] Use `OpBuilder::createBlock` in op builders and patterns (#82770 ) When creating a new block in (conversion) rewrite patterns, `OpBuilder::createBlock` must be used. Otherwise, no `notifyBlockInserted` notification is sent to the listener. Note: The dialect conversion relies on listener notifications to keep track of IR modifications. Creating blocks without the builder API can lead to memory leaks during rollback.	2024-02-24 09:10:07 +01:00
Mehdi Amini	45c226d452	[MLIR] Add ODS support for generating helpers for dialect (discardable) attributes (#77024 ) This is a new ODS feature that allows dialects to define a list of key/value pair representing an attribute type and a name. This will generate helper classes on the dialect to be able to manage discardable attributes on operations in a type safe way. For example the `test` dialect can define: ``` let discardableAttrs = (ins "mlir::IntegerAttr":$discardable_attr_key, ); ``` And the following will be generated in the TestDialect class: ``` /// Helper to manage the discardable attribute `discardable_attr_key`. class DiscardableAttrKeyAttrHelper { ::mlir::StringAttr name; public: static constexpr ::llvm::StringLiteral getNameStr() { return "test.discardable_attr_key"; } constexpr ::mlir::StringAttr getName() { return name; } DiscardableAttrKeyAttrHelper(::mlir::MLIRContext ctx) : name(::mlir::StringAttr::get(ctx, getNameStr())) {} mlir::IntegerAttr getAttr(::mlir::Operation op) { return op->getAttrOfType<mlir::IntegerAttr>(name); } void setAttr(::mlir::Operation op, mlir::IntegerAttr val) { op->setAttr(name, val); } bool isAttrPresent(::mlir::Operation op) { return op->hasAttrOfType<mlir::IntegerAttr>(name); } void removeAttr(::mlir::Operation *op) { assert(op->hasAttrOfType<mlir::IntegerAttr>(name)); op->removeAttr(name); } }; DiscardableAttrKeyAttrHelper getDiscardableAttrKeyAttrHelper() { return discardableAttrKeyAttrName; } ``` User code having an instance of the TestDialect can then manipulate this attribute on operation using: ``` auto helper = testDialect.getDiscardableAttrKeyAttrHelper(); helper.setAttr(op, value); helper.isAttrPresent(op); ... ```	2024-02-19 23:30:03 -08:00
Guray Ozen	b5d694ba14	[mlir][nvvm] Introduce `nvvm.barrier` OP (#81487 ) This PR that introduces the `nvvm.barrier` OP to the NVVM dialect. Currently, NVVM only supports the `nvvm.barrier0`, which synchronizes all threads using barrier resource 0. The new `nvvm.barrier` has two essential arguments: the barrier resource and the number of threads. This added flexibility allows for selective synchronization of threads within a CTA, aligning with the capabilities provided by LLVM intrinsics or the PTX model. I think we can deprecate `nvvm.barrier0` in favor of the more generic `nvvm.barrier`. ``` // Equivalent to nvvm.barrier0 (or __syncthreads() in CUDA) nvvm.barrier // Synchronize all threads using the 3rd barrier resource. nvvm.barrier id = 3 // Synchronize %numberOfThreads threads using the 3rd barrier resource. nvvm.barrier id = 3 number_of_threads = %numberOfThreads ```	2024-02-14 08:28:45 +01:00
Rishi Surendran	fa6850a998	[mlir][nvvm]Add support for grid_constant attribute on LLVM function arguments (#78228 ) Add support for attribute nvvm.grid_constant on LLVM function arguments. The attribute can be attached only to arguments of type llvm.ptr that have llvm.byval attribute. Generate LLVM metadata for functions with nvvm.grid_constant arguments. The metadata node is a list of integers, where each integer n denotes that the nth parameter has the grid_constant annotation (numbering from 1). The generated metadata node will be handled by NVVM compiler. See https://docs.nvidia.com/cuda/nvvm-ir-spec/index.html#supported-properties for documentation on grid_constant property. This patch also adds convertParameterAttr to LLVMTranslationDialectInterface for supporting the translation of derived dialect attributes on function parameters	2024-02-12 13:16:59 -08:00
Kolya Panchenko	9f6c00565a	[MLIR][VCIX] Support VCIX intrinsics in LLVMIR dialect (#75875 ) The changeset extends LLVMIR intrinsics with VCIX intrinsics. The VCIX intrinsics allow MLIR users to interact with RISC-V co-processors that are compatible with `XSfvcp` extension Source: https://www.sifive.com/document-file/sifive-vector-coprocessor-interface-vcix-software	2024-02-07 15:23:28 -05:00
Andrei Golubev	89cd345667	[mlir][LLVM] Use int32_t to indirectly construct GEPArg (#79562 ) GEPArg can only be constructed from int32_t and mlir::Value. Explicitly cast other types (e.g. unsigned, size_t) to int32_t to avoid narrowing conversion warnings on MSVC. Some recent examples of such are: ``` mlir\lib\Dialect\LLVMIR\Transforms\TypeConsistency.cpp: error C2398: Element '1': conversion from 'size_t' to 'T' requires a narrowing conversion with [ T=mlir::LLVM::GEPArg ] mlir\lib\Dialect\LLVMIR\Transforms\TypeConsistency.cpp: error C2398: Element '1': conversion from 'unsigned int' to 'T' requires a narrowing conversion with [ T=mlir::LLVM::GEPArg ] ``` Co-authored-by: Nikita Kudriavtsev <nikita.kudriavtsev@intel.com>	2024-01-26 14:27:51 +01:00
Mehdi Amini	acf2f24ac3	Apply clang-tidy fixes for llvm-else-after-return in LLVMDialect.cpp (NFC)	2024-01-22 17:34:56 -08:00
Guray Ozen	12c241b365	[MLIR][NVVM] Explicit Data Type for Output in `wgmma.mma_async` (#78713 ) The current implementation of `nvvm.wgmma.mma_async` Op deduces the data type of the output matrix from the data type of struct member, which can be non-intuitive, especially in cases where types like `2xf16` are packed into `i32`. This PR addresses this issue by improving the Op to include an explicit data type for the output matrix. The modified Op now includes an explicit data type for Matrix-D (<f16>), and looks as follows: ``` %result = llvm.mlir.undef : !llvm.struct<(struct<(i32, i32, ... nvvm.wgmma.mma_async %descA, %descB, %result, #nvvm.shape<m = 64, n = 32, k = 16>, D [<f16>, #nvvm.wgmma_scale_out<zero>], A [<f16>, #nvvm.wgmma_scale_in<neg>, <col>], B [<f16>, #nvvm.wgmma_scale_in<neg>, <col>] ```	2024-01-22 08:37:20 +01:00
Matthias Springer	5fcf907b34	[mlir][IR] Rename "update root" to "modify op" in rewriter API (#78260 ) This commit renames 4 pattern rewriter API functions: * `updateRootInPlace` -> `modifyOpInPlace` * `startRootUpdate` -> `startOpModification` * `finalizeRootUpdate` -> `finalizeOpModification` * `cancelRootUpdate` -> `cancelOpModification` The term "root" is a misnomer. The root is the op that a rewrite pattern matches against (https://mlir.llvm.org/docs/PatternRewriter/#root-operation-name-optional). A rewriter must be notified of all in-place op modifications, not just in-place modifications of the root (https://mlir.llvm.org/docs/PatternRewriter/#pattern-rewriter). The old function names were confusing and have contributed to various broken rewrite patterns. Note: The new function names use the term "modify" instead of "update" for consistency with the `RewriterBase::Listener` terminology (`notifyOperationModified`).	2024-01-17 11:08:59 +01:00
Christian Ulmann	fa5255eee2	[MLIR][LLVM] Enable export of DISubprograms on function declarations (#78026 ) This commit changes the MLIR to LLVMIR export to also attach subprogram debug attachements to function declarations. This commit additonally fixes the two passes that produce subprograms to not attach the "Definition" flag to function declarations. This otherwise results in invalid LLVM IR.	2024-01-15 07:34:13 +01:00
Billy Zhu	422b84a771	[MLIR][LLVM] DI Expression Rewrite & Legalization (#77541 ) Add a rewriter for DIExpressions & use it to run legalization patterns before exporting to llvm (because LLVM dialect allows DI Expressions that may not be valid in LLVM IR). The rewriter driver works similarly to the existing mlir rewriter drivers, except it operates on lists of DIExpressionElemAttr (i.e. DIExpressionAttr). Each rewrite pattern transforms a range of DIExpressionElemAttr into a new list of DIExpressionElemAttr. In addition, this PR sets up a place to add legalization patterns that are broadly applicable internally to the LLVM dialect, and they will always be applied prior to export. This PR adds one pattern for merging fragment operators. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-01-10 16:10:06 -08:00
Guray Ozen	2aec7083ad	[mlir][gpu] Use DenseI32Array for NVVM's maxntid and reqntid (NFC) (#77466 )	2024-01-09 16:44:25 +01:00
Tobias Gysi	7e54ae24d8	[mlir][llvm] Do not inline variadic functions (#77241 ) This revision updates the llvm dialect inliner to explicitly disallow the inlining of variadic functions. Already previously the inlining failed if the number of function arguments did not match the number of call arguments. After the change, inlining checks the function is not variadic and it does not contain a va_start intrinsic.	2024-01-08 08:30:10 +01:00
Christian Ulmann	bae1fdea71	[MLIR][LLVM] Add distinct identifier to the DISubprogram attribute (#77093 ) This commit adds an optional distinct attribute parameter to the DISubprogramAttr. This enables modeling of distinct subprograms, as required for LLVM IR. This change is required to avoid accidential uniquing of subprograms on functions that would lead to invalid LLVM IR post export.	2024-01-08 08:25:30 +01:00
Christian Ulmann	b3037ae1fc	[MLIR][LLVM] Add distinct identifier to DICompileUnit attribute (#77070 ) This commit adds a distinct attribute parameter to the DICompileUnit to enable the modeling of distinctness. LLVM requires DICompileUnits to be distinct and there are cases where one gets two equivalent compilation units but LLVM still requires differentiates them. We observed such cases for combinations of LTO and inline functions. This patch also changes the DIScopeForLLVMFuncOp pass to a module pass, to ensure that only one distinct DICompileUnit is created, instead of one for each function.	2024-01-08 07:42:33 +01:00
Krzysztof Drewniak	2aff7f3919	[mlir][LLVM] Add !invariant.load metadata support to llvm.load (#76754 ) Add support for !invariant.load metadata (by way of a unit attribute) to the MLIR representation of llvm.load.	2024-01-04 09:33:09 -06:00
gitoleg	8cf6bcf5a3	[mlir][llvm] Add assert in CallOp builder (#76240 ) This commit adds an assert in one of the CallOp builders to ensure it is not use to create an indirect call. Otherwise, the callee type would include the callee pointer type which is handed in as first argument.	2023-12-27 17:08:35 +01:00
Adam Paszke	85b2327192	[mlir][nvvm] Fix the PTX lowering of wgmma.mma_async (#76150 )	2023-12-22 14:46:34 +01:00
Tobias Gysi	9971b9ab19	[mlir][llvm] Improve alloca handling during inlining (#75961 ) This revision changes the alloca handling in the LLVM inliner. It ensures that alloca operations, even those nested within a region operation, can be relocated to the entry block of the function, or the closest ancestor region that is marked with either the isolated from above or automatic allocation scope trait. While the LLVM dialect does not have any region operations, the inlining interface may be used on IR that mixes different dialects.	2023-12-21 08:11:17 +01:00
Tobias Gysi	25d942403c	[mlir][llvm] Add invariant intrinsics (#75354 ) This commit implements the LLVM IR invariant intrinsics in LLVM dialect. These intrinsics can be used to mark a program regions in which the contents of a specific memory object will not change. The LLVM dialect implementation also implements the PromotableOpInterface to ensure Mem2Reg & SROA are able to promote pointers that are marked using the invariant intrinsics.	2023-12-14 14:58:45 +01:00
Kazu Hirata	88d319a29f	[mlir] Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-13 22:58:30 -08:00
Johannes de Fine Licht	ed5813c4aa	[MLIR][LLVM] Remove disallowlist from LLVM inliner (#75303 ) The disallowlist was used as a migration strategy while support was extended to more side effecting operations. We now (to the best of our knowledge) support all side effecting operations, so never fail `isLegalToInline` on any LLVM operation. There is no test included, because that's exactly the reason for this change: there are no more unsupported operations in inlining; the existing tests for unsupported inlines have already been burninated.	2023-12-13 10:31:27 +01:00
Rik Huijzer	3764f5e816	[mlir][llvm] Fix negative GEP crash in type consistency (#74859 ) Fixes https://github.com/llvm/llvm-project/issues/74453. The `gepToByteOffset` was implicitly casting an signed integer to an unsigned integer even though negative dimensions are valid for `llvm.getelementptr`. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2023-12-11 12:29:53 +01:00
Tom Eccles	e9e1c411b6	[mlir][LLVM] Add nsw and nuw flags (#74508 ) The implementation of these are modeled after the existing fastmath flags for floating point arithmetic.	2023-12-07 10:35:00 +00:00
Billy Zhu	12e5148f9c	[MLIR][LLVM] Fix CallOp asm parser for attr-dict (#74372 ) Currently the parser & printer of `CallOp` do not match when both varargs and attr-dict are present (round tripping is broken). This fixes the parser so that it conforms to the written asm format in the comments.	2023-12-05 21:18:52 +01:00
Rik Huijzer	13da9a58c5	[mlir][llvm] Fix verifier for const int and dense (#74340 ) Continuation of https://github.com/llvm/llvm-project/pull/74247 to fix https://github.com/llvm/llvm-project/issues/56962. Fixes verifier for (Integer Attr): ```mlir llvm.mlir.constant(1 : index) : f32 ``` and (Dense Attr): ```mlir llvm.mlir.constant(dense<100.0> : vector<1xf64>) : f32 ``` ## Integer Attr The addition that this PR makes to `LLVM::ConstantOp::verify` is meant to be exactly verifying the code in `mlir::LLVM::detail::getLLVMConstant`: `9f78edbd20/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp (L350-L353)` One failure mode is when the `type` (`llvm.mlir.constant(<value>) : <type>`) is not an `Integer`, because then the `cast` in `getIntegerBitWidth` will crash: `dca432cb7b/llvm/include/llvm/IR/DerivedTypes.h (L97-L99)` So that's now caught in the verifier. Apart from that, I don't see anything we could check for. `sextOrTrunc` means "Sign extend or truncate to width" and that one is quite permissive. For example, the following doesn't have to be caught in the verifier as it doesn't crash during `mlir-translate -mlir-to-llvmir`: ```mlir llvm.func @main() -> f32 { %cst = llvm.mlir.constant(100 : i64) : f32 llvm.return %cst : f32 } ``` ## Dense Attr Crash if not either a MLIR Vector type or one of these: `9f78edbd20/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp (L375-L391)`	2023-12-05 12:31:49 +01:00
Benjamin Maxwell	17de468df1	[mlir][llvm] Add llvm.target_features features attribute (#71510 ) This patch adds a target_features (TargetFeaturesAttr) to the LLVM dialect to allow setting and querying the features in use on a function. The motivation for this comes from the Arm SME dialect where we would like a convenient way to check what variants of an operation are available based on the CPU features. Intended usage: The target_features attribute is populated manually or by a pass: ```mlir func.func @example() attributes { target_features = #llvm.target_features<["+sme", "+sve", "+sme-f64f64"]> } { // ... } ``` Then within a later rewrite the attribute can be checked, and used to make lowering decisions. ```c++ // Finds the "target_features" attribute on the parent // FunctionOpInterface. auto targetFeatures = LLVM::TargetFeaturesAttr::featuresAt(op); // Check a feature. // Returns false if targetFeatures is null or the feature is not in // the list. if (!targetFeatures.contains("+sme-f64f64")) return failure(); ``` For now, this is rather simple just checks if the exact feature is in the list, though it could be possible to extend with implied features using information from LLVM.	2023-12-05 11:29:31 +00:00
Tobias Gysi	7858071524	[mlir][llvm] Fix attribute printer warning (NFC)(#74351 ) This commit fixes a compilation warning caused by the printExpressionArg function that previously returned LogicalResult instead of void. The warning has been introduced by #73367.	2023-12-04 19:10:56 +01:00
Guray Ozen	80ff67be81	[mlir][nvvm] Introduce `nvvm.fence.proxy` (#74057 ) This PR introduce `nvvm.fence.proxy` OP for the following cases: ``` nvvm.fence.proxy { kind = #nvvm.proxy_kind<alias>} nvvm.fence.proxy { kind = #nvvm.proxy_kind<async>} nvvm.fence.proxy { kind = #nvvm.proxy_kind<async.global>} nvvm.fence.proxy { kind = #nvvm.proxy_kind<async.shared>, space = #nvvm.shared_space<cta>} nvvm.fence.proxy { kind = #nvvm.proxy_kind<async.shared>, space = #nvvm.shared_space<cluster>} ```	2023-12-04 16:49:07 +01:00
Justin Wilson	6da578cec1	[mlir] Add support for DIGlobalVariable and DIGlobalVariableExpression (#73367 ) This PR introduces DIGlobalVariableAttr and DIGlobalVariableExpressionAttr so that ModuleTranslation can emit the required metadata needed for debug information about global variable. The translator implementation for debug metadata needed to be refactored in order to allow translation of nodes based on MDNode (DIGlobalVariableExpressionAttr and DIExpression) in addition to DINode-based nodes. A DIGlobalVariableExpressionAttr can now be passed to the GlobalOp operation directly and ModuleTranslation will create the respective DIGlobalVariable and DIGlobalVariableExpression nodes. The compile unit that DIGlobalVariable is expected to be configured with will be updated with the created DIGlobalVariableExpression.	2023-12-04 15:52:02 +01:00
Rik Huijzer	67f9cd4670	[mlir][llvm] Fix verifier for const float (#74247 ) Fixes one of the cases of https://github.com/llvm/llvm-project/issues/56962. This PR basically moves some code from `mlir::LLVM::detail::getLLVMConstant` ([source](`9f78edbd20/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp (L354-L371)`)) over to the verifier of `LLVM::ConstantOp`. For now, I focused just on the case where the attribute is a float and ignored the integer case of https://github.com/llvm/llvm-project/issues/56962. Note that without this patch, both added tests will crash inside `getLLVMConstant` during `mlir-translate -mlir-to-llvmir`.	2023-12-04 08:06:02 +01:00
Guray Ozen	68433f6b27	[mlir][nvvm] Introduce `setmaxregister.sync.aligned` Op (#73780 ) This PR introduce `setmaxregister.sync.aligned` Op to increase or decrease the register size. https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#miscellaneous-instructions-setmaxnreg	2023-11-29 15:26:30 +01:00
Guray Ozen	9ceea08859	[mlir] `im2col` & `l2cache` on cp.async.bulk.tensor.shared.cluster.global` (#72967 ) PR adds support of `im2col` and `l2cache` to `cp.async.bulk.tensor.shared.cluster.global`. The Op is now supports all the traits of the corresponding PTX instruction. The current structure of this operation looks somewhat like below. The PR also simplifies types so we don't need to write obvious types after `:` anymore. ``` nvvm.cp.async.bulk.tensor.shared.cluster.global %dest, %tmaDescriptor, %barrier, box[%crd0,%crd1,%crd2,%crd3,%crd4] im2col[%off0,%off1,%off2] <-- PR introduces multicast_mask = %ctamask l2_cache_hint = %cacheHint <-- PR introduces : !llvm.ptr<3>, !llvm.ptr ```	2023-11-22 16:08:09 +01:00

1 2 3 4 5 ...

619 Commits