clang-p2996

Author	SHA1	Message	Date
Ivan Butygin	f54cdc5d6e	[mlir] IntegerRangeAnalysis: add support for vector type (#112292 ) Treat integer range for vector type as union of ranges of individual elements. With this semantics, most arith ops on vectors will work out of the box, the only special handling needed for constants and vector elements manipulation ops. The end goal of these changes is to be able to optimize vectorized index calculations.	2024-11-01 23:58:16 +03:00
Henrich Lauko	cd340a4957	[mlir][Ptr] Fix license url typo (#114555 )	2024-11-01 20:10:04 +01:00
Razvan Lupusoru	c0a1597029	[mlir][acc] Consistency between acc.loop and acc compute ops (#114549 ) - GangPrivate and GangFirstPrivate renamed to just Private and Firstprivate respectively. This is makes compute ops consistent with the loop op (and also with the acc spec wording for the clause). - Added getBody to all compute ops - Verifier for firstprivate ops / recipes is enabled	2024-11-01 10:53:51 -07:00
c8ef	b57b3f6425	[NFC] Simple typo correction. (#114548 )	2024-11-02 00:40:57 +08:00
Manupa Karunaratne	a6e72f9392	[MLIR][Vector] Add Lowering for vector.step (#113655 ) Currently, the lowering for vector.step lives under a folder. This is not ideal if we want to do transformation on it and defer the materizaliztion of the constants much later. This commits adds a rewrite pattern that could be used by using `transform.structured.vectorize_children_and_apply_patterns` transform dialect operation. Moreover, the rewriter of vector.step is also now used in -convert-vector-to-llvm pass where it handles scalable and non-scalable types as LLVM expects it. As a consequence of removing the vector.step lowering as its folder, linalg vectorization will keep vector.step intact.	2024-11-01 16:38:36 +00:00
Ian Wood	d97bc388fd	Reapply "Extend getBackwardSlice to track values captured… (#114452 ) This commit fixes the failure in the original PR when building with shared libs. The problem is that `visitUsedValuesDefinedAbove` is defined in `MLIRTransformUtils`, but that lib depends on this lib (`MLIRAnalysis`). To fix, I dropped the use of `visitUsedValuesDefinedAbove` and use `Region::walk` to traverse values defined above. Reapplies PR https://github.com/llvm/llvm-project/pull/113478 Reverts PR https://github.com/llvm/llvm-project/pull/114432 This reverts commit `a9a8351`.	2024-11-01 08:42:12 -07:00
Wang Qiang	b77e40265c	[llvm][NFC] Fix typos: replace “avaliable” with “available” across various files (#114524 ) This pull request corrects multiple occurrences of the typo "avaliable" to "available" across the LLVM and Clang codebase. These changes improve the clarity and accuracy of comments and documentation. Specific modifications are in the following files: 1. clang-tools-extra/clang-tidy/readability/FunctionCognitiveComplexityCheck.cpp: Updated comments in readability checks for cognitive complexity. 2. llvm/include/llvm/ExecutionEngine/Orc/ExecutionUtils.h: Corrected documentation for JITDylib responsibilities. 3. llvm/include/llvm/Target/TargetMacroFusion.td: Fixed descriptions for FusionPredicate variables. 4. llvm/lib/CodeGen/SafeStack.cpp: Improved comments on DominatorTree availability. 5. llvm/lib/Target/RISCV/RISCVSchedSiFive7.td: Enhanced resource usage descriptions for vector units. 6. llvm/lib/Transforms/Scalar/LoopIdiomRecognize.cpp: Updated invariant description in shift-detect idiom logic. 7. llvm/test/MC/ARM/mve-fp-registers.s: Amended ARM MVE register availability notes. 8. mlir/lib/Bytecode/Reader/BytecodeReader.cpp: Adjusted forward reference descriptions for bytecode reader operations. These changes have no impact on code functionality, focusing solely on documentation clarity. Co-authored-by: wangqiang <wangqiang1@kylinos.cn>	2024-11-01 13:25:04 +00:00
Luke Hutton	36878b5542	[TOSA] Remove i64 from valid element datatypes in validation (#113380 ) Align the validation pass valid element datatypes check more closely to the specification by removing i64 as a supported datatype. The spec does not currently support it. Signed-off-by: Luke Hutton <luke.hutton@arm.com>	2024-11-01 10:12:43 +00:00
Peter Hawkins	51a4f319f0	[mlir:python] Avoid calls to get_op_result_or_results in generated value wrappers (#114491 ) If we know the output arity at tablegen time, we can often just call .result or .results directly. This saves almost 1s in a JAX-based LLM benchmark building a mixture of upstream dialects and StableHLO.	2024-11-01 00:54:20 +01:00
Jakub Kuderski	a8b4cb185c	[mlir] Remove debug prints from test pattern	2024-10-31 17:06:02 -04:00
Rolf Morel	5c1752e368	[MLIR][DLTI] Pretty parsing and printing for DLTI attrs (#113365 ) Unifies parsing and printing for DLTI attributes. Introduces a format of `#dlti.attr<key1 = val1, ..., keyN = valN>` syntax for all queryable DLTI attributes similar to that of the DictionaryAttr, while retaining support for specifying key-value pairs with `#dlti.dl_entry` (whether to retain this is TBD). As the new format does away with most of the boilerplate, it is much easier to parse for humans. This makes an especially big difference for nested attributes. Updates the DLTI-using tests and includes fixes for misc error checking/ error messages.	2024-10-31 19:18:24 +00:00
Mehdi Amini	a9a8351ef1	Revert "Extend `getBackwardSlice` to track values captured from above" (#114432 ) Reverts llvm/llvm-project#113478 Bot is broken when building with shared libs.	2024-10-31 18:29:05 +01:00
Sergio Afonso	6c28530ed0	[Flang][OpenMP] Properly bind arguments of composite operations (#113682 ) When composite constructs are lowered, clauses for each leaf construct are lowered before creating the set of loop wrapper operations, using these outside values to populate their operand lists. Then, when the loop nest associated to that composite construct is lowered, the binding of Fortran symbols to the entry block arguments defined by these loop wrappers is performed, resulting in the creation of `hlfir.declare` operations in the entry block of the `omp.loop_nest`. This approach prevents `hlfir.declare` operations related to the binding and other operations resulting from the evaluation of the clauses from being inserted between loop wrapper operations, which would be an illegal MLIR representation. However, this introduces the problem of entry block arguments defined by a wrapper that then should be used by one of its nested wrappers, because the corresponding Fortran symbol would still be mapped to an outside value at the time of gathering the list of operands for the nested wrapper. This patch adds operand re-mapping logic to update wrappers without changing when clauses are evaluated or where the `hlfir.declare` creation is performed.	2024-10-31 16:39:53 +00:00
Marius Brehler	f5e6c8e0b7	[mlir][python] Raise maximum allowed version (#114050 ) Raises the maximum allowed versions to more recent versions, which is a basic enabler to install them in a venv using Python 3.13.	2024-10-31 17:39:26 +01:00
Krzysztof Drewniak	3452149c05	[mlir][AMDGPU] Support vector<2xbf16> packed atomic fadd (#113929 ) Now that we use LLVM's native bfloat types in the AMDGPU lowering, enable vector<2xbf16> for AMDGPU.	2024-10-31 10:52:53 -05:00
Jakub Kuderski	0f8a6b7d03	[mlir] Add fast walk-based pattern rewrite driver (#113825 ) This is intended as a fast pattern rewrite driver for the cases when a simple walk gets the job done but we would still want to implement it in terms of rewrite patterns (that can be used with the greedy pattern rewrite driver downstream). The new driver is inspired by the discussion in https://github.com/llvm/llvm-project/pull/112454 and the LLVM Dev presentation from @matthias-springer earlier this week. This limitation comes with some limitations: * It does not repeat until a fixpoint or revisit ops modified in place or newly created ops. In general, it only walks forward (in the post-order). * `matchAndRewrite` can only erase the matched op or its descendants. This is verified under expensive checks. * It does not perform folding / DCE. We could probably relax some of these in the future without sacrificing too much performance.	2024-10-31 11:10:09 -04:00
Ian Wood	1bc58a258e	Extend `getBackwardSlice` to track values captured from above (#113478 ) This change modifies `getBackwardSlice` to track values captures by the regions of each operation that it traverses. Ignoring values captured from a parent region may lead to an incomplete program slice. However, there seems to be logic that depends on not traversing captured values, so this change preserves the default behavior by hiding this logic behind the `omitUsesFromAbove` flag.	2024-10-31 07:47:48 -07:00
Sergio Afonso	bd6c21460f	[MLIR][OpenMP] Emit descriptive errors for all unsupported clauses (#114037 ) This patch improves error reporting in the MLIR to LLVM IR translation pass for the 'omp' dialect by emitting descriptive errors when encountering clauses not yet supported by that pass. Additionally, not-yet-implemented errors previously missing for some clauses are added, to avoid silently ignoring them. Error messages related to inlining of `omp.private` and `omp.declare_reduction` regions have been updated to use the same format.	2024-10-31 11:59:51 +00:00
Sergio Afonso	21a6032eca	[MLIR][OpenMP] Simplify translation to LLVM IR error handling (#114036 ) This patch unifies the handling of errors passed through the OpenMPIRBuilder and removes some redundant error messages through the introduction of a custom `ErrorInfo` subclass. Additionally, the current list of operations and clauses unsupported by the MLIR to LLVM IR translation pass is added to a new Lit test to check they are being reported to the user.	2024-10-31 11:34:24 +00:00
Longsheng Mou	262afc8aec	[mlir][TosaToLinalg] `RescaleConverter` only support integer type (#114239 ) This PR fixes a bug in the `RescaleConverter` that allows non-integer types, which leads to a crash. Fixes #61383.	2024-10-31 11:32:19 +00:00
Abid Qadeer	89f2d50cda	[mlir][debug] Support DIGenericSubrange. (#113441 ) `DIGenericSubrange` is used when the dimensions of the arrays are unknown at build time (e.g. assumed-rank arrays in Fortran). It has same `lowerBound`, `upperBound`, `count` and `stride` fields as in `DISubrange` and its translation looks quite similar as a result. --------- Co-authored-by: Tobias Gysi <tobias.gysi@nextsilicon.com>	2024-10-31 10:09:26 +00:00
Marc Auberer	084889802d	[mlir][docs][NFC] Fix typo in bufferization/transforms documentation (#114313 ) Fixes #114202	2024-10-31 09:40:45 +01:00
Longsheng Mou	fdc78120bd	[mlir][docs] Fix typo in bufferization documentation(NFC) (#114342 )	2024-10-31 14:08:54 +08:00
Caio Oliveira	6e75eec866	[mlir][spirv] Remove code for de-duplicating symbols in SPIR-V grammar (#111778 ) SPIR-V grammar was updated in upstream to have an "aliases" field instead of duplicating symbols with same values. See https://github.com/KhronosGroup/SPIRV-Headers/pull/447 for details.	2024-10-30 18:40:08 -04:00
Caio Oliveira	67c485798a	[mlir][spirv] Ignore extra comma for category_args in gen_spirv_dialect.py (#111776 ) In the code being parsed, the comma separates following traits from the category args. If there's no category args, it is still present.	2024-10-30 18:39:32 -04:00
Matthias Springer	d043670d66	[mlir][func] Replace `ValueDecomposer` with target materialization (#114192 ) The `ValueDecomposer` in `DecomposeCallGraphTypes` was a workaround around missing 1:N support in the dialect conversion. Since #113032, the dialect conversion infrastructure supports 1:N type conversions and 1:N target materializations. The `ValueDecomposer` class is no longer needed. (However, target materializations must still be inserted manually, until we fully merge the 1:1 and 1:N drivers.) Note for LLVM integration: Register 1:N target materializations on the type converter instead of "decompose value conversions" on the `ValueDecomposer`.	2024-10-31 07:26:12 +09:00
Ilya Enkovich	d2109640a3	[MLIR] [AMX] Fix strides used by AMX lowering for tile loads and stores. (#113476 )	2024-10-30 20:41:28 +01:00
Simon Camphausen	95c2d79814	[mlir][EmitC] memref-to-emitc: insert conversion_casts (#114204 ) Add materializations to the conversion pass, such that types of non-converted operands are legalized.	2024-10-30 15:27:23 +01:00
Asher Mancinelli	6af275b72e	[mlir][doc] Fix nitpicks in documentation (#114157 ) A couple of these are probably up to preference, but the grammar/capitalization changes are probably more critical for readability.	2024-10-30 07:07:49 -07:00
Matthias Springer	217700baf7	[mlir][bufferization] Support bufferization of external functions (#113999 ) This commit adds support for bufferizing external functions that have no body. Such functions were previously rejected by One-Shot Bufferize if they returned a tensor value. This commit is in preparation of removing the deprecated `func-bufferize` pass. That pass can bufferize external functions. Also update a few comments.	2024-10-30 21:49:10 +09:00
Matthias Springer	ea050ab1a9	[mlir][Transforms][NFC] Dialect conversion: Reformat materialization error message (#114176 ) This commit changes the format of the materialization error message. Previously: `failed to legalize unresolved materialization from ('f64') to 'f32' that remained live after conversion` Now: `failed to legalize unresolved materialization from ('f64') to ('f32') that remained live after conversion` This commit is in preparation of merging the 1:1 and 1:N dialect conversions. At that point, target materializations may create more than one SSA value. I am sending this change as a separate PR to keep the main PR smaller.	2024-10-30 21:36:39 +09:00
donald chen	df0d249b65	[mlir] [linalg] fix side effect of linalg op (#114045 ) Linalg op need to take into account memory side effects happening inside the region when determining their own side effects. This patch fixed issue https://github.com/llvm/llvm-project/issues/112881	2024-10-30 14:01:49 +08:00
Jessica Clarke	9467645547	[CodeGen] Rename MVT::iPTRAny to MVT::pAny Whilst in upstream LLVM iPTRAny is only ever an integer, essentially an alias for iPTR, this is not true in CHERI LLVM, where it gets used to mean "iPTR or cPTR", i.e. either an integer address or a capability (with cPTR and cN being the capability equivalents of iPTR and iN). Moreover, iPTRAny is already not itself regarded as an integer (calling isInteger() will give false), so the "i" prefix is misleading, and it stands out as different from all the other xAny that have a single letter prefix denoting their type. Thus, rename it to pAny, reflecting that it is an overloaded pointer type, which could end up being specialised to an integer type, but does not have to be. This has been verified to have no effect on the generated files for LLVM itself or any in-tree target beyond the replacement of the identifier iPTRAny with pAny in GenVT.inc. Reviewers: arsenm Reviewed By: arsenm Pull Request: https://github.com/llvm/llvm-project/pull/113733	2024-10-30 03:27:48 +00:00
lialan	2c313259c6	[MLIR] VectorEmulateNarrowType to support loading of unaligned vectors (#113411 ) Previously, the pass only supported emulation of loading vector sizes that are multiples of the emulated data type. This patch expands its support for emulating sizes that are not multiples of byte sizes. In such cases, the element values are packed back-to-back to preserve memory space. To give a concrete example: if an input has type `memref<3x3xi2>`, it is actually occupying 3 bytes in memory, with the first 18 bits storing the values and the last 6 bits as padding. The slice of `vector<3xi2>` at index `[2, 0]` is stored in memory from bit 12 to bit 18. To properly load the elements from bit 12 to bit 18 from memory, first load byte 2 and byte 3, and convert it to a vector of `i2` type; then extract bits 4 to 10 (element index 2-5) to form a `vector<3xi2>`. A limitation of this patch is that the linearized index of the unaligned vector has to be known at compile time. Extra code needs to be emitted to handle it if the condition does not hold. The following ops are updated: * `vector::LoadOp` * `vector::TransferReadOp` * `vector::MaskedLoadOp`	2024-10-29 20:04:48 -07:00
Kunwar Grover	2c5eea0e88	[mlir][Vector] Fix vector.insert folder for scalar to 0-d inserts (#113828 ) The current vector.insert folder tries to replace a scalar with a 0-rank vector. This patch fixes this crash by not folding unless they types of the result and replacement are same.	2024-10-29 22:47:44 +00:00
Kazu Hirata	6f66530fd1	[mlir] Fix a warning This patch fixes: mlir/lib/Pass/PassRegistry.cpp:425:37: error: ISO C++ requires the name after '::~' to be found in the same scope as the name before '::~' [-Werror,-Wdtor-name]	2024-10-29 10:55:34 -07:00
Sergio Afonso	a1f2fb6078	[MLIR][OpenMP] Prevent composite omp.simd related crashes (#113680 ) This patch updates the translation of `omp.wsloop` with a nested `omp.simd` to prevent uses of block arguments defined by the latter from triggering null pointer dereferences. This happens because the inner `omp.simd` operation representing composite `do simd` constructs is currently skipped and not translated, but this results in block arguments defined by it not being mapped to an LLVM value. The proposed solution is to map these block arguments to the LLVM value associated to the corresponding operand, which is defined above.	2024-10-29 17:05:12 +00:00
Andrzej Warzyński	39ad84e4d1	[mlir][linalg] Split GenericPadOpVectorizationPattern into two patterns (#111349 ) At the moment, `GenericPadOpVectorizationPattern` implements two orthogonal transformations: 1. Rewrites `tensor::PadOp` into a sequence of `tensor::EmptyOp`, `linalg::FillOp` and `tensor::InsertSliceOp`. 2. Vectorizes (where possible) `tensor::InsertSliceOp` (see `tryVectorizeCopy`). This patch splits `GenericPadOpVectorizationPattern` into two separate patterns: 1. `GeneralizePadOpPattern` for the first transformation (note that currently `GenericPadOpVectorizationPattern` inherits from `GeneralizePadOpPattern`). 2. `InsertSliceVectorizePattern` to vectorize `tensor::InsertSliceOp`. With this change, we gain the following: * a clear separation between pre-processing and vectorization transformations/stages, * a path to support masked vectorisation for `tensor.insert_slice` (with a dedicated pattern for vectorization, it is much easier to specify the input vector sizes used in masking), * more opportunities to vectorize `tensor.insert_slice`. Note for downstream users: -------------------------- If you were using `populatePadOpVectorizationPatterns`, following this change you will also have to add `populateInsertSliceVectorizationPatterns`. Finer implementation details: ----------------------------- 1. The majority of changes in this patch are copy & paste + some edits. 1.1. The only functional change is that the vectorization of `tensor.insert_slice` is now broadly available (as opposed to being constrained to the pad vectorization pattern: `GenericPadOpVectorizationPattern`). 1.2. Following-on from the above, `@pad_and_insert_slice_dest` is updated. As expected, the input `tensor.insert_slice` Op is no longer "preserved" and instead gets vectorized successfully. 2. The `linalg.fill` case in `getConstantPadVal` works under the assumption that only _scalar_ source values can be used. That's consistent with the definition of the Op, but it's not tested at the moment. Hence a test case in Linalg/invalid.mlir is added. 3. The behaviour of the two TD vectorization Ops, `transform.structured.vectorize_children_and_apply_patterns` and `transform.structured.vectorize` is preserved.	2024-10-29 16:57:23 +00:00
Hugo Trachino	a9c417c28a	[MLIR][SCF] Fix LoopPeelOp documentation (NFC) (#113179 ) As an example, I added annotations to the peel_front unit test. ``` func.func @loop_peel_first_iter_op() { // CHECK: %[[C0:.+]] = arith.constant 0 // CHECK: %[[C41:.+]] = arith.constant 41 // CHECK: %[[C5:.+]] = arith.constant 5 // CHECK: %[[C5_0:.+]] = arith.constant 5 // CHECK: scf.for %{{.+}} = %[[C0]] to %[[C5_0]] step %[[C5]] // CHECK: arith.addi // CHECK: scf.for %{{.+}} = %[[C5_0]] to %[[C41]] step %[[C5]] // CHECK: arith.addi %0 = arith.constant 0 : index %1 = arith.constant 41 : index %2 = arith.constant 5 : index scf.for %i = %0 to %1 step %2 { arith.addi %i, %i : index } return } module attributes {transform.with_named_sequence} { transform.named_sequence @__transform_main(%arg1: !transform.any_op {transform.readonly}) { %0 = transform.structured.match ops{["arith.addi"]} in %arg1 : (!transform.any_op) -> !transform.any_op %1 = transform.get_parent_op %0 {op_name = "scf.for"} : (!transform.any_op) -> !transform.op<"scf.for"> %main_loop, %remainder = transform.loop.peel %1 {peel_front = true} : (!transform.op<"scf.for">) -> (!transform.op<"scf.for">, !transform.op<"scf.for">) transform.annotate %main_loop "main_loop" : !transform.op<"scf.for"> transform.annotate %remainder "remainder" : !transform.op<"scf.for"> transform.yield } } ``` Gives : ``` func.func @loop_peel_first_iter_op() { %c0 = arith.constant 0 : index %c41 = arith.constant 41 : index %c5 = arith.constant 5 : index %c5_0 = arith.constant 5 : index scf.for %arg0 = %c0 to %c5_0 step %c5 { %0 = arith.addi %arg0, %arg0 : index } {remainder} // The first iteration loop (second result) has been annotated remainder scf.for %arg0 = %c5_0 to %c41 step %c5 { %0 = arith.addi %arg0, %arg0 : index } {main_loop} // The main loop (first result) has been annotated main_loop return } ``` --------- Co-authored-by: Andrzej Warzyński <andrzej.warzynski@gmail.com>	2024-10-29 15:47:13 +00:00
goldsteinn	2e612f8d86	[MLIR][Arith] Improve accuracy of `inferDivU` (#113789 ) 1) We can always bound the maximum with the numerator. - https://alive2.llvm.org/ce/z/PqHvuT 2) Even if denominator min can be zero, we can still bound the minimum result with `lhs.umin u/ rhs.umax`. This is similar to https://github.com/llvm/llvm-project/pull/110169	2024-10-29 09:41:59 -05:00
Piotr Fusik	c370869cd6	[mlir][NFC] Avoid a warning (#114052 ) gcc 14.1 warning: template-id not allowed for destructor in C++20 [-Wtemplate-id-cdtor]	2024-10-29 15:01:37 +01:00
Matthias Springer	c0cba25cdd	[mlir][Transforms] Dialect conversion: Hardening `replaceOp` (#109540 ) This commit adds extra checks/assertions to the `ConversionPatternRewriterImpl::notifyOpReplaced` to improve its robustness. 1. Replacing an `unrealized_conversion_cast` op that was created by the driver is now forbidden and caught early during `replaceOp`. It may work in some cases, but it is generally dangerous because the conversion driver keeps track of these ops and performs some extra legalization steps during the "finalize" phase. (Erasing is them is fine.) 2. `null` replacement values are no longer registered in the `ConversionValueMapping`. This was an oversight in #106760. There is no benefit in having `null` values in the `ConversionValueMapping`. (It may even cause problems.) This change is in preparation of merging the 1:1 and 1:N dialect conversion drivers.	2024-10-29 21:13:54 +09:00
Matthias Springer	6588073724	[mlir][func] Fix incorrect API usage in `FuncOpConversion` (#113977 ) This commit fixes a case of incorrect dialect conversion API usage during `FuncOpConversion`. `replaceAllUsesExcept` (same as `replaceAllUsesWith`) is currently not supported in a dialect conversion. `replaceUsesOfBlockArgument` should be used instead. It sometimes works anyway (like in this case), but that's just because of the way we insert materializations. This commit is in preparation of merging the 1:1 and 1:N dialect conversion drivers. (At that point, the current use of `replaceAllUsesExcept` will no longer work.)	2024-10-29 13:19:43 +09:00
Matthias Springer	1549a0c183	[mlir][SCF] Remove `scf-bufferize` pass (#113840 ) The dialect conversion-based bufferization passes have been migrated to One-Shot Bufferize about two years ago. To clean up the code base, this commit removes the `scf-bufferize` pass, one of the few remaining parts of the old infrastructure. Most bufferization passes have already been removed. Note for LLVM integration: If you depend on this pass, migrate to One-Shot Bufferize or copy the pass to your codebase.	2024-10-29 09:10:30 +09:00
Thomas Preud'homme	7db4cacfd7	[MLIR] Add missing MLIRLLVMDialect dep to MLIRLinalgToStandard (#113561 ) This fixes the following failure when doing a clean build (in particular no .ninja* lying around) of lib/libMLIRLinalgToStandard.a only: ``` In file included from llvm/include/llvm/IR/Module.h:22, from mlir/include/mlir/Dialect/LLVMIR/LLVMDialect.h:37, from mlir/lib/Conversion/LinalgToStandard/LinalgToStandard.cpp:13: llvm/include/llvm/IR/Attributes.h:90:14: fatal error: llvm/IR/Attributes.inc: No such file or directory ```	2024-10-28 22:53:39 +01:00
Thomas Preud'homme	82cb22e735	[MLIR] Add missing MLIRLLVMDialect dep to MLIRMathToLibm (#113563 ) This fixes the following failure when doing a clean build (in particular no .ninja* lying around) of lib/libMLIRMathToLibm.a only: ``` In file included from llvm/include/llvm/IR/Module.h:22, from mlir/include/mlir/Dialect/LLVMIR/LLVMDialect.h:37, from mlir/lib/Conversion/MathToLibm/MathToLibm.cpp:13 llvm/include/llvm/IR/Attributes.h:90:14: fatal error: llvm/IR/Attributes.inc: No such file or directory ```	2024-10-28 22:50:23 +01:00
Petr Kurapov	7a710110fc	[MLIR][Vector] Remove unused and unimplemented Vector_WarpExecuteOnLa… (#112338 ) …ne0Op builder Removing the declaration instead of implementing the builder as discussed in #110106	2024-10-28 17:12:12 +01:00
donald chen	39ac64c1c0	[mlir][Arith] ValueBoundsInterface: speedup arith.select (#113531 ) When calculating value bounds in the arith.select op , the compare function is invoked to compare trueValue and falseValue. This function rebuilds constraints, resulting in repeated computations of value bounds. In large-scale programs, this redundancy significantly impacts compilation time.	2024-10-28 10:14:44 +08:00
Longsheng Mou	7ad63c0e44	[mlir][MathToFuncs] `MathToFuncs` only support integer type (#113693 ) This PR fixes a bug in `MathToFuncs` where it incorrectly converts index type for `math.ctlz` and `math.ipowi`, leading to a crash. Fixes #108150.	2024-10-28 09:54:51 +08:00
Durgadoss R	e33aec89ef	[MLIR][NVVM] Update the elect.sync Op to use intrinsics (#113757 ) Recently, we added an intrinsic for the elect.sync PTX instruction (PR 104780). This patch updates the corresponding Op in NVVM Dialect to lower to the intrinsic instead of inline-ptx. The existing test under Conversion/ is migrated to check for the new pattern. A separate test is added to verify the lowered intrinsic under the Target/ directory. Signed-off-by: Durgadoss R <durgadossr@nvidia.com>	2024-10-27 22:24:31 +05:30

1 2 3 4 5 ...

21084 Commits