clang-p2996

Author	SHA1	Message	Date
Fangrui Song	111fcb0df0	[llvm] Fix duplicate word typos. NFC Those fixes were taken from https://reviews.llvm.org/D137338	2023-09-01 18:25:16 -07:00
Nikita Popov	98cf20f890	Revert "[Verifier] Sanity check alloca size against DILocalVariable fragment size" This reverts commit `183f49c3e0`. The lang/cpp/trivial_abi/TestTrivialABI.py lldb test fails on buildbots.	2023-08-28 09:44:51 +02:00
Nikita Popov	183f49c3e0	[Verifier] Sanity check alloca size against DILocalVariable fragment size Add a check that the DILocalVariable fragment size in dbg.declare does not exceed the size of the alloca. This would have caught the invalid debuginfo regenerated by rustc in https://github.com/llvm/llvm-project/issues/64149. Differential Revision: https://reviews.llvm.org/D158743	2023-08-28 09:16:33 +02:00
LiaoChunyu	1b12427c01	[VP][RISCV] Add vp.is.fpclass and RISC-V support There is no vp.fpclass after FCLASS_VL(D151176), try to support vp.fpclass. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D152993	2023-08-25 15:40:55 +08:00
Sameer Sahasrabuddhe	0cb6d2f643	[LLVM][Convergence] further refactor convergence verifier This is in preparation for using the same convergence verifier for both LLVM IR and Machine IR. Reviewed By: yassingh Differential Revision: https://reviews.llvm.org/D158394	2023-08-23 12:02:30 +05:30
Felipe de Azevedo Piovezan	26ea983d22	[Verifier] Allow undef/poison in entry_values expressions. This patch relaxes the verifier when it checks whether an OP_entry_value has a valid Value associated with it. We now allow undef/poison values as well, since those may be introduced naturally through optimization. Differential Revision: https://reviews.llvm.org/D158101	2023-08-17 09:16:08 -04:00
Diana Picus	5a8ecd6456	[AMDGPU] More verifier checks for llvm.amdgcn.cs.chain Check that the SGPR arguments have the `inreg` attribute and the VGPR arguments don't. Differential Revision: https://reviews.llvm.org/D156409	2023-08-17 09:37:20 +02:00
Bjorn Pettersson	e53b28c833	[llvm] Drop some bitcasts and references related to typed pointers Differential Revision: https://reviews.llvm.org/D157551	2023-08-10 15:07:07 +02:00
Sameer Sahasrabuddhe	bd7a4d7b27	Restore "[LLVM] move verification of convergence control to a class template"" The refactored template can now be used with MachineVerifier. Resubmitted after fixing build errors: - Shared libraries build failed with undefined references due to "extern template" declarations. - Modules build failed due to a cycle dependence between llvm/ADT and llvm/IR. The Generic*Impl.h files should be in llvm/IR to prevent this. Differential Revision: https://reviews.llvm.org/D156522 This restores commit `93a3706711`. Originally reverted in `466bd99811`.	2023-08-03 10:36:57 +05:30
Sameer Sahasrabuddhe	466bd99811	Revert "[LLVM] move verification of convergence control to a class template" This reverts commit `93a3706711`. The "extern template" declaration of CycleInfo caused problems in a shared build when CycleInfo was removed from Verifier.cpp. There needs to be an explicit instantiation corresponding to an extern template in every SO.	2023-08-01 17:00:39 +05:30
Benjamin Kramer	502280ed35	[Verifier] Pass raw_ostream as pointer instead of reference This can be nullptr and ubsan found a couple of cases in LLVM's unit tests.	2023-08-01 10:44:28 +02:00
Sameer Sahasrabuddhe	93a3706711	[LLVM] move verification of convergence control to a class template The refactored template can now be used with MachineVerifier. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D156522	2023-08-01 11:21:48 +05:30
Sameer Sahasrabuddhe	f7b09516e4	[LLVM] Add missing verifier checks for convergence control	2023-07-31 11:08:19 +05:30
DianQK	2ee4d0386c	[Verifier] definition subprograms cannot be nested within DICompositeType when enabling ODR. Resolve https://github.com/llvm/llvm-project/issues/61932. We should add the validation. LLVM can't handle IR where subprogram definitions are nested within DICompositeType when doing LTO builds, because there's no good way to cross the CU boundary to insert a nested DISubprogram definition in one CU into a type defined in another CU. The test cases `cross-cu-inlining-2.ll` and `cross-cu-inlining-ranges.ll` can be deleted. In the `cross-cu-inlining-2.ll`, the low pc and high pc of the CU are also incorrect. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D152095	2023-07-26 06:08:32 +08:00
Zhongyunde	7203286329	[LangRef] vscale_range implies the vscale is power-of-two According the discuss on D154953, we need to make the LangRef change before the optimization relied on the new behaviour: vscale_range implies vscale is a power-of-two value, parse of the attribute to reject values that are not a power-of-two. Thanks nikic for the wonderful summary of discussing on D154953: To provide a bit more context here. We would like to have power of two vscale exposed in a target-independent way, so we can make use of this in places like ValueTracking, just like we currently do the vscale range. Some options that have been discussed are: - Remove support for non-power-of-two vscales entirely. (This is my personal preference, but this is hard to undo if it turns out someone does need them.) - Add an extra attribute vscale_pow2, or a data layout property. - Make vscale_range imply power-of-two vscale, as a compromise solution (what this patch does). This would be relatively easy to turn into one of the two above at a later point. Reviewed By: paulwalker-arm, nikic, efriedma Differential Revision: https://reviews.llvm.org/D155193	2023-07-15 09:13:48 +08:00
Nikita Popov	a8e76e89ce	[Verifier] Remove typed pointer verification (NFC)	2023-07-14 10:38:11 +02:00
Sameer Sahasrabuddhe	da61c865e7	[RFC] Introduce convergence control intrinsics This is a reboot of the original design and implementation by Nicolai Haehnle <nicolai.haehnle@amd.com>: https://reviews.llvm.org/D85603 This change also obsoletes an earlier attempt at restarting the work on convergence tokens: https://reviews.llvm.org/D104504 Changes relative to D85603: 1. Clean up the definition of a "convergent operation", a convergent call and convergent function. 2. Clean up the relationship between dynamic instances, sets of threads and convergence tokens. 3. Redistribute the formal rules into the definitions of the convergence intrinsics. 4. Expand on the semantics of entering a function from outside LLVM, and the environment-defined outcome of the entry intrinsic. 5. Replace the term "cycle" with "closed path". The static rules are defined in terms of closed paths, and then a relation is established with cycles. 6. Specify that if a function contains a controlled convergent operation, then all convergent operations in that function must be controlled. 7. Describe an optional procedure to infer tokens for uncontrolled convergent operations. 8. Introduce controlled maximal convergence-before and controlled m-converged property as an update to the original properties in UniformityAnalysis. 9. Additional constraint that a cycle heart can only occur in the header of a reducible cycle (natural loop). Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D147116	2023-07-12 12:31:42 +05:30
Matt Arsenault	53acadafdd	Verifier: Verify absolute_symbol metadata This is the same as !range except for one edge case.	2023-06-30 12:31:32 -04:00
Matt Arsenault	a2ce822a09	Verifier: Fix assertion on range metadata with equal bounds This only worked if the same values were the min or max. We also seem to be missing proper assembler tests for this.	2023-06-30 12:31:32 -04:00
Elliot Goodrich	f0fa2d7c29	[llvm] Move AttributeMask to a separate header Move `AttributeMask` out of `llvm/IR/Attributes.h` to a new file `llvm/IR/AttributeMask.h`. After doing this we can remove the `#include <bitset>` and `#include <set>` directives from `Attributes.h`. Since there are many headers including `Attributes.h`, but not needing the definition of `AttributeMask`, this causes unnecessary bloating of the translation units and slows down compilation. This commit adds in the include directive for `llvm/IR/AttributeMask.h` to the handful of source files that need to see the definition. This reduces the total number of preprocessing tokens across the LLVM source files in lib from (roughly) 1,917,509,187 to 1,902,982,273 - a reduction of ~0.76%. This should result in a small improvement in compilation time. Differential Revision: https://reviews.llvm.org/D153728	2023-06-27 15:26:17 +01:00
Diana Picus	8762bc77b4	[AMDGPU] Add llvm.amdgcn.cs.chain intrinsic to IR & verifier We only check a subset of the constraints in the verifier: * that we only call the intrinsic from functions with a restricted set of calling conventions * that the 'flags' argument is an immediate Other checks are (probably) more appropriate for codegen. Differential Revision: https://reviews.llvm.org/D151995	2023-06-22 10:02:45 +02:00
Diana Picus	29dcc4c143	[AMDGPU] Add amdgpu_cs_chain[_preserve] CCs to IR & verifier Add the amdgpu_cs_chain and amdgpu_cs_chain_preserve keywords to LLVM IR and make sure we can parse and print them. Also make sure we perform some basic checks in the IR verifier - similar to what we check for many of the other AMDGPU calling conventions, plus the additional restriction that we can't have direct calls to functions with these calling conventions. Differential Revision: https://reviews.llvm.org/D151994	2023-06-22 10:02:45 +02:00
Vladislav Dzhidzhoev	6bea8331f9	Revert "Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" (2)" This reverts commit `cb9ac70515`. It causes an assert in clang: virtual void llvm::DwarfDebug::endFunctionImpl(const llvm::MachineFunction*): Assertion `LScopes.getAbstractScopesList().size() == NumAbstractSubprograms && "getOrCreateAbstractScope() inserted an abstract subprogram scope"' failed. https://bugs.chromium.org/p/chromium/issues/detail?id=1456288#c2	2023-06-20 13:08:47 +02:00
Vladislav Dzhidzhoev	cb9ac70515	Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" (2) Test "local-type-as-template-parameter.ll" is now enabled only for x86_64. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144006 Depends on D144005	2023-06-20 03:01:46 +02:00
Vladislav Dzhidzhoev	fec7c6457c	Revert "Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)"" This reverts commit `2da45172c4`. Test local-type-as-template-parameter.ll fails on ppc64-aix.	2023-06-20 01:54:48 +02:00
Vladislav Dzhidzhoev	2da45172c4	Reland "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" Test "local-type-as-template-parameter.ll" now requires linux-system. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144006 Depends on D144005	2023-06-19 19:50:46 +02:00
Vladislav Dzhidzhoev	aeb99dc48a	Revert "[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7)" This reverts commit `66511b4010`. llvm/test/DebugInfo/Generic/local-type-as-template-parameter.ll is broken.	2023-06-19 19:16:13 +02:00
Vladislav Dzhidzhoev	66511b4010	[DebugMetadata][DwarfDebug] Support function-local types in lexical block scopes (4/7) RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Similar to imported declarations, the patch tracks function-local types in DISubprogram's 'retainedNodes' field. DwarfDebug is adjusted in accordance with the aforementioned metadata change and provided a support of function-local types scoped within a lexical block. The patch assumes that DICompileUnit's 'enums field' no longer tracks local types and DwarfDebug would assert if any locally-scoped types get placed there. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144006 Depends on D144005	2023-06-19 16:42:43 +02:00
Vladislav Dzhidzhoev	06a0ae6524	Reland "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)" Got rid of non-determinism in MetadataLoader.cpp. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144004	2023-06-16 00:49:59 +02:00
Vladislav Dzhidzhoev	b8ea03a4be	Revert "Reland "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)"" This reverts commit `fcc3981626`, since Bitcode-upgrading code doesn't seem to be deterministic.	2023-06-15 19:36:36 +02:00
Vladislav Dzhidzhoev	fcc3981626	Reland "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)" Run split-dwarf-local-impor3.ll only on x86_64-linux.	2023-06-15 18:15:16 +02:00
Vladislav Dzhidzhoev	fbdeb8cbc1	Revert "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)" This reverts commit `d80fdc6fc1`. split-dwarf-local-impor3.ll fails because of an issue with Dwo sections emission on Windows platform.	2023-06-15 18:04:32 +02:00
Vladislav Dzhidzhoev	d80fdc6fc1	[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7) RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Fixed PR51501 (tests from D112337). 1. Reuse of DISubprogram's 'retainedNodes' to track other function-local entities together with local variables and labels (this patch cares about function-local import while D144006 and D144008 use the same approach for local types and static variables). So, effectively this patch moves ownership of tracking local import from DICompileUnit's 'imports' field to DISubprogram's 'retainedNodes' and adjusts DWARF emitter for the new layout. The old layout is considered unsupported (DwarfDebug would assert on such debug metadata). DICompileUnit's 'imports' field is supposed to track global imported declarations as it does before. This addresses various FIXMEs and simplifies the next part of the patch. 2. Postpone emission of function-local imported entities from `DwarfDebug::endFunctionImpl()` to `DwarfDebug::endModule()`. While in `DwarfDebug::endFunctionImpl()` we do not have all the information about a parent subprogram or a referring subprogram (whether a subprogram inlined or not), so we can't guarantee we emit an imported entity correctly and place it in a proper subprogram tree. So now, we just gather needed details about the import itself and its parent entity (either a Subprogram or a LexicalBlock) during processing in `DwarfDebug::endFunctionImpl()`, but all the real work is done in `DwarfDebug::endModule()` when we have all the required information to make proper emission. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144004	2023-06-15 17:17:53 +02:00
Vladislav Dzhidzhoev	77f8f40cd4	Revert "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)" This reverts commit `ed578f02cf`. Tests llvm/test/DebugInfo/Generic/split-dwarf-local-import*.ll fail when x86_64 target is not registered.	2023-06-15 16:53:36 +02:00
Vladislav Dzhidzhoev	ed578f02cf	[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7) RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Fixed PR51501 (tests from D112337). 1. Reuse of DISubprogram's 'retainedNodes' to track other function-local entities together with local variables and labels (this patch cares about function-local import while D144006 and D144008 use the same approach for local types and static variables). So, effectively this patch moves ownership of tracking local import from DICompileUnit's 'imports' field to DISubprogram's 'retainedNodes' and adjusts DWARF emitter for the new layout. The old layout is considered unsupported (DwarfDebug would assert on such debug metadata). DICompileUnit's 'imports' field is supposed to track global imported declarations as it does before. This addresses various FIXMEs and simplifies the next part of the patch. 2. Postpone emission of function-local imported entities from `DwarfDebug::endFunctionImpl()` to `DwarfDebug::endModule()`. While in `DwarfDebug::endFunctionImpl()` we do not have all the information about a parent subprogram or a referring subprogram (whether a subprogram inlined or not), so we can't guarantee we emit an imported entity correctly and place it in a proper subprogram tree. So now, we just gather needed details about the import itself and its parent entity (either a Subprogram or a LexicalBlock) during processing in `DwarfDebug::endFunctionImpl()`, but all the real work is done in `DwarfDebug::endModule()` when we have all the required information to make proper emission. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144004	2023-06-15 16:15:39 +02:00
Vladislav Dzhidzhoev	a7e7d34dc1	Revert "[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7)" This reverts commit `d04452d548` since test llvm-project/llvm/test/Bitcode/DIImportedEntity_backward.ll is broken.	2023-06-15 14:35:54 +02:00
Vladislav Dzhidzhoev	d04452d548	[DebugMetadata][DwarfDebug] Fix DWARF emisson of function-local imported entities (3/7) RFC https://discourse.llvm.org/t/rfc-dwarfdebug-fix-and-improve-handling-imported-entities-types-and-static-local-in-subprogram-and-lexical-block-scopes/68544 Fixed PR51501 (tests from D112337). 1. Reuse of DISubprogram's 'retainedNodes' to track other function-local entities together with local variables and labels (this patch cares about function-local import while D144006 and D144008 use the same approach for local types and static variables). So, effectively this patch moves ownership of tracking local import from DICompileUnit's 'imports' field to DISubprogram's 'retainedNodes' and adjusts DWARF emitter for the new layout. The old layout is considered unsupported (DwarfDebug would assert on such debug metadata). DICompileUnit's 'imports' field is supposed to track global imported declarations as it does before. This addresses various FIXMEs and simplifies the next part of the patch. 2. Postpone emission of function-local imported entities from `DwarfDebug::endFunctionImpl()` to `DwarfDebug::endModule()`. While in `DwarfDebug::endFunctionImpl()` we do not have all the information about a parent subprogram or a referring subprogram (whether a subprogram inlined or not), so we can't guarantee we emit an imported entity correctly and place it in a proper subprogram tree. So now, we just gather needed details about the import itself and its parent entity (either a Subprogram or a LexicalBlock) during processing in `DwarfDebug::endFunctionImpl()`, but all the real work is done in `DwarfDebug::endModule()` when we have all the required information to make proper emission. Authored-by: Kristina Bessonova <kbessonova@accesssoftek.com> Differential Revision: https://reviews.llvm.org/D144004	2023-06-15 14:29:03 +02:00
Matt Arsenault	d065b1d65b	AutoUpgrade: Fix crash when tbaa has an empty argument Produce a verifier error instead.	2023-06-05 20:44:58 -04:00
Craig Topper	c5e6c886aa	[VP][SelectionDAG][RISCV] Add get_vector_length intrinsics and generic SelectionDAG support. The generic implementation is umin(TC, VF * vscale). Lowering to vsetvli for RISC-V will come in a future patch. This patch is a pre-requisite to be able to CodeGen vectorized code from D99750. Reviewed By: reames, frasercrmck Differential Revision: https://reviews.llvm.org/D149916	2023-05-26 09:06:38 -07:00
eopXD	c8eb535aed	[1/11][IR] Permit load/store/alloca for struct of the same scalable vector type This patch-set aims to simplify the existing RVV segment load/store intrinsics to use a type that represents a tuple of vectors instead. To achieve this, first we need to relax the current limitation for an aggregate type to be a target of load/store/alloca when the aggregate type contains homogeneous scalable vector types. Then to adjust the prolog of an LLVM function during lowering to clang. Finally we re-define the RVV segment load/store intrinsics to use the tuple types. The pull request under the RVV intrinsic specification is riscv-non-isa/rvv-intrinsic-doc#198 --- This is the 1st patch of the patch-set. This patch is originated from D98169. This patch allows aggregate type (StructType) that contains homogeneous scalable vector types to be a target of load/store/alloca. The RFC of this patch was posted in LLVM Discourse. https://discourse.llvm.org/t/rfc-ir-permit-load-store-alloca-for-struct-of-the-same-scalable-vector-type/69527 The main changes in this patch are: Extend `StructLayout::StructSize` from `uint64_t` to `TypeSize` to accommodate an expression of scalable size. Allow `StructType:isSized` to also return true for homogeneous scalable vector types. Let `Type::isScalableTy` return true when `Type` is `StructType` and contains scalable vectors Extra description is added in the LLVM Language Reference Manual on the relaxation of this patch. Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Co-Authored-by: eop Chen <eop.chen@sifive.com> Reviewed By: craig.topper, nikic Differential Revision: https://reviews.llvm.org/D146872	2023-05-19 09:39:36 -07:00
Felipe de Azevedo Piovezan	becfcdfc81	[Verifier] Allow DW_OP_LLVM_entry_value in IR A follow up patch will make the CoroSplit pass introduce such operations in the IR level when it is safe to do so. Depends on D149748 Differential Revision: https://reviews.llvm.org/D149778	2023-05-10 14:35:04 -04:00
Zain Jaffal	5d3a884229	[IRGen] Change annotation metadata to support inserting tuple of strings into annotation metadata array. Annotation metadata supports adding singular annotation strings to annotation block. This patch adds the ability to insert a tuple of strings into the metadata array. The idea here is that each tuple of strings represents a piece of information that can be all related. It makes it easier to parse through related metadata information given it will be contained in one tuple. For example in remarks any pass that implements annotation remarks can have different type of remarks and pass additional information for each. The original behaviour of annotation remarks is preserved here and we can mix tuple annotations and single annotations for the same instruction. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D148328	2023-05-09 17:51:28 +03:00
Shraiysh Vaishay	7021182d6b	[nfc][llvm] Replace pointer cast functions in PointerUnion by llvm casting functions. This patch replaces the uses of PointerUnion.is function by llvm::isa, PointerUnion.get function by llvm::cast, and PointerUnion.dyn_cast by llvm::dyn_cast_if_present. This is according to the FIXME in the definition of the class PointerUnion. This patch does not remove them as they are being used in other subprojects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D148449	2023-04-17 13:40:51 -05:00
Yeting Kuo	6858a920b8	[RISCV] Support vector type strict_[su]int_to_fp and strict_fp_to_[su]int. Also the patch loose the fixed vector contraint in llvm/lib/IR/Verifier.cpp. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D147380	2023-04-06 10:09:44 +08:00
Florian Hahn	1a7cf7a182	[Verifier] Verify sizes of matrix.multiply operands and specified shape. Extend the verifier to check if the size of the matrix operands of matrix.multiply match the sizes specified by the numeric arguments. Reviewed By: thegameg Differential Revision: https://reviews.llvm.org/D147466	2023-04-04 20:51:43 +01:00
Nikita Popov	af101f9ae0	[IR] Allow !range on vector of integer instructions Inspired by https://reviews.llvm.org/D144467#4188310, this allows !range on vector of integer instructions, with the usual element-wise interpretation, which is already used by various analysis APIs that support vectors. Differential Revision: https://reviews.llvm.org/D145920	2023-03-14 09:41:56 +01:00
Yeting Kuo	b2c48559c8	[IR][DAG][RISCV] Allow scalable vector ISD::STRICT_FP_EXTEND and RISC-V supports for vector ISD::STRICT_FP_EXTEND. The patch mainly does two things. The first is allowing scalable vector ISD::STRICT_FP_EXTEND. The second is making RISC-V customized lower strict_fpextend to riscv_strict_fpextend_vl, the strict version of riscv_fpextend_vl. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D145548	2023-03-09 17:37:59 +08:00
J. Ryan Stinnett	1f9d42aeda	[DebugInfo] Remove `dbg.addr` from IR Part of `dbg.addr` removal Discussed in https://discourse.llvm.org/t/what-is-the-status-of-dbg-addr/62898 Differential Revision: https://reviews.llvm.org/D144801	2023-03-02 09:29:44 +00:00
Matt Arsenault	c6f64c5d88	Verifier: Don't rely on bitmask enum when checking nofpclass value	2023-02-24 09:39:32 -04:00
Matt Arsenault	5da674492a	IR: Add nofpclass parameter attribute This carries a bitmask indicating forbidden floating-point value kinds in the argument or return value. This will enable interprocedural -ffinite-math-only optimizations. This is primarily to cover the no-nans and no-infinities cases, but also covers the other floating point classes for free. Textually, this provides a number of names corresponding to bits in FPClassTest, e.g. call nofpclass(nan inf) @must_be_finite() call nofpclass(snan) @cannot_be_snan() This is more expressive than the existing nnan and ninf fast math flags. As an added bonus, you can represent fun things like nanf: declare nofpclass(inf zero sub norm) float @only_nans() Compared to nnan/ninf: - Can be applied to individual call operands as well as the return value - Can distinguish signaling and quiet nans - Distinguishes the sign of infinities - Can be safely propagated since it doesn't imply anything about other operands. - Does not apply to FP instructions; it's not a flag This is one step closer to being able to retire "no-nans-fp-math" and "no-infs-fp-math". The one remaining situation where we have no way to represent no-nans/infs is for loads (if we wanted to solve this we could introduce !nofpclass metadata, following along with noundef/!noundef). This is to help simplify the GPU builtin math library distribution. Currently the library code has explicit finite math only checks, read from global constants the compiler driver needs to set based on the compiler flags during linking. We end up having to internalize the library into each translation unit in case different linked modules have different math flags. By propagating known-not-nan and known-not-infinity information, we can automatically prune the edge case handling in most functions if the function is only reached from fast math uses.	2023-02-24 07:41:29 -04:00

1 2 3 4 5 ...

1000 Commits