clang-p2996

Author	SHA1	Message	Date
Pavel Kosov	671ea052fb	[TableGen] Emit separate computeRequiredFeatures() function A function is already emitted in GenInstrInfo.inc that takes Opcode number and a set of supported Features and reports fatal error if some of the required features are missing. The information about features required by the particular opcode can be reused by llvm-exegesis, so move its computation info a separate computeRequiredFeatures() function. Then verifyInstructionPredicates() can just compare the sets of available and required features computed by the other functions. This commit moves the definition of FeatureBitsets[] as well as CEFBS_ enumerator values (that are indices into FeatureBitsets[] array) inside the computeRequiredFeatures() function because these are implementation details of that function. The inclusion of potentially huge computeRequiredFeatures() function is now controlled by a dedicated macro that is set for simplicity by TableGen-erated code itself if `defined(ENABLE_INSTR_PREDICATE_VERIFIER) && !defined(NDEBUG)`. ~~ Huawei RRI, OS Lab Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D148516	2023-06-07 18:23:21 +03:00
pvanhout	d19a3834dc	[NFC][RFC][TableGen] Split GlobalISelEmitter.cpp This patch splits the GlobalISelEmitter.cpp file, which imports DAG ISel patterns for GISel, into separate "GISelMatchTable.h/cpp" files. The main motive is readability & maintainability. GlobalISelEmitter.cpp was about 6400 lines of mixed code, some bits implementing the match table codegen, some others dedicated to importing DAG patterns. Now it's down to 2700 + a 2150 header + 2000 impl. It's a tiny bit more lines overall but that's to be expected - moving inline definitions to out-of-line, adding comments in the .cpp, etc. all of that takes additional space, but I think the tradeoff is worth it. I did as little unrelated code changes as possible, I would say the biggest change is the introduction of the `gi` namespace used to prevent name conflicts/ODR violations with type common names such as `Matcher`. It was previously not an issue because all of the code was in an anonymous namespace. This moves all of the "match table" code out of the file, so predicates, rules, and actions are all separated now. I believe this helps separating concerns, now `GlobalISelEmitter.cpp` is more focused on importing DAG patterns into GI, instead of also containing the whole match table internals as well. Note: the new files have a "GISel" prefix to make them distinct from the other "GI" files in the same folder, which are for the combiner. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D151432	2023-06-07 09:34:57 +02:00
Simon Pilgrim	7c80747d33	Fix unused variable warning. NFC.	2023-06-06 11:45:22 +01:00
pvanhout	bbcd998efd	Revert "[NFC][RFC][TableGen] Split GlobalISelEmitter.cpp" This reverts commit `79caedf5f8`.	2023-06-05 09:38:22 +02:00
pvanhout	79caedf5f8	[NFC][RFC][TableGen] Split GlobalISelEmitter.cpp This patch splits the GlobalISelEmitter.cpp file, which imports DAG ISel patterns for GISel, into separate "GISelMatchTable.h/cpp" files. The main motive is readability & maintainability. GlobalISelEmitter.cpp was about 6400 lines of mixed code, some bits implementing the match table codegen, some others dedicated to importing DAG patterns. Now it's down to 2700 + a 2150 header + 2000 impl. It's a tiny bit more lines overall but that's to be expected - moving inline definitions to out-of-line, adding comments in the .cpp, etc. all of that takes additional space, but I think the tradeoff is worth it. I did as little unrelated code changes as possible, I would say the biggest change is the introduction of the `gi` namespace used to prevent name conflicts/ODR violations with type common names such as `Matcher`. It was previously not an issue because all of the code was in an anonymous namespace. This moves all of the "match table" code out of the file, so predicates, rules, and actions are all separated now. I believe this helps separating concerns, now `GlobalISelEmitter.cpp` is more focused on importing DAG patterns into GI, instead of also containing the whole match table internals as well. Note: the new files have a "GISel" prefix to make them distinct from the other "GI" files in the same folder, which are for the combiner. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D151432	2023-06-05 09:27:48 +02:00
Sergei Barannikov	7a258706e3	[CodeGen] Fix incorrect usage of MCPhysReg for diff list elements The lists contain differences between register numbers, not the register numbers themselves. Since a difference can also be negative, this also changes its type to signed. Changing the type to signed exposed a "bug". For AMDGPU, which has many registers, the first element of a sequence could be as big as ~45k. The value does not fit into int16_t, but fits into uint16_t. The bug didn't show up because of unsigned wrapping and truncation of the Val field in the advance() method. To fix the issue, I changed the way regunit difflists are encoded. The 4-bit 'scale' field of MCRegisterDesc::RegUnit was replaced by 12-bit number of the first regunit, and the first element of each of the lists was removed. The higher 20 bits of RegUnit field contain the initial offset into DiffLists array. AMDGPU has 1'409 regunits (2^12 = 4'096), and the biggest offset is 80'041 (2^20 = 1'048'576). That is, there is enough room. Changing the encoding method also resulted in a smaller array size, the numbers are below (I omitted targets with less than 100 elements). ``` AMDGPU \| 80052 \| 78741 \| -1,6% RISCV \| 6498 \| 6297 \| -3,1% ARM \| 4181 \| 3966 \| -5,1% AArch64 \| 2770 \| 2592 \| -6,4% PPC \| 1578 \| 1441 \| -8,7% Hexagon \| 994 \| 740 \| -25,6% R600 \| 508 \| 398 \| -21,7% VE \| 471 \| 459 \| -2,5% Sparc \| 381 \| 363 \| -4,7% X86 \| 326 \| 208 \| -36,2% Mips \| 253 \| 200 \| -20,9% SystemZ \| 186 \| 162 \| -12,9% ``` Reviewed By: foad, arsenm Differential Revision: https://reviews.llvm.org/D151036	2023-06-04 14:01:04 +03:00
Nitin John Raj	aa7eace843	[TableGen][GlobalISel] Account for HwMode in RegisterBank register sizes This patch adds logic for determining RegisterBank size to RegisterBankInfo, which allows accounting for the HwMode of the target. Individual RegisterBanks cannot be constructed with HwMode information as construction is generated by TableGen, but a RegisterBankInfo subclass can provide the HwMode as a constructor argument. The HwMode is used to select the appropriate RegisterBank size from an array relating sizes to RegisterBanks. Targets simply need to provide the HwMode argument to the <target>GenRegisterBankInfo constructor. The RISC-V RegisterBankInfo constructor has been updated accordingly (plus an unused argument removed). Reviewed By: simoncook, craig.topper Differential Revision: https://reviews.llvm.org/D76007	2023-06-02 23:14:17 -07:00
Stanislav Mekhanoshin	a15eb89aba	[TableGen] Allow bit fields in SearchableTables. Differential Revision: https://reviews.llvm.org/D151756	2023-06-02 13:49:07 -07:00
Amara Emerson	7f374b6902	[GlobalISel] Delete code in GIMatcher complaining about unreachable rules. Fixes #62897	2023-06-01 14:26:57 -07:00
Luo, Yuanke	752f9d02cc	[NFC][TableGen] Remove dead code. Differential Revision: https://reviews.llvm.org/D151635	2023-05-29 15:59:33 +08:00
Wang, Xin10	fbb241c552	use ref to avoid copy in range for-loop Use big obj copy in range for-loop will call copy constructor every time, which can be avoided by use ref instead. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D150024	2023-05-28 22:24:21 -04:00
Craig Topper	d91f65ea36	[TableGen] Filter duplicate predicates in PatternToMatch::getPredicateRecords. Recent changes to RISC-V cause the same predicate to appear in the predicate list multiple times in some cases. This patch filters the duplicates to reduce the number of predicate string variations.	2023-05-23 13:32:18 -07:00
Sergei Barannikov	4faf3aaf28	Revert "[CodeGen] Fix incorrect usage of MCPhysReg for diff list elements" This reverts commit `fa2827f079`. Causes build bot failres: https://lab.llvm.org/buildbot/#/builders/38/builds/12037	2023-05-23 05:14:34 +03:00
Sergei Barannikov	fa2827f079	[CodeGen] Fix incorrect usage of MCPhysReg for diff list elements The lists contain differences between register numbers, not the register numbers themselves. Since a difference can also be negative, this also changes its type to signed. Changing the type to signed exposed a "bug". For AMDGPU, which has many registers, the first element of a sequence could be as big as ~45k. The value does not fit into int16_t, but fits into uint16_t. The bug didn't show up because of unsigned wrapping and truncation of the Val field in the advance() method. To fix the issue, I changed the way regunit difflists are encoded. The 4-bit 'scale' field of MCRegisterDesc::RegUnit was replaced by 12-bit number of the first regunit, and the first element of each of the lists was removed. The higher 20 bits of RegUnit field contain the initial offset into DiffLists array. AMDGPU has 1'409 regunits (2^12 = 4'096), and the biggest offset is 80'041 (2^20 = 1'048'576). That is, there is enough room. Changing the encoding method also resulted in a smaller array size, the numbers are below (I omitted targets with less than 100 elements). ``` AMDGPU \| 80052 \| 78741 \| -1,6% RISCV \| 6498 \| 6297 \| -3,1% ARM \| 4181 \| 3966 \| -5,1% AArch64 \| 2770 \| 2592 \| -6,4% PPC \| 1578 \| 1441 \| -8,7% Hexagon \| 994 \| 740 \| -25,6% R600 \| 508 \| 398 \| -21,7% VE \| 471 \| 459 \| -2,5% Sparc \| 381 \| 363 \| -4,7% X86 \| 326 \| 208 \| -36,2% Mips \| 253 \| 200 \| -20,9% SystemZ \| 186 \| 162 \| -12,9% ``` Reviewed By: foad, arsenm Differential Revision: https://reviews.llvm.org/D151036	2023-05-23 04:10:53 +03:00
Shengchen Kan	c81a121f3f	Revert "Revert "[X86] Remove patterns for ADC/SBB with immediate 8 and optimize during MC lowering, NFCI"" This reverts commit `cb16b33a03`. In fact, the test https://bugs.chromium.org/p/chromium/issues/detail?id=1446973#c2 already passed after `5586bc539a`	2023-05-19 22:21:56 +08:00
Hans Wennborg	cb16b33a03	Revert "[X86] Remove patterns for ADC/SBB with immediate 8 and optimize during MC lowering, NFCI" This caused compiler assertions, see comment on https://reviews.llvm.org/D150107. This also reverts the dependent follow-up change: > [X86] Remove patterns for ADD/AND/OR/SUB/XOR/CMP with immediate 8 and optimize during MC lowering, NFCI > > This is follow-up of D150107. > > In addition, the function `X86::optimizeToFixedRegisterOrShortImmediateForm` can be > shared with project bolt and eliminates the code in X86InstrRelaxTables.cpp. > > Differential Revision: https://reviews.llvm.org/D150949 This reverts commit `2ef8ae1348` and `5586bc539a`.	2023-05-19 14:43:33 +02:00
Shengchen Kan	5586bc539a	[X86] Remove patterns for ADD/AND/OR/SUB/XOR/CMP with immediate 8 and optimize during MC lowering, NFCI This is follow-up of D150107. In addition, the function `X86::optimizeToFixedRegisterOrShortImmediateForm` can be shared with project bolt and eliminates the code in X86InstrRelaxTables.cpp. Differential Revision: https://reviews.llvm.org/D150949	2023-05-19 18:22:30 +08:00
Kazu Hirata	43cd59d5df	[TableGen] Remove unused getMinimalTypeForEnumBitfield The last use was removed by: commit `e98944ed47` Author: Stanislav Mekhanoshin <Stanislav.Mekhanoshin@amd.com> Date: Mon Mar 11 17:04:35 2019 +0000	2023-05-17 20:32:37 -07:00
Sergei Barannikov	da42b2846c	[CodeGen] Support allocating of arguments by decreasing offsets Previously, `CCState::AllocateStack` always allocated stack space by increasing offsets. For targets with stack growing up (away from zero) it is more convenient to allocate arguments by decreasing offsets, so that the first argument is at the top of the stack. This is important when calling a function with variable number of arguments: the callee does not know the size of the stack, but must be able to access "fixed" arguments. For that to work, the "fixed" arguments should have fixed offsets relative to the stack top, i.e. the variadic arguments area should be at the stack bottom (at lowest addresses). The in-tree target with stack growing up is AMDGPU, but it allocates arguments by increasing addresses. It does not support variadic arguments. A drive-by change is to promote stack size/offset to 64-bit integer. This is what MachineFrameInfo expects. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D149575	2023-05-17 21:51:52 +03:00
XinWang10	744b12adb4	[X86]check that Uses, Defs are same for entries in memory folding table Add expensive check that Uses, Defs are same for entries in memory folding table. MemFolding could not change the Uses/Defs. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D150633	2023-05-16 22:53:52 -04:00
Wang, Xin10	8a5450d322	Fix regression after D150436 llvm-clang-x86_64-expensive-checks-debian will fail after D150436 merged. The fail occurred in X86, I changed the sort rule in AsmMatcher in Patch D150436, so x86 code will arrive line 633 first(will not affect other targets). The logic here want to use the order record written in source file to make AsmMatcher to first use AVX instructions, it used field HasPositionOrder. But the condition here just makes sure one of the compared record is subclass of Instruction and has field HasPositionOrder true, and didn't check another. (Committing on behalf of @XinWang10 to unblock broken expensive-cjhecks builds) Differential Revision: https://reviews.llvm.org/D150651	2023-05-16 13:04:44 +01:00
Wang, Xin10	9a24ba2397	Correct the sort logic in AsmMatcherEmmitter.cpp The logic from line 633 to 640 is specific for ARM as the comments said, it will make all the targets will prefer to using instruction with more predicates when compiler do AsmMatching. And for code from line 642 to 649, X86 want to use the order records written in source file to sort the instructions. So X86 could be affected by this logic. (These code could be arrived only by X86) After change this, seems AVX instructions have not be affected but it exposed some other errors for instruction push and call. CALLpcrel16 could not be used in 64 bit mode, we need add Predicate for it. And for push instruction, previously because pushi32 has predicates = [Not64bitmode], so it precede pushi16, which is incorrect here, we should get pushw here and it also align with gcc. Reviewed By: skan Differential Revision: https://reviews.llvm.org/D150436	2023-05-16 02:44:02 -04:00
Francesco Petrogalli	4bfe410802	[TableGen][SubtargetEmitter] Add the StartAtCycles field in the WriteRes class. Conditions that need to be met: 1. count(StartAtCycle) == count(ReservedCycles); 2. For each i: StartAtCycles[i] < ReservedCycles[i]; 3. For each i: StartAtCycles[i] >= 0; 4. If left unspecified, the elements are set to 0. Differential Revision: https://reviews.llvm.org/D150310	2023-05-15 10:39:45 +02:00
Krzysztof Parzyszek	1b18064069	[TableGen] Print message about dropped patterns with -debug A selection pattern can be silently dropped if type inference fails to deduce some types. Print a message when that happens, and -debug was applied.	2023-05-10 14:51:57 -07:00
Matt Arsenault	a4610c2064	TableGen: Fix missing C++ mode comments	2023-05-10 08:01:27 +01:00
Alexey Vishnyakov	9c07aa75b9	[TableGen] Fix null pointer dereferences in TreePattern::ParseTreePattern() Bugs were found by Svace static analysis tool. Null pointers are dereferenced right after error checking that does not return from function. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D147706	2023-05-09 18:06:10 -07:00
Jon Roelofs	30b4351c7c	cmake: add missing dependencies on Attributes.inc Differential revision: https://reviews.llvm.org/D150144	2023-05-08 15:35:57 -07:00
Akshay Khadse	5c7c3af1d0	Reapply [Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D149506	2023-05-08 21:19:40 +08:00
NAKAMURA Takumi	631bfdbee5	Switch `llvm/CodeGen/MachineValueType.h` to the generated one Prune `SupportTests/MVTTest` since it is no longer needed. Depends on D148769 Differential Revision: https://reviews.llvm.org/D148770	2023-05-03 00:13:20 +09:00
NAKAMURA Takumi	5d71ec6e44	Split out `CodeGenTypes` from `CodeGen` for LLT/MVT This reduces dependencies on `llvm-tblgen` so much. `CodeGenTypes` depends on `Support` at the moment. Be careful to append deps on this, since Targets' tablegens depend on this. Depends on D149024 Differential Revision: https://reviews.llvm.org/D148769	2023-05-03 00:13:20 +09:00
NAKAMURA Takumi	c1221251fb	Restore CodeGen/MachineValueType.h from `Support` This is rework of; - rG13e77db2df94 (r328395; MVT) Since `LowLevelType.h` has been restored to `CodeGen`, `MachinveValueType.h` can be restored as well. Depends on D148767 Differential Revision: https://reviews.llvm.org/D149024	2023-05-03 00:13:20 +09:00
NAKAMURA Takumi	9cfeba5b12	Restore CodeGen/LowLevelType from `Support` This is rework of; - D30046 (LLT) Since I have introduced `llvm-min-tblgen` as D146352, `llvm-tblgen` may depend on `CodeGen`. `LowLevlType.h` originally belonged to `CodeGen`. Almost all userse are still under `CodeGen` or `Target`. I think `CodeGen` is the right place to put `LowLevelType.h`. `MachineValueType.h` may be moved as well. (later, D149024) I have made many modules depend on `CodeGen`. It is consistent but inefficient. It will be split out later, D148769 Besides, I had to isolate MVT and LLT in modmap, since `llvm::PredicateInfo` clashes between `TableGen/CodeGenSchedule.h` and `Transforms/Utils/PredicateInfo.h`. (I think better to introduce namespace llvm::TableGen) Depends on D145937, D146352, and D148768. Differential Revision: https://reviews.llvm.org/D148767	2023-05-03 00:13:19 +09:00
NAKAMURA Takumi	65365cff3b	Add deps on LLVMTableGenCommon even if it is actually unused.	2023-05-02 12:43:16 +09:00
NAKAMURA Takumi	137d8039e4	llvm-tblgen: Split out `obj.LLVMTableGenCommon` `$<TARGET_OBJECTS:llvm-min-tblgen>` was too lazy. It has `rc.res` in the list with MS toolchain. Fixup for D146352	2023-05-02 12:25:28 +09:00
NAKAMURA Takumi	243e8f8d23	Introduce `llvm-min-tblgen` to build public header files `llvm-min-tblgen` is capable of building `llvm/include/llvm`; - `-gen-attrs` - `-gen-directive-` - `-gen-intrinsics-` - `-gen-riscv-target-def` `llvm-min-tblgen` is built and used only when `llvm-tblgen` is built in-tree. This is not installed. `llvm-tblgen` is built with complete set and may be installed. `check-llvm` uses not `llvm-min-tblgen` but `llvm-tblgen`. `LLVM_TABLEGEN_PROJECT` overrides the definition of `tablegen(project)`. `LLVM_HEADERS` is used as the overridden prefix for LLVM header generators. If `EXPORT` is not specified in `add_tablegen`, its tablegen is treated as internal. Let `llvm-tblgen` depend on `intrinsics_gen` Depends on D149072 Differential Revision: https://reviews.llvm.org/D146352	2023-05-02 11:32:22 +09:00
Craig Topper	09f6bdda24	[RISCV] Remove INVALID from the list of CPUs in RISCVTargetParser. NFC This value is never used outside and is only used as a sentinel internally which we can solve with other means.	2023-05-01 15:26:09 -07:00
Sergei Barannikov	60f815d241	[TableGen] Forward declare CodeGenRegister et al. (NFC)	2023-04-30 07:01:00 +03:00
Matt Arsenault	bc37be1855	LangRef: Add "dynamic" option to "denormal-fp-math" This is stricter than the default "ieee", and should probably be the default. This patch leaves the default alone. I can change this in a future patch. There are non-reversible transforms I would like to perform which are legal under IEEE denormal handling, but illegal with flushing zero behavior. Namely, conversions between llvm.is.fpclass and fcmp with zeroes. Under "ieee" handling, it is legal to translate between llvm.is.fpclass(x, fcZero) and fcmp x, 0. Under "preserve-sign" handling, it is legal to translate between llvm.is.fpclass(x, fcSubnormal\|fcZero) and fcmp x, 0. I would like to compile and distribute some math library functions in a mode where it's callable from code with and without denormals enabled, which requires not changing the compares with denormals or zeroes. If an IEEE function transforms an llvm.is.fpclass call into an fcmp 0, it is no longer possible to call the function from code with denormals enabled, or write an optimization to move the function into a denormal flushing mode. For the original function, if x was a denormal, the class would evaluate to false. If the function compiled with denormal handling was converted to or called from a preserve-sign function, the fcmp now evaluates to true. This could also be of use for strictfp handling, where code may be changing the denormal mode. Alternative name could be "unknown". Replaces the old AMDGPU custom inlining logic with more conservative logic which tries to permit inlining for callees with dynamic handling and avoids inlining other mismatched modes.	2023-04-29 08:44:59 -04:00
NAKAMURA Takumi	24706aff15	TableGen: Replace `IntrinsicEmitter::ComputeFixedEncoding()` and cleanup Depends on D146915 Differential Revision: https://reviews.llvm.org/D145937	2023-04-26 23:49:08 +09:00
NAKAMURA Takumi	91b80ce417	TableGen: Implement TypeSig generator in `Intrinsics.td` This commit doesn't replace `IntrinsicEmitter::ComputeFixedEncoding()`, but compares outputs to it, to make sure implementation correct. Depends on D145871, D145872, D145874, and D146914 Differential Revision: https://reviews.llvm.org/D146915	2023-04-26 23:48:39 +09:00
NAKAMURA Takumi	5540e29a0a	Reformat	2023-04-26 23:47:15 +09:00
NAKAMURA Takumi	c49f850d55	Migrate `IIT_Info` into `Intrinsics.td` - Define `IIT_Info` in `Intrinsics.td` - Implement `EmitIITInfo` in `IntrinsicEmitter.cpp` - Use generated `IIT_Info` in `Function.cpp` Depends on D145873 and D146179 Differential Revision: https://reviews.llvm.org/D146914	2023-04-25 08:53:18 +09:00
NAKAMURA Takumi	ddaf085e7b	Fully generate `MachineValueType.h` Part of D146914	2023-04-25 08:53:18 +09:00
Tom Weaver	b63c08c773	Revert "[Coverity] Fix explicit null dereferences" This reverts commit `22b23a5213`. This commit caused the following two build bots to start failing: https://lab.llvm.org/buildbot/#/builders/216/builds/20322 https://lab.llvm.org/buildbot/#/builders/123/builds/18511	2023-04-24 11:14:10 +01:00
Craig Topper	d20c29d996	[TableGen] Make getRegisterValueType stricter about HwModes. I don't think this code would work correctly if the register class used used HwModes. Add asserts to make sure it's not used with HwModes. Also fix a long outdated comment on the function.	2023-04-23 23:28:23 -07:00
Shengchen Kan	9616fd1a7d	[X86][tablgen] Fix typo in comments, NFC	2023-04-24 13:57:00 +08:00
Craig Topper	7a6dc3d24c	[TableGen] Remove unused ForceMode and CodeGen fields from TypeInfer. NFC As well as the ForceMode field in PatternToMatch.	2023-04-23 20:32:14 -07:00
Craig Topper	24a8251b04	[TableGen] Remove unused method form ScopeMatcher. NFC	2023-04-23 00:44:50 -07:00
Craig Topper	a8aa43bb1a	[TableGen] Intialize vector with constructor instead of assign. NFC	2023-04-22 22:37:57 -07:00
Akshay Khadse	22b23a5213	[Coverity] Fix explicit null dereferences This change fixes static code analysis errors Reviewed By: skan Differential Revision: https://reviews.llvm.org/D148912	2023-04-23 12:07:11 +08:00

1 2 3 4 5 ...

5474 Commits