clang-p2996

Author	SHA1	Message	Date
Christudasan Devadasan	b5efec4b27	[CodeGen] Additional Register argument to storeRegToStackSlot/loadRegFromStackSlot With D134950, targets get notified when a virtual register is created and/or cloned. Targets can do the needful with the delegate callback. AMDGPU propagates the virtual register flags maintained in the target file itself. They are useful to identify a certain type of machine operands while inserting spill stores and reloads. Since RegAllocFast spills the physical register itself, there is no way its virtual register can be mapped back to retrieve the flags. It can be solved by passing the virtual register as an additional argument. This argument has no use when the spill interfaces are called during the greedy allocator or even the PrologEpilogInserter and can pass a null register in such cases. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D138656	2022-12-17 11:55:34 +05:30
Yeting Kuo	982a586ab4	[RISCV] Emit .variant_cc directives for vector function calls. The patch is splitted from D103435. The patch emits .variant_cc [0] for those function calls that have vector arguments or vector return values. [0]: https://github.com/riscv/riscv-elf-psabi-doc/pull/190 Initial authored by: HsiangKai Reviewed By: reames Differential Revision: https://reviews.llvm.org/D139414	2022-12-16 13:51:39 +08:00
Anton Sidorenko	37f9eec142	[RISCV] Allow conversion of fp divisions to fp multiplications by the reciprocal If the divisor is repeated at least twice, we will convert the FDIVs to the calculation of the reciprocal and FMULs. We perform the transformation only under fast-math mode. FDIVs must have 'arcp' flag. Differential Revision: https://reviews.llvm.org/D140024	2022-12-15 13:00:36 +03:00
Philip Reames	d86011984e	[RISCV] Avoid generate large LMUL vmv.s.x or fvmv.s.f This is a follow up to patch discussion on D139656. As noted there, M2/M4/M8 versions of these instructions don't actually exist, and using them results in overly constrained register allocation. In that review, we'd talked about moving towards a variant of the instructions which ignored LMUL. I decided to see what happened if we just stopped generating the high LMUL variants, and the results are surprisingly neutral. I only see one minor thing which looks like a real regression among all the churn. I think this is worth doing now to loosen register allocation constraints, and avoid digging our hole around these instructions deeper while thinking about the right model change. Differential Revision: https://reviews.llvm.org/D140027	2022-12-14 10:53:34 -08:00
Yeting Kuo	ad68586a37	[VP][RISCV] Add vp.ctpop and RISC-V support. The patch also adds expandVPCTPOP in TargetLowering to expand VP_CTPOP nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139920	2022-12-14 09:47:44 +08:00
Philip Reames	668cde81df	[RISCV] Reuse VL (if non-zero) when building single element vector for start of reduction chain This is an alternative patch on a path to D137530. The basic problem being tackled here is that we need to place a scalar into lane 0 of a vector register before our reduction instructions. Since we only care about lane 0 of the vector, we can use any VL >= 1 provided that the total amount of work performed matches the work performed for a VL=1. This change does not contain the logic from D137530 to perform the insert at the original VT, and then extract down to LMUL1. That turns out to be a good choice, as discussion in this review has indicated there are issues around LMUL2 and above with our representation of vmv.s.x. We'd also need to be careful with the splat logic for the same reasons. The only potentially concerning codegen change I spot here is that we stop using a broadcast load (for VL=1) and instead do a scalar load and insert. I think this is probably reasonable; if reviewers disagree, I can investigate using a broadcast load which writes to the undef lanes. If we want to do that, we should do it for VECTOR_INSERT_ELT as well, so that'll end up as it's own patch series. Differential Revision: https://reviews.llvm.org/D139656	2022-12-13 12:16:26 -08:00
Craig Topper	bee9a92aec	[RISCV] Use reduction result type for EXTRACT_VECTOR_ELT in lowerReductionSeq. Remove the call to getSExtOrTrunc. Reduction ISD nodes produce a scalar result and that result is allowed to be larger than the vector element type due to type legalization. This is the same rule we allow for EXTRACT_VECTOR_ELT for the same reason. We can copy the result type over from the reduction node to EXTRACT_VECTOR_ELT. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D139757	2022-12-13 09:10:36 -08:00
Philip Reames	ecabba04a3	[RISCV] Use lowerScalarInsert when folding op into reduction [nfc] This doesn't cause any functional change since this is being applied to a insert generated by the same routine. This is mostly about consolidating the logic for vmv.s.x into one place to simplify future changes.	2022-12-13 09:08:39 -08:00
Philip Reames	44e0427cf0	[RISCV] Use lowerScalarInsert in lowerReductionSeq [nfc] Use the newly introduced helper routine. At the moment, this generates the same code (at this call site!) since LMUL is restricted to LMUL1 or less, and VL is hard coded to 1. In a future patch, I will loosen the second part.	2022-12-13 09:08:39 -08:00
Philip Reames	8adde6941a	[RISCV] Use vmv.v.i for insertion into lane 0 of undef vector when profitable If we're initializing lane 0 of an undef vector, we can optionally write to other lanes of the vector. Doing so may require additional work, so we don't want to e.g. always use a splat. However, since we don't have an immediate form of vmv.s.x it's useful to use a vmv.v.i if the work required is expected to be equal in practice. We restrict this to when LMUL <= 1 to a) prevent doing additional work at higher LMULs, and b) avoid overconstraining the register allocator. At the moment, the new utility is only used by one case in INSERT_VECTOR_ELT lowering. My expectation is that we will reuse this in a couple other places, but each of those deserve individual review. This change is inspired by D137530, but is not directly related to it. I vaguely remember we discussed the tradeoffs of using vmv.v.i in another recent review, but couldn't find it. Differential Revision: https://reviews.llvm.org/D139648	2022-12-13 07:54:46 -08:00
Alexey Baturo	54e72dd4eb	re-land [RISC-V][HWASAN] Support tagging global variables for RISC-V HWASAN Now with fix to limit added tagged-globals.ll to risc-v platform -- [RISC-V][HWASAN] Support tagging global variables for RISC-V HWASAN Reviewed by: luismarques Differential Revision: https://reviews.llvm.org/D132995	2022-12-13 15:51:51 +03:00
Alexey Baturo	5e89876538	Revert "[RISC-V][HWASAN] Support tagging global variables for RISC-V HWASAN" This reverts commit `11937ca564`.	2022-12-13 15:17:40 +03:00
Alexey Baturo	11937ca564	[RISC-V][HWASAN] Support tagging global variables for RISC-V HWASAN Reviewed by: luismarques Differential Revision: https://reviews.llvm.org/D132995	2022-12-13 14:57:34 +03:00
Philip Reames	a4b45c28a1	[RISCV] Allow fractional LMUL for reduction start value For reductions, we need to put the start value into a source vector. For fractional LMULs, we can perform the operation at the original LMUL. For LMUL > 1, we eventually want to use a scalar insert, but that's outside the scope of this patch. Differential Revision: https://reviews.llvm.org/D139747	2022-12-12 09:08:21 -08:00
jacquesguan	c2f199fa48	[DAGCombiner] Scalarize extend/truncate for splat vector. This revision scalarizes extend/truncate for splat vector. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122875	2022-12-12 14:53:10 +08:00
Yeting Kuo	47b9da72e0	[VP][RISCV] Add vp.bitreverse and RISC-V support. The patch also added function expandVPBITREVERSE to expand ISD::VP_BITREVERSE nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139697	2022-12-12 10:58:44 +08:00
Craig Topper	eec0ac9726	[RISCV] clang-format lowerReductionSeq. NFC Wraps a long line to 80 columns.	2022-12-09 17:03:35 -08:00
Philip Reames	1ebe8f4c45	[RISCV] Share reduction lowering code for vp.reduce We can consolidate code and clarify edge case behavior at the same time. There are two functional differences here. First, I remove the ResVT handling, and always use the reduction element type. This appears to be dead code. There's no test coverage, and this code doesn't need to account for scalar type legalization anyways. Second, if the VL happens to be known non-zero, we can avoid passing through start. This is mostly needed to allow reuse of the existing code; I don't consider it interesting as an optimization on it's own. Differential Revision: https://reviews.llvm.org/D139733	2022-12-09 12:22:59 -08:00
Philip Reames	4e5b3f6307	[RISCV] Consolidate a bit of common logic for forming reductions There's several patches in flght which change this code, better to only have one copy. The VP case is left seperate for the moment as the result value type differs.	2022-12-09 08:18:51 -08:00
Craig Topper	66ff073182	[RISCV] Support F16 vectors with Zfhmin+Zvfh. I've enabled Zfhmin on 2 basic tests to show this isn't completely broken. Reviewed By: monkchiang Differential Revision: https://reviews.llvm.org/D139562	2022-12-07 19:14:11 -08:00
Craig Topper	258bb453fb	[RISCV] Without Zfh, promote f16 inputs before creating RISCVISD::FCVT_W(U)_RV64 nodes. This allows us to remove a couple more Zfhmin isel patterns.	2022-12-07 12:25:30 -08:00
Craig Topper	e3540fb948	[RISCV] Promote f16 fp_to_int_sat with Zfhmin during lowering instead of isel. We already have a custom handler for FP_TO_(S/U)INT_SAT. It's easy enought to inject an FP_EXTEND in there.	2022-12-07 11:58:30 -08:00
Yeting Kuo	0f8c761c48	[VP][RISCV] Recommit "Add vp.fshl/fshr and RISC-V support." This reverts commit `7883e5b061`. The original commit was reverted that it didn't update test files after D136263 landed. The recommit fixed those. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139509	2022-12-07 15:58:12 +08:00
Kazu Hirata	7883e5b061	Revert "[VP][RISCV] Add vp.fshl/fshr and RISC-V support." This reverts commit `70de0e0140`. I'm seeing: Failed Tests (2): LLVM :: CodeGen/RISCV/rvv/fixed-vectors-fshr-fshl-vp.ll LLVM :: CodeGen/RISCV/rvv/fshr-fshl-vp.ll Also reported at: https://lab.llvm.org/buildbot/#/builders/123/builds/14531	2022-12-06 22:27:43 -08:00
Monk Chiang	7b50c18360	[RISCV] Codegen support for Zfhmin. The Zfhmin subset only has FLH, FSH, FMV.X.H, FMV.H.X, FCVT.S.H, and FCVT.H.S. If the D extension is present, the FCVT.D.H and FCVT.H.D instructions are also included. Since most instructions are not included for Zfhmin, so most operations are promoted. The patch primarily about making f16 a legal type. RISC-V ISA info: https://wiki.riscv.org/display/HOME/Recently+Ratified+Extensions Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139391	2022-12-06 22:14:15 -08:00
Yeting Kuo	70de0e0140	[VP][RISCV] Add vp.fshl/fshr and RISC-V support. The patch made VectorLegalizer expand ISD::VP_FSHL and ISD::VP_FSHR to achieve the codegen. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D138379	2022-12-07 12:16:36 +08:00
jacquesguan	f7a46aa8fb	[RISCV] Fold vector binary operatrion into select with identity constant. This patch implements shouldFoldSelectWithIdentityConstant for RISCV. It would try to generate vmerge after the binary instruction and let them folded to maksed instruction later. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D131551	2022-12-06 11:19:31 +08:00
ChunyuLiao	85834d8685	[RISCV]Keep (select c, 0/-1, X) during PerformDAGCombine D135833, lowerSelect: (select C, -1/0, X) -> or/and Keep (select c, 0/-1, X), thus making better use of lowerSelect to eliminate branch instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D139272	2022-12-06 09:26:29 +08:00
Kazu Hirata	3c09ed006a	[llvm] Use std::nullopt instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 17:12:44 -08:00
Fangrui Song	b0df70403d	[Target] llvm::Optional => std::optional The updated functions are mostly internal with a few exceptions (virtual functions in TargetInstrInfo.h, TargetRegisterInfo.h). To minimize changes to LLVMCodeGen, GlobalISel files are skipped. https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 22:43:14 +00:00
Kazu Hirata	20cde15415	[Target] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 20:36:06 -08:00
Krzysztof Parzyszek	864aaa21b4	TargetLowering: convert Optional to std::optional	2022-12-01 16:19:10 -08:00
Philip Reames	7d82c99403	[RISCV][TTI] Account for constant materialization cost when costing arithmetic operations At the IR level, we generally assume that constants are free to materialize. However, for RISCV due to some quirks of the ISA, materializing arbitrary constants can be rather expensive. We frequently fallback to constant pool loads. We've been slowly moving in the direction of modeling the cost of the remat as part of the instruction cost. This has the effect of disincentivizing vectorization - mostly SLP - when we'd have to materialize an expensive constant. We need better modeling of which constants are expensive and not, but the moment let's be consistent with how we model arithmetic and memory instructions. The difference between the two is that arithmetic can sometimes fold a splat operation which stores can not. Differential Revision: https://reviews.llvm.org/D138941	2022-11-30 07:20:51 -08:00
Philip Reames	b25672ba82	[RISCV] Separate out helper for checking if vector splat supported for operand [nfc]	2022-11-29 11:05:46 -08:00
Kazu Hirata	2f61c6c639	[RISCV] Use std::optional in RISCVISelLowering.cpp (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-25 23:04:58 -08:00
LiaoChunyu	aa14f002d5	[RISCV] Branchless lowering for (select (x < 0), TrueConstant, FalseConstant) and (select (x >= 0), TrueConstant, FalseConstant) This patch reduces the number of unpredictable branches (select (x < 0), y, z) -> x >> (XLEN - 1) & (y - z) + z (select (x >= 0), y, z) -> x >> (XLEN - 1) & (z - y) + y Reviewed By: craig.topper, reames Differential Revision: https://reviews.llvm.org/D137949	2022-11-25 20:18:30 +08:00
wangpc	241accea2a	[RISCV] Lower unmasked zero-stride vector load to (scalar load + splat) So we have the opportunity to fold splat into .vx instruction as what D101138 has done. If failed, we can select zero-stride vector load again. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D138101	2022-11-24 11:09:45 +08:00
WuXinlong	219417b2c6	[RISCV] Add CodeGen support and MC testcase of RISCV Zca Extension This patch add the support of RISCV Zca ext `Zca` is a subset of C extension instructions that are compatible with the Zc extension. So this patch implements Zca code generation with reference to the C extension and sets the 2-byte alignment for the Zca extension, just like C extension does. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D130483	2022-11-22 17:22:26 +08:00
Han-Kuan Chen	7e6dbfcd9d	[RISCV] Make lowerVECTOR_SHUFFLEAsVSlidedown follow source until not EXTRACT_SUBVECTOR. Current lowerVECTOR_SHUFFLEAsVSlidedown only seeks whether input are EXTRACT_SUBVECTOR and their source are same. The commit will make the function seek input and their source until they are not EXTRACT_SUBVECTOR. Differential Revision: https://reviews.llvm.org/D138025	2022-11-17 22:32:53 -08:00
Stanislav Mekhanoshin	bcaf31ec3f	[AMDGPU] Allow finer grain control of an unaligned access speed A target can return if a misaligned access is 'fast' as defined by the target or not. In reality there can be different levels of 'fast' and 'slow'. This patch changes the boolean 'Fast' argument of the allowsMisalignedMemoryAccesses family of functions to an unsigned representing its speed. A target can still define it as it wants and the direct translation of the current code uses 0 and 1 for current false and true. This makes the change an NFC. Subsequent patch will start using an actual value of speed in the load/store vectorizer to compare if a vectorized access going to be not just fast, but not slower than before. Differential Revision: https://reviews.llvm.org/D124217	2022-11-17 09:23:53 -08:00
Craig Topper	7e15ea102f	[RISCV] Add a DAG combine to pre-promote (i1 (truncate (i32 (srl X, Y)))) with Zbs on RV64. Type legalization will want to turn (srl X, Y) into RISCVISD::SRLW, which will prevent us from using a BEXT instruction. This is similar to what we do for (i32 (and (srl X, Y), 1)).	2022-11-16 19:07:33 -08:00
Craig Topper	5c9b03faef	[RISCV] Remove duplicate setOperationAction. NFC	2022-11-16 16:54:27 -08:00
Yeting Kuo	ed9638c44b	[VP][RISCV] Add vp.nearbyint and RISC-V support. nearbyint has the property to execute without exception. For not modifying fflags, the patch added new machine opcode PseudoVFROUND_NOEXCEPT_V that expands vfcvt.x.f.v and vfcvt.f.x.v between a pair of frflags and fsflags. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137685	2022-11-16 14:05:35 +08:00
Yeting Kuo	5c3ca10b09	[VP][RISCV] Add vp.bswap and RISC-V support. The patch also added function expandVPBSWAP to expand ISD::VP_BSWAP nodes. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137928	2022-11-16 11:36:38 +08:00
wangpc	a214c521f8	[RISCV] Don't use zero-stride vector load for gather if not optimized We may form a zero-stride vector load when lowering gather to strided load. As what D137699 has done, we use `load+splat` for this form if there is no optimized implementation. We restrict this to unmasked loads currently in consideration of the complexity of hanlding all falses masks. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D137931	2022-11-16 10:43:10 +08:00
Han-Kuan Chen	aa47bfa9bc	[RISCV] Refactor getDefaultVLOps. NFC. Current getDefaultVLOps can only deduce VL from a MVT. However, sometimes users have already known VL value. This commit will provide a uniform interface to get VL instead of calling DAG.getConstant. Differential Revision: https://reviews.llvm.org/D138003	2022-11-15 18:11:11 -08:00
Craig Topper	25dcca60f4	[RISCV] Teach shouldSinkOperands that vp.add and friends are commutative. We previously had a bug that our isel patterns weren't commutative, but that has been fixed for a while.	2022-11-14 22:01:59 -08:00
Craig Topper	dde8423f21	[RISCV] Expand i32 abs to negw+max at isel. This adds a RISCVISD::ABSW to remember that we started with an i32 abs. Previously we used a DAG combine of (sext_inreg (abs)) to delay emitting a freeze from type legalization in order to make ComputeNumSignBits optimizations work on other promoted nodes. This new approach always uses negw+max even if the result doesn't need to be sign extended. This helps the RISCVSExtWRemoval pass if the sext.w is in another basic block.	2022-11-14 19:44:05 -08:00
Yeting Kuo	0c0681b741	[RISCV][NFC] Remove dead code. All ISD::BSWAP nodes are not customized lowered in RISC-V now, so the patch removed dead code for ISD::BSWAP in LowerOperation. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D137907	2022-11-14 10:08:48 +08:00
Yeting Kuo	06a7e04be4	[RISCV][NFC] Fix unused variable warning. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D137633	2022-11-10 20:23:09 +08:00

1 2 3 4 5 ...

889 Commits