clang-p2996

Author	SHA1	Message	Date
Noah Goldstein	809b1d834d	[KnownBits] Return `0` for poison {s,u}div inputs It seems consistent to always return zero for known poison rather than varying the value. We do the same elsewhere. Differential Revision: https://reviews.llvm.org/D150922	2023-06-06 15:14:10 -05:00
Paulo Matos	9571a28ee4	[WebAssembly] Add tests ensuring rotates persist Due to the nature of WebAssembly, it's always better to keep rotates instead of trying to optimize it. Commit `9485d983` disabled the generation of fsh for rotates, however these tests ensure that future changes don't change the behaviour for the Wasm backend that tends to have different optimization requirements than other architectures. Also see: https://github.com/llvm/llvm-project/issues/62703 Differential Revision: https://reviews.llvm.org/D152126	2023-06-06 07:48:35 +02:00
Craig Topper	139392c0a5	[LegalizeTypes][ARM][AArch6][RISCV][VE][WebAssembly] Add special case for smin(X, -1) and smax(X, 0) to ExpandIntRes_MINMAX. We can compute a simpler expression for Lo for these cases. This is an alternative for the test cases in D151180 that works for more targets. This is similar to some of the special cases we have for expanding setcc operands. Differential Revision: https://reviews.llvm.org/D151182	2023-05-23 09:19:55 -07:00
Tobias Hieta	f84bac329b	[NFC][Py Reformat] Reformat lit.local.cfg python files in llvm This is a follow-up to `b71edfaa4e` since I forgot the lit.local.cfg files in that one. Reformatting is done with `black`. If you end up having problems merging this commit because you have made changes to a python file, the best way to handle that is to run git checkout --ours <yourfile> and then reformat it with black. If you run into any problems, post to discourse about it and we will try to help. RFC Thread below: https://discourse.llvm.org/t/rfc-document-and-standardize-python-code-style Reviewed By: barannikov88, kwk Differential Revision: https://reviews.llvm.org/D150762	2023-05-17 17:03:15 +02:00
Tobias Hieta	b71edfaa4e	[NFC][Py Reformat] Reformat python files in llvm This is the first commit in a series that will reformat all the python files in the LLVM repository. Reformatting is done with `black`. See more information here: https://discourse.llvm.org/t/rfc-document-and-standardize-python-code-style Reviewed By: jhenderson, JDevlieghere, MatzeB Differential Revision: https://reviews.llvm.org/D150545	2023-05-17 10:48:52 +02:00
Thomas Lively	72a72315b0	[WebAssembly] Mark @llvm.wasm.shuffle lane indices as immediates This intrinsic is meant to lower directly to the i8x16.shuffle instruction, which takes its lane index arguments as immmediates. The ISel for the intrinsic assumed that the lane index arguments were constants, so bitcode that "incorrectly" used this intrinsic with non-immediate arguments caused an assertion failure in the backend. Avoid the crash by defining the lane index arguments to be immediates, matching the underlying instruction. Update ISel accordingly. This change means that the bitcode that previously caused a crash will now fail to validate. Fixes #55559. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D149898	2023-05-05 08:12:41 -07:00
Thomas Lively	abdb5e041c	[WebAssembly] Remove incorrect result from wasm64 store_lane instructions The wasm64 versions of the v128.storeX_lane instructions was incorrectly defined as returning a v128 value, which resulted in spurious drop instructions being emitted and causing validation to fail. This was not caught earlier because wasm64 has been experimental and not well tested. Update the relevant test file to test both wasm32 and wasm64. Fixes #62443. Differential Revision: https://reviews.llvm.org/D149780	2023-05-03 16:00:20 -07:00
Heejin Ahn	0e37487df8	[WebAssembly] Fix selection of global calls When selecting calls, currently we unconditionally remove `Wrapper`s of the call target. But we are supposed to do that only when the target is a function, an external symbol (= library function), or an alias of a function. Otherwise we end up directly calling globals that are not functions. Fixes https://github.com/llvm/llvm-project/issues/60003. Reviewed By: tlively, HerrCai0907 Differential Revision: https://reviews.llvm.org/D147397	2023-04-05 01:42:36 -07:00
Heejin Ahn	47fc0186e6	[WebAssembly] Move call_indirect_alloca to call.ll Not sure the distinction between `call.ll` and `call-indirect.ll`, because `call.ll` also seems to contain many `call_indirect` tests. Also before D147033 `call-indirect.ll` only contained a single test and it also tests it with `obj2yaml`, so I guess that file was created for testing functionalities for object files as well. We can probably merge these two someday. But anyway, this moves `call_indirect_alloca` I added in D147033 to `call.ll`, given that that file contains more `call_indirect` tests and I'm planning to add more `call_indirect` tests in a followup CL. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D147396	2023-04-05 01:41:59 -07:00
Thomas Lively	62bfb0b14c	[WebAssembly] Add libcall signatures for roundeven Since clang started emitting roundeven intrinsics in `a7d6593a0a`, they would cause a crash in the WebAssembly backend because it did not know the roundeven library function signatures. Fix the crash by adding the signatures. Differential Revision: https://reviews.llvm.org/D147476	2023-04-04 08:32:26 -07:00
Nikita Popov	1b16c70299	[WebAssembly] Convert tests to opaque pointers (NFC)	2023-04-04 12:16:50 +02:00
Peter Rong	3b2476910b	[WASM] Prevent casting `undef` to `CosntantSDNode` WebAssembly tries to cast an `undef` to `CosntantSDNode` during `LowerAccessVectorElement`. These operations will trigger an assertion error in cast. To avoid this issue, we prevent casting, and abort the lowering operation. A unit test is also included. This patch fixes [pr61828](https://github.com/llvm/llvm-project/issues/61828) Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D147198	2023-03-30 20:14:11 -07:00
Peter Rong	51a93828d7	[WASM] Fix legalizer for LowerBUILD_VECTOR. Constants in BUILD_VECTOR may be down cast into a smaller value that fits LaneBits, i.e., the bit width of elements in the vector. This cast didn't consider 2^N where it would be cast into -2^N, which still doesn't fit into LaneBits after casting. This will cause an assertion in later legalization. 2^N should be cast into 0, and this patch reflects such behavior. This patch also includes a test to reflect the fix. This patch fixes [issue 61780](https://github.com/llvm/llvm-project/issues/61780) Related patch: https://reviews.llvm.org/D108669 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D147208	2023-03-30 19:20:04 -07:00
Heejin Ahn	d91c9aef9b	[WebAssembly] Select call_indirect for alloca calls Currently calling stack locations is selected using `CALL` in ISel, resulting in an invalid code and crashing in AsmPrinter. FastISel correctly selects it will `CALL_INDIRECT`. Fixes the problem reported in D146781. Reviewed By: tlively, HerrCai0907 Differential Revision: https://reviews.llvm.org/D147033	2023-03-29 12:46:58 -07:00
Thomas Lively	dd0bbae5ef	[WebAssembly] Fix epilogue insertion for indirect tail calls Previously epilogues were incorrectly inserted after indirect tail calls because they did not have the `isTerminator` property. Add that property and test that they get correct epilogues. To be safe, also add other properties that were defined for direct tail calls. Differential Revision: https://reviews.llvm.org/D146569	2023-03-22 09:28:48 -07:00
Thomas Lively	0528087663	[NFC][WebAssembly] Autogenerate test expectations for tailcall.ll A follow-on commit will add tests to this file and using the update_llc_test_checks script will make that easier. Differential Revision: https://reviews.llvm.org/D146568	2023-03-22 09:21:12 -07:00
Congcong Cai	696fdece49	[WebAssembly] Fix i64_i64_func_i64_i64_i32 type signature when multivalue feature is enabled Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D146533	2023-03-22 06:53:54 +08:00
Congcong Cai	ec2a726a63	[Webassembly][multivalue] update libcall signature for f128 when multivalue feature enabled further update for [D146271](https://reviews.llvm.org/D146271) Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D146499	2023-03-22 00:16:29 +08:00
Congcong Cai	d9661d79f4	[Webassembly][multivalue] update libcall signature when multivalue feature enabled fixed: #59095 Update libcall signatures to use multivalue return rather than returning via a pointer when the multivalue features is enabled in the WebAssembly backend. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D146271	2023-03-21 12:10:51 +08:00
Heejin Ahn	4e844a1498	[WebAssembly] Replace Bugzilla links with Github issues Reviewed By: dschuff, asb Differential Revision: https://reviews.llvm.org/D145966	2023-03-17 20:13:00 -07:00
Nikita Popov	bbfb13a5ff	[ConstExpr] Remove select constant expression This removes the select constant expression, as part of https://discourse.llvm.org/t/rfc-remove-most-constant-expressions/63179. Uses of this expressions have already been removed in advance, so this just removes related infrastructure and updates tests. Differential Revision: https://reviews.llvm.org/D145382	2023-03-16 10:32:08 +01:00
Jun Ma	00eef4f7c3	[SelectionDAG] Fix mismatched truncate when combine BUILD_VECTOR with EXTRACT_SUBVECTOR Just use correct type for truncation. Fixes PR59625 Differential Revision: https://reviews.llvm.org/D145757	2023-03-13 08:59:52 +08:00
Jun Ma	403926aefe	[WebAssembly] Skip implied bitmask operation in LowerShift This patch skips redundant explicit masks of the shift count since it is implied inside wasm shift instruction. Differential Revision: https://reviews.llvm.org/D144619	2023-03-02 09:37:25 +08:00
Alex Bradbury	1ae859753c	[WebAssembly][test] Clean up ir-locals.ll after opaque pointer conversion The `tyname_cell` definitions at the top are now all the same, so replace them with a single `alloca_cell` type.	2023-02-26 19:17:06 +00:00
Alex Bradbury	771261ff01	[Webassembly][test] Regenerate ir-locals.ll using update_llc_test_checks.py Preparation for further additions.	2023-02-26 19:12:36 +00:00
Samuel Parker	f48d3b6f46	Revert "[DAGCombine] Fold redundant select" This reverts commit `c7f9344d0f`.	2023-02-23 17:59:41 +00:00
Samuel Parker	28ee604071	[WebAssembly] pmin/pmax fixes Reverse the operand ordering to ? rhs : lhs. Differential Revision: https://reviews.llvm.org/D144466	2023-02-22 10:02:16 +00:00
Jun Ma	e9d7f96a11	[WebAssembly] Add more combine pattern for vector shift After change with D144169, the codegen generates redundant instructions like and and wrap. This fixes it. Differential Revision: https://reviews.llvm.org/D144360	2023-02-22 09:53:00 +08:00
Samuel Parker	c7f9344d0f	[DAGCombine] Fold redundant select Recommit `bbdf243579`. Original commit message: If a chain of two selects share a true/false value and are controlled by two setcc nodes, that are never both true, we can fold away one of the selects. So, the following: (select (setcc X, const0, eq), Y, (select (setcc X, const1, eq), Z, Y)) Can be combined to: select (setcc X, const1, eq) Z, Y Differential Revision: https://reviews.llvm.org/D142535	2023-02-15 10:32:16 +00:00
Jake Egan	08533f8b86	Revert "[CGP] Add generic TargetLowering::shouldAlignPointerArgs() implementation" These commits are causing a test-suite build failure on AIX. Revert for now for time to investigate. https://lab.llvm.org/buildbot/#/builders/214/builds/5779/steps/9/logs/stdio This reverts commit `bd87a2449d` and `4c72266830`.	2023-02-14 15:20:06 -05:00
Samuel Parker	a674a12dd5	[WebAssembly] Additional patterns for pmin/pax Each operation was missing their inverted condition using olt or ogt. Also, as we don't need to discern +/-0, I think we should also be able to use ole and oge. Differential Revision: https://reviews.llvm.org/D143581	2023-02-10 09:54:45 +00:00
Andrew Savonichev	c65b4d64d4	[SelectionDAG] Do not second-guess alignment for alloca Alignment of an alloca in IR can be lower than the preferred alignment on purpose, but this override essentially treats the preferred alignment as the minimum alignment. The patch changes this behavior to always use the specified alignment. If alignment is not set explicitly in LLVM IR, it is set to DL.getPrefTypeAlign(Ty) in computeAllocaDefaultAlign. Tests are changed as well: explicit alignment is increased to match the preferred alignment if it changes output, or omitted when it is hard to determine the right value (e.g. for pointers, some structs, or weird types). Differential Revision: https://reviews.llvm.org/D135462	2023-02-09 18:45:20 +03:00
Alex Richardson	bd87a2449d	[CGP] Add generic TargetLowering::shouldAlignPointerArgs() implementation This function was added for ARM targets, but aligning global/stack pointer arguments passed to memcpy/memmove/memset can improve code size and performance for all targets that don't have fast unaligned accesses. This adds a generic implementation that adjusts the alignment to pointer size if unaligned accesses are slow. Review D134168 suggests that this significantly improves performance on synthetic benchmarks such as Dhrystone on RV32 as it avoids memcpy() calls. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D134282	2023-02-09 10:11:40 +00:00
Alex Bradbury	3a80dc27ed	[WebAssembly][test][NFC] Add coverage of non-void funcref calls This is trickier to handle in some other representations of funcrefs that are being explored, so it makes sense to ensure we have some coverage of this requirement.	2023-02-07 15:56:10 +00:00
Alex Bradbury	604c9a07f3	[WebAssembly][test][NFC] Regenerate funcref-call.ll using update_llc_test_checks.py In preparation for some slight expansion of the tests.	2023-02-07 15:44:24 +00:00
Samuel Parker	7bff37783f	[SDAG] Check fminnum/fmaxnum for non-zero operand. Currently, in TargetLowering, if the target does not support fminnum, we lower to fminimum if neither operand could be a NaN. But this isn't quite correct because fminnum and fminimum treat +/-0 differently; so, we need to prove that one of the operands isn't a zero, or we don't have signed zeros. Differential Revision: https://reviews.llvm.org/D143256	2023-02-07 10:54:23 +00:00
Samuel Parker	a7de5c82bb	[NFC] minnum/maxnum intrinsic tests ARM and WebAssembly tests.	2023-02-07 10:47:40 +00:00
Samuel Parker	91f8289ff0	Revert "[DAGCombine] Fold redundant select" This reverts commit `bbdf243579`.	2023-02-07 10:37:20 +00:00
Sanjay Patel	fb3e3ef62e	[SDAG] fix miscompiles caused by using ValueTracking matchSelectPattern to create FMINIMUM/FMAXIMUM ValueTracking attempts to match compare+select patterns to FP min/max operations, but it was created before the newer IEEE-754-2019 minimum/maximum ops were defined. Ie, matchSelectPattern() does not account for the -0.0/+0.0 behavior that is specified in the newer standard. FMINIMUM/FMAXIMUM nodes were created to map to the newer standard: /// FMINIMUM/FMAXIMUM - NaN-propagating minimum/maximum that also treat -0.0 /// as less than 0.0. While FMINNUM_IEEE/FMAXNUM_IEEE follow IEEE 754-2008 /// semantics, FMINIMUM/FMAXIMUM follow IEEE 754-2018 draft semantics. We could adjust ValueTracking to deal with signed zero, but it seems like a moot point given the divergent NaN behavior discussed in D143056, so just delete this possibility to avoid bugs when converting IR to SDAG. Differential Revision: https://reviews.llvm.org/D143106	2023-02-03 09:53:47 -05:00
Samuel Parker	bbdf243579	[DAGCombine] Fold redundant select If a chain of two selects share a true/false value and are controlled by two setcc nodes, that are never both true, we can fold away one of the selects. So, the following: (select (setcc X, const0, eq), Y, (select (setcc X, const1, eq), Z, Y)) Can be combined to: select (setcc X, const1, eq) Z, Y Differential Revision: https://reviews.llvm.org/D142535	2023-02-02 09:43:21 +00:00
Nikita Popov	78f88082de	[ConstantFold] Fix incorrect inbounds inference for [0 x T] GEPs Previously all indices into [0 x T] arrays were considered in range, which resulted in us incorrectly inferring inbounds for all GEPs of that form. We should not consider them in range here, and instead bail out of the rewriting logic (which would divide by zero). Do continue to consider 0 always in range, to avoid changing behavior for zero-index GEPs.	2023-02-01 15:14:11 +01:00
Samuel Parker	038f7debfd	[DAGCombine] fp_to_sint isSaturatingMinMax Recommitting after fixing scalable vector crash. Check for single smax pattern against zero when converting from a small enough float. Differential Revision: https://reviews.llvm.org/D142481	2023-01-30 12:25:25 +00:00
Sergei Barannikov	6594d058b9	[WebAssembly] Convert some tests to opaque pointers (NFC)	2023-01-30 07:08:42 +03:00
Samuel Parker	e60b91df13	Revert "[DAGCombine] fp_to_sint isSaturatingMinMax" This reverts commit `85395af272`. This is causing trouble with scalable vectors.	2023-01-27 15:42:12 +00:00
Samuel Parker	79649eacbc	[WebAssembly] Trying to fix expensive buildbot	2023-01-26 14:26:02 +00:00
Samuel Parker	85395af272	[DAGCombine] fp_to_sint isSaturatingMinMax Check for single smax pattern against zero when converting from a small enough float. Differential Revision: https://reviews.llvm.org/D142481	2023-01-26 12:37:43 +00:00
Samuel Parker	41080b2fdd	[NFC][WebAssembly] Updated tests Run update_llc_test_checks on a number of codegen tests.	2023-01-26 10:26:24 +00:00
Samuel Parker	430bdb1215	[NFC][WebAssembly] More fpclamptosat tests	2023-01-25 10:25:23 +00:00
Matt Arsenault	778cf5431c	IR: Add atomicrmw uinc_wrap and udec_wrap These are essentially add/sub 1 with a clamping value. AMDGPU has instructions for these. CUDA/HIP expose these as atomicInc/atomicDec. Currently we use target intrinsics for these, but those do no carry the ordering and syncscope. Add these to atomicrmw so we can carry these and benefit from the regular legalization processes.	2023-01-24 17:55:11 -04:00
Samuel Parker	32af267447	[NFC][WebAssembly] Add tests Add more variations to fpclamptosat.	2023-01-18 13:30:53 +00:00

1 2 3 4 5 ...

1060 Commits