clang-p2996

Author	SHA1	Message	Date
Peter Collingbourne	feea10bcdf	Recognise 32-bit ror-based bswap implementation used by uclibc llvm-svn: 119007	2010-11-13 19:54:30 +00:00
Evan Cheng	2bcb8daa44	Add conditional move of large immediate. llvm-svn: 118968	2010-11-13 02:25:14 +00:00
Evan Cheng	8ce967e393	Fix an obvious typo which inverted an immediate. llvm-svn: 118951	2010-11-13 00:27:47 +00:00
Eric Christopher	a08ccc8cb9	This should be still failing, but is. Disable it with the forget-me-stick for now. llvm-svn: 118950	2010-11-13 00:25:06 +00:00
Evan Cheng	0fc8084a64	Add conditional mvn instructions. llvm-svn: 118935	2010-11-12 22:42:47 +00:00
Evan Cheng	2d59ee34f1	Add some missing isel predicates on def : pat patterns to avoid generating VFP vmla / vmls (they cause stalls). Disabling them in isel is properly not a right solution, I'll look into a proper solution next. llvm-svn: 118922	2010-11-12 20:32:20 +00:00
Andrew Trick	b709ec6345	Emacs auto-fill bug. llvm-svn: 118908	2010-11-12 18:17:46 +00:00
Andrew Trick	ff5f8680d8	Test case for PR8287: SD scheduling time. Fixed in r118904. llvm-svn: 118906	2010-11-12 17:57:22 +00:00
Kalle Raiskila	0a9dd405a5	Fix memory access lowering on SPU, adding support for the case where alignment<value size. These cases were silently miscompiled before this patch. Now they are overly verbose -especially storing is- and any front-end should still avoid misaligned memory accesses as much as possible. The bit juggling algorithm added here probably has some room for improvement still. llvm-svn: 118889	2010-11-12 10:14:03 +00:00
Bruno Cardoso Lopes	03c0330176	Enable mips32 mul instruction. Patch by Akira Hatanaka <ahatanaka@mips.com> llvm-svn: 118864	2010-11-12 00:38:32 +00:00
Dan Gohman	6cf9bb45ad	Remove the memmove->memcpy optimization from CodeGen. MemCpyOpt does this. llvm-svn: 118789	2010-11-11 16:24:49 +00:00
Bruno Cardoso Lopes	a4ceea8cd8	Add a test to the previous added clo instruction. Patch by Akira again llvm-svn: 118668	2010-11-10 02:22:44 +00:00
Bob Wilson	193722ebc8	Do not use MEMBARRIER_MCR for any Thumb code. It is only supported for ARM code. Normally Thumb2 code would use DMB instead, but depending on how the compiler is invoked (e.g., -mattr=-db) that might be disabled. This prevents a "cannot select MEMBARRIER_MCR" error in that situation. Radar 8644195 llvm-svn: 118642	2010-11-09 22:50:44 +00:00
Duncan Sands	e5276f11ee	Testcase for PR8211 (llc crash at -O0). llvm-svn: 118509	2010-11-09 16:22:27 +00:00
Dan Gohman	5db8921422	Fix DAGCombiner to avoid folding a sext-in-reg or similar through a shl in order to fold it into a load. llvm-svn: 118471	2010-11-09 01:54:35 +00:00
Dan Gohman	4677bafd85	Delete an extraneous svn:executable property. llvm-svn: 118470	2010-11-09 01:51:06 +00:00
Dale Johannesen	f11ea9ce61	Fix an inline asm pasto from 117667; was preventing {i64, i64} from matching i128. llvm-svn: 118465	2010-11-09 01:15:07 +00:00
Owen Anderson	c7baee31ad	Add support for ARM's specialized vector-compare-against-zero instructions. llvm-svn: 118453	2010-11-08 23:21:22 +00:00
Dale Johannesen	0ef474730f	Revert 118422 in search of bot verdancy. llvm-svn: 118429	2010-11-08 19:17:22 +00:00
Jason W Kim	f3e224f830	Support -mcpu=cortex-a8 in ARM attributes - Has Fixme. 1 Test modified. llvm-svn: 118422	2010-11-08 17:58:07 +00:00
Chris Lattner	ca7801e472	go to great lengths to work around a GAS bug my previous patch exposed: GAS doesn't accept "fcomip %st(1)", it requires "fcomip %st(1), %st(0)" even though st(0) is implicit in all other fp stack instructions. Fortunately, there is an alias for fcomip named "fcompi" and gas does accept the default argument for the alias (boggle!). As such, switch the canonical form of this instruction to "pi" instead of "ip". This makes the code generator and disassembler generate pi, avoiding the gas bug. llvm-svn: 118356	2010-11-06 21:37:06 +00:00
Owen Anderson	30c4892ea5	Add codegen and encoding support for the immediate form of vbic. llvm-svn: 118291	2010-11-05 19:27:46 +00:00
Duncan Sands	98512315f7	When passing a huge parameter using the byval mechanism, a long sequence of loads and stores was being generated to perform the copy on the x86 targets if the parameter was less than 4 byte aligned, causing llc to use up vast amounts of memory and time. Use a "rep movs" form instead. PR7170. llvm-svn: 118260	2010-11-04 21:16:46 +00:00
Evan Cheng	21acf9fb38	Fix @llvm.prefetch isel. Selecting between pld / pldw using the first immediate rw. There is currently no intrinsic that matches to pli. llvm-svn: 118237	2010-11-04 05:19:35 +00:00
Owen Anderson	bc9b31c493	Covert VORRIMM to be produced via early target-specific DAG combining, rather than legalization. This is both the conceptually correct place for it, as well as allowing it to be more aggressive. llvm-svn: 118204	2010-11-03 23:15:26 +00:00
Owen Anderson	0747307049	Add support for code generation of the one register with immediate form of vorr. We could be more aggressive about making this work for a larger range of constants, but this seems like a good start. llvm-svn: 118201	2010-11-03 22:44:51 +00:00
Evan Cheng	3ad8df65c5	Fix test. llvm-svn: 118187	2010-11-03 18:21:33 +00:00
Dale Johannesen	c7d82d58b5	This test assumes SSE is present; that is not the default on non-X86 hosts. Hopefully fixes ppc-host buildbot. llvm-svn: 118182	2010-11-03 18:08:41 +00:00
Bob Wilson	7d0ac84abd	Add codegen patterns for VST1-lane instructions. Radar 8599955. llvm-svn: 118176	2010-11-03 16:24:53 +00:00
Bob Wilson	ceb49296ef	Check for extractelement with a variable operand for the element number. For NEON we had been assuming this was always an immediate constant. llvm-svn: 118175	2010-11-03 16:24:50 +00:00
Evan Cheng	8740ee3637	Fix preload instruction isel. Only v7 supports pli, and only v7 with mp extension supports pldw. Add subtarget attribute to denote mp extension support and legalize illegal ones to nothing. llvm-svn: 118160	2010-11-03 06:34:55 +00:00
Evan Cheng	6f36042557	Add support to match @llvm.prefetch to pld / pldw / pli. rdar://8601536. llvm-svn: 118152	2010-11-03 05:14:24 +00:00
Dan Gohman	68fb004616	Fix DAGCombiner to avoid going into an infinite loop when it encounters (and:i64 (shl:i64 (load:i64), 1), 0xffffffff). This fixes rdar://8606584. llvm-svn: 118143	2010-11-03 01:47:46 +00:00
Evan Cheng	debf9c502a	Two sets of changes. Sorry they are intermingled. 1. Fix pre-ra scheduler so it doesn't try to push instructions above calls to "optimize for latency". Call instructions don't have the right latency and this is more likely to use introduce spills. 2. Fix if-converter cost function. For ARM, it should use instruction latencies, not # of micro-ops since multi-latency instructions is completely executed even when the predicate is false. Also, some instruction will be "slower" when they are predicated due to the register def becoming implicit input. rdar://8598427 llvm-svn: 118135	2010-11-03 00:45:17 +00:00
John Thompson	beffa5bef1	Inline asm mult-alt constraint tests. llvm-svn: 118107	2010-11-02 23:01:44 +00:00
Jim Grosbach	0b7fda23cc	Revert r114340 (improvements in Darwin function prologue/epilogue), as it broke assumptions about stack layout. Specifically, LR must be saved next to FP. llvm-svn: 118026	2010-11-02 17:35:25 +00:00
Devang Patel	94f2a2578c	Use frameindex, if available, as a last resort to emit debug info for a parameter. llvm-svn: 118020	2010-11-02 17:01:30 +00:00
Bob Wilson	dd9fbaa9c0	Add support for alignment operands on VLD1-lane instructions. This is another part of the fix for Radar 8599955. llvm-svn: 117976	2010-11-01 23:40:51 +00:00
Bob Wilson	7e57573844	Add VLD1-lane testcases for quad-register types. llvm-svn: 117975	2010-11-01 23:40:46 +00:00
Bob Wilson	dc44990c7d	Add NEON VLD1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 117964	2010-11-01 22:04:05 +00:00
Bill Wendling	c6627eec13	When we look at instructions to convert to setting the 's' flag, we need to look at more than those which define CPSR. You can have this situation: (1) subs ... (2) sub r6, r5, r4 (3) movge ... (4) cmp r6, 0 (5) movge ... We cannot convert (2) to "subs" because (3) is using the CPSR set by (1). There's an analogous situation here: (1) sub r1, r2, r3 (2) sub r4, r5, r6 (3) cmp r4, ... (5) movge ... (6) cmp r1, ... (7) movge ... We cannot convert (1) to "subs" because of the intervening use of CPSR. llvm-svn: 117950	2010-11-01 20:41:43 +00:00
Bob Wilson	44be217af1	NEON does not support truncating vector stores. Radar 8598391. llvm-svn: 117940	2010-11-01 18:31:39 +00:00
Bill Wendling	359dd0c6bd	More tests to XFAIL. The arm-and-txt-peephole.ll test passes even when the peephole optimizer is disabled. That's not good at all. llvm-svn: 117905	2010-11-01 05:59:43 +00:00
Bill Wendling	cd4750cb4d	Disable because peephole is disabled. llvm-svn: 117903	2010-11-01 05:48:44 +00:00
Bob Wilson	7ed597149b	Overhaul memory barriers in the ARM backend. Radar 8601999. There were a number of issues to fix up here: * The "device" argument of the llvm.memory.barrier intrinsic should be used to distinguish the "Full System" domain from the "Inner Shareable" domain. It has nothing to do with using DMB vs. DSB instructions. * The compiler should never need to emit DSB instructions. Remove the ARMISD::SYNCBARRIER node and also remove the instruction patterns for DSB. * Merge the separate DMB/DSB instructions for options only used for the disassembler with the default DMB/DSB instructions. Add the default "full system" option ARM_MB::SY to the ARM_MB::MemBOpt enum. * Add a separate ARMISD::MEMBARRIER_MCR node for subtargets that implement a data memory barrier using the MCR instruction. * Fix up encodings for these instructions (except MCR). I also updated the tests and added a few new ones to check for DMB options that were not currently being exercised. llvm-svn: 117756	2010-10-30 00:54:37 +00:00
Evan Cheng	2b3f25e031	Teach machine cse to eliminate instructions with multiple physreg uses and defs. rdar://8610857. llvm-svn: 117745	2010-10-29 23:36:03 +00:00
Bob Wilson	08882be86c	Remove DAG combiner patch to fold vector splats. Instcombiner does it now. llvm-svn: 117720	2010-10-29 22:03:02 +00:00
Evan Cheng	6c1414f9c2	Avoiding overly aggressive latency scheduling. If the two nodes share an operand and one of them has a single use that is a live out copy, favor the one that is live out. Otherwise it will be difficult to eliminate the copy if the instruction is a loop induction variable update. e.g. BB: sub r1, r3, #1 str r0, [r2, r3] mov r3, r1 cmp bne BB => BB: str r0, [r2, r3] sub r3, r3, #1 cmp bne BB This fixed the recent 256.bzip2 regression. llvm-svn: 117675	2010-10-29 18:09:28 +00:00
Bob Wilson	f63da12be9	Teach the DAG combiner to fold a splat of a splat. Radar 8597790. Also do some minor refactoring to reduce indentation. llvm-svn: 117558	2010-10-28 17:06:14 +00:00
Evan Cheng	ff310737e5	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00

1 2 3 4 5 ...

3787 Commits