clang-p2996

Author	SHA1	Message	Date
Chris Lattner	32c01df299	Canonicalize (X+C1)C2 -> XC2+C1*C2 This implements Transforms/InstCombine/add.ll:test31 llvm-svn: 26519	2006-03-04 06:04:02 +00:00
Chris Lattner	681ef2f083	Change this to work with renamed intrinsics. llvm-svn: 26484	2006-03-03 01:34:17 +00:00
Chris Lattner	85dda9a2bd	Generalize the REM folding code to handle another case Nick Lewycky pointed out: realize the AND can provide factors and look through Casts. llvm-svn: 26469	2006-03-02 06:50:58 +00:00
Chris Lattner	c5b6c9a12a	Fix a regression in a patch from a couple of days ago. This fixes Transforms/InstCombine/2006-02-28-Crash.ll llvm-svn: 26427	2006-02-28 19:47:20 +00:00
Chris Lattner	b70f141893	Implement rem.ll:test[7-9] and PR712 llvm-svn: 26415	2006-02-28 05:49:21 +00:00
Chris Lattner	2a7c7b8bab	Simplify some code now that the RHS of a rem can't be 0 llvm-svn: 26413	2006-02-28 05:40:55 +00:00
Chris Lattner	0de4a8d7b7	Rearrange some code, fold "rem X, 0", implementing rem.ll:test6 llvm-svn: 26411	2006-02-28 05:30:45 +00:00
Chris Lattner	c7bfed0f7b	Merge two almost-identical pieces of code. Make this code more powerful by using ComputeMaskedBits instead of looking for an AND operand. This lets us fold this: int %test23(int %a) { %tmp.1 = and int %a, 1 %tmp.2 = seteq int %tmp.1, 0 %tmp.3 = cast bool %tmp.2 to int ;; xor tmp1, 1 ret int %tmp.3 } into: xor (and a, 1), 1 llvm-svn: 26396	2006-02-27 02:38:23 +00:00
Chris Lattner	f5c8a0b83f	Fold (A^B) == A -> B == 0 and (A-B) == A -> B == 0 llvm-svn: 26394	2006-02-27 01:44:11 +00:00
Chris Lattner	f78df7c14d	Fold (X\|C1)^C2 -> X^(C1\|C2) when possible. This implements InstCombine/or.ll:test23. llvm-svn: 26385	2006-02-26 19:57:54 +00:00
Chris Lattner	b580d26e7d	Fix a problem that Nate noticed that boils down to an over conservative check in the code that does "select C, (X+Y), (X-Y) --> (X+(select C, Y, (-Y)))". We now compile this loop: LBB1_1: ; no_exit add r6, r2, r3 subf r3, r2, r3 cmpwi cr0, r2, 0 addi r7, r5, 4 lwz r2, 0(r5) addi r4, r4, 1 blt cr0, LBB1_4 ; no_exit LBB1_3: ; no_exit mr r3, r6 LBB1_4: ; no_exit cmpwi cr0, r4, 16 mr r5, r7 bne cr0, LBB1_1 ; no_exit into this instead: LBB1_1: ; no_exit srawi r6, r2, 31 add r2, r2, r6 xor r6, r2, r6 addi r7, r5, 4 lwz r2, 0(r5) addi r4, r4, 1 add r3, r3, r6 cmpwi cr0, r4, 16 mr r5, r7 bne cr0, LBB1_1 ; no_exit llvm-svn: 26356	2006-02-24 18:05:58 +00:00
Jeff Cohen	0add83e969	Fix bugs identified by VC++. llvm-svn: 26287	2006-02-18 03:20:33 +00:00
Nate Begeman	8a77efe4f7	Rework the SelectionDAG-based implementations of SimplifyDemandedBits and ComputeMaskedBits to match the new improved versions in instcombine. Tested against all of multisource/benchmarks on ppc. llvm-svn: 26238	2006-02-16 21:11:51 +00:00
Chris Lattner	8b10ab3002	Implement Instcombine/and.ll:test34 llvm-svn: 26155	2006-02-13 23:07:23 +00:00
Chris Lattner	7d8522884b	If any of the sign extended bits are demanded, the input sign bit is demanded for a sign extension. This fixes InstCombine/2006-02-13-DemandedMiscompile.ll and Ptrdist/bc. llvm-svn: 26152	2006-02-13 22:41:07 +00:00
Chris Lattner	68e7475777	Be careful not to request or look at bits shifted in from outside the size of the input. This fixes the mediabench/gsm/toast failure last night. llvm-svn: 26138	2006-02-13 06:09:08 +00:00
Chris Lattner	f5b4ef7f58	remove some more dead special case code llvm-svn: 26135	2006-02-12 08:07:37 +00:00
Chris Lattner	5b2edb1fca	Eliminate special case hacks that are superceded by general purpose hacks llvm-svn: 26134	2006-02-12 08:02:11 +00:00
Chris Lattner	ee0f280743	Three changes: 1. Teach GetConstantInType to handle boolean constants. 2. Teach instcombine to fold (compare X, CST) when X has known 0/1 bits. Testcase here: set.ll:test22 3. Improve the "(X >> c1) & C2 == 0" folding code to allow a noop cast between the shift and and. More aggressive bitfolding for other reasons was turning signed shr's into unsigned shr's, leaving the noop cast in the way. llvm-svn: 26131	2006-02-12 02:07:56 +00:00
Chris Lattner	0157e7f55b	Port the recent innovations in ComputeMaskedBits to SimplifyDemandedBits. This allows us to simplify on conditions where bits are not known, but they are not demanded either! This also fixes a couple of bugs in ComputeMaskedBits that were exposed during this work. In the future, swaths of instcombine should be removed, as this code subsumes a bunch of ad-hockery. llvm-svn: 26122	2006-02-11 09:31:47 +00:00
Chris Lattner	24cd2fa269	Fix 80-column violations llvm-svn: 26088	2006-02-09 07:41:14 +00:00
Chris Lattner	4534dd59a3	Enhance MVIZ in three ways: 1. Teach it new tricks: in particular how to propagate through signed shr and sexts. 2. Teach it to return a bitset of known-1 and known-0 bits, instead of just zero. 3. Teach instcombine (AND X, C) to fold when we know all C bits of X. This implements Regression/Transforms/InstCombine/bittest.ll, and allows future things to be simplified. llvm-svn: 26087	2006-02-09 07:38:58 +00:00
Chris Lattner	ab2dc4d70d	Simplify some code, reducing calls to MaskedValueIsZero. Implement a minor optimization where we reduce the number of bits in AND masks when possible. llvm-svn: 26056	2006-02-08 07:34:50 +00:00
Chris Lattner	5997cf9381	Use EraseInstFromFunction in a few cases to put the uses of the removed instruction onto the worklist (in case they are now dead). Add a really trivial local DSE implementation to help out bitfield code. We now fold this: struct S { unsigned char a : 1, b : 1, c : 1, d : 2, e : 3; S(); }; S::S() : a(0), b(0), c(1), d(0), e(6) {} to this: void %_ZN1SC1Ev(%struct.S* %this) { entry: %tmp.1 = getelementptr %struct.S* %this, int 0, uint 0 store ubyte 38, ubyte* %tmp.1 ret void } much earlier (in gccas instead of only in gccld after DSE runs). llvm-svn: 26050	2006-02-08 03:25:32 +00:00
Chris Lattner	ddba3289b5	Fix a problem in my patch yesterday, causing a miscompilation of 176.gcc llvm-svn: 26045	2006-02-08 01:20:23 +00:00
Chris Lattner	44314827d6	Fix Transforms/InstCombine/2006-02-07-SextZextCrash.ll llvm-svn: 26040	2006-02-07 19:07:40 +00:00
Chris Lattner	92a6865321	Generalize MaskedValueIsZero into a ComputeMaskedNonZeroBits function, which is just as efficient as MVIZ and is also more general. Fix a few minor bugs introduced in recent patches llvm-svn: 26036	2006-02-07 08:05:22 +00:00
Chris Lattner	c3ebf40031	Make MaskedValueIsZero take a uint64_t instead of a ConstantIntegral as a mask. This allows the code to be simpler and more efficient. Also, generalize some of the cases in MVIZ a bit, making it slightly more aggressive. llvm-svn: 26035	2006-02-07 07:27:52 +00:00
Chris Lattner	77defbae0a	Use Type::getIntegralTypeMask() to simplify some code llvm-svn: 26034	2006-02-07 07:00:41 +00:00
Chris Lattner	2590e511d8	Implement the beginnings of a facility for simplifying expressions based on 'demanded bits', inspired by Nate's work in the dag combiner. This isn't complete, but needs to unrelated instcombiner changes to continue. llvm-svn: 26033	2006-02-07 06:56:34 +00:00
Chris Lattner	2e90b732fa	Turn A % (C << N), where C is 2^k, into A & ((C << N)-1) [urem only]. Turn A / (C1 << N), where C1 is "1<<C2" into A >> (N+C2) [udiv only]. Tested with: rem.ll:test5, div.ll:test10 llvm-svn: 26003	2006-02-05 07:54:04 +00:00
Chris Lattner	c597b8a55e	Make iostream #inclusion explicit llvm-svn: 25514	2006-01-22 23:32:06 +00:00
Chris Lattner	e154abf9b3	Implement casts.ll:test26: a cast from float -> double -> integer, doesn't need the float->double part. llvm-svn: 25452	2006-01-19 07:40:22 +00:00
Chris Lattner	307b7ea15f	fix a crash due to missing parens llvm-svn: 25363	2006-01-16 19:47:21 +00:00
Robert Bocchino	a83529678e	Added instcombine support for extractelement. llvm-svn: 25299	2006-01-13 22:48:06 +00:00
Chris Lattner	503221f5c5	Do a simple instcombine xforms to delete llvm.stackrestore cases. llvm-svn: 25294	2006-01-13 21:28:09 +00:00
Chris Lattner	c66b223b28	Simplify this a tiny bit by using the new IntrinsicInst functionality. llvm-svn: 25292	2006-01-13 20:11:04 +00:00
Chris Lattner	9cbfbc21bb	fix some 176.gcc miscompilation from my previous patch. llvm-svn: 25137	2006-01-07 01:32:28 +00:00
Chris Lattner	330628a6d8	silence some bogus gcc warnings on fenris llvm-svn: 25130	2006-01-06 17:59:59 +00:00
Chris Lattner	eb372a0276	Enhance the shift-shift folding code to allow a no-op cast to occur in between the shifts. This allows us to fold this (which is the 'integer add a constant' sequence from cozmic's scheme compmiler): int %x(uint %anf-temporary776) { %anf-temporary777 = shr uint %anf-temporary776, ubyte 1 %anf-temporary800 = cast uint %anf-temporary777 to int %anf-temporary804 = shl int %anf-temporary800, ubyte 1 %anf-temporary805 = add int %anf-temporary804, -2 %anf-temporary806 = or int %anf-temporary805, 1 ret int %anf-temporary806 } into this: int %x(uint %anf-temporary776) { %anf-temporary776 = cast uint %anf-temporary776 to int %anf-temporary776.mask1 = add int %anf-temporary776, -2 %anf-temporary805 = or int %anf-temporary776.mask1, 1 ret int %anf-temporary805 } note that instcombine already knew how to eliminate the AND that the two shifts fold into. This is tested by InstCombine/shift.ll:test26 -Chris llvm-svn: 25128	2006-01-06 07:52:12 +00:00
Chris Lattner	b330939d90	Simplify the code a bit more llvm-svn: 25126	2006-01-06 07:22:22 +00:00
Chris Lattner	145539343f	Extract a bunch of code out of visitShiftInst into FoldShiftByConstant. No functionality changes. llvm-svn: 25125	2006-01-06 07:12:35 +00:00
Nate Begeman	848622f87f	Add support alignment of allocation instructions. Add support for specifying alignment and size of setjmp jmpbufs. No targets currently do anything with this information, nor is it presrved in the bytecode representation. That's coming up next. llvm-svn: 24196	2005-11-05 09:21:28 +00:00
Chris Lattner	dd0c174082	Turn sdiv into udiv if both operands have a clear sign bit. This occurs a few times in crafty: OLD: %tmp.36 = div int %tmp.35, 8 ; <int> [#uses=1] NEW: %tmp.36 = div uint %tmp.35, 8 ; <uint> [#uses=0] OLD: %tmp.19 = div int %tmp.18, 8 ; <int> [#uses=1] NEW: %tmp.19 = div uint %tmp.18, 8 ; <uint> [#uses=0] OLD: %tmp.117 = div int %tmp.116, 8 ; <int> [#uses=1] NEW: %tmp.117 = div uint %tmp.116, 8 ; <uint> [#uses=0] OLD: %tmp.92 = div int %tmp.91, 8 ; <int> [#uses=1] NEW: %tmp.92 = div uint %tmp.91, 8 ; <uint> [#uses=0] Which all turn into shrs. llvm-svn: 24190	2005-11-05 07:40:31 +00:00
Chris Lattner	e9ff0eaf5b	Turn srem -> urem when neither input has their sign bit set. This triggers 8 times in vortex, allowing the srems to be turned into shrs: OLD: %tmp.104 = rem int %tmp.5.i37, 16 ; <int> [#uses=1] NEW: %tmp.104 = rem uint %tmp.5.i37, 16 ; <uint> [#uses=0] OLD: %tmp.98 = rem int %tmp.5.i24, 16 ; <int> [#uses=1] NEW: %tmp.98 = rem uint %tmp.5.i24, 16 ; <uint> [#uses=0] OLD: %tmp.91 = rem int %tmp.5.i19, 8 ; <int> [#uses=1] NEW: %tmp.91 = rem uint %tmp.5.i19, 8 ; <uint> [#uses=0] OLD: %tmp.88 = rem int %tmp.5.i14, 8 ; <int> [#uses=1] NEW: %tmp.88 = rem uint %tmp.5.i14, 8 ; <uint> [#uses=0] OLD: %tmp.85 = rem int %tmp.5.i9, 1024 ; <int> [#uses=2] NEW: %tmp.85 = rem uint %tmp.5.i9, 1024 ; <uint> [#uses=0] OLD: %tmp.82 = rem int %tmp.5.i, 512 ; <int> [#uses=2] NEW: %tmp.82 = rem uint %tmp.5.i1, 512 ; <uint> [#uses=0] OLD: %tmp.48.i = rem int %tmp.5.i.i161, 4 ; <int> [#uses=1] NEW: %tmp.48.i = rem uint %tmp.5.i.i161, 4 ; <uint> [#uses=0] OLD: %tmp.20.i2 = rem int %tmp.5.i.i, 4 ; <int> [#uses=1] NEW: %tmp.20.i2 = rem uint %tmp.5.i.i, 4 ; <uint> [#uses=0] it also occurs 9 times in gcc, but with odd constant divisors (1009 and 61) so the payoff isn't as great. llvm-svn: 24189	2005-11-05 07:28:37 +00:00
Andrew Lenharth	662295587d	make this 64 bit clean, fixed test30 of /Regression/Transforms/InstCombine/add.ll llvm-svn: 24158	2005-11-02 18:35:40 +00:00
Chris Lattner	09efd4e5b6	Limit the search depth of MaskedValueIsZero to 6 instructions, to avoid bad cases. This fixes Markus's second testcase in PR639, and should seal it for good. llvm-svn: 24123	2005-10-31 18:35:52 +00:00
Chris Lattner	8f663e8bbc	Pull some code out into a function, give it the ability to see through +. This allows us to turn code like malloc(4*x+4) -> malloc int, (x+1) llvm-svn: 24081	2005-10-29 04:36:15 +00:00
Chris Lattner	8270c33606	Remove a special case, allowing the general case to handle it. No functionality change. llvm-svn: 24076	2005-10-29 03:19:53 +00:00
Chris Lattner	b9d3ca5c3c	Fix a bit of backwards logic that broke exptree and smg2000 llvm-svn: 24056	2005-10-28 16:27:35 +00:00

... 5 6 7 8 9 ...

748 Commits