clang-p2996

Author	SHA1	Message	Date
Chris Lattner	76b2ff4ded	Adjustments to support the new ConstantAggregateZero class llvm-svn: 11474	2004-02-15 05:55:15 +00:00
Chris Lattner	283ffdfac5	Fix compilation of 126.gcc: intrinsic functions cannot throw, so they are not allowed in invoke instructions. Thus, if we are inlining a call to an intrinsic function into an invoke site, we don't need to turn the call into an invoke! llvm-svn: 11384	2004-02-13 16:47:35 +00:00
Chris Lattner	18d1f19fba	Implement SimplifyCFG/PhiEliminate.ll Having a proper 'select' instruction would allow the elimination of a lot of the special case cruft in this patch, but we don't have one yet. llvm-svn: 11307	2004-02-11 03:36:04 +00:00
Chris Lattner	838b845781	The hasConstantReferences predicate always returns false. llvm-svn: 11301	2004-02-11 01:17:07 +00:00
Chris Lattner	fae8ab3088	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	39ad6f2772	Minor speedup, don't query ValueMap each time through the loop llvm-svn: 11123	2004-02-04 21:44:26 +00:00
Chris Lattner	6f8865bf9f	Two changes: 1. Don't scan to the end of alloca instructions in the caller function to insert inlined allocas, just insert at the top. This saves a lot of time inlining into functions with a lot of allocas. 2. Use splice to move the alloca instructions over, instead of remove/insert. This allows us to transfer a block at a time, and eliminates a bunch of silly symbol table manipulations. This speeds up the inliner on the testcase in PR209 from 1.73s -> 1.04s (67%) llvm-svn: 11118	2004-02-04 21:33:42 +00:00
Chris Lattner	0fa8c7c321	Optimize the case where we are inlining a function that contains only one basic block, and that basic block ends with a return instruction. In this case, we can just splice the cloned "body" of the function directly into the source basic block, avoiding a lot of rearrangement and splitBasicBlock's linear scan over the split block. This speeds up the inliner on the testcase in PR209 from 2.3s to 1.7s, a 35% reduction. llvm-svn: 11116	2004-02-04 04:17:06 +00:00
Chris Lattner	18ef3fda57	More refactoring. Move alloca instructions and handle invoke instructions before we delete the original call site, allowing slight simplifications of code, but nothing exciting. llvm-svn: 11109	2004-02-04 02:51:48 +00:00
Chris Lattner	9fc977eac4	Move the cloning of the function body much earlier in the inlinefunction process. The only optimization we did so far is to avoid creating a PHI node, then immediately destroying it in the common case where the callee has one return statement. Instead, we just don't create the return value. This has no noticable performance impact, but paves the way for future improvements. llvm-svn: 11108	2004-02-04 01:41:09 +00:00
Chris Lattner	a6578ef318	Give CloneBasicBlock an optional function argument to specify which function to add the cloned block to. This allows the block to be added to the function immediately, and all of the instructions to be immediately added to the function symbol table, which speeds up the inliner from 3.7 -> 3.38s on the PR209. llvm-svn: 11107	2004-02-04 01:19:43 +00:00
Chris Lattner	ae51cae111	Bunch up all locally used allocas by the block they are allocated in, and process them all as a group. This speeds up SRoA/mem2reg from 28.46s to 0.62s on the testcase from PR209. llvm-svn: 11100	2004-02-03 22:34:12 +00:00
Chris Lattner	3784188620	Handle extremely trivial cases extremely efficiently. This speeds up SRoA/mem2reg from 41.2s to 27.5s on the testcase in PR209. llvm-svn: 11099	2004-02-03 22:00:33 +00:00
Chris Lattner	6b052f2154	Clean up #includes llvm-svn: 10799	2004-01-12 19:56:36 +00:00
Chris Lattner	429963742e	Remove use of ConstantExpr::getShift llvm-svn: 10792	2004-01-12 19:10:58 +00:00
Chris Lattner	2853a7ed22	Remove use of ConstantHandling llvm-svn: 10789	2004-01-12 18:35:03 +00:00
Chris Lattner	fc6c859a0c	Move llvm::ConstantFoldInstruction from VMCore to here, next to ConstantFoldTerminator llvm-svn: 10785	2004-01-12 18:25:22 +00:00
Chris Lattner	fafa2ff2d6	Implement Transforms/ScalarRepl/phinodepromote.ll, which is an important case that the C/C++ front-end generates. llvm-svn: 10761	2004-01-12 01:18:32 +00:00
Chris Lattner	df3c342a4c	Finegrainify namespacification llvm-svn: 10727	2004-01-09 06:12:26 +00:00
Chris Lattner	04efa4b155	Add new function llvm-svn: 10529	2003-12-19 05:56:28 +00:00
Chris Lattner	a29600046d	Minor cleanups and simplifications llvm-svn: 10127	2003-11-21 16:52:05 +00:00
Chris Lattner	2af517281d	Start using the nicer terminator auto-insertion API llvm-svn: 10111	2003-11-20 18:25:24 +00:00
Chris Lattner	63a0ccff44	Spew symbolic types! llvm-svn: 10110	2003-11-20 18:23:14 +00:00
Brian Gaeke	960707c335	Put all LLVM code into the llvm namespace, as per bug 109. llvm-svn: 9903	2003-11-11 22:41:34 +00:00
Chris Lattner	1e6d3053f2	Reorganize code for locality, improve comments llvm-svn: 9857	2003-11-10 04:42:42 +00:00
Chris Lattner	4474336166	Adjust to new critical edge interface llvm-svn: 9853	2003-11-10 04:10:50 +00:00
Chris Lattner	38cd27e450	Various cleanups and efficiency improvements llvm-svn: 9753	2003-11-06 19:46:29 +00:00
Chris Lattner	8055fb3afa	Yet more fixes for constant expr shifts llvm-svn: 9739	2003-11-05 20:43:58 +00:00
Chris Lattner	ba55bd37fe	Further fixes for PR93 llvm-svn: 9738	2003-11-05 20:37:01 +00:00
John Criswell	81587e798a	Checking in Chris's suggestions: Added assert() to ensure symbol table is well formed. Added code to remember the value that was found; resolving types can change the symbol table and invalidate the value of the iterator. Added comments to the ResolveTypes() function (mainly for my own benefit). Please feel free to correct the comments if they are not accurate. llvm-svn: 9693	2003-11-04 15:22:26 +00:00
Chris Lattner	b727fb2663	Fix test: Linker/2003-10-27-LinkOncePromote.ll Fix PR58 llvm-svn: 9530	2003-10-27 16:39:39 +00:00
Chris Lattner	d9f4ffdf5e	Get the list of PHI node values before the basic block is split. Also, add PHI node entries for unwind instructions just like for call instructions which became invokes! This fixes PR57, tested by Inline/2003-10-26-InlineInvokeExceptionDestPhi.ll llvm-svn: 9526	2003-10-27 05:33:09 +00:00
Chris Lattner	4f2581f828	Fix bug: Linker/2003-10-21-ConflictingTypesTolerance.ll llvm-svn: 9357	2003-10-21 22:46:38 +00:00
Chris Lattner	9bc22b7439	Fix message to make more sense and confuse Chris less llvm-svn: 9354	2003-10-21 21:52:20 +00:00
John Criswell	29265fe981	Added LLVM copyright header. llvm-svn: 9321	2003-10-21 15:17:13 +00:00
John Criswell	4436c49787	Added LLVM copyright notice to Makefiles. llvm-svn: 9312	2003-10-20 22:26:57 +00:00
John Criswell	482202a601	Added LLVM project notice to the top of every C++ source file. Header files will be on the way. llvm-svn: 9298	2003-10-20 19:43:21 +00:00
Chris Lattner	b32f5748b7	Fix PR#50 llvm-svn: 9227	2003-10-18 06:14:59 +00:00
Chris Lattner	068ad84038	Add support for 'weak' linkage. llvm-svn: 9171	2003-10-16 18:29:00 +00:00
Chris Lattner	f77a856f3b	Cleanup llvm-svn: 9133	2003-10-15 16:42:21 +00:00
Chris Lattner	b4778c73c9	Do not move variable sized allocations to the top of the caller, which might break dominance relationships, and is otherwise bad. This fixes bug: Inline/2003-10-13-AllocaDominanceProblem.ll. This also fixes miscompilation of 3 176.gcc source files (reload1.c, global.c, flow.c) llvm-svn: 9109	2003-10-14 01:11:07 +00:00
Chris Lattner	72272a70b8	Rename loop preheaders pass to loop simplify llvm-svn: 9061	2003-10-12 21:52:28 +00:00
Misha Brukman	8b2bd4ed47	Fix spelling. llvm-svn: 9027	2003-10-10 17:57:28 +00:00
Chris Lattner	6aa34b0d0b	Avoid doing pointless work. Amazingly, this makes us go faster. Running the inliner on 252.eon used to take 48.4763s, now it takes 14.4148s. In release mode, it went from taking 25.8741s to taking 11.5712s. This also fixes a FIXME. llvm-svn: 8890	2003-10-06 15:23:43 +00:00
Chris Lattner	c30f22f57c	This changes the PromoteMemToReg function to create "pruned" SSA form, not "minimal" SSA form (in other words, it doesn't insert dead PHIs). This speeds up the mem2reg pass very significantly because it doesn't have to do a lot of frivolous work in many common cases. In the 252.eon function I have been playing with, this doesn't even insert the 120 PHI nodes that it used to which were trivially dead (in the process of promoting 356 alloca instructions overall). This speeds up the mem2reg pass from 1.2459s to 0.1284s. More significantly, the DCE pass used to take 2.4138s to remove the 120 dead PHI nodes that mem2reg constructed, now it takes 0.0134s (which is the time to scan the function and decide that there is nothing dead). So overall, on this one function, we speed things up a total of 3.5179s, which is a 24.8x speedup! :) This change is tested by the Mem2Reg/2003-10-05-DeadPHIInsertion.ll test, which now passes. llvm-svn: 8884	2003-10-05 22:19:20 +00:00
Chris Lattner	a906bacfdd	Change the interface to PromoteMemToReg to also take a DominatorTree llvm-svn: 8883	2003-10-05 21:20:13 +00:00
Chris Lattner	8047152977	Speed up the mem2reg transform for allocas which are only read/written in a single basic block. This is amazingly common in code generated by the C/C++ front-ends. This change makes it not have to insert ANY phi nodes, whereas before it would insert a ton of dead ones which DCE would have to clean up. Thus, this fix improves compile-time performance of these trivial allocas in two ways: 1. It doesn't have to do the walking and book-keeping for renaming 2. It does not insert dead phi nodes for them which would have to subsequently be cleaned up. On my favorite testcase from 252.eon, this special case handles 305 out of 356 promoted allocas in the function. It speeds up the mem2reg pass from 7.5256s to 1.2505s. It inserts 677 fewer dead PHI nodes, which speeds up a subsequent -dce pass from 18.7524s to 2.4806s. There are still 120 trivially dead PHI nodes being inserted for variables used in multiple basic blocks, but they are not handled by this patch. llvm-svn: 8881	2003-10-05 20:54:03 +00:00
Chris Lattner	a5721d3d03	The first PHI node may be null, scan for the first non-null one llvm-svn: 8865	2003-10-05 05:34:39 +00:00
Chris Lattner	203bc011e5	The VersionNumbers vector is only used during PHI placement. Turn it into an argument, allowing us to get rid of the vector. llvm-svn: 8864	2003-10-05 04:33:22 +00:00
Chris Lattner	7d9692df22	* Update file header comment *** Revamp the code which handled unreachable code in the function. Now the code is much more efficient for high-degree basic blocks, such as those that occur in the 252.eon SPEC benchmark. For the interested, the time to promote a SINGLE alloca in _ZN7mrScene4ReadERSi function used to be > 3.5s. Now it is < .075s. The function has a LOT of allocas in it, so it appeared to be infinite looping, this should make it much nicer. :) llvm-svn: 8863	2003-10-05 04:26:39 +00:00

1 2 3 4 5

239 Commits