clang-p2996

Author	SHA1	Message	Date
Daniel Berlin	43dab5a5fb	Make PredIteratorCache size() logically const. Do not require copying predecessors to get size. Summary: Every single benchmark i can run, on large and small cfgs, fully connected, etc, across 3 different platforms (x86, arm., and PPC) says that the current pred iterator cache is a losing proposition. I can't find a case where it's faster than just walking preds, and in some cases, it's 5-10% slower. This is due to copying the preds. It also degrades into copying the entire cfg. The one operation that is occasionally faster is the cached size. This makes that operation faster by not relying on having the copies available. I'm not even sure that is faster enough to be worth it. I, again, have trouble finding cases where this takes long enough in a pass to be worth caching compared to a million other things they could cache or improve. My suggestion: We next remove the get() interface. We do stronger benchmarking of size(). We probably end up killing this entire cache. / Reviewers: chandlerc Subscribers: aemerson, llvm-commits, trentxintong Differential Revision: https://reviews.llvm.org/D30873 llvm-svn: 297733	2017-03-14 11:25:45 +00:00
Sam Parker	916b1ba617	[ARM] Move SMULW[B\|T] isel to DAG Combine Create nodes for smulwb and smulwt and move their selection from DAGToDAG to DAG combine. smlawb and smlawt can then be selected using tablegen. Added some helper functions to detect shift patterns as well as a wrapper around SimplifyDemandBits. Added a couple of extra tests. Differential Revision: https://reviews.llvm.org/D30708 llvm-svn: 297716	2017-03-14 09:13:22 +00:00
Oren Ben Simhon	fe34c5e429	Disable Callee Saved Registers Each Calling convention (CC) defines a static list of registers that should be preserved by a callee function. All other registers should be saved by the caller. Some CCs use additional condition: If the register is used for passing/returning arguments – the caller needs to save it - even if it is part of the Callee Saved Registers (CSR) list. The current LLVM implementation doesn’t support it. It will save a register if it is part of the static CSR list and will not care if the register is passed/returned by the callee. The solution is to dynamically allocate the CSR lists (Only for these CCs). The lists will be updated with actual registers that should be saved by the callee. Since we need the allocated lists to live as long as the function exists, the list should reside inside the Machine Register Info (MRI) which is a property of the Machine Function and managed by it (and has the same life span). The lists should be saved in the MRI and populated upon LowerCall and LowerFormalArguments. The patch will also assist to implement future no_caller_saved_regsiters attribute intended for interrupt handler CC. Differential Revision: https://reviews.llvm.org/D28566 llvm-svn: 297715	2017-03-14 09:09:26 +00:00
Jonas Paulsson	a48ea231c0	[TargetTransformInfo] getIntrinsicInstrCost() scalarization estimation improved getIntrinsicInstrCost() used to only compute scalarization cost based on types. This patch improves this so that the actual arguments are checked when they are available, in order to handle only unique non-constant operands. Tests updates: Analysis/CostModel/X86/arith-fp.ll Transforms/LoopVectorize/AArch64/interleaved_cost.ll Transforms/LoopVectorize/ARM/interleaved_cost.ll The improvement in getOperandsScalarizationOverhead() to differentiate on constants made it necessary to update the interleaved_cost.ll tests even though they do not relate to intrinsics. Review: Hal Finkel https://reviews.llvm.org/D29540 llvm-svn: 297705	2017-03-14 06:35:36 +00:00
Nirav Dave	54e22f33d9	In visitSTORE, always use FindBetterChain, rather than only when UseAA is enabled. Recommiting with compiler time improvements Recommitting after fixup of 32-bit aliasing sign offset bug in DAGCombiner. * Simplify Consecutive Merge Store Candidate Search Now that address aliasing is much less conservative, push through simplified store merging search and chain alias analysis which only checks for parallel stores through the chain subgraph. This is cleaner as the separation of non-interfering loads/stores from the store-merging logic. When merging stores search up the chain through a single load, and finds all possible stores by looking down from through a load and a TokenFactor to all stores visited. This improves the quality of the output SelectionDAG and the output Codegen (save perhaps for some ARM cases where we correctly constructs wider loads, but then promotes them to float operations which appear but requires more expensive constant generation). Some minor peephole optimizations to deal with improved SubDAG shapes (listed below) Additional Minor Changes: 1. Finishes removing unused AliasLoad code 2. Unifies the chain aggregation in the merged stores across code paths 3. Re-add the Store node to the worklist after calling SimplifyDemandedBits. 4. Increase GatherAllAliasesMaxDepth from 6 to 18. That number is arbitrary, but seems sufficient to not cause regressions in tests. 5. Remove Chain dependencies of Memory operations on CopyfromReg nodes as these are captured by data dependence 6. Forward loads-store values through tokenfactors containing {CopyToReg,CopyFromReg} Values. 7. Peephole to convert buildvector of extract_vector_elt to extract_subvector if possible (see CodeGen/AArch64/store-merge.ll) 8. Store merging for the ARM target is restricted to 32-bit as some in some contexts invalid 64-bit operations are being generated. This can be removed once appropriate checks are added. This finishes the change Matt Arsenault started in r246307 and jyknight's original patch. Many tests required some changes as memory operations are now reorderable, improving load-store forwarding. One test in particular is worth noting: CodeGen/PowerPC/ppc64-align-long-double.ll - Improved load-store forwarding converts a load-store pair into a parallel store and a memory-realized bitcast of the same value. However, because we lose the sharing of the explicit and implicit store values we must create another local store. A similar transformation happens before SelectionDAG as well. Reviewers: arsenm, hfinkel, tstellarAMD, jyknight, nhaehnle llvm-svn: 297695	2017-03-14 00:34:14 +00:00
Adrian Prantl	19aadf57c8	Revert "Debug Info: Add basic support for external types references." This reverts commit r242302. External type refs of this form were never used by any LLVM frontend so this is effectively dead code. (They were introduced to support clang module debug info, but in the end we came up with a better design that doesn't use this feature at all.) rdar://problem/25897929 Differential Revision: https://reviews.llvm.org/D30917 llvm-svn: 297684	2017-03-13 22:56:14 +00:00
David Blaikie	1914c82d6c	Fix llvm-symbolizer to navigate both DW_AT_abstract_origin and DW_AT_specification in a single chain In a recent refactoring (r291959) this regressed to only following one or the other, not both, in a single chain. llvm-svn: 297676	2017-03-13 21:46:37 +00:00
Tim Northover	55e6f10d69	Revert "GlobalISel: move vector extract/insert inside generic opcode region." I was writing against an earlier branch and Volkan had already fixed this. llvm-svn: 297668	2017-03-13 21:25:10 +00:00
Tim Northover	0f1d32d557	GlobalISel: move vector extract/insert inside generic opcode region. Otherwise they won't be legalized or selected, causing instruction selection to fail horribly. llvm-svn: 297666	2017-03-13 21:18:59 +00:00
Volkan Keles	fc3f501498	[GlobalISel] Update PRE_ISEL_GENERIC_OPCODE_END marker llvm-svn: 297663	2017-03-13 20:31:45 +00:00
Jessica Paquette	c984e21394	[Outliner] Add tail call support This commit adds tail call support to the MachineOutliner pass. This allows the outliner to insert jumps rather than calls in areas where tail calling is possible. Outlined tail calls include the return or terminator of the basic block being outlined from. Tail call support allows the outliner to take returns and terminators into consideration while finding candidates to outline. It also allows the outliner to save more instructions. For example, in the X86-64 outliner, a tail called outlined function saves one instruction since no return has to be inserted. llvm-svn: 297653	2017-03-13 18:39:33 +00:00
Jonas Devlieghere	5eb9c81d82	[Linker] Provide callback for internalization Differential Revision: https://reviews.llvm.org/D30738 llvm-svn: 297649	2017-03-13 18:08:11 +00:00
Craig Topper	13a3af1931	[SelectionDAG] Enhance SDTCisSameNumEltsAs to work with scalar types and use it on extend/trunc/round operations. Currently we don't enforce that ISD::ANY_EXTEND, ZERO_EXTEND, SIGN_EXTEND, TRUNC, FP_ROUND, FP_EXTEND have the same number of elements(including scalar) between their input and output. Though we have them documented as such. Up until a few months ago x86 created nodes that violated this rule. That's all been fixed now, and we should enforce the rule going forward. In order to do this we need to allow SDTCisSameNumEltsAs to support scalar types and not enforce being a vector. If one type is scalar we will force the other type to also be scalar. Differential Revision: https://reviews.llvm.org/D30878 llvm-svn: 297648	2017-03-13 17:37:14 +00:00
Zachary Turner	8f6bdf69a2	Add missing include on <limits>. llvm-svn: 297646	2017-03-13 17:25:47 +00:00
Adrian Prantl	140a8569ce	API gardening: Rename FindAllocaDbgValue to findDbgValue (NFC) and use have it use SmallVectorImpl. There is nothing specific about allocas in this function. llvm-svn: 297643	2017-03-13 17:20:47 +00:00
Zachary Turner	326b4fe7f7	Use numeric_limits<size_t>::max() instead of size_t(-1). llvm-svn: 297641	2017-03-13 17:12:37 +00:00
Zachary Turner	e677a92241	Fix a warning due to signed/unsigned comparison. llvm-svn: 297639	2017-03-13 16:41:49 +00:00
Zachary Turner	309a0889a4	[ADT] Improve the genericity of llvm::enumerate(). There were some issues in the implementation of enumerate() preventing it from being used in various contexts. These were all related to the fact that it did not supporter llvm's iterator_facade_base class. So this patch adds support for that and additionally exposes a new helper method to_vector() that will evaluate an entire range and store the results in a vector. Differential Revision: https://reviews.llvm.org/D30853 llvm-svn: 297633	2017-03-13 16:24:10 +00:00
Zachary Turner	37b91512d3	Remove an unused variable. llvm-svn: 297632	2017-03-13 16:18:08 +00:00
Zachary Turner	407dec59a4	[llvm-pdbdump] Add support for dumping symbols from Yaml -> PDB. Previously we could round-trip type records from PDB -> Yaml -> PDB, but for symbols we could only go from PDB -> Yaml. This completes the round-tripping for symbols as well. llvm-svn: 297625	2017-03-13 14:57:45 +00:00
Aaron Ballman	345012dfa0	Reverting r297617 because it broke some bots: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/49970 llvm-svn: 297618	2017-03-13 12:24:51 +00:00
Aaron Ballman	f5cba91591	Add support for getting file system permissions and implement sys::fs::permissions to set them. Patch by James Henderson. llvm-svn: 297617	2017-03-13 12:17:14 +00:00
Konstantin Zhuravlyov	1867a87c3d	AMDGPU/RelocVisitor: Handle R_AMDGPU_ABS64 Test is in the separate patch. Differential Revision: https://reviews.llvm.org/D30027 llvm-svn: 297604	2017-03-13 06:03:11 +00:00
Anna Thomas	a10e3e4c34	[LVI] Add Datalayout to the class LazyValueInfo since all its Impls require it. NFC llvm-svn: 297583	2017-03-12 14:06:41 +00:00
Daniel Berlin	64e689938d	Split NewGVN class into a legacy pass and an impl, instead of a merged class. llvm-svn: 297576	2017-03-12 04:46:45 +00:00
Zachary Turner	6023fb58cc	[ADT] Add a DenseMapInfo<T> for shorts. Differential Revision: https://reviews.llvm.org/D30857 llvm-svn: 297552	2017-03-11 02:52:48 +00:00
Zachary Turner	de042776d8	Fix line endings of DenseMapInfo.h llvm-svn: 297550	2017-03-11 02:50:18 +00:00
Zachary Turner	dc41e69d4c	Remove eol-style:native from DenseMapInfo.h llvm-svn: 297549	2017-03-11 02:47:59 +00:00
Zachary Turner	d2efbae8e8	[Support] Add a formatv provider for Twine. llvm-svn: 297548	2017-03-11 02:45:50 +00:00
Daniel Berlin	14f5c47c1d	Remove opt-bisect support for "cases" in favor of debug counters Summary: Ths "cases" support was not quite finished, is unused, and is really just debug counters. (well, almost, debug counters are slightly more powerful, in that they can skip things at the start, too). Note, opt-bisect itself could also be implemented as a wrapper around debug counters, but not sure it's worth it ATM. I'll shove it on a todo list if we think it is. Reviewers: MatzeB, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30856 llvm-svn: 297542	2017-03-11 01:41:03 +00:00
Sanjoy Das	3f1e8e0102	Use a WeakVH for UnknownInstructions in AliasSetTracker Summary: This change solves the same problem as D30726, except that this only throws out the bathwater. AST was not correctly tracking and deleting UnknownInstructions via handles. The existing code only tracks "pointers" in its `ASTCallbackVH`, so an UnknownInstruction (that isn't also def'ing a pointer used by another memory instruction) never gets a `ASTCallbackVH`. There are two other ways to solve this problem: - Use the `PointerRec` scheme for both known and unknown instructions. - Use a `CallbackVH` that erases the offending Instruction from the UnknownInstruction list. Both of the above changes seemed to be significantly (and unnecessarily IMO) more complex than this. Reviewers: chandlerc, dberlin, hfinkel, reames Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D30849 llvm-svn: 297539	2017-03-11 01:15:48 +00:00
Daniel Berlin	cd07a0f685	VNCoercion: Make the function signatures all consistent llvm-svn: 297537	2017-03-11 00:51:01 +00:00
Peter Collingbourne	780a4dd35f	LTO: Hash type identifier resolutions for LowerTypeTests. Differential Revision: https://reviews.llvm.org/D30553 llvm-svn: 297513	2017-03-10 21:35:17 +00:00
Daniel Berlin	47d7e1f9cb	Fix all these headers to properly mark the doxygen comments. llvm-svn: 297505	2017-03-10 20:44:39 +00:00
Dehao Chen	c2048155a0	Refactor the PSI to extract getCallSiteCount and remove checks for profile type. Summary: There is no need to check profile count as only CallInst will have metadata attached. Reviewers: eraman Reviewed By: eraman Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30799 llvm-svn: 297500	2017-03-10 19:45:16 +00:00
Volkan Keles	04cb08cc83	[GlobalISel] Translate insertelement and extractelement Reviewers: qcolombet, aditya_nandakumar, dsanders, ab, t.p.northover, javed.absar Reviewed By: qcolombet Subscribers: dberris, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30761 llvm-svn: 297495	2017-03-10 19:08:28 +00:00
Michael Kuperstein	5fb39a7966	[SLP] Revert everything that has to do with memory access sorting. This reverts r293386, r294027, r294029 and r296411. Turns out the SLP tree isn't actually a "tree" and we don't handle accessing the same packet of loads in several different orders well, causing miscompiles. Revert until we can fix this properly. llvm-svn: 297493	2017-03-10 18:59:07 +00:00
Volkan Keles	685fbda217	[GlobalISel] Make LegalizerInfo accessible in LegalizerHelper Summary: We don’t actually use LegalizerInfo in Legalizer pass, it’s just passed as an argument. In order to check if an instruction is legal or not, we need to get LegalizerInfo by calling `MI.getParent()->getParent()->getSubtarget().getLegalizerInfo()`. Instead, make LegalizerInfo accessible in LegalizerHelper. Reviewers: qcolombet, aditya_nandakumar, dsanders, ab, t.p.northover, kristof.beyls Reviewed By: qcolombet Subscribers: dberris, llvm-commits, rovka Differential Revision: https://reviews.llvm.org/D30838 llvm-svn: 297491	2017-03-10 18:34:57 +00:00
Zachary Turner	e48ace6a65	Add llvm::sys::fs::real_path. LLVM already has real_path like functionality, but it is cumbersome to use and involves clean up after (e.g. you have to call openFileForRead, then close the resulting FD). Furthermore, on Windows it doesn't work for directories since opening a directory and opening a file require slightly different flags. So I add a simple function `real_path` which works for all paths on all platforms and has a simple to use interface. In doing so, I add the ability to opt in to resolving tilde expressions (e.g. ~/foo), which are normally handled by the shell. Differential Revision: https://reviews.llvm.org/D30668 llvm-svn: 297483	2017-03-10 17:39:21 +00:00
Simon Pilgrim	debcbef038	Fix Wdocumentation warning llvm-svn: 297459	2017-03-10 14:01:50 +00:00
Simon Pilgrim	b02667c469	[APInt] Add APInt::insertBits() method to insert an APInt into a larger APInt We currently have to insert bits via a temporary variable of the same size as the target with various shift/mask stages, resulting in further temporary variables, all of which require the allocation of memory for large APInts (MaskSizeInBits > 64). This is another of the compile time issues identified in PR32037 (see also D30265). This patch adds the APInt::insertBits() helper method which avoids the temporary memory allocation and masks/inserts the raw bits directly into the target. Differential Revision: https://reviews.llvm.org/D30780 llvm-svn: 297458	2017-03-10 13:44:32 +00:00
Sanne Wouda	9dfa6ade4f	[Assembler] Add location info to unary expressions. Summary: This is a continuation of D28861. Add an SMLoc to MCUnaryExpr such that a better diagnostic can be given in case of an error in later stages of assembling. Reviewers: rengolin, grosbach, javed.absar, olista01 Reviewed By: olista01 Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30581 llvm-svn: 297454	2017-03-10 13:08:20 +00:00
Simon Atanasyan	ec8dfb1ca7	[MC] Set SHT_MIPS_DWARF section type for all .debug_* sections on MIPS All MIPS .debug_* sections should be marked with ELF type SHT_MIPS_DWARF accordingly the specification [1]. Also the same section type is assigned to these sections by GNU tools. [1] ftp.software.ibm.com/software/os390/czos/dwarf/mips_extensions.pdf Differential Revision: https://reviews.llvm.org/D29789 llvm-svn: 297447	2017-03-10 08:22:20 +00:00
Daniel Berlin	5ac9179f6c	Move memory coercion functions from GVN.cpp to VNCoercion.cpp so they can be shared between GVN and NewGVN. Summary: These are the functions used to determine when values of loads can be extracted from stores, etc, and to perform the necessary insertions to do this. There are no changes to the functions themselves except reformatting, and one case where memdep was informed of a removed load (which was pushed into the caller). Reviewers: davide Subscribers: mgorny, llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30478 llvm-svn: 297438	2017-03-10 04:54:10 +00:00
Matt Arsenault	f03bf9bdd7	PatternMatch; Add m_ZExtOrSExt matcher llvm-svn: 297432	2017-03-10 00:47:35 +00:00
Daniel Berlin	e3e69e1680	NewGVN: Rewrite DCE during elimination so we do it as well as old GVN did. llvm-svn: 297428	2017-03-10 00:32:33 +00:00
Ahmed Bougacha	d22b84b9d0	[GlobalISel] Use ImmutableCallSite instead of templates. NFC. ImmutableCallSite abstracts away CallInst and InvokeInst. Use it! llvm-svn: 297426	2017-03-10 00:25:44 +00:00
Daniel Berlin	04d9e746f1	Add support for DenseMap/DenseSet count and find using const pointers Summary: Similar to SmallPtrSet, this makes find and count work with both const referneces and const pointers. Reviewers: dblaikie Subscribers: llvm-commits, mzolotukhin Differential Revision: https://reviews.llvm.org/D30713 llvm-svn: 297424	2017-03-10 00:25:26 +00:00
Tim Northover	aa995c98f4	GlobalISel: support trivial inlineasm calls. They're used for nefarious purposes by ObjC. llvm-svn: 297422	2017-03-09 23:36:26 +00:00
Eli Friedman	93f47e5ffb	Refactor alias check from MISched into common helper. NFC. Differential Revision: https://reviews.llvm.org/D30598 llvm-svn: 297421	2017-03-09 23:33:36 +00:00

1 2 3 4 5 ...

30620 Commits