clang-p2996

Author	SHA1	Message	Date
Johannes Doerfert	066dbf3f8e	Track assumptions and restrictions separatly In order to speed up compile time and to avoid random timeouts we now separately track assumptions and restrictions. In this context assumptions describe parameter valuations we need and restrictions describe parameter valuations we do not allow. During AST generation we create a runtime check for both, whereas the one for the restrictions is negated before a conjunction is build. Except the In-Bounds assumptions we currently only track restrictions. Differential Revision: http://reviews.llvm.org/D17247 llvm-svn: 262328	2016-03-01 13:06:28 +00:00
Johannes Doerfert	abadd71da1	[FIX] Prevent compile time problems due to complex invariant loads This cures the symptoms we see in h264 of SPEC2006 but not the cause. llvm-svn: 262327	2016-03-01 13:05:14 +00:00
Tobias Grosser	64ca00c344	IslAst: Expose run-time check generation as individual function This allows to construct run-time checks for a scop without having to generate a full AST. This is currently not taken advantage of in Polly itself, but external users may benefit from this feature. llvm-svn: 262009	2016-02-26 12:59:38 +00:00
Hongbin Zheng	defd098612	Adapt to LLVM head, again llvm-svn: 261905	2016-02-25 17:54:42 +00:00
Hongbin Zheng	566c614525	Revert "Adapt to LLVM head. NFC" This reverts commit 4d3753b9646a69c00d234ccd6e91dc3d0ea5d643. llvm-svn: 261892	2016-02-25 16:46:17 +00:00
Hongbin Zheng	f4e35f9cb9	Adapt to LLVM head. NFC llvm-svn: 261886	2016-02-25 16:36:09 +00:00
Michael Kruse	8f25b0cb4d	Use inline local variable declaration. NFC. llvm-svn: 261876	2016-02-25 15:52:43 +00:00
Johannes Doerfert	a792098047	Support calls with known ModRef function behaviour Check the ModRefBehaviour of functions in order to decide whether or not a call instruction might be acceptable. Differential Revision: http://reviews.llvm.org/D5227 llvm-svn: 261866	2016-02-25 14:08:48 +00:00
Michael Kruse	f33c125dd2	Fix DomTree preservation for generated subregions. The generated dedicated subregion exit block was assumed to have the same dominance relation as the original exit block. This is incorrect if the exit block receives other edges than only from the subregion, which results in that e.g. the subregion's entry block does not dominate the exit block. llvm-svn: 261865	2016-02-25 14:08:48 +00:00
Michael Kruse	375cb5fe0a	Introduce ScopStmt::getEntryBlock(). NFC. This replaces an ungly inline ternary operator pattern. llvm-svn: 261792	2016-02-24 22:08:24 +00:00
Michael Kruse	6f7721f02b	Introduce Scop::getStmtFor. NFC. Replace Scop::getStmtForBasicBlock and Scop::getStmtForRegionNode, and add overloads for llvm::Instruction and llvm::RegionNode. getStmtFor and overloads become the common interface to get the Stmt that contains something. Named after LoopInfo::getLoopFor and RegionInfo::getRegionFor. llvm-svn: 261791	2016-02-24 22:08:19 +00:00
Michael Kruse	eac9726e8c	Add assertions checking def dominates use. NFC. This is also be caught by the function verifier, but disconnected from the place that produced it. Catch it already at creation to be able to reason more directly about the cause. llvm-svn: 261790	2016-02-24 22:08:14 +00:00
Roman Gareev	11001e1534	Annotation of SIMD loops Use 'mark' nodes annotate a SIMD loop during ScheduleTransformation and skip parallelism checks. The buildbot shows the following compile/execution time changes: Compile time: Improvements Δ Previous Current σ …/gesummv -6.06% 0.2640 0.2480 0.0055 …/gemver -4.46% 0.4480 0.4280 0.0044 …/covariance -4.31% 0.8360 0.8000 0.0065 …/adi -3.23% 0.9920 0.9600 0.0065 …/doitgen -2.53% 0.9480 0.9240 0.0090 …/3mm -2.33% 1.0320 1.0080 0.0087 Execution time: Regressions Δ Previous Current σ …/viterbi 1.70% 5.1840 5.2720 0.0074 …/smallpt 1.06% 12.4920 12.6240 0.0040 Reviewed-by: Tobias Grosser <tobias@grosser.es> Differential Revision: http://reviews.llvm.org/D14491 llvm-svn: 261620	2016-02-23 09:00:13 +00:00
Tobias Grosser	820cf20a98	IslAst: Expose IslAst class in header file [NFC] This allows other passes and transformations to use some of the existing AST building infrastructure. This is not yet used in Polly itself. llvm-svn: 261496	2016-02-21 20:01:28 +00:00
Tobias Grosser	2b809d1390	BlockGenerator: Drop unnecessary return value llvm-svn: 261473	2016-02-21 15:44:34 +00:00
Tobias Grosser	58e585444a	Codegen: Print error in Polly code verification and allow to disable verfication. We now always print the reason why the code did not pass the LLVM verifier and we also allow to disable verfication with -polly-codegen-verify=false. Before this change the first assertion had generally no information why or what might have gone wrong and it was also impossible to -view-cfg without recompile. This change makes debugging bugs that result in incorrect IR a lot easier. llvm-svn: 261320	2016-02-19 11:07:12 +00:00
Hongbin Zheng	8831eb7db4	[Refactor] Move isl_ctx into Scop. After we moved isl_ctx into Scop, we need to free the isl_ctx after freeing all isl objects, which requires the ScopInfo pass to be freed at last. But this is not guaranteed by the PassManager, and we need extra code to free the isl_ctx at the right time. We introduced a shared pointer to manage the isl_ctx, and distribute it to all analyses that create isl objects. As such, whenever we free an analyses with the shared_ptr (and also free the isl objects which are created by the analyses), we decrease the (shared) reference counter of the shared_ptr by 1. Whenever the reference counter reach 0 in the releaseMemory function of an analysis, that analysis will be the last one that hold any isl objects, and we can safely free the isl_ctx with that analysis. Differential Revision: http://reviews.llvm.org/D17241 llvm-svn: 261100	2016-02-17 15:49:21 +00:00
Johannes Doerfert	2c3ffc04f3	Replace getLoopForInst by getLoopForStmt This patch was extracted from http://reviews.llvm.org/D13611. llvm-svn: 260958	2016-02-16 12:36:14 +00:00
Johannes Doerfert	6a7c3e4bac	Set AST Build for all statements [NFC] llvm-svn: 260956	2016-02-16 12:11:03 +00:00
Tobias Grosser	652f780894	CodeGeneration: Add back verification of generated code This got accidentally dropped in r260025 llvm-svn: 260857	2016-02-14 20:56:49 +00:00
Johannes Doerfert	96e5471139	Separate invariant equivalence classes by type We now distinguish invariant loads to the same memory location if they have different types. This will cause us to pre-load an invariant location once for each type that is used to access it. However, we can thereby avoid invalid casting, especially if an array is accessed though different typed/sized invariant loads. This basically reverts the changes in r260023 but keeps the test cases. llvm-svn: 260045	2016-02-07 17:30:13 +00:00
Johannes Doerfert	adeab372ca	Simplify code [NFC] llvm-svn: 260030	2016-02-07 13:57:32 +00:00
Tobias Grosser	8ebdc2dd53	Make memory accesses with different element types optional We also disable this feature by default, as there are still some issues in combination with invariant load hoisting that slipped through my initial testing. llvm-svn: 260025	2016-02-07 08:48:57 +00:00
Tobias Grosser	107cd5f5f6	IslNodeBuilder: Invariant load hoisting of elements with differing sizes Always use access-instruction pointer type to load the invariant values. Otherwise mismatches between ScopArrayInfo element type and memory access element type will result in invalid casts. These type mismatches are after r259784 a lot more common and also arise with types of different size, which have not been handled before. Interestingly, this change actually simplifies the code, as we now have only one code path that is always taken, rather then a standard code path for the common case and a "fixup" code path that replaces the standard code path in case of mismatching types. llvm-svn: 260009	2016-02-06 21:23:39 +00:00
Tobias Grosser	d840fc7277	Support accesses with differently sized types to the same array This allows code such as: void multiple_types(char Short, char Float, char Double) { for (long i = 0; i < 100; i++) { Short[i] = (short )&Short[2 i]; Float[i] = (float )&Float[4 * i]; Double[i] = (double )&Double[8 * i]; } } To model such code we use as canonical element type of the modeled array the smallest element type of all original array accesses, if type allocation sizes are multiples of each other. Otherwise, we use a newly created iN type, where N is the gcd of the allocation size of the types used in the accesses to this array. Accesses with types larger as the canonical element type are modeled as multiple accesses with the smaller type. For example the second load access is modeled as: { Stmt_bb2[i0] -> MemRef_Float[o0] : 4i0 <= o0 <= 3 + 4i0 } To support code-generating these memory accesses, we introduce a new method getAccessAddressFunction that assigns each statement instance a single memory location, the address we load from/store to. Currently we obtain this address by taking the lexmin of the access function. We may consider keeping track of the memory location more explicitly in the future. We currently do _not_ handle multi-dimensional arrays and also keep the restriction of not supporting accesses where the offset expression is not a multiple of the access element type size. This patch adds tests that ensure we correctly invalidate a scop in case these accesses are found. Both types of accesses can be handled using the very same model, but are left to be added in the future. We also move the initialization of the scop-context into the constructor to ensure it is already available when invalidating the scop. Finally, we add this as a new item to the 2.9 release notes Reviewers: jdoerfert, Meinersbur Differential Revision: http://reviews.llvm.org/D16878 llvm-svn: 259784	2016-02-04 13:18:42 +00:00
Tobias Grosser	e2c31210b2	Revert "Support loads with differently sized types from a single array" This reverts commit (@259587). It needs some further discussions. llvm-svn: 259629	2016-02-03 05:53:27 +00:00
Tobias Grosser	5d3fc1ea43	Support loads with differently sized types from a single array We support now code such as: void multiple_types(char Short, char Float, char Double) { for (long i = 0; i < 100; i++) { Short[i] = (short )&Short[2 i]; Float[i] = (float )&Float[4 * i]; Double[i] = (double )&Double[8 * i]; } } To support such code we use as element type of the modeled array the smallest element type of all original array accesses. Accesses with larger types are modeled as multiple accesses with the smaller type. For example the second load access is modeled as: { Stmt_bb2[i0] -> MemRef_Float[o0] : 4i0 <= o0 <= 3 + 4i0 } To support jscop-rewritable memory accesses we need each statement instance to only be assigned a single memory location, which will be the address at which we load the value. Currently we obtain this address by taking the lexmin of the access function. We may consider keeping track of the memory location more explicitly in the future. llvm-svn: 259587	2016-02-02 22:05:29 +00:00
Johannes Doerfert	800e17a75c	Add const keyword to MemoryAccess argument [NFC] llvm-svn: 259504	2016-02-02 14:16:01 +00:00
Michael Kruse	70131d3416	Introduce MemAccInst helper class; NFC MemAccInst wraps the common members of LoadInst and StoreInst. Also use of this class in: - ScopInfo::buildMemoryAccess - BlockGenerator::generateLocationAccessed - ScopInfo::addArrayAccess - Scop::buildAliasGroups - Replace every use of polly::getPointerOperand Reviewers: jdoerfert, grosser Differential Revision: http://reviews.llvm.org/D16530 llvm-svn: 258947	2016-01-27 17:09:17 +00:00
Michael Kruse	ee6a4fc680	Unique phi write accesses Ensure that there is at most one phi write access per PHINode and ScopStmt. In particular, this would be possible for non-affine subregions with multiple exiting blocks. We replace multiple MAY_WRITE accesses by one MUST_WRITE access. The written value is constructed using a PHINode of all exiting blocks. The interpretation of the PHI WRITE's "accessed value" changed from the incoming value to the PHI like for PHI READs since there is no unique incoming value. Because region simplification shuffles around PHI nodes -- particularly with exit node PHIs -- the PHINodes at analysis time does not always exist anymore in the code generation pass. We instead remember the incoming block/value pair in the MemoryAccess. Differential Revision: http://reviews.llvm.org/D15681 llvm-svn: 258809	2016-01-26 13:33:27 +00:00
Tobias Grosser	f2cdd144e5	BlockGenerators: Replace getNewScalarValue with getNewValue Both functions implement the same functionality, with the difference that getNewScalarValue assumes that globals and out-of-scop scalars can be directly reused without loading them from their corresponding stack slot. This is correct for sequential code generation, but causes issues with outlining code e.g. for OpenMP code generation. getNewValue handles such cases correctly. Hence, we can replace getNewScalarValue with getNewValue. This is not only more future proof, but also eliminates a bunch of code. The only functionality that was available in getNewScalarValue that is lost is the on-demand creation of scalar values. However, this is not necessary any more as scalars are always loaded at the beginning of each basic block and will consequently always be available when scalar stores are generated. As this was not the case in older versions of Polly, it seems the on-demand loading is just some older code that has not yet been removed. Finally, generateScalarLoads also generated loads for values that are loop invariant, available in GlobalMap and which are preferred over the ones loaded in generateScalarLoads. Hence, we can just skip the code generation of such scalar values, avoiding the generation of dead code. Differential Revision: http://reviews.llvm.org/D16522 llvm-svn: 258799	2016-01-26 10:01:35 +00:00
Tobias Grosser	5c7f16be6b	BlockGenerators: Avoid redundant map lookup [NFC] llvm-svn: 258660	2016-01-24 14:16:59 +00:00
Johannes Doerfert	370cf00c9f	Make sure we preserve alignment information after hoisting invariant load In Polly, after hoisting loop invariant loads outside loop, the alignment information for hoisted loads are missing, this patch restore them. Contributed-by: Lawrence Hu <lawrence@codeaurora.org> Differential Revision: http://reviews.llvm.org/D16160 llvm-svn: 258105	2016-01-19 00:17:21 +00:00
Roman Gareev	b0c4e49a37	Fix of r257495. Remove redundant "FPM->add(createDemoteRegisterToMemoryPass());" llvm-svn: 257514	2016-01-12 20:47:48 +00:00
Roman Gareev	6ebc01c973	We do not need to schedule another loop interchange pass after Polly, as Polly should perform loop interchanges itself. This also fixes a bug we see due to the "loop-interchange" pass producing incorrect IR when compiling linpack-pc.c from the LLVM test-suite with "-polly-position=before-vectorizer". Reviewed-by: Tobias Grosser <tobias@grosser.es> llvm-svn: 257495	2016-01-12 17:59:06 +00:00
Johannes Doerfert	5dced2693e	Refactor canSynthesize in the BlockGenerators [NFC] llvm-svn: 256269	2015-12-22 19:08:49 +00:00
Johannes Doerfert	28f8ac1db2	Treat inline assembly as a constant in the code generation. llvm-svn: 256267	2015-12-22 19:08:24 +00:00
Johannes Doerfert	42df8d1db6	Reduce indention in BlockGenerator::trySynthesizeNewValue [NFC] llvm-svn: 256266	2015-12-22 19:08:01 +00:00
Tobias Grosser	fcabb155c1	BlockGenerators: Remove unnecessary const_cast llvm-svn: 256227	2015-12-22 01:41:25 +00:00
Tobias Grosser	5624d3c978	Adjust formatting to clang-format changes in 256149 llvm-svn: 256151	2015-12-21 12:38:56 +00:00
Tobias Grosser	184a4926b3	BlockGenerator: Use getArrayAccessFor for vector code generation getAccessFor does not guarantee a certain access to be returned in case an instruction is related to multiple accesses. However, in the vector code generation we want to know the stride of the array access of a store instruction. By using getArrayAccessFor we ensure we always get the correct memory access. This patch fixes a potential bug, but I was unable to produce a failing test case. Several existing test cases cover this code, but all of them already passed out of luck (or the specific but not-guaranteed order in which we build memory accesses). llvm-svn: 255715	2015-12-15 23:50:01 +00:00
Tobias Grosser	a69d4f0d83	VectorBlockGenerator: Generate scalar loads for vector statements When generating scalar loads/stores separately the vector code has not been updated. This commit adds code to generate scalar loads for vector code as well as code to assert in case scalar stores are encountered within a vector loop. llvm-svn: 255714	2015-12-15 23:49:58 +00:00
Tobias Grosser	0921477248	ScopInfo: Look up first (and only) array access When rewriting the access functions of load/store statements, we are only interested in the actual array memory location. The current code just took the very first memory access, which could be a scalar or an array access. As a result, we failed to update access functions even though this was requested via .jscop. llvm-svn: 255713	2015-12-15 23:49:53 +00:00
Michael Kruse	5bbc0e1888	Fix typos; NFC llvm-svn: 255580	2015-12-14 23:41:32 +00:00
Tobias Grosser	9bd0dad926	BlockGenerator: Do not use fast-path for external constants This change should not change the behavior of Polly today, but it allows external constants to be remapped e.g. when targetting multiple LLVM modules. llvm-svn: 255506	2015-12-14 16:19:59 +00:00
Tobias Grosser	6f764bbd9c	BlockGenerator: Drop unneeded const_casts llvm-svn: 255505	2015-12-14 16:19:54 +00:00
Tobias Grosser	a535dff471	ScopInfo: Harmonize the different array kinds Over time different vocabulary has been introduced to describe the different memory objects in Polly, resulting in different - often inconsistent - naming schemes in different parts of Polly. We now standartize this to the following scheme: KindArray, KindValue, KindPHI, KindExitPHI \| ------- isScalar -----------\| In most cases this naming scheme has already been used previously (this minimizes changes and ensures we remain consistent with previous publications). The main change is that we remove KindScalar to clearify the difference between a scalar as a memory object of kind Value, PHI or ExitPHI and a value (former KindScalar) which is a memory object modeling a llvm::Value. We also move all documentation to the Kind* enum in the ScopArrayInfo class, remove the second enum in the MemoryAccess class and update documentation to be formulated from the perspective of the memory object, rather than the memory access. The terms "Implicit"/"Explicit", formerly used to describe memory accesses, have been dropped. From the perspective of memory accesses they described the different memory kinds well - especially from the perspective of code generation - but just from the perspective of a memory object it seems more straightforward to talk about scalars and arrays, rather than explicit and implicit arrays. The last comment is clearly subjective, though. A less subjective reason to go for these terms is the historic use both in mailing list discussions and publications. llvm-svn: 255467	2015-12-13 19:59:01 +00:00
Tobias Grosser	2d3d4ec860	executeScopConditionally: Introduce special exiting block When introducing separate control flow for the original and optimized code we introduce now a special 'ExitingBlock': \ / EnteringBB \| SplitBlock---------\ _____\|_____ \| / EntryBB \ StartBlock \| (region) \| \| \_ExitingBB_/ ExitingBlock \| \| MergeBlock---------/ \| ExitBB / \ This 'ExitingBlock' contains code such as the final_reloads for scalars, which previously were just added to whichever statement/loop_exit/branch-merge block had been generated last. Having an explicit basic block makes it easier to find these constructs when looking at the CFG. llvm-svn: 255107	2015-12-09 11:38:22 +00:00
Tobias Grosser	2fd89da90d	Remove non-debug printing of domain set Contributed-by: Chris Jenneisch <chrisj@codeaurora.org> Differential Revision: http://reviews.llvm.org/D15094 llvm-svn: 254343	2015-11-30 22:59:41 +00:00
Michael Kruse	4c86a1d97b	IR cleanup after CodeGeneration Re-run canonicalization passes after Polly's code generation. The set of passes currently added here are nearly all the passes between --polly-position=early and --polly-position=before-vectorizer, i.e. all passes that would usually run after Polly. In order to run these only if Polly actually modified the code, we add a function attribute "polly-optimzed" to a function that contains generated code. The cleanup pass is skipped if the function does not have this attribute. There is no support by the (legacy) PassManager to run passes only under some conditions. One could have wrapped all transformation passes to run only when CodeGeneration changed the code, but the analyses would run anyway. This patch creates an independent pass manager. The disadvantages are that all analyses have to re-run even if preserved and it does not honor compiler switches like the PassManagerBuilder does. Differential Revision: http://reviews.llvm.org/D14333 llvm-svn: 254150	2015-11-26 12:36:25 +00:00

1 2 3 4 5 ...

483 Commits