clang-p2996

Author	SHA1	Message	Date
Nikita Popov	ae1cf4577c	[OpenMP] Convert some tests to opaque pointers (NFC)	2023-01-04 17:03:10 +01:00
Matt Arsenault	2e7640e6dc	OpenMPOpt: Fix null dereference on missing declaration cache Found by llvm-reduce fuzzing.	2023-01-03 16:26:37 -05:00
Matt Arsenault	c3054aeb5a	OpenMPOpt: Fix using wrong address space for alloca Using the function's address space makes no sense. Copied from the existing test, with more addrspace variation. Could just replace the existing one with this version if it's redundant.	2023-01-03 16:26:37 -05:00
Matt Arsenault	a87de3a6dc	OpenMPOpt: Fix introducing empty nvvm.annotations into module	2023-01-03 10:32:10 -05:00
Nikita Popov	aa8e9fac2a	[OpenMP] Convert some tests to opaque pointers (NFC)	2023-01-03 15:03:14 +01:00
Joseph Huber	bb4c6e7a06	[OpenMP] Remove folding logic for removed runtime function This function was removed from the device runtime at some point but we still have specialized code for it and an entry in the runtime kinds. Remove it as it is no longer necessary. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D140402	2022-12-20 13:37:38 -06:00
Johannes Doerfert	4e0f464ce2	Reapply "[OpenMP][FIX] Restrict more unsound assmptions about threading" This reverts commit `3b05255812`. This patch got reverted due to an unrelated memory leak that has been fixed.	2022-12-19 18:27:52 -08:00
Johannes Doerfert	d4f3d8212a	[OpenMP][FIX] Ensure to inline `ompx::` functions after the rename in D140334	2022-12-19 16:41:49 -08:00
Mitch Phillips	3b05255812	Revert "[OpenMP][FIX] Restrict more unsound assmptions about threading" This reverts commit `07c3753480`. Reason: This change is dependent on a commit that needs to be rolled back because it broke the ASan buildbot. See https://reviews.llvm.org/rGfc21f2d7bae2e0be630470cc7ca9323ed5859892 for more information.	2022-12-16 17:56:38 -08:00
Johannes Doerfert	07c3753480	[OpenMP][FIX] Restrict more unsound assmptions about threading Even if all loads and stores are in `nosync` functions we cannot guarantee there is no synchronization going on between them. As such, we cannot use CFG reasoning. We could check the entire module, or, what happens now to minimize test churn, is to check if all accesses are in the same function that is `nosync`. A follow up will undo some of the regressions where possible. Similarly, reachability cannot be used to exclude an access if the access is not known to be executed by the same thread as the given instruction. The OpenMP-opt test was added for the latter problem.	2022-12-13 22:58:33 -08:00
Johannes Doerfert	23333bb6b7	[NFC] Rerun update test checks on Attributor and OpenMP-Opt tests	2022-12-13 18:44:19 -08:00
Johannes Doerfert	90609fb68f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime Differential Revision: https://reviews.llvm.org/D136903	2022-12-13 18:44:19 -08:00
Johannes Doerfert	f9c29878b0	Revert "[OpenMP][NFCI] Remove effectively dead code in clang and the runtime" This reverts commit `c1c8cbbf5f`. One of the tests seems to be flaky/non-deterministic.	2022-12-12 22:08:28 -08:00
Johannes Doerfert	f622446769	[OpenMP][FIX] Ensure combing accesses does not violate invariants	2022-12-12 20:55:36 -08:00
Johannes Doerfert	c1c8cbbf5f	[OpenMP][NFCI] Remove effectively dead code in clang and the runtime	2022-12-12 20:55:36 -08:00
Bjorn Pettersson	3528e63d89	[test] Remove duplicate RUN lines in Transform tests	2022-12-08 11:47:16 +01:00
Johannes Doerfert	2dd158d655	[OpenMP] Make barrier elimination work in the presence of llvm.assume Assumptions are droppable and eliminating them to eliminate barriers seems reasonable.	2022-12-07 22:37:57 -08:00
Johannes Doerfert	f6e3a89cc0	[AMDGPU] Annotate the intrinsics to be default and nocallback Differential Revision: https://reviews.llvm.org/D135155	2022-12-07 14:25:25 -08:00
Matt Arsenault	0a67e771f6	CallGraph: Fix IgnoreAssumeLikeCalls option to Function::hasAddressTaken This was added in `29e2d9461a` and likely never worked in a useful way. The test added for it fails when converted to opaque pointers, since the lifetime intrinsic now directly uses the address. The code was only trying to handle a user indirectly through a bitcast instruction. That would never have been useful; a bitcast of a global value would be folded to a ConstantExpr cast. I also don't understand why it was special casing use_empty on the cast. Relax the check to be either BitCastOperator or AddrSpaceCastOperator. In practice, BitCastOperator won't appear today. I believe the change in parallel_deletion_cg_update is a correct improvement but I didn't fully follow it. .omp_outlined..0 is used in a constant expression cast to a call which ends up getting deleted.	2022-12-05 21:41:59 -05:00
LiaoChunyu	2c2c9688f0	[OpenMP][LegacyPM] Remove OpenMPOptCGSCCLegacyPass Using the legacy pass manager for the optimization pipeline is deprecated. I see the new PM is available. Reviewed By: aeubanks, jdoerfert Differential Revision: https://reviews.llvm.org/D139004	2022-12-01 09:21:10 +08:00
Nikita Popov	304f1d59ca	[IR] Switch everything to use memory attribute This switches everything to use the memory attribute proposed in https://discourse.llvm.org/t/rfc-unify-memory-effect-attributes/65579. The old argmemonly, inaccessiblememonly and inaccessiblemem_or_argmemonly attributes are dropped. The readnone, readonly and writeonly attributes are restricted to parameters only. The old attributes are auto-upgraded both in bitcode and IR. The bitcode upgrade is a policy requirement that has to be retained indefinitely. The IR upgrade is mainly there so it's not necessary to update all tests using memory attributes in this patch, which is already large enough. We could drop that part after migrating tests, or retain it longer term, to make it easier to import IR from older LLVM versions. High-level Function/CallBase APIs like doesNotAccessMemory() or setDoesNotAccessMemory() are mapped transparently to the memory attribute. Code that directly manipulates attributes (e.g. via AttributeList) on the other hand needs to switch to working with the memory attribute instead. Differential Revision: https://reviews.llvm.org/D135780	2022-11-04 10:21:38 +01:00
Johannes Rudolf Doerfert	41a278f56a	[OpenMP][FIX] Do not add custom state machine eagerly in LTO runs If we run LTO optimization we migth end up introducing a custom state machine and later transforming the region into SPMD. This is a problem. While a follow up will introduce a check for the SPMD conversion, this already prevents the eager custom state machine generation. Only if the kernel init function is defined, rather then declared, we will emit a custom state machine. SPMD-zation can happen eagerly though. Tests are adjusted via a weak definition. The LTO test was added to verify this works as expected. Differential Revision: https://reviews.llvm.org/D136740	2022-10-26 10:40:11 -07:00
Arthur Eubanks	e23aee7175	[test] Update some legacy PM tests	2022-09-30 11:31:02 -07:00
Doru Bercea	c9adeca501	Move allocas converted from __kmpc_alloc_shared to entry block.	2022-09-27 17:16:58 +00:00
Sebastian Peryt	99c9b37d11	[NFC][1/n] Remove -enable-new-pm=0 flags from lit tests This is the first patch in a series intended for removing flag -enable-new-pm=0 from lit tests. This is part of a bigger effort of completely removing legacy code related to legacy pass manager in favor of currently default new pass manager. In this patch flag has been removed only from tests where no significant change has been required because checks has been duplicated for both PMs. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D134150	2022-09-19 09:57:37 -07:00
Johannes Doerfert	c922cac868	Revert "[Attributor] AAPointerInfo should allow "harmless" uses" Revert "[Attributor] Teach AAPointerInfo to look into aggregates" This reverts commit `844f6c5d03` and `4ed0a88cd8` as they broke the buildbots that run openmp/libomptarget/test/offloading/bug49021.cpp.	2022-09-11 21:37:54 -07:00
Johannes Doerfert	844f6c5d03	[Attributor] AAPointerInfo should allow "harmless" uses If a call base use will not capture a pointer we can approximate the effects. This is important especially for readnone/only uses.	2022-09-11 20:16:11 -07:00
Johannes Doerfert	4ed0a88cd8	[Attributor] Teach AAPointerInfo to look into aggregates If we have a constant aggregate, e.g., as an initializer, we usually failed to extract the proper value/type from it. This patch provides the size and offset information necessary to extract the right part of the constant.	2022-09-11 20:16:11 -07:00
Johannes Doerfert	b046ebdc01	[Attributor][FIX] Conservatively handle ptr2int, don't crash If a pointer-2-int cast is found we give up on AAPointerInfo for now. This caused a crash before. Reported by John Tramm (@jtramm).	2022-09-11 20:16:11 -07:00
Johannes Doerfert	21711039e3	[OpenMP] Allow the Attributor to look at functions we also internalized This is important as we have accesses to globals in those which we need to categorize.	2022-09-11 20:16:11 -07:00
Augie Fackler	4fea8ee540	OpenMP: mark allocptr attribute on __kmpc_free_shared Differential Revision: https://reviews.llvm.org/D124491	2022-09-09 14:09:18 -04:00
Joseph Huber	58645d3252	[OpenMP] Fix `omp_get_wtime` function being marked incorrectly as readonly OpenMP has a list of of optimistic attributes that can be attached to known runtime functions to aid some analysis. The `omp_get_wtime` function incorrectly used the `readonly` attribute. This is not correct at the `omp_get_wtime` function changes values depending on some external state. This is more correctly modeled with `inaccessiblememonly` meaning that the value does not depend on anything within the module, but can not be removes as it depends on external state. Fixes #57578 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D133360	2022-09-06 12:59:00 -05:00
Doru Bercea	0b1160fdeb	Fix OpenMP Opt for target without a parallel region. Remove ctx redeclaration. Format code. Remove parallel check. Modify tests. Clean-up code. Fix another test. Move code to helper functions. Format file. Minor fixes.	2022-09-06 16:04:53 +00:00
Fangrui Song	c2a3888793	[IR] Use Min behavior for module flag "PIC Level" Using Max for both "PIC Level" and "PIE Level" is inconsistent. PIC imposes less restriction while PIE imposes more restriction. The result generally picks the more restrictive behavior: Min for PIC. This choice matches `ld -r`: a non-pic object and a pic object merge into a result which should be treated as non-pic. To allow linking "PIC Level" using Error/Max from old bitcode files, upgrade Error/Max to Min. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D130531	2022-08-18 16:28:55 -07:00
Joseph Huber	b08369f7f2	Revert "[OpenMP] Remove noinline attributes in the device runtime" The behaviour of this patch is not great, but it has some side-effects that are required for OpenMPOpt to work. The problem is that when we use `-mlink-builtin-bitcode` we only import used symbols from the runtime. Then OpenMPOpt will insert calls to symbols that were not previously included. This patch removed this implicit behaviour as these functions were kept alive by the `noinline` simply because it kept calls to them in the module. This caused regression in some tests that relied on some OpenMPOpt passes without using LTO. Reverting for the LLVM15 release but will try to fix it more correctly on main. This reverts commit `d61d72dae6`. Fixes #56752	2022-07-27 11:09:18 -04:00
Joseph Huber	d61d72dae6	[OpenMP] Remove noinline attributes in the device runtime We previously used the `noinline` attributes to specify some defintions which should be kept alive in the runtime. These were then stripped immediately in the OpenMPOpt module pass. However, Since the changes in D130298, we not explicitly state which functions will have external visiblity in the bitcode library. Additionally the OpenMPOpt module pass should run before the inliner pass, so this shouldn't make a difference in whether or not the functions will be alive for the initial pass of OpenMPOpt. This should simplify the interface, and additionally save time spend on scanning funciton names for noinline. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D130368	2022-07-25 15:44:50 -04:00
Shilei Tian	0c86c4f50c	[OpenMP] Fix test error introduced in D130179	2022-07-22 14:16:47 -04:00
Johannes Doerfert	bf789b1957	[Attributor] Replace AAValueSimplify with AAPotentialValues For the longest time we used `AAValueSimplify` and `genericValueTraversal` to determine "potential values". This was problematic for many reasons: - We recomputed the result a lot as there was no caching for the 9 locations calling `genericValueTraversal`. - We added the idea of "intra" vs. "inter" procedural simplification only as an afterthought. `genericValueTraversal` did offer an option but `AAValueSimplify` did not. Thus, we might end up with "too much" simplification in certain situations and then gave up on it. - Because `genericValueTraversal` was not a real `AA` we ended up with problems like the infinite recursion bug (#54981) as well as code duplication. This patch introduces `AAPotentialValues` and replaces the `AAValueSimplify` uses with it. `genericValueTraversal` is folded into `AAPotentialValues` as are the instruction simplifications performed in `AAValueSimplify` before. We further distinguish "intra" and "inter" procedural simplification now. `AAValueSimplify` was not deleted as we haven't ported the re-materialization of instructions yet. There are other differences over the former handling, e.g., we may not fold trivially foldable instructions right now, e.g., `add i32 1, 1` is not folded to `i32 2` but if an operand would be simplified to `i32 1` we would fold it still. We are also even more aware of function/SCC boundaries in CGSCC passes, which is good even if some tests look like they regress. Fixes: https://github.com/llvm/llvm-project/issues/54981 Note: A previous version was flawed and consequently reverted in `6555558a80`.	2022-07-19 16:24:42 -05:00
Johannes Doerfert	f6e0c05e3d	Revert "[Attributor] Replace AAValueSimplify with AAPotentialValues" This reverts commit `f17639ea0c` as three AMDGPU tests haven't been updated. Will need to verify the changes are not regressions we should avoid.	2022-07-08 00:53:38 -05:00
Johannes Doerfert	f17639ea0c	[Attributor] Replace AAValueSimplify with AAPotentialValues For the longest time we used `AAValueSimplify` and `genericValueTraversal` to determine "potential values". This was problematic for many reasons: - We recomputed the result a lot as there was no caching for the 9 locations calling `genericValueTraversal`. - We added the idea of "intra" vs. "inter" procedural simplification only as an afterthought. `genericValueTraversal` did offer an option but `AAValueSimplify` did not. Thus, we might end up with "too much" simplification in certain situations and then gave up on it. - Because `genericValueTraversal` was not a real `AA` we ended up with problems like the infinite recursion bug (#54981) as well as code duplication. This patch introduces `AAPotentialValues` and replaces the `AAValueSimplify` uses with it. `genericValueTraversal` is folded into `AAPotentialValues` as are the instruction simplifications performed in `AAValueSimplify` before. We further distinguish "intra" and "inter" procedural simplification now. `AAValueSimplify` was not deleted as we haven't ported the re-materialization of instructions yet. There are other differences over the former handling, e.g., we may not fold trivially foldable instructions right now, e.g., `add i32 1, 1` is not folded to `i32 2` but if an operand would be simplified to `i32 1` we would fold it still. We are also even more aware of function/SCC boundaries in CGSCC passes, which is good even if some tests look like they regress. Fixes: https://github.com/llvm/llvm-project/issues/54981 Note: A previous version was flawed and consequently reverted in `6555558a80`.	2022-07-08 00:38:27 -05:00
Johannes Doerfert	cb26b01d57	[Attributor] Make heap2stack record alloca placement We recently learned to place the alloca during the heap2stack transformation in the entry block but we did not account for other concurrent modifications. We need to record our decision rather than checking (then outdated) passes during the manifest stage. This will also allow us to use a custom (=optimistic) "loop info" in the future.	2022-07-07 16:49:22 -05:00
Johannes Doerfert	c771eaf07e	[OpenMP] Ensure to not use SPMD mode in the absence of parallel regions	2022-07-07 16:49:22 -05:00
Johannes Doerfert	07766f4070	[Attributor] Move heap2stack allocas to the entry block if possible If we are certainly not in a loop we can directly emit the heap2stack allocas in the function entry block. This will help to get rid of them (SROA) and avoid stacksave/restore intrinsics when the function is inlined.	2022-07-01 21:34:12 -05:00
Joseph Huber	c7243f21d3	[OpenMP] Only strip runtime attributes if needed Summary: Currently in OpenMPOpt we strip `noinline` attributes from runtime functions. This is here because the device bitcode library that we link has problems with needed definitions getting prematurely optimized out. This is only necessary for OpenMP offloading to GPUs so we should narrow the scope for where we spend time doing this. In the future this shouldn't be necessary as we move to using a linked library rather than pulling in a bitcode library in Clang.	2022-06-27 13:35:41 -04:00
Johannes Doerfert	6555558a80	Revert "[Attributor] Replace AAValueSimplify with AAPotentialValues" This reverts commit `da50dab1ae`. Patch broke AMD GPU OpenMP offload buildbots. https://lab.llvm.org/buildbot/#/builders/193/builds/13246	2022-06-09 17:04:01 +02:00
Johannes Doerfert	da50dab1ae	[Attributor] Replace AAValueSimplify with AAPotentialValues For the longest time we used `AAValueSimplify` and `genericValueTraversal` to determine "potential values". This was problematic for many reasons: - We recomputed the result a lot as there was no caching for the 9 locations calling `genericValueTraversal`. - We added the idea of "intra" vs. "inter" procedural simplification only as an afterthought. `genericValueTraversal` did offer an option but `AAValueSimplify` did not. Thus, we might end up with "too much" simplification in certain situations and then gave up on it. - Because `genericValueTraversal` was not a real `AA` we ended up with problems like the infinite recursion bug (#54981) as well as code duplication. This patch introduces `AAPotentialValues` and replaces the `AAValueSimplify` uses with it. `genericValueTraversal` is folded into `AAPotentialValues` as are the instruction simplifications performed in `AAValueSimplify` before. We further distinguish "intra" and "inter" procedural simplification now. `AAValueSimplify` was not deleted as we haven't ported the re-materialization of instructions yet. There are other differences over the former handling, e.g., we may not fold trivially foldable instructions right now, e.g., `add i32 1, 1` is not folded to `i32 2` but if an operand would be simplified to `i32 1` we would fold it still. We are also even more aware of function/SCC boundaries in CGSCC passes, which is good. Fixes: https://github.com/llvm/llvm-project/issues/54981	2022-06-09 16:48:53 +02:00
Johannes Doerfert	94841c713f	[Attributor] Try to delete stores and simplify stored values By default we should try to eliminate unused stores and simplify values stored while we are at it.	2022-06-09 16:48:53 +02:00
Johannes Doerfert	ae10b8a582	[Attributor][FIX] Give registered simplification callbacks precedence We accidentally checked for constants before we looked for registered simplification callbacks. The latter needs to take precedence though.	2022-06-09 15:31:53 +02:00
Johannes Doerfert	cb8adf76f7	[Attributor] Simplify loads from constant globals If a global is constant and the initializer is known we can simplify loads from it as the value has to be the initializer.	2022-06-09 13:41:23 +02:00
Joseph Huber	dbffa4073c	[NVVM] Update intrinsic defintions to include the `nocallback` attribute This patch adds the `nocallback` attribute to the NVVM intrinsics that did not use the `DefaultAttrsIntrinsic` method that includes it already. The `nocallback` attribute states that the intrinsic function cannot enter back into the caller's translation-unit. This allows as to determine that a function calling a `nocallback` function can have the `norecurse` attribute. This should be safe for all the NVVM intrinsics because they do not call other functions within the translation unit. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D125937	2022-05-19 12:30:35 -04:00

1 2 3 4 5 ...

255 Commits