clang-p2996

Author	SHA1	Message	Date
Craig Topper	d1cb4ceacd	[CodeGen] Update to use an ArrayRef of uint32_t instead of int in calls to CreateShuffleVector to match llvm interface change. llvm-svn: 272492	2016-06-12 00:41:24 +00:00
Craig Topper	2769bb5753	[X86] Handle AVX2 pslldqi and psrldqi intrinsics shufflevector creation directly in the header file instead of in CGBuiltin.cpp. Simplify the sse2 equivalents as well. llvm-svn: 272246	2016-06-09 05:15:12 +00:00
Craig Topper	c1442973c8	[X86] Reuse the EmitX86Select routine to handle the select for masked palignr too. llvm-svn: 272245	2016-06-09 05:15:00 +00:00
Igor Breger	aadb876200	[AVX512] Emit select instruction instead of using x86 specific instrinsics. This will allow us to remove the x86 instrinics from the backend. Differential Revision: http://reviews.llvm.org/D21060 llvm-svn: 272141	2016-06-08 13:59:20 +00:00
Craig Topper	f51cc07719	[AVX512] Convert masked palignr builtins directly to native IR similar to the other palignr builtins, but with a select to handle masking. llvm-svn: 271873	2016-06-06 06:13:01 +00:00
Craig Topper	4b060e31c9	[AVX512] Convert masked load builtins to generic masked load intrinsics instead of the x86 specific ones. This will allow the x86 intrinsics to be removed from the backend. llvm-svn: 271253	2016-05-31 06:58:07 +00:00
Craig Topper	6e891fbdd2	[AVX512] Emit generic masked store instrinsics instead of using x86 specific intrinsics. This will allow us to remove the x86 instrinics from the backend. llvm-svn: 271246	2016-05-31 01:50:10 +00:00
Craig Topper	b8b4b7eb01	[X86] Simplify alignr builtin support by recognizing that NumLaneElts is always 16. NFC llvm-svn: 271176	2016-05-29 07:06:02 +00:00
Craig Topper	832caf041f	[CodeGen] Use the ArrayRef form CreateShuffleVector instead of building ConstantVectors or ConstantDataVectors and calling the other form. llvm-svn: 271165	2016-05-29 02:39:30 +00:00
Matt Arsenault	2d51059ebb	AMDGPU: Add fract builtin llvm-svn: 271080	2016-05-28 00:43:27 +00:00
David Majnemer	e6abf3d29f	[CodeGen] Don't crash when sizeof(long) != 4 for some intrins _InterlockedIncrement and _InterlockedDecrement have 'long' in their prototypes. We assumed 'long' was the same size as an i32 which is incorrect for other targets. This fixes PR27892. llvm-svn: 270953	2016-05-27 02:06:19 +00:00
Yaxun Liu	f7449a179b	[OpenCL] Add to_{global\|local\|private} builtin functions. OpenCL builtin functions to_{global\|local\|private} accepts argument of pointer type to arbitrary pointee type, and return a pointer to the same pointee type in different addr space, i.e. global gentype to_global(gentype p); It is not desirable to declare it as global void to_global(void ); in opencl header file since it misses diagnostics. This patch implements these builtin functions as Clang builtin functions. In the builtin def file they are defined to have signature void(void). When handling call expressions, their declarations are re-written to have correct parameter type and return type corresponding to the call argument. In codegen call to addr void to_addr(void) is generated with addrcasts or bitcasts to facilitate implementation in builtin library. Differential Revision: http://reviews.llvm.org/D19932 llvm-svn: 270261	2016-05-20 19:54:38 +00:00
Benjamin Kramer	f4c520d5d2	Add all the avx512 flavors to __builtin_cpu_supports's list. This is matching what trunk gcc is accepting. Also adds a missing ssse3 case. PR27779. The amount of duplication here is annoying, maybe it should be factored into a separate .def file? llvm-svn: 270224	2016-05-20 15:21:08 +00:00
Justin Lebar	2e4ecfdebe	[CUDA] Implement __ldg using intrinsics. Summary: Previously it was implemented as inline asm in the CUDA headers. This change allows us to use the [addr+imm] addressing mode when executing ld.global.nc instructions. This translates into a 1.3x speedup on some benchmarks that call this instruction from within an unrolled loop. Reviewers: tra, rsmith Subscribers: jhen, cfe-commits, jholewinski Differential Revision: http://reviews.llvm.org/D19990 llvm-svn: 270150	2016-05-19 22:49:13 +00:00
Derek Schuff	dbd24b4593	[WebAssembly] Rename memory_size intrinsic to current_memory This follows the recent change in the wasm spec. llvm-svn: 268256	2016-05-02 17:26:19 +00:00
Marcin Koscielnicki	4005070e1b	[AArch64] Fix D19098 fallout. The intrinsic is now called llvm.thread.pointer, not llvm.aarch64.thread.pointer. Also, the code handling it in CGBuiltin.cpp is dead - it's already covered by GCCBuiltin. Remove it. Differential Revision: http://reviews.llvm.org/D19099 llvm-svn: 266817	2016-04-19 20:51:00 +00:00
Ahmed Bougacha	1d9de10130	[ARM NEON] Define vfms_f32 on ARM, and all vfms using vfma. r259537 added vfma/vfms to armv7, but the builtin was only lowered on the AArch64 side. Instead of supporting it on ARM, get rid of it. The vfms builtin lowered to: %nb = fsub float -0.0, %b %r = @llvm.fma.f32(%a, %nb, %c) Instead, define the operation in terms of vfma, and swap the multiplicands. It now lowers to: %na = fsub float -0.0, %a %r = @llvm.fma.f32(%na, %b, %c) This matches the instruction more closely, and lets current LLVM generate the "natural" operand ordering: fmls.2s v0, v1, v2 instead of the crooked (but equivalent): fmls.2s v0, v2, v1 Except for theses changes, assembly is identical. LLVM accepts both commutations, and the LLVM tests in: test/CodeGen/AArch64/arm64-fmadd.ll test/CodeGen/AArch64/fp-dp3.ll test/CodeGen/AArch64/neon-fma.ll test/CodeGen/ARM/fusedMAC.ll already check either the new one only, or both. Also verified against the test-suite unittests. llvm-svn: 266807	2016-04-19 19:44:45 +00:00
Sanjay Patel	ae7a9df7bf	make __builtin_isfinite more efficient (PR27145) isinf (is infinite) and isfinite should be implemented with the same function except we change the comparison operator. See PR27145 for more details: https://llvm.org/bugs/show_bug.cgi?id=27145 Ref: forked off of the discussion in D18513. Differential Revision: http://reviews.llvm.org/D18648 llvm-svn: 265675	2016-04-07 14:29:05 +00:00
JF Bastien	92f4ef1017	NFC: make AtomicOrdering an enum class Summary: See LLVM change D18775 for details, this change depends on it. Reviewers: jyknight, reames Subscribers: cfe-commits Differential Revision: http://reviews.llvm.org/D18776 llvm-svn: 265569	2016-04-06 17:26:42 +00:00
Matt Arsenault	3fb963389e	AMDGPU: Add frexp_mant + frexp_exp builtins llvm-svn: 264960	2016-03-30 22:57:40 +00:00
Aaron Ballman	abd466ed04	Silencing warnings from MSVC 2015 Update 2. Both of these changes silence "C4334 '<<': result of 32-bit shift implicitly converted to 64 bits (was 64-bit shift intended?)". NFC. llvm-svn: 264932	2016-03-30 21:33:34 +00:00
Matt Arsenault	08087c52eb	Add missing __builtin_bitreverse8 Also add documentation for bitreverse builtins llvm-svn: 264203	2016-03-23 22:14:43 +00:00
Justin Lebar	717d2b0a0d	[CUDA] Implement atomicInc and atomicDec builtins These functions cannot be implemented as atomicrmw or cmpxchg instructions, so they are implemented as a call to the NVVM intrinsics @llvm.nvvm.atomic.load.inc.32.p0i32 and @llvm.nvvm.atomic.load.dec.32.p0i32. Patch by Jason Henline. Reviewers: jlebar Differential Revision: http://reviews.llvm.org/D18322 llvm-svn: 264009	2016-03-22 00:09:28 +00:00
John McCall	c56a8b3284	Preserve ExtParameterInfos into CGFunctionInfo. As part of this, make the function-arrangement interfaces a little simpler and more semantic. NFC. llvm-svn: 263191	2016-03-11 04:30:31 +00:00
Kit Barton	fbab158767	[PPC] FE support for generating VSX [negated] absolute value instructions Includes new built-in, conversion of built-in to target-independent intrinsic and update in the header file. Tests are also updated. There is a second part in the backend for which I will post a separate code-review. BACKEND PART SHOULD BE COMMITTED FIRST. Phabricator: http://reviews.llvm.org/D17816 llvm-svn: 263051	2016-03-09 19:28:31 +00:00
Matt Arsenault	2d9339890f	Add __builtin_canonicalize llvm-svn: 262122	2016-02-27 09:06:18 +00:00
Matt Arsenault	9b277b4ad4	AMDGPU: Add sin/cos builtins llvm-svn: 260783	2016-02-13 01:21:09 +00:00
Matt Arsenault	f5c1f47181	AMDGPU: Update builtin for intrinsic change llvm-svn: 260781	2016-02-13 01:03:09 +00:00
Matt Arsenault	105e892c2c	Add builtins for bitreverse intrinsic Follow the naming convention that bswap uses since it's a similar sort of operation. llvm-svn: 259671	2016-02-03 17:49:38 +00:00
Xiuli Pan	bb4d8d30b1	Recommit: R258773 [OpenCL] Pipe builtin functions Fix arc patch fuzz error. Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258782	2016-01-26 04:03:48 +00:00
David Majnemer	747f168e8d	Revert "[OpenCL] Pipe builtin functions" This reverts commit r258773, it broke the build bots: http://bb.pgr.jp/builders/cmake-clang-x86_64-linux/builds/43853 llvm-svn: 258775	2016-01-26 02:22:31 +00:00
Xiuli Pan	3a9952c9e7	[OpenCL] Pipe builtin functions Summary: Support for the pipe built-in functions for OpenCL 2.0. The pipe builtin functions may have infinite kinds of element types, one approach would be to just generate calls that would always use generic types such as void*. This patch is based on bader's opencl support patch on SPIR-V branch. Reviewers: Anastasia, pekka.jaaskelainen Subscribers: keryell, bader, cfe-commits Differential Revision: http://reviews.llvm.org/D15914 llvm-svn: 258773	2016-01-26 02:06:04 +00:00
Justin Lebar	3039a593db	[CUDA] Make printf work. Summary: The code in CGCUDACall is largely based on a patch written by Eli Bendersky: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20140324/210218.html That patch implemented an LLVM pass lowering printf to vprintf; this one does something similar, but in Clang codegen. Reviewers: echristo Subscribers: cfe-commits, jhen, tra, majnemer Differential Revision: http://reviews.llvm.org/D16372 llvm-svn: 258642	2016-01-23 21:28:14 +00:00
Matt Arsenault	8a4078c741	AMDGPU: Rename builtins to use amdgcn prefix Keep the ones still used by libclc around for now. Emit the new amdgcn intrinsic name if not targeting r600, in which case the old AMDGPU name is still used. llvm-svn: 258560	2016-01-22 21:30:53 +00:00
Ben Craig	cd7e9f143b	Reordering fields to reduce padding in Clang. NFC llvm-svn: 255552	2015-12-14 21:54:11 +00:00
George Burgess IV	3e3bb95b69	Add the `pass_object_size` attribute to clang. `pass_object_size` is our way of enabling `__builtin_object_size` to produce high quality results without requiring inlining to happen everywhere. A link to the design doc for this attribute is available at the Differential review link below. Differential Revision: http://reviews.llvm.org/D13263 llvm-svn: 254554	2015-12-02 21:58:08 +00:00
Eric Christopher	fbfd97ed5c	Move checkTargetFeatures to CodeGenFunction.cpp to make it more obvious that it's generic. llvm-svn: 252833	2015-11-12 00:44:07 +00:00
Eric Christopher	c7e79dbec8	In preparation to use it in more places rename checkBuiltinTargetFeatures to checkTargetFeatures and sink the error handling into the function. llvm-svn: 252832	2015-11-12 00:44:04 +00:00
Eric Christopher	2b90a64e31	Extract out a function onto CodeGenModule for getting the map of features for a particular function, then use it to clean up some code. llvm-svn: 252819	2015-11-11 23:05:08 +00:00
Eric Christopher	ed60b436d4	Fix a FIXME about using std::is_sorted. llvm-svn: 252691	2015-11-11 02:04:08 +00:00
Petar Jovanovic	73d1044abe	Fix __builtin_signbit for ppcf128 type Function__builtin_signbit returns wrong value for type ppcf128 on big endian machines. This patch fixes how value is generated in that case. Patch by Aleksandar Beserminji. Differential Revision: http://reviews.llvm.org/D14149 llvm-svn: 252307	2015-11-06 14:52:46 +00:00
Dan Gohman	24f0a08c1b	[WebAssembly] Update wasm builtin functions to match spec changes. The page_size operator has been removed from the spec, and the resize_memory operator has been changed to grow_memory. llvm-svn: 252201	2015-11-05 20:16:37 +00:00
John McCall	03107a4ef0	Add support for __builtin_{add,sub,mul}_overflow. Patch by David Grayson! llvm-svn: 251651	2015-10-29 20:48:01 +00:00
Benjamin Kramer	e003ca2a03	Put global classes into the appropriate namespace. Most of the cases belong into an anonymous namespace. No functionality change intended. llvm-svn: 251514	2015-10-28 13:54:16 +00:00
Eric Christopher	9d628c33b3	Reflow comment. llvm-svn: 251501	2015-10-28 06:56:25 +00:00
Eric Christopher	99af5b2ea7	Handle target builtin options that are all required rather than only one of a group of possibilities. This changes the syntax in the builtin files to represent: , as the and operator \| as the or operator The former syntax matches how the backend tablegen files represent multiple subtarget features being required. Updated the builtin and intrinsic headers accordingly for the new syntax. llvm-svn: 251388	2015-10-27 06:11:03 +00:00
Eric Christopher	4a4367534b	Use early exits to reduce indentation. llvm-svn: 251371	2015-10-27 00:06:21 +00:00
Craig Topper	273dbc602f	Make a bunch of static arrays const. llvm-svn: 250647	2015-10-18 05:29:26 +00:00
Eric Christopher	15709991d0	Add an error when calling a builtin that requires features that don't match the feature set of the function that they're being called from. This ensures that we can effectively diagnose some[1] code that would instead ICE in the backend with a failure to select message. Example: __m128d foo(__m128d a, __m128d b) { return __builtin_ia32_addsubps(b, a); } compiled for normal x86_64 via: clang -target x86_64-linux-gnu -c would fail to compile in the back end because the normal subtarget features for x86_64 only include sse2 and the builtin requires sse3. [1] We're still not erroring on: __m128i bar(__m128i const *p) { return _mm_lddqu_si128(p); } where we should fail and error on an always_inline function being inlined into a function that doesn't support the subtarget features required. llvm-svn: 250473	2015-10-15 23:47:11 +00:00
Benjamin Kramer	c2d2b4259c	[CodeGen] Remove dead code. NFC. llvm-svn: 250418	2015-10-15 15:29:40 +00:00

1 2 3 4 5 ...

668 Commits