clang-p2996

Author	SHA1	Message	Date
Krzysztof Parzyszek	f608812bde	[Hexagon] Handle VACOPY in isel lowering llvm-svn: 326599	2018-03-02 18:35:57 +00:00
Simon Pilgrim	8cbc1d232b	[X86][BTVER2] Fix throughput of YMM bitwise instructions These instructions are double-pumped, split into 2 128-bit ops and then passing through either FPU pipe. Found while testing llvm-mca (D43951) llvm-svn: 326597	2018-03-02 18:20:35 +00:00
Craig Topper	6b1419b547	[X86] Reject xmm16-31 in inline asm constraints when AVX512 is disabled Fixes PR36532 Differential Revision: https://reviews.llvm.org/D43960 llvm-svn: 326596	2018-03-02 18:19:40 +00:00
Derek Schuff	57feeed307	[X86][x32] Save callee-save register used as base pointer for x32 ABI For the x32 ABI, since the base pointer register (EBX) is a callee save register it should be saved before use. This fixes https://bugs.llvm.org/show_bug.cgi?id=36011 Differential Revision: https://reviews.llvm.org/D42358 Patch by Pratik Bhatu llvm-svn: 326593	2018-03-02 17:46:39 +00:00
Benjamin Kramer	4925653555	[ARM] Fold variable into assert. Avoids unused variable warnings in Release mode. llvm-svn: 326592	2018-03-02 17:39:20 +00:00
Matt Arsenault	b9699c009d	AMDGPU/GlobalISel: InstrMapping for G_ZEXT llvm-svn: 326589	2018-03-02 16:55:37 +00:00
Matt Arsenault	1c1aab99ae	AMDGPU/GlobalISel: InstrMapping for G_TRUNC llvm-svn: 326588	2018-03-02 16:55:33 +00:00
Matt Arsenault	ef8db767d7	AMDGPU/GlobalISel: Define InstrMappings for G_FCMP Patch by Tom Stellard llvm-svn: 326587	2018-03-02 16:53:15 +00:00
Matt Arsenault	2607dc60de	AMDGPU/GlobalISel: Define instruction mapping for @llvm.minnum Patch by Tom Stellard llvm-svn: 326586	2018-03-02 16:40:17 +00:00
Momchil Velikov	505614bb4f	[ARM] Fix access to stack arguments when re-aligning SP in Armv6m When an Armv6m function dynamically re-aligns the stack, access to incoming stack arguments (and to stack area, allocated for register varargs) is done via SP, which is incorrect, as the SP is offset by an unknown amount relative to the value of SP upon function entry. This patch fixes it, by making access to "fixed" frame objects be done via FP when the function needs stack re-alignment. It also changes the access to "fixed" frame objects be done via FP (instead of using R6/BP) also for the case when the stack frame contains variable sized objects. This should allow more objects to fit within the immediate offset of the load instruction. All of the above via a small refactoring to reuse the existing `ARMFrameLowering::ResolveFrameIndexReference.` Differential Revision: https://reviews.llvm.org/D43566 llvm-svn: 326584	2018-03-02 15:47:14 +00:00
Stefan Pintilie	b5a9440a80	[Power9] Add missing instructions to the Power 9 scheduler Adding more instructions using InstRW so that we can move away from ItinRW and ultimately have a complete Power 9 scheduler. llvm-svn: 326578	2018-03-02 14:41:38 +00:00
David Stenberg	3fb8c324b3	Test commit: Remove an extraneous space. NFC Test commit access. llvm-svn: 326573	2018-03-02 14:28:56 +00:00
Nicholas Wilson	be28e61a03	Revert "[WebAssembly] More uses of uint8_t" and "[WebAssembly] Update tests" This reverts commits r326541 and r326571. The tests were correct, and were updated with incorrect expectations. The original commit was broken and should be reverted to get things back to a working state. llvm-svn: 326572	2018-03-02 14:07:39 +00:00
Florian Hahn	9deef20b6c	[ARM] Fix codegen for VLD3/VLD4/VST3/VST4 with WB Code generation of VLD3, VLD4, VST3 and VST4 with register writeback is broken due to 2 separate bugs: 1) VLD1d64TPseudoWB_register and VLD1d64QPseudoWB_register are missing rules to expand them to non pseudo MIR. These are selected for ARMISD::VLD3_UPD/VLD4_UPD with v1i64 vectors in SelectVLD. 2) Selection of the right VLD/VST instruction is broken for load and store of 3 and 4 v1i64 vectors. SelectVLD and SelectVST are called with MIR opcode for fixed writeback (ie increment is access size) and call getVLDSTRegisterUpdateOpcode() to select an opcode with register writeback if base register update is of a different size. Since getVLDSTRegisterUpdateOpcode() only knows about VLD1/VLD2/VST1/VST2 the call is currently conditional on the number of element in the vector. However, VLD1/VST1 is selected by SelectVLD/SelectVST's caller for load and stores of 3 or 4 v1i64 vectors. Therefore the opcode is not updated which later lead to a fixed writeback instruction being constructed with an extra operand for the register writeback. This patch addresses the two issues as follows: - it adds the necessary mapping from VLD1d64TPseudoWB_register and VLD1d64QPseudoWB_register to VLD1d64Twb_register and VLD1d64Qwb_register respectively. Like for the existing _fixed variants, the cost of these is bumped for unaligned access. - it changes the logic in SelectVLD and SelectVSD to call isVLDfixed and isVSTfixed respectively to decide whether the opcode should be updated. It also reworks the logic and comments for pushing the writeback offset operand and r0 operand to clarify the logic: writeback offset needs to be pushed if it's a register writeback, r0 needs to be pushed if not and the instruction is a VLD1/VLD2/VST1/VST2. Reviewers: rengolin, t.p.northover, samparker Reviewed By: samparker Patch by Thomas Preud'homme <thomas.preudhomme@arm.com> Differential Revision: https://reviews.llvm.org/D42970 llvm-svn: 326570	2018-03-02 13:02:55 +00:00
Matt Arsenault	b46c191c49	AMDGPU/GlobalISel: Define instruction mapping for @llvm.maxnum Patch by Tom Stellard llvm-svn: 326567	2018-03-02 12:23:00 +00:00
Simon Pilgrim	c879aa7eab	[X86] Remove old UNIMPLEMENTED list All of these are implemented and have appropriate test coverage llvm-svn: 326553	2018-03-02 11:59:37 +00:00
Heejin Ahn	d684cb57f4	[WebAssembly] More uses of uint8_t for single byte values Summary: It looks like this was missing from D43921. Reviewers: sbc100 Subscribers: jfb, dschuff, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D43991 llvm-svn: 326541	2018-03-02 06:51:35 +00:00
Jan Vesely	b283ea0f0f	AMDGPU/GCN: Promote i16 ctpop i16 capable ASICs do not support i16 operands for this instruction. Add tablegen pattern to merge chained i16 additions. Differential Revision: https://reviews.llvm.org/D43985 llvm-svn: 326535	2018-03-02 02:50:22 +00:00
Matt Arsenault	41d2e3d98e	AMDGPU/GlobalISel: Define instruction mapping for G_FPTOSI Patch by Tom Stellard llvm-svn: 326534	2018-03-02 02:19:16 +00:00
Matt Arsenault	b23041ad4d	AMDGPU/GlobalISel: Define instruction mapping for G_FPTOUI Patch by Tom Stellard llvm-svn: 326533	2018-03-02 02:19:11 +00:00
Matt Arsenault	327d5fb2e5	AMDGPU/GlobalISel: Define instruction mapping for G_FMUL llvm-svn: 326532	2018-03-02 02:17:01 +00:00
Matt Arsenault	5a9e834eac	AMDGPU/GlobalISel: Define instruction mapping for G_FADD Patch by Tom Stellard llvm-svn: 326526	2018-03-02 01:22:13 +00:00
Matt Arsenault	d99317f1b3	AMDGPU/GlobalISel: Define instruction mapping for G_SHL Patch by Tom Stellard llvm-svn: 326525	2018-03-02 01:22:10 +00:00
Matt Arsenault	3c7a123ccc	AMDGPU/GlobalISel: Define instruction mapping for G_XOR llvm-svn: 326524	2018-03-02 01:22:06 +00:00
Matt Arsenault	c0f34c9e36	AMDGPU/GlobalISel: Define instruction mapping for G_AND Patch by Tom Stellard llvm-svn: 326523	2018-03-02 01:22:01 +00:00
Heejin Ahn	e4a8deea84	[WebAssembly] Gather EH instructions in one place. NFC. Summary: - Gather EH instructions in one place for easy tracking (more will be added later) - Variable name change Reviewers: dschuff Subscribers: jfb, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D43742 llvm-svn: 326522	2018-03-02 01:03:40 +00:00
Yonghong Song	03e1c8b8f9	bpf: introduce -mattr=dwarfris to disable DwarfUsesRelocationsAcrossSections Commit e4507fb8c94b ("bpf: disable DwarfUsesRelocationsAcrossSections") disables MCAsmInfo DwarfUsesRelocationsAcrossSections unconditionally so that dwarf will not use cross section (between dwarf and symbol table) relocations. This new debug format enables pahole to dump structures correctly as libdwarves.so does not have BPF backend support yet. This new debug format, however, breaks bcc (https://github.com/iovisor/bcc) source debug output as llvm in-memory Dwarf support has some issues to handle it. More specifically, with DwarfUsesRelocationsAcrossSections disabled, JIT compiler does not generate .debug_abbrev and Dwarf DIE (debug info entry) processing is not happy about this. This patch introduces a new flag -mattr=dwarfris (dwarf relocation in section) to disable DwarfUsesRelocationsAcrossSections. DwarfUsesRelocationsAcrossSections is true by default. Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 326505	2018-03-01 23:04:59 +00:00
Simon Pilgrim	90fd0622b6	[X86][MMX] Improve handling of 64-bit MMX constants 64-bit MMX constant generation usually ends up lowering into SSE instructions before being spilled/reloaded as a MMX type. This patch bitcasts the constant to a double value to allow correct loading directly to the MMX register. I've added MMX constant asm comment support to improve testing, it's better to always print the double values as hex constants as MMX is mainly an integer unit (and even with 3DNow! its just floats). Differential Revision: https://reviews.llvm.org/D43616 llvm-svn: 326497	2018-03-01 22:22:31 +00:00
Krzysztof Parzyszek	c5e0ed109d	[Hexagon] Add trap1 instruction llvm-svn: 326492	2018-03-01 21:54:08 +00:00
Matt Arsenault	364f12e8f9	AMDGPU/GlobalISel: Define instruction mapping for @llvm.amdgcn.cvt.pkrtz Patch by Tom Stellard llvm-svn: 326490	2018-03-01 21:25:30 +00:00
Matt Arsenault	5320ee4a05	AMDGPU/GlobalISel: Define instruction mapping for G_OR Patch by Tom Stellard llvm-svn: 326489	2018-03-01 21:25:25 +00:00
Matt Arsenault	e65404f5c5	AMDGPU/GlobalISel: Remove default register mapping This crashes for some opcodes, which prevents the SelectionDAG fallback from working. Patch by Tom Stellard llvm-svn: 326487	2018-03-01 21:20:44 +00:00
Evandro Menezes	2bbb4a7c93	[AArch64] Clean up code (NFC) Clean up a couple of functions in `AArch64TargetLowering` by removing redundant statements. llvm-svn: 326486	2018-03-01 21:17:36 +00:00
Matt Arsenault	1422a19a88	AMDGPU/GlobalISel: Use a more correct getValueMapping This was finding the wrong size registers for anything with more than 2 components. Patch by Tom Stellard llvm-svn: 326483	2018-03-01 21:08:51 +00:00
Matt Arsenault	62669ede94	AMDGPU/GlobalISel: Define instruction mapping for G_BITCAST Patch by Tom Stellard llvm-svn: 326482	2018-03-01 20:59:44 +00:00
Matt Arsenault	0529a8e2de	AMDGPU/GlobalISel: Mark i32->i64 zext as legal llvm-svn: 326481	2018-03-01 20:56:21 +00:00
Martin Storsjo	c61ff3bef1	[AArch64] Add support for secrel add/load/store relocations for COFF Differential Revision: https://reviews.llvm.org/D43288 llvm-svn: 326480	2018-03-01 20:42:28 +00:00
Matt Arsenault	36b99e1937	AMDGPU/GlobalISel: InstrMapping for llvm.amdgcn.exp.compr Patch by Tom Stellard llvm-svn: 326479	2018-03-01 20:40:55 +00:00
Matt Arsenault	8931bbf8df	AMDGPU/GlobalISel: Define instruction mapping for @llvm.amdgcn.exp Patch by Tom Stellard llvm-svn: 326477	2018-03-01 20:24:37 +00:00
Matt Arsenault	50721ab325	AMDGPU/GlobalISel: Define InstrMappings for G_ICMP Patch by Tom Stellard llvm-svn: 326472	2018-03-01 19:27:10 +00:00
Matt Arsenault	dc14ec05d4	AMDGPU/GlobalISel: Make i32 mul legal llvm-svn: 326471	2018-03-01 19:22:05 +00:00
Matt Arsenault	06cbb27a79	AMDGPU/GlobalISel: Define instruction mapping for G_IMPLICIT_DEF Patch by Tom Stellard llvm-svn: 326470	2018-03-01 19:16:52 +00:00
Matt Arsenault	e3d9ecf2b9	AMDGPU/GlobalISel: Define instruction mapping for G_FCONSTANT Patch by Tom Stellard llvm-svn: 326468	2018-03-01 19:13:30 +00:00
Matt Arsenault	51b0b20023	AMDGPU/GlobalISel: Add copyCost for VGPR->SGPR copies Patch by Tom Stellard llvm-svn: 326467	2018-03-01 19:09:25 +00:00
Matt Arsenault	3f6a204eaa	AMDGPU/GlobalISel: Make i32 xor legal llvm-svn: 326466	2018-03-01 19:09:21 +00:00
Matt Arsenault	8e80a5fbca	AMDGPU/GlobalISel: Mark 32/64-bit G_FCMP as legal Patch by Tom Stellard llvm-svn: 326465	2018-03-01 19:09:16 +00:00
Matt Arsenault	dd022ce064	AMDGPU/GlobalISel: Mark 32-bit G_FPTOSI as legal Patch by Tom Stellard llvm-svn: 326464	2018-03-01 19:04:25 +00:00
Sam Clegg	503fdea3cb	[WebAssembly] Fix broken gcc build after rL326454 The gcc builders were broken by rL326454 See: https://reviews.llvm.org/D43921 llvm-svn: 326460	2018-03-01 18:48:08 +00:00
Artem Belevich	8c9749b1dc	[NVPTX] use pattern matching to lower int_nvvm_match_all_sync*. Now that patterns can handle intrinsics returning multiple results, use tablegen'ed pattern matching instead of custom lowering. Differential Revision: https://reviews.llvm.org/D43890 llvm-svn: 326457	2018-03-01 18:28:45 +00:00
Sam Clegg	03e101f1b0	[WebAssembly] Use uint8_t for single byte values to match the spec The original BinaryEncoding.md document used to specify that these values were `varint7`, but the official spec lists them explicitly as single byte values and not LEB. A similar change for wabt is in flight: https://github.com/WebAssembly/wabt/pull/782 Differential Revision: https://reviews.llvm.org/D43921 llvm-svn: 326454	2018-03-01 18:06:21 +00:00

1 2 3 4 5 ...

46336 Commits