clang-p2996

Author	SHA1	Message	Date
Greg McGary	93c8559baf	[lld-macho] Implement branch-range-extension thunks Extend the range of calls beyond an architecture's limited branch range by first calling a thunk, which loads the far address into a scratch register (x16 on ARM64) and branches through it. Other ports (COFF, ELF) use multiple passes with successively-refined guesses regarding the expansion of text-space imposed by thunk-space overhead. This MachO algorithm places thunks during MergedOutputSection::finalize() in a single pass using exact thunk-space overheads. Thunks are kept in a separate vector to avoid the overhead of inserting into the `inputs` vector of `MergedOutputSection`. FIXME: * arm64-stubs.s test is broken * add thunk tests * Handle thunks to DylibSymbol in MergedOutputSection::finalize() Differential Revision: https://reviews.llvm.org/D100818	2021-05-12 09:44:58 -07:00
Jez Ng	1aa29dffce	[lld-macho] Support subtractor relocations that reference sections The minuend (but not the subtrahend) can reference a section. Note that we do not yet properly validate that the subtrahend isn't referencing a section; I've filed PR50034 to track that. I've also extended the reloc-subtractor.s test to reorder symbols, to make sure that the addends are being associated with the minuend (and not the subtrahend) relocation. Fixes PR49999. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D100804	2021-04-20 16:58:57 -04:00
Jez Ng	4938b090cf	[lld-macho] Don't use arrays as template parameters MSVC from VSCode 2017 appears unhappy with it (causes an internal compiler error.) This also means that we need to avoid doing `sizeof(stubCode)` as `sizeof(int[N])` on function array parameters decays into `sizeof(int *)`. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100605	2021-04-15 21:16:34 -04:00
Jez Ng	3bc88eb392	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-15 21:16:33 -04:00
Jez Ng	8ca366935b	Revert "[lld-macho] Add support for arm64_32" and other stacked diffs This reverts commits: * `8914902b01` * `35a745d814` * `682d1dfe09`	2021-04-13 12:40:58 -04:00
Jez Ng	8914902b01	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-13 10:43:28 -04:00
Jez Ng	88cb786ec2	[lld-macho][nfc] Remove DYSYM8 reloc attribute It's likely redundant, per discussion with @gkm. The BYTE8 attribute covers the bit width requirement already. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100133	2021-04-09 19:48:08 -04:00
Jez Ng	817d98d841	[lld-macho][nfc] Refactor in preparation for 32-bit support The main challenge was handling the different on-disk structures (e.g. `mach_header` vs `mach_header_64`). I tried to strike a balance between sprinkling `target->wordSize == 8` checks everywhere (branchy = slow, and ugly) and templatizing everything (causes code bloat, also ugly). I think I struck a decent balance by judicious use of type erasure. Note that LLD-ELF has a similar architecture, though it seems to use more templating. Linking chromium_framework takes about the same time before and after this change: N Min Max Median Avg Stddev x 20 4.52 4.67 4.595 4.5945 0.044423204 + 20 4.5 4.71 4.575 4.582 0.056344803 No difference proven at 95.0% confidence Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D99633	2021-04-02 18:46:39 -04:00
Greg McGary	427d359721	[lld-macho][NFC] Drop unnecessary macho:: namespace prefix on unambiguous references to Symbol Within `lld/macho/`, only `InputFiles.cpp` and `Symbols.h` require the `macho::` namespace qualifier to disambiguate references to `class Symbol`. Add braces to outer `for` of a 5-level single-line `if`/`for` nest. Differential Revision: https://reviews.llvm.org/D99555	2021-03-30 14:58:35 -07:00
Jez Ng	dc8bee9265	[lld-macho] Check address ranges when applying relocations This diff required fixing `getEmbeddedAddend` to apply sign extension to 32-bit values. We were previously passing around wrong 64-bit addend values that became "right" after being truncated back to 32-bit. I've also made `getEmbeddedAddend` return a signed int, which is similar to what LLD-ELF does for its `getImplicitAddend`. `reportRangeError`, `checkUInt`, and `checkInt` are counterparts of similar functions in LLD-ELF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98387	2021-03-12 17:26:27 -05:00
Jez Ng	a723db92d8	[lld-macho][nfc] Refactor subtractor reloc handling SUBTRACTOR relocations are always paired with UNSIGNED relocations to indicate a pair of symbols whose address difference we want. Functionally they are like a single relocation: only one pointer gets written / relocated. Previously, we would handle these pairs by skipping over the SUBTRACTOR relocation and writing the pointer when handling the UNSIGNED reloc. This diff reverses things, so we write while handling SUBTRACTORs and skip over the UNSIGNED relocs instead. Being able to distinguish between SUBTRACTOR and UNSIGNED relocs in the write phase (i.e. inside `relocateOne`) is useful for the upcoming range check diff: we want to check that SUBTRACTOR relocs write signed values, but UNSIGNED relocs (naturally) write unsigned values. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98386	2021-03-11 13:28:13 -05:00
Jez Ng	5433a79176	[lld-macho][nfc] Create Relocations.{h,cpp} for relocation-specific code This more closely mirrors the structure of lld-ELF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D98384	2021-03-11 13:28:09 -05:00
Nico Weber	7d26916859	[lld/mac] tweak comment based on feedback on D98053	2021-03-05 16:28:38 -05:00
Nico Weber	210cc0738b	[mac/lld] Fix scale computation for vector ops in PAGEOFF12 relocations With this, llvm-tblgen no longer tries and fails to allocate 7953 petabyte when it runs during the build. Instead, `check-llvm` with lld/mac as host linker now completes without any failures on an m1 mac. This vector op handling code matches what happens in: - ld64's OutputFile::applyFixUps() in OutputFile.cpp for kindStoreARM64PageOff12 - lld.ld64.darwinold's offset12KindFromInstruction() in lld/lib/ReaderWriter/MachO/ArchHandler_arm64.cpp for offset12scale16 - RuntimeDyld's decodeAddend() in llvm/lib/ExecutionEngine/RuntimeDyld/Targets/RuntimeDyldMachOAArch64.h for ARM64_RELOC_PAGEOFF12 Fixes PR49444. Differential Revision: https://reviews.llvm.org/D98053	2021-03-05 12:24:37 -05:00
Jez Ng	82b3da6f6f	[lld-macho] Extract embedded addends for arm64 UNSIGNED relocations On arm64, UNSIGNED relocs are the only ones that use embedded addends instead of the ADDEND relocation. Also ensure that the addend works when UNSIGNED is part of a SUBTRACTOR pair. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D97105	2021-02-27 12:31:34 -05:00
Jez Ng	541390131e	[lld-macho] Don't emit rebase opcodes for subtractor minuend relocs Also add a few asserts to verify that we are indeed handling an UNSIGNED relocation as the minued. I haven't made it an actual user-facing error since I don't think llvm-mc is capable of generating SUBTRACTOR relocations without an associated UNSIGNED. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97103	2021-02-27 12:31:34 -05:00
Jez Ng	cc5c03e109	[lld-macho] Properly test subtractor relocations & fix their attributes `llvm-mc` doesn't generate any relocations for subtractions between local symbols -- they must be global -- so the previous test wasn't actually testing any relocation logic. I've fixed that and extended the test to cover r_length=3 relocations as well as both x86_64 and arm64. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D97057	2021-02-27 12:31:34 -05:00
Jez Ng	5e851733c5	[lld-macho] Fix semantics & add tests for ARM64 GOT/TLV relocs I've adjusted the RelocAttrBits to better fit the semantics of the relocations. In particular: 1. _UNSIGNED relocations are no longer marked with the `TLV` bit, even though they can occur within TLV sections. Instead the `TLV` bit is reserved for relocations that can reference thread-local symbols, and _UNSIGNED relocations have their own `UNSIGNED` bit. The previous implementation caused TLV and regular UNSIGNED semantics to be conflated, resulting in rebase opcodes being incorrectly emitted for TLV relocations. 2. I've added a new `POINTER` bit to denote non-relaxable GOT relocations. This distinction isn't important on x86 -- the GOT relocations there are either relaxable or non-relaxable loads -- but arm64 has `GOT_LOAD_PAGE21` which loads the page that the referent symbol is in (regardless of whether the symbol ends up in the GOT). This relocation must reference a GOT symbol (so must have the `GOT` bit set) but isn't itself relaxable (so must not have the `LOAD` bit). The `POINTER` bit is used for relocations that must reference a GOT slot. 3. A similar situation occurs for TLV relocations. 4. ld64 supports both a pcrel and an absolute version of ARM64_RELOC_POINTER_TO_GOT. But the semantics of the absolute version are pretty weird -- it results in the value of the GOT slot being written, rather than the address. (That means a reference to a dynamically-bound slot will result in zeroes being written.) The programs I've tried linking don't use this form of the relocation, so I've dropped our partial support for it by removing the relevant RelocAttrBits. Reviewed By: alexshap Differential Revision: https://reviews.llvm.org/D97031	2021-02-23 22:02:38 -05:00
Mikael Holmen	caff023b77	[lld] Silence compiler warnings by removing always true/false comparisons type is an uint8_t so type >= 0 is always true and type < 0 is always false.	2021-02-17 08:16:02 +01:00
Greg McGary	87104faac4	[lld-macho] Add ARM64 target arch This is an initial base commit for ARM64 target arch support. I don't represent that it complete or bug-free, but wish to put it out for review now that some basic things like branch target & load/store address relocs are working. I can add more tests to this base commit, or add them in follow-up commits. It is not entirely clear whether I use the "ARM64" (Apple) or "AArch64" (non-Apple) naming convention. Guidance is appreciated. Differential Revision: https://reviews.llvm.org/D88629	2021-02-08 18:14:07 -07:00

20 Commits