clang-p2996

Author	SHA1	Message	Date
Fangrui Song	42e4967140	[ELF] Don't create copy relocation/canonical PLT entry for a defined symbol (#75095 ) Copy relocations and canonical PLT entries are for symbols defined in a DSO. Currently we create them even for a `Defined`, possibly leading to an output that won't work at run-time (e.g. R_X86_64_JUMP_SLOT referencing a null symbol). ``` % cat a.s .globl _start, main .type main, @function _start: main: ret .rodata .quad main % clang -fuse-ld=lld -pie -nostdlib a.s % readelf -Wr a.out Relocation section '.rela.plt' at offset 0x290 contains 1 entry: Offset Info Type Symbol's Value Symbol's Name + Addend 00000000000033b8 0000000000000007 R_X86_64_JUMP_SLOT 12b0 ``` Report an error instead for the default `-z text` mode. GNU ld reports an error in `-z text` mode as well.	2023-12-12 10:14:36 -08:00
Fangrui Song	255ea48608	[ELF] Merge verdefIndex into versionId. NFC (#72208 ) The two fields are similar. `versionId` is the Verdef index in the output file. It is set for `--exclude-id=`, version script patterns, and `sym@ver` symbols. `verdefIndex` is the Verdef index of a Sharedfile (SharedSymbol or a copy-relocated Defined), the default value -1 is also used to indicate that the symbol has not been matched by a version script pattern (https://reviews.llvm.org/D65716). It seems confusing to have two fields. Merge them so that we can allocate one bit for #70130 (suppress --no-allow-shlib-undefined error in the presence of a DSO definition).	2023-11-16 01:03:52 -08:00
Fangrui Song	e84575449f	Revert "[ELF] Merge verdefIndex into versionId. NFC" #72208 (#72484 ) Reverts llvm/llvm-project#72208 If a unversioned Defined preempts a versioned DSO definition, the version ID will not be reset.	2023-11-15 23:14:07 -08:00
Fangrui Song	667ea2ca40	[ELF] Merge verdefIndex into versionId. NFC (#72208 ) The two fields are similar. `versionId` is the Verdef index in the output file. It is set for version script patterns and `sym@ver` symbols. `verdefIndex` is the Verdef index of a SharedSymbol. The default value -1 is also used to indicate that the symbol has not been matched by a version script pattern (https://reviews.llvm.org/D65716). It seems confusing to have two fields. Merge them so that we can allocate one bit for #70130 (suppress --no-allow-shlib-undefined error in the presence of a DSO definition).	2023-11-14 10:20:21 -08:00
Fangrui Song	b169e7fedd	[ELF] Improve undefined symbol message w/ DW_TAG_variable of the enclosing symbol but w/o line number information (#70854 ) The undefined symbol message suggests the source line when line number information is available (see https://reviews.llvm.org/D31481). When the undefined symbol is from a global variable, we won't get the line information. ``` extern int undef; namespace ns { int *var[] = { &undef }; // DW_TAG_variable(DW_AT_decl_file/DW_AT_decl_line) is available while // line number information is unavailable. } ld.lld: error: undefined symbol: undef >>> referenced by undef-debug2.cc >>> undef-debug2.o:(ns::var) ``` This patch utilizes `getEnclosingSymbol` to locate `var` and find DW_TAG_variable for `var`: ``` ld.lld: error: undefined symbol: undef >>> referenced by undef-debug2.cc:3 (/tmp/c/undef-debug2.cc:3) >>> undef-debug2.o:(ns::var) ```	2023-11-03 13:53:36 -07:00
Fangrui Song	1981b1b6b9	[ELF] Demote symbols in /DISCARD/ discarded sections to Undefined (#69295 ) When an input section is matched by /DISCARD/ in a linker script, GNU ld reports errors for relocations referencing symbols defined in the section: `.aaa' referenced in section `.bbb' of a.o: defined in discarded section `.aaa' of a.o Implement the error by demoting eligible symbols to `Undefined` and changing STB_WEAK to STB_GLOBAL. As a side benefit, in relocatable links, relocations referencing symbols defined relative to /DISCARD/ discarded sections no longer set symbol/type to zeros. It's arguable whether a weak reference to a discarded symbol should lead to errors. GNU ld reports an error and our demoting approach reports an error as well. Close #58891 Co-authored-by: Bevin Hansson <bevin.hansson@ericsson.com>	2023-10-17 14:10:52 -07:00
Arthur Eubanks	9d6ec280fc	[lld/ELF] Don't relax R_X86_64_(REX_)GOTPCRELX when offset is too far For each R_X86_64_(REX_)GOTPCRELX relocation, check that the offset to the symbol is representable with 2^32 signed offset. If not, add a GOT entry for it and set its expr to R_GOT_PC so that we emit the GOT load instead of the relaxed lea. Do this in finalizeAddressDependentContent() where we iteratively attempt this (e.g. RISCV uses this for relaxation, ARM uses this to insert thunks). Decided not to do the opposite of inserting GOT entries initially and removing them when relaxable because removing GOT entries isn't simple. One drawback of this approach is that if we see any GOTPCRELX relocation, we'll create an empty .got even if it's not required in the end. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D157020	2023-10-04 13:03:56 -07:00
Mitch Phillips	ca35a19aca	[lld] Synthesize metadata for MTE globals As per the ABI at https://github.com/ARM-software/abi-aa/blob/main/memtagabielf64/memtagabielf64.rst, this patch interprets the SHT_AARCH64_MEMTAG_GLOBALS_STATIC section, which contains R_NONE relocations to tagged globals, and emits a SHT_AARCH64_MEMTAG_GLOBALS_DYNAMIC section, with the correct DT_AARCH64_MEMTAG_GLOBALS and DT_AARCH64_MEMTAG_GLOBALSSZ dynamic entries. This section describes, in a uleb-encoded stream, global memory ranges that should be tagged with MTE. We are also out of bits to spare in the LLD Symbol class. As a result, I've reused the 'needsTocRestore' bit, which is a PPC64 only feature. Now, it's also used for 'isTagged' on AArch64. An entry in SHT_AARCH64_MEMTAG_GLOBALS_STATIC is practically a guarantee from an objfile that all references to the linked symbol are through the GOT, and meet correct alignment requirements. As a result, we go through all symbols and make sure that, for all symbols $SYM, all object files that reference $SYM also have a SHT_AARCH64_MEMTAG_GLOBALS_STATIC entry for $SYM. If this isn't the case, we demote the symbol to being untagged. Symbols that are imported from other DSOs should always be fine, as they're GOT-referenced (and thus the GOT entry either has the correct tag or not, depending on whether it's tagged in the defining DSO or not). Additionally hand-tested by building {libc, libm, libc++, libm, and libnetd} on Android with some experimental MTE globals support in the linker/libc. Reviewed By: MaskRay, peter.smith Differential Revision: https://reviews.llvm.org/D152921	2023-07-31 17:07:42 +02:00
WANG Xuerui	6084ee7420	[lld][ELF] Support LoongArch This adds support for the LoongArch ELF psABI v2.00 [1] relocation model to LLD. The deprecated stack-machine-based psABI v1 relocs are not supported. The code is tested by successfully bootstrapping a Gentoo/LoongArch stage3, complete with common GNU userland tools and both the LLVM and GNU toolchains (GNU toolchain is present only for building glibc, LLVM+Clang+LLD are used for the rest). Large programs like QEMU are tested to work as well. [1]: https://loongson.github.io/LoongArch-Documentation/LoongArch-ELF-ABI-EN.html Reviewed By: MaskRay, SixWeining Differential Revision: https://reviews.llvm.org/D138135	2023-07-25 17:06:07 +08:00
Fangrui Song	8d85c96e0e	[lld] StringRef::{starts,ends}with => {starts,ends}_with. NFC The latter form is now preferred to be similar to C++20 starts_with. This replacement also removes one function call when startswith is not inlined.	2023-06-05 14:36:19 -07:00
Alexey Lapshin	85c2768ce9	[Support][Parallel] Initialize threadIndex and add assertion checking its usage. That patch adds a check for threadIndex being used with only threads created by ThreadPoolExecutor. This helps catch two types of errors: 1. If a thread is created not by ThreadPoolExecutor its index may clash with the index of another thread. Using threadIndex, in that case, may lead to a data race. 2. Index of the main thread(threadIndex == 0) currently clashes with the index of thread0 in ThreadPoolExecutor threads. That may lead to a data race if main thread and thread0 are executed concurrently. This patch allows execution tasks on the main thread only in case parallel::strategy.ThreadsRequested == 1. In all other cases, assertions check that threadIndex != UINT_MAX(i.e. that task is executed on a thread created by ThreadPoolExecutor). Differential Revision: https://reviews.llvm.org/D148916	2023-05-02 18:44:15 +02:00
Alexey Lapshin	fea8c07356	[Support][Parallel] Add sequential mode to TaskGroup::spawn(). This patch allows to specify that some part of tasks should be done in sequential order. It makes it possible to not use condition operator for separating sequential tasks: TaskGroup tg; for () { if(condition) ==> tg.spawn([](){fn();}, condition) fn(); else tg.spawn([](){fn();}); } It also prevents execution on main thread. Which allows adding checks for getThreadIndex() function discussed in D142318. The patch also replaces std::stack with std::deque in the ThreadPoolExecutor to have natural execution order in case (parallel::strategy.ThreadsRequested == 1). Differential Revision: https://reviews.llvm.org/D148728	2023-04-26 13:52:26 +02:00
Fangrui Song	b30b1f173c	[ELF] Add single quotes around out of range errors to match the convention we use for other diagnostics.	2023-03-03 12:48:16 -08:00
Fangrui Song	ffa1118330	[ELF] Mention section name for STT_SECTION in reportRangeError() D73518 mentioned non-STT_SECTION symbol names. This patch extends the code to handle STT_SECTION symbols, where we report the section name. This change helps at least the following cases with very little code. * Whether a out-of-range relocation is due to code or data. * For a relocation in .debug_info, which referenced `.debug_*` section (due to DWARF32 limitation) causes the problem. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D145199	2023-03-03 12:35:05 -08:00
Fangrui Song	08c915fa76	[ELF] -z notext: avoid dynamic relocations in .eh_frame Fix https://github.com/llvm/llvm-project/issues/60392 ``` // a.cc void raise() { throw 42; } bool foo() { try { raise(); } catch (int) { return true; } return false; } int main() { foo(); } ``` ``` clang++ --target=x86_64-linux-gnu -fno-pic -mcmodel=large -no-pie -fuse-ld=lld -z notext a.cc -o a && ./a clang++ --target=aarch64-linux-gnu -fno-pic -no-pie -fuse-ld=lld -Wl,--dynamic-linker=/usr/aarch64-linux-gnu/lib/ld-linux-aarch64.so.1 -Wl,-rpath=/usr/aarch64-linux-gnu/lib -z notext a.cc -o a && ./a ``` Both commands fail because we produce a dynamic relocation for R_X86_64_64/R_AARCH64_ABS64 in .eh_frame which will be adjusted to a wrong offset by `SectionBase::getOffset` after D122459. Since GNU ld uses a canonical PLT entry instead of a dynamic relocation for .eh_frame, we follow suit as well to avoid the issue. Mips has an ABI issue (https://github.com/llvm/llvm-project/issues/5837) and we don't implement GNU ld's DW_EH_PE_absptr conversion. mips64-eh-abs-reloc.s wants a dynamic relocation, so keep the original behavior for EM_MIPS. Differential Revision: https://reviews.llvm.org/D143136	2023-02-03 10:27:33 -08:00
Shivam Gupta	e8e0e5f3ee	[NFC] Small indentation fix in lld/ELF/Relocations.cpp	2023-01-22 18:40:58 +05:30
Guillaume Chatelet	08e2a76381	[lld][NFC] rename ELF alignment into addralign	2022-12-01 16:20:12 +00:00
Fangrui Song	4191fda69c	[ELF] Change most llvm::Optional to std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 19:19:15 -08:00
Fangrui Song	6bca3ad379	[ELF] Simplify postScanRelocations with in.got	2022-11-21 06:04:18 +00:00
Fangrui Song	c3c9e45312	[ELF] Add InputSectionBase::{addRelocs,relocs} and GotSection::addConstant to add/access relocations to prepare for changing `relocations` from a SmallVector to a pointer. Also change the `isec` parameter in `addAddendOnlyRelocIfNonPreemptible` to `GotSection &`.	2022-11-21 04:12:03 +00:00
Fangrui Song	2bf5d86422	[ELF] Change rawData to content() and data() to contentMaybeDecompress() Clarify data() which may trigger decompression and make it feasible to refactor the member variable rawData.	2022-11-20 22:43:22 +00:00
Fangrui Song	4eda362539	[ELF] Inline computeAddend. NFC	2022-10-17 13:09:39 -07:00
Fangrui Song	d8af31eced	[ELF] Move ELFT-agnostic relocation code to processAux	2022-10-17 11:57:17 -07:00
Fangrui Song	874fc6bd78	[ELF] Move ELFT-agnostic relocation code to processAux. NFC	2022-10-17 11:44:28 -07:00
Fangrui Song	dc884f0f43	[ELF] Remove RelocationScanner::target. NFC	2022-10-16 12:39:37 -07:00
Fangrui Song	e983109876	[ELF] Move R_TPREL/R_TPREL_NEG check into handleTlsRelocation	2022-10-16 12:19:58 -07:00
Fangrui Song	2b153088be	[ELF] Set DF_STATIC_TLS for AArch64/PPC32/PPC64	2022-10-16 12:08:08 -07:00
Fangrui Song	9c626d4a0d	[ELF] Remove symtab indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection.	2022-10-01 14:46:49 -07:00
Fangrui Song	34fa860048	[ELF] Remove ctx indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection. We can move other global variables into ctx without indirection concern. In the long term we may consider passing Ctx as a parameter to various functions and eliminate global state as much as possible and then remove `Ctx::reset`.	2022-10-01 12:06:33 -07:00
Fangrui Song	a623a4c8b4	[ELF] Remove elf::config indirection. NFC `config` has 1000+ uses so we try to avoid changing `config->foo`. Define a wrapper with LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection. My x86-64 lld executable is 11+KiB smaller.	2022-10-01 11:39:45 -07:00
Fangrui Song	ab11ed5249	[ELF] Reset verdefIndex for Defined preempting SharedSymbol to avoid spurious "attempt to reassign symbol '...'" warning after `7a58dd1046`	2022-09-29 21:26:53 -07:00
Fangrui Song	e3ecc6a912	[ELF] Make symAux[0] a sentinel And default auxIdx to 0.	2022-09-29 00:50:19 -07:00
Fangrui Song	7a58dd1046	[ELF] Refactor Symbol initialization and overwriting Symbol::replace intends to overwrite a few fields (mostly Elf{32,64}_Sym fields), but the implementation copies all fields then restores some old fields. This is error-prone and wasteful. Add Symbol::overwrite to copy just the needed fields and add other overwrite member functions to copy the extra fields.	2022-09-28 13:11:31 -07:00
Fangrui Song	62e7c5b4e2	Revert "[ELF] --pack-dyn-relocs=android: scan relocation serially after D133003" This reverts commit `bce6416775`. The workaround is unneeded after `7dac9f4e48`.	2022-09-28 07:06:49 +00:00
Fangrui Song	bce6416775	[ELF] --pack-dyn-relocs=android: scan relocation serially after D133003 https://reviews.llvm.org/D133003#3806508 can reproduce a non-determinism with --threads=4. Making the config serial fixes non-determinism (by running the link many times and compare output).	2022-09-21 11:43:13 -07:00
Martin Storsjö	e280940bfb	[Support] Access threadIndex via a wrapper function On Unix platforms, this wrapper function is inline, so it should expand to the same direct access to the thread local variable. On Windows, it's a non-inline function within Parallel.cpp, allowing making the thread_local variable static. Windows Native TLS doesn't support direct access to thread local variables in a different DLL, and GCC/binutils on Windows occasionally has problems with non-static thread local variables too. This fixes mingw dylib builds with native TLS after `e6aebff674`. At the same time, move the whole thread local variable within #if LLVM_ENABLE_THREADS to fix builds without threading support. Differential Revision: https://reviews.llvm.org/D133759	2022-09-14 09:19:27 +03:00
Fangrui Song	e6aebff674	[ELF] Parallelize relocation scanning * Change `Symbol::flags` to a `std::atomic<uint16_t>` * Add `llvm::parallel::threadIndex` as a thread-local non-negative integer * Add `relocsVec` to part.relaDyn and part.relrDyn so that relative relocations can be added without a mutex * Arbitrarily change -z nocombreloc to move relative relocations to the end. Disable parallelism for deterministic output. MIPS and PPC64 use global states for relocation scanning. Keep serial scanning. Speed-up with mimalloc and --threads=8 on an Intel Skylake machine: * clang (Release): 1.27x as fast * clang (Debug): 1.06x as fast * chrome (default): 1.05x as fast * scylladb (default): 1.04x as fast Speed-up with glibc malloc and --threads=16 on a ThunderX2 (AArch64): * clang (Release): 1.31x as fast * scylladb (default): 1.06x as fast Reviewed By: andrewng Differential Revision: https://reviews.llvm.org/D133003	2022-09-12 12:56:35 -07:00
Fangrui Song	bd16ffb389	[ELF] Merge Symbol::needs* into uint16_t flags. NFC Split off from D133003 ([ELF] Parallelize relocation scanning) to make its diff smaller.	2022-09-09 14:37:18 -07:00
Fangrui Song	50b7eb91f0	[ELF] Reuse one RelocationScanner to scan all sections. NFC	2022-09-04 23:12:27 -07:00
Fangrui Song	94ca041905	[ELF] Move scanRelocations into Relocations.cpp. NFC	2022-09-04 21:31:18 -07:00
Fangrui Song	c8d9d0000b	[ELF] Relocations: set hasDirectReloc only if not ifunc. NFC	2022-09-04 21:30:19 -07:00
Fangrui Song	82ed93ea05	[ELF] Use stOther to track visibility This simplifies SymbolTableSection<ELFT>::writeTo. Add dsoProtected to be used in canDefineSymbolInExecutable and get the side benefit that the protected DSO preemption diagnostic is clearer.	2022-09-04 17:27:35 -07:00
Sam Clegg	2cd4cd9a32	[lld][ELF] Rename SymbolTable::symbols() to SymbolTable::getSymbols(). NFC This change renames this method match its original name and the name used in the wasm linker. Back in `d8f8abbd4a` the ELF SymbolTable method `getSymbols()` was replaced with `forEachSymbol`. Then in `a2fc964417` `forEachSymbol` was replaced with a `llvm::iterator_range`. Then in `e9262edf0d` we came full circle and the `llvm::iterator_range` was replaced with a `symbols()` accessor that was identical the original `getSymbols()`. `getSymbols` also matches the name used elsewhere in the ELF linker as well as in both COFF and wasm backend (e.g. `InputFiles.h` and `SyntheticSections.h`) Differential Revision: https://reviews.llvm.org/D130787	2022-08-19 14:56:08 -07:00
Fangrui Song	e3fcf2e06f	[ELF] Simplify llvm::enumerate with structured binding. NFC	2022-08-09 21:52:08 -07:00
Fangrui Song	abd9807590	[ELF] mergeCmp: work around irreflexivity bug Some tests (e.g. aarch64-feature-pac.s) segfault in libstdc++ _GLIBCXX_DEBUG builds (enabled by LLVM_ENABLE_EXPENSIVE_CHECKS). dyn_cast<ThunkSection> is incorrectly true for any SyntheticSection. std::merge transitively calls mergeCmp(x, x) (due to __glibcxx_requires_irreflexive_pred) and will segfault in `ta->getTargetInputSection()`. The dyn_cast<ThunkSection> issue should be eventually fixed properly, bug `a != b` is robust enough for now.	2022-08-05 17:08:37 -07:00
Fangrui Song	3e9adff456	[ELF] Split EhInputSection::pieces into cies and fdes This simplifies code, removes a read32 (for id==0 check), and makes it feasible to combine some operations in EhInputSection::split and EhFrameSection::addRecords. Mostly NFC, but fixes "Relocation not in any piece" assertion failure in an erroneous case when a relocation offset precedes all CIE/FDE pices.	2022-07-31 16:16:10 -07:00
Fangrui Song	6611d58f5b	[ELF] Relax R_RISCV_ALIGN Alternative to D125036. Implement R_RISCV_ALIGN relaxation so that we can handle -mrelax object files (i.e. -mno-relax is no longer needed) and creates a framework for future relaxation. `relaxAux` is placed in a union with InputSectionBase::jumpInstrMod, storing auxiliary information for relaxation. In the first pass, `relaxAux` is allocated. The main data structure is `relocDeltas`: when referencing `relocations[i]`, the actual offset is `r_offset - (i ? relocDeltas[i-1] : 0)`. `relaxOnce` performs one relaxation pass. It computes `relocDeltas` for all text section. Then, adjust st_value/st_size for symbols relative to this section based on `SymbolAnchor`. `bytesDropped` is set so that `assignAddresses` knows that the size has changed. Run `relaxOnce` in the `finalizeAddressDependentContent` loop to wait for convergence of text sections and other address dependent sections (e.g. SHT_RELR). Note: extrating `relaxOnce` into a separate loop works for many cases but has issues in some linker script edge cases. After convergence, compute section contents: shrink the NOP sequence of each R_RISCV_ALIGN as appropriate. Instead of deleting bytes, we run a sequence of memcpy on the content delimitered by relocation locations. For R_RISCV_ALIGN let the next memcpy skip the desired number of bytes. Section content computation is parallelizable, but let's ensure the implementation is mature before optimizations. Technically we can save a copy if we interleave some code with `OutputSection::writeTo`, but let's not pollute the generic code (we don't have templated relocation resolving, so using conditions can impose overhead to non-RISCV.) Tested: `make ARCH=riscv CROSS_COMPILE=riscv64-linux-gnu- LLVM=1 defconfig all` built Linux kernel using -mrelax is bootable. FreeBSD RISCV64 system using -mrelax is bootable. bash/curl/firefox/libevent/vim/tmux using -mrelax works. Differential Revision: https://reviews.llvm.org/D127581	2022-07-07 10:16:09 -07:00
Fangrui Song	9a572164d5	[ELF] Move InputFiles global variables (memoryBuffers, objectFiles, etc) into Ctx. NFC	2022-06-29 18:53:38 -07:00
Daniel Bertalan	0eec7e2a89	Reland "[lld-macho] Group undefined symbol diagnostics by symbol". This reverts commit `36e7c9a450`. This relands `d61341768c` with the fix described in https://reviews.llvm.org/D127753#3587390	2022-06-15 19:22:39 -04:00
Xiaodong Liu	36d4f42c36	[lld] Fix typo for processAux; NFC Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D125163	2022-05-09 10:21:47 +08:00

1 2 3 4 5 ...

616 Commits