clang-p2996

Author	SHA1	Message	Date
Fangrui Song	39c20a63b1	[ELF] Add --remap-inputs= and --remap-inputs-file= --remap-inputs-file= can be specified multiple times, each naming a remap file that contains `from-glob=to-file` lines or `#`-led comments. ('=' is used a separator a la -fdebug-prefix-map=) --remap-inputs-file= can be used to: * replace an input file. E.g. `"/libz.so=exp/libz.so"` can replace a resolved `-lz` without updating the input file list or (if used) a response file. When debugging an application where a bug is isolated to one single input file, this option gives a convenient way to test fixes. remove an input file with `/dev/null` (changed to `NUL` on Windows), e.g. `"a.o=/dev/null"`. A build system may add unneeded dependencies. This option gives a convenient way to test the result removing some inputs. `--remap-inputs=a.o=aa.o` can be specified to provide one pattern without using an extra file. (bash/zsh process substitution is handy for specifying a pattern without using a remap file, e.g. `--remap-inputs-file=<(printf 'a.o=aa.o')`, but it may be unavailable in some systems. An extra file can be inconvenient for a build system.) Exact patterns are tested before wildcard patterns. In case of a tie, the first patterns wins. This is an implementation detail that users should not rely on. Co-authored-by: Marco Elver <elver@google.com> Link: https://discourse.llvm.org/t/rfc-support-exclude-inputs/70070 Reviewed By: melver, peter.smith Differential Revision: https://reviews.llvm.org/D148859	2023-04-26 13:18:55 -07:00
Fangrui Song	a8788de1c3	[ELF] Cap parallel::strategy to 16 threads when --threads= is unspecified When --threads= is unspecified, we set it to `parallel::strategy.compute_thread_count()`, which uses sched_getaffinity (Linux)/cpuset_getaffinity (FreeBSD)/std::thread::hardware_concurrency (others). With extensive testing on many machines (many configurations from {aarch64,x86-64} x {Linux,FreeBSD,Windows} x allocators(native,mimalloc,rpmalloc) combinations) with varying workloads, we discovered that when the concurrency is larger than 16, the linking process is slower than using --threads=16 due to parallelism overhead outweighs optimizations. This is particularly harmful for machines with many cores or when the link job competes with other jobs. Cap parallel::strategy when --threads= is unspecified. For some workloads changing the concurrency from 8 to 16 has nearly no improvement. --thinlto-jobs= is unchanged since ThinLTO backend compiles are embarrassingly parallel. Link: https://discourse.llvm.org/t/avoidable-overhead-from-threading-by-default/69160 Reviewed By: peter.smith, andrewng Differential Revision: https://reviews.llvm.org/D147493	2023-04-20 12:17:26 -07:00
Fangrui Song	187a35527d	Revert D147493 "[ELF] Cap parallel::strategy to 16 threads when --threads= is unspecified" This reverts commit `da68d2164e`. This change is correct, but left a `config->threadCount` use that is error-prone and may harm performance when parallel::strategy.compute_thread_count() > 16.	2023-04-20 09:49:22 -07:00
Craig Topper	85444794cd	[lld][RISCV] Implement GP relaxation for R_RISCV_HI20/R_RISCV_LO12_I/R_RISCV_LO12_S. This implements support for relaxing these relocations to use the GP register to compute addresses of globals in the .sdata and .sbss sections. This feature is off by default and must be enabled by passing --relax-gp to the linker. The GP register might not always be the "global pointer". It can be used for other purposes. See discussion here https://github.com/riscv-non-isa/riscv-elf-psabi-doc/pull/371 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D143673	2023-04-13 10:52:15 -07:00
Fangrui Song	da68d2164e	[ELF] Cap parallel::strategy to 16 threads when --threads= is unspecified When --threads= is unspecified, we set it to `parallel::strategy.compute_thread_count()`, which uses sched_getaffinity (Linux)/cpuset_getaffinity (FreeBSD)/std::thread::hardware_concurrency (others). With extensive testing on many machines (many configurations from {aarch64,x86-64} x {Linux,FreeBSD,Windows} x allocators(native,mimalloc,rpmalloc) combinations) with varying workloads, we discovered that when the concurrency is larger than 16, the linking process is slower than using --threads=16 due to parallelism overhead outweighs optimizations. This is particularly harmful for machines with many cores or when the link job competes with other jobs. Cap parallel::strategy when --threads= is unspecified. For some workloads changing the concurrency from 8 to 16 has nearly no improvement. --thinlto-jobs= is unchanged since ThinLTO backend compiles are embarrassingly parallel. Link: https://discourse.llvm.org/t/avoidable-overhead-from-threading-by-default/69160 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D147493	2023-04-12 13:13:38 -07:00
Ivan Tadeu Ferreira Antunes Filho	73fd9d310f	[lld] Support separate native object file path in --thinlto-prefix-replace Currently, the --thinlto-prefix-replace="oldpath;newpath" option is used during distributed ThinLTO thin links to specify the mapping of the input bitcode object files' directory tree (oldpath) to the directory tree (newpath) used for both: 1) the output files of the thin link itself (the .thinlto.bc index files and the optional .imports files) 2) the specified object file paths written to the response file given in the --thinlto-index-only=${response} option, which is used by the final native link and must match the paths of the native object files that will be produced by ThinLTO backend compiles. This patch expands the --thinlto-prefix-replace option to allow a separate directory tree mapping to be specified for the object file paths written to the response file (number 2 above). This is important to support builds and build systems where the same output directory may not be written by multiple build actions (e.g. the thin link and the ThinLTO backend compiles). The new format is: --thinlto-prefix-replace="origpath;outpath[;objpath]" This replaces the origpath directory tree of the thin link input files with outpath when writing the thin link index and imports outputs (number 1 above). If objpath is specified it replaces origpath of the input files with objpath when writing the response file (number 2 above), otherwise it falls back to the old behavior of using outpath for this as well. Reviewed By: tejohnson, MaskRay Differential Revision: https://reviews.llvm.org/D144596	2023-04-04 11:24:51 -07:00
Simi Pallipurath	2f68ddc604	[lld][ARM][2/3]Big Endian support - Word invariant support Changes: - Adding BE32 big endian Support for Arm. - Replace the writele and readle with their endian-aware versions. - Adding test cases for the big-endian be32 arm configuration. Patch by: Milosz Plichta. This patch merges all the changes from this patch https://reviews.llvm.org/D140203 as well. Reviewed By: peter.smith, MaskRay Differential Revision: https://reviews.llvm.org/D140202	2023-03-29 10:21:00 +01:00
Mitch Phillips	c574e93afd	[lld] [MTE] Add DT_AARCH64_MEMTAG_* dynamic entries, and small cleanup Adds the new AArch64-ABI dynamic entry generation to LLD. This will allow Android to move from the Android-specific ELF note onto the dynamic entries. Change the behaviour of an unspecified --android-memtag-mode. Now, when unspecified, this will print a warning that you're doing a no-op, rather than implicitly turning on sync mode. This is important for MTE globals later, where a binary containing static tagged global descriptors shouldn't have MTE turned on without specific intent being passed to the linker. For now, continue to emit the Android ELF note by default. In future, we can probably make it only emit the note when provided a flag. Do a quick NFC-cleanup of the ELF note while we're here. It doesn't change anything about the ELF note itself, but makes it more clear to the reader of the code what alignment requirements are being (previously implicitly) met. Reviewed By: fmayer, MaskRay Differential Revision: https://reviews.llvm.org/D143769	2023-03-01 11:14:05 -08:00
Scott Linder	45ee0a9afc	[LLD] Add --lto-CGO[0-3] option Allow controlling the CodeGenOpt::Level independent of the LTO optimization level in LLD via new options for the COFF, ELF, MachO, and wasm frontends to lld. Most are spelled as --lto-CGO[0-3], but COFF is spelled as -opt:lldltocgo=[0-3]. See D57422 for discussion surrounding the issue of how to set the CG opt level. The ultimate goal is to let each function control its CG opt level, but until then the current default means it is impossible to specify a CG opt level lower than 2 while using LTO. This option gives the user a means to control it for as long as it is not handled on a per-function basis. Reviewed By: MaskRay, #lld-macho, int3 Differential Revision: https://reviews.llvm.org/D141970	2023-02-15 17:34:35 +00:00
Mikael Holmen	46dc3d0b9b	[lld] Fix gcc compiler warnings related to variadic macro [NFC] gcc warned like ../../lld/ELF/InputSection.cpp:75:37: warning: ISO C++11 requires at least one argument for the "..." in a variadic macro 75 \| invokeELFT(parseCompressedHeader); \| ^	2023-02-02 10:26:56 +01:00
Nikita Popov	d7cf7ab61c	[LLD] Remove no-opaque-pointers plugin option We always use opaque pointers. The opaque-pointers option is retained as a no-op, same as no-lto-legacy-pass-manager.	2023-01-25 12:29:59 +01:00
serge-sans-paille	984b800a03	Move from llvm::makeArrayRef to ArrayRef deduction guides - last part This is a follow-up to https://reviews.llvm.org/D140896, split into several parts as it touches a lot of files. Differential Revision: https://reviews.llvm.org/D141298	2023-01-10 11:47:43 +01:00
Fangrui Song	6b9a80de49	[lld] Fix iwyu problems after `83d59e05b2` The commit transitively includes lld/include/lld/Common/ErrorHandler.h into lld/include/lld/Common/Driver.h, which is not intended.	2022-12-28 10:46:45 -08:00
Ties Stuij	62c605771a	[lld][ARM] support absolute thunks for Armv4T Thumb and interworking changes: - BLX: The Arm architecture versions that support the branch and link instruction (BLX), can rewrite BLs in place when a state change from Arm<->Thumb is required. Armv4T does not have BLX and so needs thunks for state changes. - v4T Thumb long branches needed their own thunk. We could have used the v6M implementation, but v6M doesn't have Arm state and must resolve to rather inefficient stack reshuffling. We also can't reuse v7 thumb thunks as they use MOVV/MOVT, which wasn't available yet for v4T. - Remove the `lack of BLX' warning. LLVM only supports Arm Architecture versions upwards of v4, which we now all support in LLD. - renamed existing thunks to better reflect their use: ARMV5ABSLongThunk -> ARMV5LongLdrPcThunk, ARMV5PILongThunk -> ARMV4PILongThunk - removed isCompatibleWith method from ARMV5ABSLongThunk and ARMV5PILongThunk, as they were identical to the ARMThunk parent class implementation. Support for (efficient) position independent thunks for v4T will be added in a follow-up patch, including possible related thunk renaming and code comment cleanup. Reviewed By: MaskRay, peter.smith Differential Revision: https://reviews.llvm.org/D139888	2022-12-21 11:04:32 +00:00
Fangrui Song	8a900f2438	[ELF] Merge SHT_RISCV_ATTRIBUTES sections Currently we take the first SHT_RISCV_ATTRIBUTES (.riscv.attributes) as the output. If we link an object without an extension with an object with the extension, the output Tag_RISCV_arch may not contain the extension and some tools like objdump -d will not decode the related instructions. This patch implements Tag_RISCV_stack_align/Tag_RISCV_arch/Tag_RISCV_unaligned_access merge as specified by https://github.com/riscv-non-isa/riscv-elf-psabi-doc/blob/master/riscv-elf.adoc#attributes For the deprecated Tag_RISCV_priv_spec{,_minor,_revision}, dump the attribute to the output iff all input agree on the value. This is different from GNU ld but our simple approach should be ok for deprecated tags. `RISCVAttributeParser::handler` currently warns about unknown tags. This behavior is retained. In GNU ld arm, tags >= 64 (mod 128) are ignored with a warning. If RISC-V ever wants to do something similar (https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/352), consider documenting it in the psABI and changing RISCVAttributeParser. Like GNU ld, zero value integer attributes and empty string attributes are not dumped to the output. Reviewed By: asb, kito-cheng Differential Revision: https://reviews.llvm.org/D138550	2022-12-08 09:53:40 +00:00
Dan Albert	241dbd3105	[ELF] Enable --no-undefined-version by default Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-12-08 01:41:18 +00:00
Fangrui Song	4191fda69c	[ELF] Change most llvm::Optional to std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-11-26 19:19:15 -08:00
Yaxun (Sam) Liu	056ebadf5c	[HIP] Fix lld failure when devie object is empty When -fgpu-rdc is used for linking relocatable objects, clang driver launches clang-offload-bundler to extract a device relocatable object from each input relocatable object file and passes the extracted files to lld. The input relocatable object file could either come from HIP program or C++ program. The relocatable object file from C++ program does not contain device relocatable objects, therefore clang-offload-bundler extracts an empty file and passes it to lld. lld treates empty file as linker script. When there is no object input file to lld, lld will emit error: target emulation unknown: -m or at least one .o file required This patch adds "elf64_amdgpu" to lld so that lld always know the target no matter whether there are object input files or not. Reviewed by: Artem Belevich, Fangrui Song Differential Revision: https://reviews.llvm.org/D138221	2022-11-22 10:38:42 -05:00
Nico Weber	4ba3c5d92f	[lld/ELF] Rename OPT_thinlto_jobs to OPT_thinlto_jobs_eq The flag ends with a `=`, so its name should end with `_eq`. No behavior change.	2022-11-21 10:54:12 -05:00
Fangrui Song	2bf5d86422	[ELF] Change rawData to content() and data() to contentMaybeDecompress() Clarify data() which may trigger decompression and make it feasible to refactor the member variable rawData.	2022-11-20 22:43:22 +00:00
Brett Werling	cf4f35b788	[ELF] Handle GCC collect2 -plugin-opt= on Windows Follows up on commit `cd5d5ce235` by additionally ignoring relative paths ending in "lto-wrapper.exe" as can be the case for GCC cross-compiled for Windows. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D138065	2022-11-16 09:58:50 -08:00
Nico Weber	67d311a5fd	[COFF, Mach-O] Include -mllvm options in thinlto cache key Like D134013, but for COFF and Mach-O. Also expand the ELF test a bit. I at first didn't realize that `getValue()` for `-mllvm -foo=bar` would return `-foo=bar` instead of just `bar`, and so I wrote the test to check if we indeed get this wrong. We don't, but having the test for it seems nice, so I'm including it. Differential Revision: https://reviews.llvm.org/D137971	2022-11-14 15:18:09 -05:00
Jon Chesterfield	b1fdeeec1a	Revert "[LLD] Enable --no-undefined-version by default." This reverts commit `eedbe44b87`.	2022-11-09 16:17:33 +00:00
Dan Albert	eedbe44b87	[LLD] Enable --no-undefined-version by default. Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-11-07 22:15:28 +00:00
Fangrui Song	26fcee601f	[ELF] Add --no-warnings/-w Mach-O ld64 supports -w to suppress warnings. GNU ld 2.40 will support the option as well (https://sourceware.org/bugzilla/show_bug.cgi?id=29654). This feature has some small value. E.g. when analyzing a large executable with relocation overflow issues, we may use --noinhibit-exec --emit-relocs to get an output file with static relocations despite relocation overflow issues. -w can significantly improve the link time as printing the massive warnings is slow. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D136569	2022-10-24 10:54:09 -07:00
Ben Dunbobbin	13816e0358	[LLD][ELF] --wrap: __real_foo references should trigger archive extraction for foo A reference to __real_foo should trigger archive extraction of the input file that defines foo, otherwise a link using --wrap=foo might fail to link with an undefined reference to foo. This matches bfd linker behaviour. Differential Revision: https://reviews.llvm.org/D135897	2022-10-18 12:53:06 +01:00
Fangrui Song	14f996dca8	[ELF] Move inputSections/ehInputSections into Ctx. NFC	2022-10-16 00:49:48 -07:00
Slava Zakharin	88da0de14f	Revert "[Libomp] Do not error on undefined version script symbols" This reverts commit `096f93e73d`. Revert "[Libomptarget] Make the plugins ingore undefined exported symbols" This reverts commit `3f62314c23`. Revert "[LLD] Enable --no-undefined-version by default." This reverts commit `7ec8b0d162`. Three commits are reverted because of the current omp build fail with GNU ld. See discussion here: https://reviews.llvm.org/rG096f93e73dc3	2022-10-13 14:12:07 -07:00
Dan Albert	7ec8b0d162	[LLD] Enable --no-undefined-version by default. Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-10-12 12:29:38 -07:00
Fangrui Song	9f9bab19e3	[ELF] Replace some config->ekind with file->ekind. NFC	2022-10-02 21:27:41 -07:00
Fangrui Song	d9dbf9e30a	[ELF] Move init from ELFFileBase constructor to a separate function. NFC	2022-10-02 21:10:28 -07:00
Fangrui Song	c171250e38	[ELF] Simplify addFile. NFC	2022-10-02 19:49:17 -07:00
Fangrui Song	f596d82385	[ELF] Move driver into ctx and remove indirection. NFC This removes one global variable and removes GOT and unique_ptr indirection.	2022-10-01 15:12:50 -07:00
Fangrui Song	9c626d4a0d	[ELF] Remove symtab indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection.	2022-10-01 14:46:49 -07:00
Fangrui Song	34fa860048	[ELF] Remove ctx indirection. NFC Add LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection. We can move other global variables into ctx without indirection concern. In the long term we may consider passing Ctx as a parameter to various functions and eliminate global state as much as possible and then remove `Ctx::reset`.	2022-10-01 12:06:33 -07:00
Fangrui Song	a623a4c8b4	[ELF] Remove elf::config indirection. NFC `config` has 1000+ uses so we try to avoid changing `config->foo`. Define a wrapper with LLVM_LIBRARY_VISIBILITY to remove unneeded GOT and unique_ptr indirection. My x86-64 lld executable is 11+KiB smaller.	2022-10-01 11:39:45 -07:00
Fangrui Song	e3ecc6a912	[ELF] Make symAux[0] a sentinel And default auxIdx to 0.	2022-09-29 00:50:19 -07:00
Fangrui Song	9e6840ccba	[ELF] Remove resolve => resolve{Defined,Common,Shared,Lazy,Undefined} indirection. NFC	2022-09-28 20:01:41 -07:00
Fangrui Song	7a58dd1046	[ELF] Refactor Symbol initialization and overwriting Symbol::replace intends to overwrite a few fields (mostly Elf{32,64}_Sym fields), but the implementation copies all fields then restores some old fields. This is error-prone and wasteful. Add Symbol::overwrite to copy just the needed fields and add other overwrite member functions to copy the extra fields.	2022-09-28 13:11:31 -07:00
Mircea Trofin	c625c17b88	[lld][thinlto] Include -mllvm options in the thinlto cache key They may modify thinlto optimization. This patch only extends support for `-mllvm`. There is another way to pass llvm flags, `-plugin-opt`, but its processing is different and will be provided in a subsequent patch. Differential Revision: https://reviews.llvm.org/D134013	2022-09-19 12:04:17 -07:00
Fangrui Song	12607f57da	[ELF] Cache compute_thread_count. NFC	2022-09-12 19:09:08 -07:00
Fangrui Song	449f2ca146	[ELF] Add --compress-debug-sections=zstd `clang -gz=zstd a.o` passes this option to the linker. This option compresses output debug sections with zstd and sets ch_type to ELFCOMPRESS_ZSTD. As of today, very few DWARF consumers recognize ELFCOMPRESS_ZSTD. Use the llvm::zstd::compress API with level llvm::zstd::DefaultCompression (5), which we may tune after we have more experience with zstd output. zstd has built-in parallel compression support (so we don't need to do D117853 for zlib), which is not leveraged yet. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D133548	2022-09-09 10:30:18 -07:00
Sam Clegg	2cd4cd9a32	[lld][ELF] Rename SymbolTable::symbols() to SymbolTable::getSymbols(). NFC This change renames this method match its original name and the name used in the wasm linker. Back in `d8f8abbd4a` the ELF SymbolTable method `getSymbols()` was replaced with `forEachSymbol`. Then in `a2fc964417` `forEachSymbol` was replaced with a `llvm::iterator_range`. Then in `e9262edf0d` we came full circle and the `llvm::iterator_range` was replaced with a `symbols()` accessor that was identical the original `getSymbols()`. `getSymbols` also matches the name used elsewhere in the ELF linker as well as in both COFF and wasm backend (e.g. `InputFiles.h` and `SyntheticSections.h`) Differential Revision: https://reviews.llvm.org/D130787	2022-08-19 14:56:08 -07:00
Alex Brachet	dbd04b853b	[ELF] Support --package-metadata This was recently introduced in GNU linkers and it makes sense for ld.lld to have the same support. This implementation omits checking if the input string is valid json to reduce size bloat. Differential Revision: https://reviews.llvm.org/D131439	2022-08-08 21:31:58 +00:00
Fangrui Song	e89d6d2ac5	[ELF] Keep only getTarget() call. NFC The place from D61712 seems unneeded now. We can just use the place added by D62609 (support AArch64 BTI/PAC).	2022-08-05 18:20:23 -07:00
Fangrui Song	2515cb80cd	[ELF] Parallelize input section initialization This implements the last step of https://discourse.llvm.org/t/parallel-input-file-parsing/60164 for the ELF port. For an ELF object file, we previously did: parse, (parallel) initializeLocalSymbols, (parallel) postParseObjectFile. Now we do: parse, (parallel) initSectionsAndLocalSyms, (parallel) postParseObjectFile. initSectionsAndLocalSyms does most of input section initialization. The sequential `parse` does SHT_ARM_ATTRIBUTES/SHT_RISCV_ATTRIBUTES/SHT_GROUP initialization for now. Performance linking some programs with --threads=8 (glibc 2.33 malloc and mimalloc): * clang: 1.05x as fast with glibc malloc, 1.03x as fast with mimalloc * chrome: 1.04x as fast with glibc malloc, 1.03x as fast with mimalloc * internal search program: 1.08x as fast with glibc malloc, 1.05x as fast with mimalloc Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D130810	2022-08-04 11:47:52 -07:00
Gabriel Ravier	5dbd8faad5	[lld] Fixed a number of typos I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Differential Revision: https://reviews.llvm.org/D130982	2022-08-02 09:52:31 -04:00
Fangrui Song	b3452f8f13	[ELF] redirectSymbols: skip versioned symbol combine if config->versionDefinitions.size() == 2	2022-07-31 17:47:09 -07:00
Fangrui Song	c09d323599	[ELF] Move EhInputSection out of inputSections. NFC inputSections temporarily contains EhInputSection objects mainly for combineEhSections. Place EhInputSection objects into a new vector ehInputSections instead of inputSections.	2022-07-31 11:58:08 -07:00
Fangrui Song	4b2b68d5ab	[lld] Change vector to SmallVector. NFC My lld executable is 1.6KiB smaller and some functions are now more efficient.	2022-07-30 18:11:21 -07:00

1 2 3 4 5 ...

1055 Commits