clang-p2996

Author	SHA1	Message	Date
Fangrui Song	8a900f2438	[ELF] Merge SHT_RISCV_ATTRIBUTES sections Currently we take the first SHT_RISCV_ATTRIBUTES (.riscv.attributes) as the output. If we link an object without an extension with an object with the extension, the output Tag_RISCV_arch may not contain the extension and some tools like objdump -d will not decode the related instructions. This patch implements Tag_RISCV_stack_align/Tag_RISCV_arch/Tag_RISCV_unaligned_access merge as specified by https://github.com/riscv-non-isa/riscv-elf-psabi-doc/blob/master/riscv-elf.adoc#attributes For the deprecated Tag_RISCV_priv_spec{,_minor,_revision}, dump the attribute to the output iff all input agree on the value. This is different from GNU ld but our simple approach should be ok for deprecated tags. `RISCVAttributeParser::handler` currently warns about unknown tags. This behavior is retained. In GNU ld arm, tags >= 64 (mod 128) are ignored with a warning. If RISC-V ever wants to do something similar (https://github.com/riscv-non-isa/riscv-elf-psabi-doc/issues/352), consider documenting it in the psABI and changing RISCVAttributeParser. Like GNU ld, zero value integer attributes and empty string attributes are not dumped to the output. Reviewed By: asb, kito-cheng Differential Revision: https://reviews.llvm.org/D138550	2022-12-08 09:53:40 +00:00
Dan Albert	241dbd3105	[ELF] Enable --no-undefined-version by default Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-12-08 01:41:18 +00:00
Fangrui Song	b067aa56ce	[ELF] Produce DT_RISCV_VARIANT_CC https://github.com/riscv/riscv-elf-psabi-doc/pull/190 introduced STO_RISCV_VARIANT_CC. The linker should: * Copy the STO_RISCV_VARIANT_CC bit to .symtab/.dynsym: already fulfilled after `82ed93ea05` * Produce DT_RISCV_VARIANT_CC if at least one R_RISCV_JUMP_SLOT relocation references a symbol with the STO_RISCV_VARIANT_CC bit. Done by this patch. Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D107951	2022-12-04 21:08:37 -08:00
Alexandre Ganea	242a9cf7e6	[LLD][COFF] Survive empty and invalid PCH signature Solve two issues that showed up when using LLD with Unreal Engine & FASTBuild: 1. It seems the S_OBJNAME record doesn't always record the "precomp signature". We were relying on that to match the PCH.OBJ with their dependent-OBJ. 2. MSVC link.exe is able to link a PCH.OBJ when the "precomp signatureÈ doesn't match, but LLD was failing. This was occuring since the Unreal Engine Build Tool was compiling the PCH.OBJ, but the dependent-OBJ were compiled & cached through FASTBuild. Upon a clean rebuild, the PCH.OBJs were recompiled by the Unreal Build Tool, thus the "precomp signatures" were changing; however the OBJs were already cached by FASTBuild, thus having an old "precomp signatures". We now ignore "precomp signatures" and properly fallback to cmd-line name lookup, like MSVC link.exe does, and only fail if the PCH.OBJ type stream doesn't match the count expected by the dependent-OBJ. Differential Revision: https://reviews.llvm.org/D136762	2022-11-20 10:40:50 -05:00
Alexandre Ganea	49e483d3d6	[CodeView] Replace GHASH hasher by BLAKE3 Previously, we used SHA-1 for hashing the CodeView type records. SHA-1 in `GloballyHashedType::hashType()` is coming top in the profiles. By simply replacing with BLAKE3, the link time is reduced in our case from 15 sec to 13 sec. I am only using MSVC .OBJs in this case. As a reference, the resulting .PDB is approx 2.1GiB and .EXE is approx 250MiB. Differential Revision: https://reviews.llvm.org/D137101	2022-11-19 15:17:42 -05:00
Sylvain Audi	df2f9e57f8	[PDB] Don't include input files in the 'cmd' entry of S_ENVBLOCK MSVC records the command line arguments in S_ENVBLOCK, skipping the input file arguments. This patch adds this filtering on lld-link side. Differential Revision: https://reviews.llvm.org/D137723	2022-11-14 11:31:01 -05:00
Jon Chesterfield	b1fdeeec1a	Revert "[LLD] Enable --no-undefined-version by default." This reverts commit `eedbe44b87`.	2022-11-09 16:17:33 +00:00
Dan Albert	eedbe44b87	[LLD] Enable --no-undefined-version by default. Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-11-07 22:15:28 +00:00
Fangrui Song	26fcee601f	[ELF] Add --no-warnings/-w Mach-O ld64 supports -w to suppress warnings. GNU ld 2.40 will support the option as well (https://sourceware.org/bugzilla/show_bug.cgi?id=29654). This feature has some small value. E.g. when analyzing a large executable with relocation overflow issues, we may use --noinhibit-exec --emit-relocs to get an output file with static relocations despite relocation overflow issues. -w can significantly improve the link time as printing the massive warnings is slow. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D136569	2022-10-24 10:54:09 -07:00
Slava Zakharin	88da0de14f	Revert "[Libomp] Do not error on undefined version script symbols" This reverts commit `096f93e73d`. Revert "[Libomptarget] Make the plugins ingore undefined exported symbols" This reverts commit `3f62314c23`. Revert "[LLD] Enable --no-undefined-version by default." This reverts commit `7ec8b0d162`. Three commits are reverted because of the current omp build fail with GNU ld. See discussion here: https://reviews.llvm.org/rG096f93e73dc3	2022-10-13 14:12:07 -07:00
Dan Albert	7ec8b0d162	[LLD] Enable --no-undefined-version by default. Allowing incorrect version scripts is not a helpful default. Flip that to help users find their bugs at build time rather than at run time. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135402	2022-10-12 12:29:38 -07:00
Fangrui Song	449f2ca146	[ELF] Add --compress-debug-sections=zstd `clang -gz=zstd a.o` passes this option to the linker. This option compresses output debug sections with zstd and sets ch_type to ELFCOMPRESS_ZSTD. As of today, very few DWARF consumers recognize ELFCOMPRESS_ZSTD. Use the llvm::zstd::compress API with level llvm::zstd::DefaultCompression (5), which we may tune after we have more experience with zstd output. zstd has built-in parallel compression support (so we don't need to do D117853 for zlib), which is not leveraged yet. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D133548	2022-09-09 10:30:18 -07:00
Fangrui Song	5e0464e38b	[ELF] Support ELFCOMPRESS_ZSTD input so that lld accepts relocatable object files produced by `clang -c -g -gz=zstd`. We don't want to increase the size of InputSection, so do redundant but cheap ch_type checks instead. Differential Revision: https://reviews.llvm.org/D129406	2022-09-09 10:25:37 -07:00
Alvin Wong	bf7c5f1fae	[LLD][MinGW] Add --[no-]guard-cf and --[no-]guard-longjmp These will be LLD-specific options to support Control Flow Guard for the MinGW target. They are disabled by default, but enabling `--guard-cf` will also enable `--guard-longjmp` unless `--no-guard-longjmp` is also specified. These options maps to `-guard:cf,[no]longjmp`. Note that these features require the `_load_config_used` symbol to contain the load config directory and be filled with the required symbols. While current versions of mingw-w64 do not supply this symbol, the user can provide their own version of it. Reviewed By: MaskRay, rnk Differential Revision: https://reviews.llvm.org/D132808	2022-09-09 09:55:40 +03:00
Fangrui Song	e0f64b319e	[docs] Remove lld/ELF release note about --package-metadata= which is in release/15.x	2022-08-19 19:08:29 -07:00
Martin Storsjö	e6db064394	[doc] Remove release notes from the main branch for changes that were backported to 15.x	2022-08-12 11:21:51 +03:00
Martin Storsjö	5d513ef6cf	[LLD] [COFF] Add support for a new, mingw specific embedded directive -exclude-symbols: This is an entirely new embedded directive - extending the GNU ld command line option --exclude-symbols to be usable in embedded directives too. (GNU ld.bfd also got support for the same new directive, currently in the latest git version, after the 2.39 branch.) This works as an inverse to the regular embedded dllexport directives, for cases when autoexport of all eligible symbols is performed. Differential Revision: https://reviews.llvm.org/D130120	2022-08-11 11:59:48 +03:00
Martin Storsjö	d1da6469f9	[LLD] [MinGW] Implement the --exclude-symbols option This adds support for the existing GNU ld command line option, which allows excluding individual symbols from autoexport (when linking a DLL and no symbols are marked explicitly as dllexported). Differential Revision: https://reviews.llvm.org/D130118	2022-08-11 11:59:47 +03:00
Alex Brachet	dbd04b853b	[ELF] Support --package-metadata This was recently introduced in GNU linkers and it makes sense for ld.lld to have the same support. This implementation omits checking if the input string is valid json to reduce size bloat. Differential Revision: https://reviews.llvm.org/D131439	2022-08-08 21:31:58 +00:00
Tom Stellard	809855b56f	Bump the trunk major version to 16	2022-07-26 21:34:45 -07:00
Jez Ng	403d61aedd	[lld-macho] Enable EH frame relocation / pruning This just removes the code that gates the logic. The main issue here is perf impact: without {D122258}, LLD takes a significant perf hit because it now has to do a lot more work in the input parsing phase. But with that change to eliminate unnecessary EH frames from input object files, the perf overhead here is minimal. Concretely, here are the numbers for some builds as measured on my 16-core Mac Pro: chromium_framework This is without the use of `-femit-dwarf-unwind=no-compact-unwind`: base diff difference (95% CI) sys_time 1.826 ± 0.019 1.962 ± 0.034 [ +6.5% .. +8.4%] user_time 9.306 ± 0.054 9.926 ± 0.082 [ +6.2% .. +7.1%] wall_time 8.225 ± 0.068 8.947 ± 0.128 [ +8.0% .. +9.6%] samples 15 22 With that flag enabled, the regression mostly disappears, as hoped: base diff difference (95% CI) sys_time 1.839 ± 0.062 1.866 ± 0.068 [ -0.9% .. +3.8%] user_time 9.452 ± 0.068 9.490 ± 0.067 [ -0.1% .. +0.9%] wall_time 8.383 ± 0.127 8.452 ± 0.114 [ -0.1% .. +1.8%] samples 17 21 Unnamed internal app Without `-femit-dwarf-unwind`, this is the perf hit: base diff difference (95% CI) sys_time 1.372 ± 0.029 1.317 ± 0.024 [ -4.6% .. -3.5%] user_time 2.835 ± 0.028 2.980 ± 0.027 [ +4.8% .. +5.4%] wall_time 3.205 ± 0.079 3.383 ± 0.066 [ +4.9% .. +6.2%] samples 102 83 With `-femit-dwarf-unwind`, the perf hit almost disappears: base diff difference (95% CI) sys_time 1.274 ± 0.026 1.270 ± 0.025 [ -0.9% .. +0.3%] user_time 2.812 ± 0.023 2.822 ± 0.035 [ +0.1% .. +0.7%] wall_time 3.166 ± 0.047 3.174 ± 0.059 [ -0.2% .. +0.7%] samples 95 97 Just for fun, I measured the impact of `-femit-dwarf-unwind` on ld64 (`base` has the extra DWARF unwind info in the input object files, `diff` doesn't): base diff difference (95% CI) sys_time 1.128 ± 0.010 1.124 ± 0.023 [ -1.3% .. +0.6%] user_time 7.176 ± 0.030 7.106 ± 0.094 [ -1.5% .. -0.4%] wall_time 7.874 ± 0.041 7.795 ± 0.121 [ -1.7% .. -0.3%] samples 16 25 And for LLD: base diff difference (95% CI) sys_time 1.315 ± 0.019 1.280 ± 0.019 [ -3.2% .. -2.0%] user_time 2.980 ± 0.022 2.822 ± 0.016 [ -5.5% .. -5.0%] wall_time 3.369 ± 0.038 3.175 ± 0.033 [ -6.2% .. -5.3%] samples 47 47 So parsing the extra EH frames is a lot more expensive for us than for ld64. But given that we are quite a lot faster than ld64 to begin with, I guess this isn't entirely unexpected... Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D129540	2022-07-13 21:14:05 -04:00
Ben Dunbobbin	dfb77f2e99	[LLD][ELF] Add FORCE_LLD_DIAGNOSTICS_CRASH to force LLD to crash Add FORCE_LLD_DIAGNOSTICS_CRASH inspired by the existing FORCE_CLANG_DIAGNOSTICS_CRASH. This is particularly useful for people customizing LLD as they may want to modify the crash reporting behavior. Differential Revision: https://reviews.llvm.org/D128195	2022-07-05 09:43:09 +01:00
Nico Weber	a2c1f7c90d	[lld, ELF and mac] Add --time-trace=<file>, remove --time-trace-file=<file> `--time-trace=foo` has the same behavior as `--time-trace --time-trace-file=<file>` had previously. Also, for mac, make --time-trace-granularity not imply --time-trace, to match behavior of the ELF port. Differential Revision: https://reviews.llvm.org/D128451	2022-06-23 15:46:22 -04:00
Fangrui Song	e09f77d394	[ELF] Remove support for legacy .zdebug sections .zdebug is unlikely used any longer: gcc -gz switched from legacy .zdebug to SHF_COMPRESSED with binutils 2.26 (2016), which has been several years. clang 14 dropped -gz=zlib-gnu support. According to Debian Code Search (`gz=zlib-gnu`), no project uses -gz=zlib-gnu. Remove .zdebug support to (a) simplify code and (b) allow removal of llvm-mc's --compress-debug-sections=zlib-gnu. In case the old object file `a.o` uses .zdebug, run `objcopy --decompress-debug-sections a.o` Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D126793	2022-06-02 13:37:19 -07:00
Fangrui Song	2ac8ce5d56	Revert D125410 "[ELF] Align the end of PT_GNU_RELRO to max-page-size instead of common-page-size" This reverts commit `ebdb9d635a`. Changing p_memsz is insufficient and may make PT_GNU_RELRO extend beyond the PT_LOAD.	2022-05-12 20:41:22 -07:00
Fangrui Song	ebdb9d635a	[ELF] Align the end of PT_GNU_RELRO to max-page-size instead of common-page-size We picked common-page-size to match GNU ld. Recently, the resolution to GNU ld https://sourceware.org/bugzilla/show_bug.cgi?id=28824 (milestone: 2.39) switched to max-page-size so that the last page can be protected by RELRO in case the system page size is larger than common-page-size. Thanks to our two RW PT_LOAD scheme (D58892), switching to max-page-size does not change file size (while GNU ld's scheme may increase file size). Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D125410	2022-05-12 11:03:12 -07:00
Nico Weber	9c00e3d49e	[lld/win] Mention in release notes that /winsysroot: currently requires /machine: Differential Revision: https://reviews.llvm.org/D124254	2022-04-22 09:40:39 -04:00
Fangrui Song	c0065f1182	[ELF] Default to --no-fortran-common D86142 introduced --fortran-common and defaulted it to true (matching GNU ld but deviates from gold/macOS ld64). The default state was motivated by transparently supporting some FORTRAN 77 programs (Fortran 90 deprecated common blocks). Now I think it again. I believe we made a mistake to change the default: * this is a weird and legacy rule, though the breakage is very small * --fortran-common introduced complexity to parallel symbol resolution and will slow down it * --fortran-common more likely causes issues when users mix COMMON and STB_GLOBAL definitions (see https://github.com/llvm/llvm-project/issues/48570 and https://maskray.me/blog/2022-02-06-all-about-common-symbols). I have seen several issues in our internal projects and Android. On the other hand, --no-fortran-common is safer since COMMON/STB_GLOBAL have the same semantics related to archive member extraction. Therefore I think we should switch back, not punishing the common uage. A platform wanting --fortran-common can implement ld.lld as a shell script wrapper around `lld -flavor gnu --fortran-common "$@"`. Reviewed By: ikudrin, sfertile Differential Revision: https://reviews.llvm.org/D122450	2022-03-30 09:12:09 -07:00
Fangrui Song	7c7702b318	[ELF] Move section assignment from initializeSymbols to postParse https://discourse.llvm.org/t/parallel-input-file-parsing/60164 initializeSymbols currently sets Defined::section and handles non-prevailing COMDAT groups. Move the code to the parallel postParse to reduce work from the single-threading code path and make parallel section initialization infeasible. Postpone reporting duplicate symbol errors so that the messages have the section information. (`Defined::section` is assigned in postParse and another thread may not have the information). * duplicated-synthetic-sym.s: BinaryFile duplicate definition (very rare) now has no section information * comdat-binding: `%t/w.o %t/g.o` leads to an undesired undefined symbol. This is not ideal but we report a diagnostic to inform that this is unsupported. (See release note) * comdat-discarded-lazy.s: %tdef.o is unextracted. The new behavior (discarded section error) makes more sense * i386-comdat.s: switched to a better approach working around .gnu.linkonce.t.__x86.get_pc_thunk.bx in glibc<2.32 for x86-32. Drop the ancient no-longer-relevant workaround for __i686.get_pc_thunk.bx Depends on D120640 Differential Revision: https://reviews.llvm.org/D120626	2022-03-15 19:24:41 -07:00
Fangrui Song	9b61fff0eb	Revert D120626 "[ELF] Move section assignment from initializeSymbols to postParse" This reverts commit `c30e6447c0`. It exposed brittle support for __x86.get_pc_thunk.bx. Need to think a bit how to support __x86.get_pc_thunk.bx.	2022-03-15 19:00:54 -07:00
Fangrui Song	c30e6447c0	[ELF] Move section assignment from initializeSymbols to postParse https://discourse.llvm.org/t/parallel-input-file-parsing/60164 initializeSymbols currently sets Defined::section and handles non-prevailing COMDAT groups. Move the code to the parallel postParse to reduce work from the single-threading code path and make parallel section initialization infeasible. Postpone reporting duplicate symbol errors so that the messages have the section information. (`Defined::section` is assigned in postParse and another thread may not have the information). * duplicated-synthetic-sym.s: BinaryFile duplicate definition (very rare) now has no section information * comdat-binding: `%t/w.o %t/g.o` leads to an undesired undefined symbol. This is not ideal but we report a diagnostic to inform that this is unsupported. (See release note) * comdat-discarded-lazy.s: %tdef.o is unextracted. The new behavior (discarded section error) makes more sense Depends on D120640 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D120626	2022-03-14 14:13:41 -07:00
Fangrui Song	4a8de2832a	[ELF] Add -z pack-relative-relocs GNU ld 2.38 added -z pack-relative-relocs which is similar to --pack-dyn-relocs=relr but synthesizes the `GLIBC_ABI_DT_RELR` version dependency if a shared object named `libc.so.` has a `GLIBC_2.` version dependency. This is used to implement the (as some glibc folks call) version lockout mechanism. Add this option, because glibc does not want to support --pack-dyn-relocs=relr which does not add `GLIBC_ABI_DT_RELR`. See https://maskray.me/blog/2021-10-31-relative-relocations-and-relr for detail. Close https://github.com/llvm/llvm-project/issues/53775 Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D120701	2022-03-10 19:54:21 -08:00
Peter Kasting	c5fb05f663	Reland: Make lld-link work in a non-MSVC shell, add /winsysroot: This relands `73e585e44d` (and `0574b5fc65`), with a fix for the failing test (by using Optional<StringRef>s instead of making StringRef::empty() mean absence of value). Differential Revision: https://reviews.llvm.org/D118070	2022-02-16 09:22:39 -05:00
Douglas Yung	437d4e01fe	Revert "try to fix windows build after 73e585e44d" and Revert "Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:"" This reverts commit `0574b5fc65` and `73e585e44d`. This change is causing the test Driver/cl-options.c to fail on Windows buildbots. https://lab.llvm.org/staging/#/builders/204/builds/1343	2022-02-11 23:47:53 -08:00
Nico Weber	73e585e44d	Reland "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This relands commit `b3b2538df1`, except that the new files in Support are instead in a new library WindowsDriver.	2022-02-11 17:07:33 -05:00
Adrian Prantl	baac665adf	Revert "[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot:" This reverts commit `b3b2538df1`, it introduced a cycklic module depenency that broke the -DLLVM_ENABLE_MODULES=1 build.	2022-02-11 13:07:23 -08:00
Peter Kasting	b3b2538df1	[lld/coff] Make lld-link work in a non-MSVC shell, add /winsysroot: Makes lld-link work in a non-MSVC shell by autodetecting MSVC toolchain. Also adds support for /winsysroot and a few other switches. All this is done by refactoring to share code with clang-cl's existing support for the same. Differential Revision: https://reviews.llvm.org/D118070	2022-02-11 13:55:18 -05:00
Fangrui Song	ce45c95694	[ELF] Remove obscure -dp and GNU ld incompatible --[no-]define-common, ignore -d/-dc https://maskray.me/blog/2022-02-06-all-about-common-symbols#no-define-common In GNU ld, -dc only affects -r links and causes COMMON symbols to be allocated. --no-define-common is defined to make COMMON symbols undefined for -shared. AIUI --no-define-common is a workaround around glibc 2.1 time and not really useful. gold confuses --define-common with -d/FORCE_COMMON_ALLOCATION and implements --define-common with -d semantics. Its --no-define-common is incompatible with GNU ld. In ld.lld, `b2a23cf3c0` fixed the default -r behavior for COMMON symbols but ported the incompatible gold --[no-]define-common. To the best of my knowledge, no project uses -dp --[no-]define-common. So just remove these options. -d/-dc are used by the following projects: * grub grub-core/genmod.sh.in uses -Wl,-r,-d (https://lists.gnu.org/archive/html/grub-devel/2022-02/msg00088.html) * FreeBSD crunchgen uses -Wl,-dc (https://reviews.freebsd.org/D34215) A no-op implementation works for them. Only when a program inspects relocatable output by itself and does not recognize COMMON symbols, there may be a problem. This is an extremely unlikely case. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D119108	2022-02-09 10:35:53 -08:00
Tom Stellard	a2601c9887	Bump the trunk major version to 15	2022-02-01 23:54:52 -08:00
Tom Stellard	e80c52986e	[docs] Remove hard-coded version numbers from sphinx configs This updates all the non-runtime project release notes to use the version number from CMake instead of the hard-coded version numbers in conf.py. It also hides warnings about pre-releases when the git suffix is dropped from the LLVM version in CMake. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D112181	2022-02-01 23:14:12 -08:00
Fangrui Song	0c3704fdbd	[ELF] Deduplicate names of local symbols only with -O2 The deduplication requires a DenseMap of the same size of the local part of .strtab . I optimized it in `e205445434` but it is still quite slow. For Release build of clang, deduplication makes .strtab 1.1% smaller and makes the link 3% slower. For chrome, deduplication makes .strtab 0.1% smaller and makes the link 6% slower. I suggest that we only perform the optimization with -O2 (default is -O1). Not deduplicating local symbol names will simplify parallel symbol table write. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D118577	2022-02-01 10:10:22 -08:00
Stephan T. Lavavej	8bd106a891	[NFC] Fix typos in release notes. Reviewed By: ldionne, Mordante, MaskRay Differential Revision: https://reviews.llvm.org/D115685	2021-12-14 14:19:42 -08:00
Fangrui Song	1ce51a5f35	[ELF] --cref: If -Map is specified, print to the map file PR48282: This behavior matches GNU ld and gold. Reviewed By: markj Differential Revision: https://reviews.llvm.org/D114663	2021-11-29 14:14:53 -08:00
Fangrui Song	38ed1db7e8	[ELF] Support non-RAX/non-adjacent R_X86_64_GOTPC32_TLSDESC/R_X86_64_TLSDESC_CALL The current TLSDESC optimization code assumes: ``` leaq x@tlsdesc(%rip), %rax call x@tlscall(%rax) # adjacent ``` From https://gitlab.freedesktop.org/mesa/mesa/-/issues/5665 , it seems that the two instructions may not be adjacent in GCC 10's output: ``` leaq x@tlsdesc(%rip), %rax something else call x@tlscall(%rax) ``` This patch supports the case. While here, support non-RAX registers for R_X86_64_GOTPC32_TLSDESC, in case the compiler generates inefficient: ``` leaq x@tlsdesc(%rip), %rcx # or %rdx, %rbx, %rdi, ... movq %rcx, %rax call *x@tlscall(%rax) # GNU ld/gold error for non-RAX ``` Differential Revision: https://reviews.llvm.org/D114416	2021-11-23 10:30:11 -08:00
Fangrui Song	a05384dc89	[ELF] Make --no-relax disable R_X86_64_GOTPCRELX and R_X86_64_REX_GOTPCRELX GOT optimization This brings back the original version of D81359. I have found several use cases now. * Unlike GNU ld, LLD's relocation processing is one pass. If we decide to optimize(relax) R_X86_64_{,REX_}GOTPCRELX, we will suppress GOT generation and cannot undo the decision later. Optimizing R_X86_64_REX_GOTPCRELX can usually make it easy to hit `relocation R_X86_64_REX_GOTPCRELX out of range` because the distance to GOT is usually shorter. Without --no-relax, the user has to recompile with `-Wa,-mrelax-relocations=no`. * The option would help during my investigationg of the root cause of https://git.kernel.org/linus/09e43968db40c33a73e9ddbfd937f46d5c334924 * There is need for relaxation for AArch64 & RISC-V. Implementing this for x86-64 improves consistency with little target-specific cost (two-line X86_64.cpp change). Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113615	2021-11-12 09:47:31 -08:00
Fangrui Song	6e04ec801b	[docs] Fix docs-lld-html	2021-10-28 18:44:44 -07:00
Fangrui Song	e39c138f45	[ELF] Implement TLSDESC for x86-32 `-z rela` is also supported. Tested with: ``` cat > ./a.c <<eof #include <assert.h> int foo(); int bar(); int main() { assert(foo() == 2); assert(foo() == 4); assert(bar() == 2); assert(bar() == 4); } eof cat > ./b.c <<eof #include <stdio.h> __thread int tls0; extern __thread int tls1; int foo() { return ++tls0 + ++tls1; } static __thread int tls2, tls3; int bar() { return ++tls2 + ++tls3; } eof echo '__thread int tls1;' > ./c.c sed 's/ /\t/' > ./Makefile <<'eof' .MAKE.MODE = meta curDirOk=true CC := gcc -m32 -g -fpic -mtls-dialect=gnu2 LDFLAGS := -m32 -Wl,-rpath=. all: a0 a1 a2 run: all ./a0 && ./a1 && ./a2 c.so: c.o; ${LINK.c} -shared $> -o $@ bc.so: b.o c.o; ${LINK.c} -shared $> -o $@ b.so: b.o c.so; ${LINK.c} -shared $> -o $@ a0: a.o b.o c.o; ${LINK.c} $> -o $@ a1: a.o b.so; ${LINK.c} $> -o $@ a2: a.o bc.so; ${LINK.c} $> -o $@ eof ``` and glibc `elf/tst-gnu2-tls1`. `/usr/local/bin/ld` points to the freshly built `lld`. `bmake run && bmake CFLAGS=-O1 run` => ok. Differential Revision: https://reviews.llvm.org/D112582	2021-10-28 17:52:03 -07:00
Fangrui Song	a954bb18b1	[ELF] Add --why-extract= to query why archive members/lazy object files are extracted Similar to D69607 but for archive member extraction unrelated to GC. This patch adds --why-extract=. Prior art: GNU ld -M prints ``` Archive member included to satisfy reference by file (symbol) a.a(a.o) main.o (a) b.a(b.o) (b()) ``` -M is mainly for input section/symbol assignment <-> output section mapping (often huge output) and the information may appear ad-hoc. Apple ld64 ``` __Z1bv forced load of b.a(b.o) _a forced load of a.a(a.o) ``` It doesn't say the reference file. Arm's proprietary linker ``` Selecting member vsnprintf.o(c_wfu.l) to define vsnprintf. ... Loading member vsnprintf.o from c_wfu.l. definition: vsnprintf reference : _printf_a ``` --- --why-extract= gives the user the full data (which is much shorter than GNU ld -Map). It is easy to track a chain of references to one archive member with a one-liner, e.g. ``` % ld.lld main.o a_b.a b_c.a c.a -o /dev/null --why-extract=- \| tee stdout reference extracted symbol main.o a_b.a(a_b.o) a a_b.a(a_b.o) b_c.a(b_c.o) b() b_c.a(b_c.o) c.a(c.o) c() % ruby -ane 'BEGIN{p={}}; p[$F[1]]=[$F[0],$F[2]] if $.>1; END{x="c.a(c.o)"; while y=p[x]; puts "#{y[0]} extracts #{x} to resolve #{y[1]}"; x=y[0] end}' stdout b_c.a(b_c.o) extracts c.a(c.o) to resolve c() a_b.a(a_b.o) extracts b_c.a(b_c.o) to resolve b() main.o extracts a_b.a(a_b.o) to resolve a ``` Archive member extraction happens before --gc-sections, so this may not be a live path under --gc-sections, but I think it is a good approximation in practice. * Specifying a file avoids output interleaving with --verbose. * Required `=` prevents accidental overwrite of an input if the user forgets `=`. (Most of compiler drivers' long options accept `=` but not ` `) Differential Revision: https://reviews.llvm.org/D109572	2021-09-20 09:52:30 -07:00
Fangrui Song	d001ab82e4	[ELF] Don't fall back to .text for e_entry We have the rule to simulate (https://sourceware.org/binutils/docs/ld/Entry-Point.html), but the behavior is questionable (https://sourceware.org/pipermail/binutils/2021-September/117929.html). gold doesn't fall back to .text. The behavior is unlikely relied by projects (there is even a warning for executable links), so let's just delete this fallback path. Reviewed By: jhenderson, peter.smith Differential Revision: https://reviews.llvm.org/D110014	2021-09-20 09:35:12 -07:00
Fangrui Song	44361e5b90	[ELF] Add --export-dynamic-symbol-list This is available in GNU ld 2.35 and can be seen as a shortcut for multiple --export-dynamic-symbol, or a --dynamic-list variant without the symbolic intention. In the long term, this option probably should be preferred over --dynamic-list. Reviewed By: peter.smith Differential Revision: https://reviews.llvm.org/D107317	2021-08-03 09:01:03 -07:00

1 2

93 Commits