clang-p2996

Author	SHA1	Message	Date
Nico Weber	d782f198a6	lld/MachO: Fix two typos to cycle bots	2023-12-22 12:06:38 -05:00
Fangrui Song	26ddf4eee2	[ELF] Change .debug_names tombstone value to UINT32_MAX/UINT64_MAX (#74686 ) `clang -g -gpubnames -fdebug-types-section` now emits .debug_names section with references to local type unit entries defined in COMDAT .debug_info sections. ``` .section .debug_info,"G",@progbits,5657452045627120676,comdat .Ltu_begin0: ... .section .debug_names,"",@progbits ... // DWARF32 .long .Ltu_begin0 # Type unit 0 // DWARF64 // .long .Ltu_begin0 # Type unit 0 ``` When `.Ltu_begin0` is relative to a non-prevailing .debug_info section, the relocation resolves to 0, which is a valid offset within the .debug_info section. ``` cat > a.cc <<e struct A { int x; }; inline A foo() { return {1}; } int main() { foo(); } e cat > b.cc <<e struct A { int x; }; inline A foo() { return {1}; } void use() { foo(); } e clang++ -g -gpubnames -fdebug-types-section -fuse-ld=lld a.cc b.cc -o old ``` ``` % llvm-dwarfdump old ... Local Type Unit offsets [ LocalTU[0]: 0x00000000 ] ... Local Type Unit offsets [ LocalTU[0]: 0x00000000 // indistinguishable from a valid offset within .debug_info ] ``` https://dwarfstd.org/issues/231013.1.html proposes that we use a tombstone value instead to inform consumers. This patch implements the idea. The second LocalTU entry will now use 0xffffffff. https://reviews.llvm.org/D84825 has a TODO that we should switch the tombstone value for most `.debug_*` sections to UINT64_MAX. We have postponed the change for more than three years for consumers to migrate. At some point we shall make the change, so that .debug_names is no long different from other debug section that is not .debug_loc/.debug_ranges. Co-authored-by: Alexander Yermolovich <ayermolo@meta.com>	2023-12-21 18:59:11 -08:00
Shoaib Meenai	c77c3663db	[MachO] Fix test on llvm-x86_64-debian-dylib builder Make the `--implicit-check-not` account for the file extension as well. This will still fail if we ever have a builder with `.dylib` in its name, and we probably want a more robust solution, but this addresses the immediate issue. Fixes https://github.com/llvm/llvm-project/issues/75850 Fixes https://github.com/llvm/llvm-project/issues/75910	2023-12-19 11:42:26 -08:00
Fangrui Song	96aca7c517	[LTO] Improve diagnostics handling when parsing module-level inline assembly (#75726 ) Non-LTO compiles set the buffer name to "<inline asm>" (`AsmPrinter::addInlineAsmDiagBuffer`) and pass diagnostics to `ClangDiagnosticHandler` (through the `MCContext` handler in `MachineModuleInfoWrapperPass::doInitialization`) to ensure that the exit code is 1 in the presence of errors. In contrast, LTO compiles spuriously succeed even if error messages are printed. ``` % cat a.c void _start() {} asm("unknown instruction"); % clang -c a.c <inline asm>:1:1: error: invalid instruction mnemonic 'unknown' 1 \| unknown instruction \| ^ 1 error generated. % clang -c -flto a.c; echo $? # -flto=thin is the same error: invalid instruction mnemonic 'unknown' unknown instruction ^~~~~~~ error: invalid instruction mnemonic 'unknown' unknown instruction ^~~~~~~ 0 ``` `CollectAsmSymbols` parses inline assembly and is transitively called by both `ModuleSummaryIndexAnalysis::run` and `WriteBitcodeToFile`, leading to duplicate diagnostics. This patch updates `CollectAsmSymbols` to be similar to non-LTO compiles. ``` % clang -c -flto=thin a.c; echo $? <inline asm>:1:1: error: invalid instruction mnemonic 'unknown' 1 \| unknown instruction \| ^ 1 errors generated. 1 ``` The `HasErrors` check does not prevent duplicate warnings but assembler warnings are very uncommon.	2023-12-18 09:46:58 -08:00
Fangrui Song	01c8af5739	[ELF,test] Improve duplicate "symbol not found" error tests	2023-12-16 13:12:17 -08:00
SingleAccretion	b2cdf3cc4c	[lld][WebAssembly] Add an `--initial-heap` option (#75594 ) It is beneficial to preallocate a certain number of pages in the linear memory (i. e. use the "minimum" field of WASM memories) so that fewer "memory.grow"s are needed at startup. So far, the way to do that has been to pass the "--initial-memory" option to the linker. It works, but has the very significant downside of requiring the user to know the size of static data beforehand, as it must not exceed the number of bytes passed-in as "--initial-memory". The new "--initial-heap" option avoids this downside by simply appending the specified number of pages to static data (and stack), regardless of how large they already are. Ref: https://github.com/emscripten-core/emscripten/issues/20888.	2023-12-15 10:16:38 -08:00
Martin Storsjö	7dcd8ef135	[LLD] [MinGW] Respect the -S/-s options when writing PDB files (#75181 ) This allows avoiding including some stray DWARF sections (e.g. from toolchain provided files), when writing a PDB file. While that probably could be considered reasonable default behaviour, PDB writing and including DWARF sections are two entirely orthogonal concepts, and GNU ld (which can generate PDB files these days) does include DWARF unless -S/-s is passed, when creating a PDB.	2023-12-15 20:12:03 +02:00
Martin Storsjö	e36535d4be	[LLD] [COFF] Add /debug: options nodwarf and nosymtab (#75180 ) These allow tweaking what gets implied by /debug and /debug:dwarf.	2023-12-15 20:10:41 +02:00
Martin Storsjö	efe017f8f0	[LLD] [COFF] Parse all /debug: options, like /opt: (#75178 ) Most option handling is like it was before; the last /debug: option takes effect. However, the options /debug:dwarf or /debug:symtab don't reset all flags into the specific behaviour they chose before - e.g. if an earlier option enables writing a PDB, a later /debug:dwarf or /debug:symtab doesn't disable that. This allows combining these options with options for controlling PDB writing, for finetuning what is done.	2023-12-15 20:09:24 +02:00
Martin Storsjö	e6e615cade	[LLD] [COFF] Rewrite handling of the /debug: option. NFC. (#75175 ) Don't treat the options as unique enum items, but more as flags that can be composed, like the /opt: options. This still only processes the last option on the command line though, so the behaviour should still remain exactly as it was, in all corner cases.	2023-12-15 20:07:22 +02:00
Martin Storsjö	23e6e88187	[LLD] [COFF] Rewrite the config flags for dwarf debug info or symtab. NFC. (#75172 ) This shouldn't have any user visible effect, but makes the logic within the linker implementation more explicit. Note how DWARF debug info sections were retained even if enabling a link with PDB info only; that behaviour is preserved.	2023-12-15 20:01:13 +02:00
Wang Yaduo	c532ba4edd	[RISCV] Support printing immediate of RISCV MCInst in hexadecimal format (#74053 ) Enable the llvm-objdump to disassemble the immediate of RISCV instruction in hexadecimal format with --print-imm-hex flag.	2023-12-14 22:42:11 -08:00
Vitaly Buka	fc3adf74d3	Revert "[RISCV] Support printing immediate of RISCV MCInst in hexadecimal format" (#75561 ) Reverts llvm/llvm-project#74053 Breaks https://lab.llvm.org/buildbot/#/builders/5/builds/39291 Co-authored-by: Wang Yaduo <wangyaduo@linux.alibaba.com> Issue #75563	2023-12-14 22:05:47 -08:00
Wang Yaduo	3dde0d0256	[RISCV] Support printing immediate of RISCV MCInst in hexadecimal format (#74053 ) Enable the llvm-objdump to disassemble the immediate of RISCV instruction in hexadecimal format with --print-imm-hex flag.	2023-12-15 10:13:20 +08:00
Zequan Wu	47b4bbfe52	[LLD][COFF] add __buildid symbol. (#74652 ) After #71433, lld-link is able to always generate build id even when PDB is not generated. This adds the `__buildid` symbol to points to the start of 16 bytes guid (which is after `RSDS`) and allows profile runtime to access it and dump it to raw profile.	2023-12-14 17:43:10 -05:00
Jacek Caban	b1cc6f778d	[LLD][COFF] Fix ARM64 EC chunks comparator. (#75495 ) Spotted by Alexandre Ganea in #75407.	2023-12-14 23:05:29 +01:00
Kazu Hirata	732bccb8c1	Use StringRef::{starts,ends}_with (NFC) This patch replaces uses of StringRef::{starts,ends}with with StringRef::{starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. I'm planning to deprecate and eventually remove StringRef::{starts,ends}with.	2023-12-14 07:53:20 -08:00
stephenpeckham	2fd7657b66	[XCOFF] Display branch-absolute targets in hex. (#72532 ) Branch-absolute instructions are currently printed in decimal, and negative addresses are printed as positive numbers. With this change, addresses are printed in hex and negative addresses are converted to an unsigned 32- or 64-bit address.	2023-12-13 12:55:47 -06:00
Jacek Caban	f78024c855	[lld][COFF] Merge .00cfg section into .rdata. (#75207 ) .00cfg section is used by crt for load config and is merged by MS link.exe into .rdata.	2023-12-13 11:43:38 +01:00
Fangrui Song	215c565644	[ELF,test] Improve PROVIDE tests	2023-12-12 22:29:36 -08:00
Sam Clegg	97b25d91df	[lld][WebAssembly] Don't set importUndefined when -shared is used. NFC (#75241 ) `importUndefined` is only used a couple of places and both of those already handle `isPIC` separately.	2023-12-12 13:19:42 -08:00
Fangrui Song	42e4967140	[ELF] Don't create copy relocation/canonical PLT entry for a defined symbol (#75095 ) Copy relocations and canonical PLT entries are for symbols defined in a DSO. Currently we create them even for a `Defined`, possibly leading to an output that won't work at run-time (e.g. R_X86_64_JUMP_SLOT referencing a null symbol). ``` % cat a.s .globl _start, main .type main, @function _start: main: ret .rodata .quad main % clang -fuse-ld=lld -pie -nostdlib a.s % readelf -Wr a.out Relocation section '.rela.plt' at offset 0x290 contains 1 entry: Offset Info Type Symbol's Value Symbol's Name + Addend 00000000000033b8 0000000000000007 R_X86_64_JUMP_SLOT 12b0 ``` Report an error instead for the default `-z text` mode. GNU ld reports an error in `-z text` mode as well.	2023-12-12 10:14:36 -08:00
Jacek Caban	ccec22b675	[lld][NFC] Silence -Wuninitialized GCC 11 warnings. (#75183 ) Use of those variables is guarded by lastType, so they are not actually used uninitialized.	2023-12-12 14:45:42 +01:00
Fangrui Song	a439ef518d	[ELF][test] Improve copy relocation/canonical PLT entry tests	2023-12-11 12:32:13 -08:00
Kazu Hirata	cc4ecfd68b	[ADT] Rename SmallString::{starts,ends}with to {starts,ends}_with (#74916 ) This patch renames {starts,ends}with to {starts,ends}_with for consistency with std::{string,string_view}::{starts,ends}_with in C++20. Since there are only a handful of occurrences, this patch skips the deprecation phase and simply renames them.	2023-12-09 14:28:45 -08:00
Saiyedul Islam	5c4c199fe3	[AMDGPU][NFC] Improve testing for AMDHSA ABI Version (#74300 ) Add tests for COV4 as well as COV5 instead of only testing for the default version.	2023-12-08 18:09:45 +05:30
Fangrui Song	3fd1d6953d	[ELF] relocateNonAlloc: clean up workaround code relocateNonAlloc is costly for .debug_* section relocating. We don't want to burn CPU cycles on other targets' workarounds. Remove a temporary workaround for Linux objtool after a proper fix https://git.kernel.org/linus/b8ec60e1186cdcfce41e7db4c827cb107e459002 Move the R_386_GOTPC workaround for GCC<8 beside the R_PC workaround.	2023-12-07 12:43:40 -08:00
Fangrui Song	a4d4b45aef	[ELF] relocateNonAlloc: move likely expr == R_ABS before unlikely R_SIZE. NFC	2023-12-07 12:10:42 -08:00
Fangrui Song	0e6685ab1a	[ELF,test] Improve tombstone value tests Add 32-bit test for -z dead-reloc-in-nonalloc= and add tests for a non-x86 64-bit (x86-64 is unique in discerning signed/unsigned 32-bit absolute relocations (R_X86_64_32/R_X86_64_32S). AArch64/PPC64/RISC-V/etc don't have the distinction). Having a test will improve coverage for #74686	2023-12-07 10:49:57 -08:00
Fangrui Song	23d402e5b7	[ELF] IWYU <optional> NFC	2023-12-06 15:18:46 -08:00
Zequan Wu	aaf3a8ded4	[LLD][COFF] Add -build-id flag to generate .buildid section. (#71433 ) [RFC](https://discourse.llvm.org/t/rfc-add-build-id-flag-to-lld-link/74661) Before, lld-link only generate the debug directory containing guid when generating PDB with the hash of PDB content. With this change, lld-link can generate the debug directory when only `/build-id` is given: 1. If generating PDB, `/build-id` is ignored. Same behaviour as before. 2. Not generating PDB, using hash of the binary. - Not under MinGW, the debug directory is still in `.rdata` section. - Under MinGW, place the debug directory into new `.buildid` section.	2023-12-05 14:57:45 -05:00
Jacek Caban	72c6ca6943	[lld][COFF] Support .pdata section on ARM64EC targets. (#72521 ) ARM64EC needs to handle both ARM and x86_64 exception tables. This is achieved by separating their chunks and sorting them separately. EXCEPTION_TABLE directory references x86_64 variant, while ARM variant is exposed using CHPE metadata, which references __arm64x_extra_rfe_table and __arm64x_extra_rfe_table_size symbols.	2023-12-05 11:59:43 +01:00
Martin Storsjö	143133fe68	[LLD] [COFF] Don't preserve unnecessary __imp_ prefixed symbols (#72989 ) This redoes the fix from `3ab6209a3f` differently, without the unwanted effect of preserving unnecessary `__imp_` prefixed symbols. If the referencing object is a regular object, the `__imp_` symbol will have `isUsedInRegularObj` set on it from that already. If the referencing object is an LTO object, we set `isUsedInRegularObj` for any symbol starting with `__imp_`. If the object file defining the `__imp_` symbol is a regular object, the `isUsedInRegularObj` flag has no effect. If it is an LTO object, it causes the symbol to be preserved.	2023-12-04 23:38:46 +02:00
Jacek Caban	708158529b	[lld][COFF][NFC] Store pdata range as ChunkRange. (#74024 )	2023-12-02 13:09:51 +01:00
Philip Reames	62213be872	[LLD][RISCV] Fix incorrect call relaxation when mixing +c and -c objects (#73977 ) This fixes a mis-link when mixing compressed and non-compressed input to LLD. When relaxing calls, we must respect the source file that the section came from when deciding whether it's legal to use compressed instructions. If the call in question comes from a non-rvc source, then it will not expect 2-byte alignments and cascading failures may result. This fixes https://github.com/llvm/llvm-project/issues/63964. The symptom seen there is that a latter RISCV_ALIGN can't be satisfied and we either fail an assert or produce a totally bogus link result. (It can be easily reproduced by putting .p2align 5 right before the nop in the reduced test case and running check-lld on an assertions enabled build.) However, it's important to note this is just one possible symptom of the problem. If the resulting binary has a runtime switch between rvc and non-rvc routines (via e.g. ifuncs), then even if we manage to link we may execute invalid instructions on a machine which doesn't implement compressed instructions.	2023-12-01 11:02:53 -08:00
eleviant	28ad007927	[ThinLTO] Don't mark calloc function dead (#72673 ) Dead store elimination pass may fold malloc + memset calls into a single call to calloc. If calloc is not preserved and is not being called it can be marked dead which results in link error.	2023-11-30 19:02:25 +01:00
Sam Clegg	a0bd6361d4	[lld][WebAssembly] Convert bitcode test to assembly. NFC (#73716 )	2023-11-29 15:44:58 -08:00
Adrian Prantl	2c07181424	[LEB128] Don't initialize error on success This change removes an unnecessary branch from a hot path. It's also questionable API to override any previous error unconditonally.	2023-11-29 12:47:27 -08:00
Adrian Prantl	69b0cb9c56	Revert "[LEB128] Don't initialize error on success" This reverts commit `545c8e009e`.	2023-11-29 12:40:37 -08:00
Adrian Prantl	545c8e009e	[LEB128] Don't initialize error on success This change removes an unnecessary branch from a hot path. It's also questionable API to override any previous error unconditonally.	2023-11-29 12:16:32 -08:00
Fangrui Song	2212e90059	[ELF,LTO] Test calloc defined in a lazy bitcode file for (malloc+memset => calloc) libcall optimization Similar to https://reviews.llvm.org/D50017: malloc+memset references can be combined to a calloc reference, which is not explicit in the referencer's IR symbol table. If calloc is defined in a lazy bitcode file, we should extract the archive member to satisfy possible references from LTO generated object files; otherwise (current status, which will be fixed by #72673), `calloc` as a LazyObject symbol will be resolved by compileBitcodeFiles generated Undefined, leading to an incorrectly-extracted Defined symbol without section, which will lower to an SHN_ABS symbol at address 0.	2023-11-29 07:39:43 -08:00
Heejin Ahn	83305faeb5	[lld][WebAssembly] Fix bitcode LTO order in archive parsing (#73095 ) When doing LTO on multiple archives, the order with which bitcodes are linked to the LTO module is hard to control, given that processing undefined symbols can lead to parsing of an object file, which in turn lead to parsing of another object file before finishing parsing of the previous file. This can result in encountering a non-prevailing comdat first when linking, which can make the the symbol undefined, and the real definition is added later with an additional prefix to avoid duplication (e.g. `__cxx_global_var_init` and `__cxx_global_var_init.2`) So this one-line fix ensures we compile bitcodes in the order that we process comdats, so that when multiple archived bitcode files have the same variable with the same comdat, we make sure that the prevailing comdat will be linked first in the LTO. Fixes #62243.	2023-11-28 17:44:32 -08:00
Weining Lu	84a20989c6	[lld][LoongArch] Add a another corner testcase for elf::getLoongArchPageDelta Similar to `e752b58e0d`.	2023-11-25 20:38:45 +08:00
GongMengyao	c0722478d5	[lld][ELF] Add getBitcodeMachineKind test for LoongArch (#71931 ) Test that getBitcodeMachineKind can get right e_machine for Triple::loongarch64 and Triple::loongarch32 during LTO.	2023-11-24 14:36:47 +08:00
Jon Roelofs	d506aa4edf	Reland "[MC][AsmParser] Diagnose improperly nested .cfi frames" This showed up when simplifying some large testcase, where the cfi directives became out of sync with the proc's they enclose. Now restricted to platforms that support .subsections_via_symbols. This reverts commit `797b68c0ba`. Fixes: #72802 Differential revision: https://reviews.llvm.org/D153167 rdar://111459507	2023-11-21 10:33:11 -08:00
Fangrui Song	7ffabb61a5	[ELF] Support R_RISCV_SET_ULEB128/R_RISCV_SUB_ULEB128 in non-SHF_ALLOC sections (#72610 ) For a label difference like `.uleb128 A-B`, MC generates a pair of R_RISCV_SET_ULEB128/R_RISCV_SUB_ULEB128 if A-B cannot be folded as a constant. GNU assembler generates a pair of relocations in more cases (when A or B is in a code section with linker relaxation). `.uleb128 A-B` is primarily used by DWARF v5 .debug_loclists/.debug_rnglists (DW_LLE_offset_pair/DW_RLE_offset_pair entry kinds) implemented in Clang and GCC. `.uleb128 A-B` can be used in SHF_ALLOC sections as well (e.g. `.gcc_except_table`). This patch does not handle SHF_ALLOC. `-z dead-reloc-in-nonalloc=` can be used to change the relocated value, if the R_RISCV_SET_ULEB128 symbol is in a discarded section. We don't check the R_RISCV_SUB_ULEB128 symbol since for the expected cases A and B should be defined in the same input section.	2023-11-21 07:43:29 -08:00
Martin Storsjö	89efffd463	[LTO] [LLD] Don't alias the __imp_func and func symbol resolutions (#71376 ) Commit `b963c0b658` fixed LTO compilation of cases where one translation unit is calling a function with the dllimport attribute, and another translation unit provides this function locally within the same linked module (i.e. not actually dllimported); see https://github.com/llvm/llvm-project/issues/37453 or https://bugs.llvm.org/show_bug.cgi?id=38105 for full context. This was fixed by aliasing their GlobalResolution structs, for the `__imp_` prefixed and non prefixed symbols. I believe this fix to be wrong. This patch reverts that fix, and fixes the same issue differently, within LLD instead. The fix assumed that one can treat the `__imp_` prefixed and unprefixed symbols as equal, referencing SVN r240620 (`d766653534`). However that referenced commit had mistaken how this logic works, which was corrected later in SVN r240622 (`88e0f9206b`); those symbols aren't direct aliases for each other - but if there's a need for the `__imp_` prefixed one and the other one exists, the `__imp_` prefixed one is created, as a pointer to the other one. However this fix only works if both translation units are compiled as LTO; if the caller is compiled as a regular object file and the callee is compiled as LTO, the fix fails, as the LTO compilation doesn't know that the unprefixed symbol is needed. The only level that knows of the potential relationship between the `__imp_` prefixed and unprefixed symbol, across regular and bitcode object files, is LLD itself. Therefore, revert the original fix from `b963c0b658`, and fix the issue differently - when concluding that we can fulfill an undefined symbol starting with `__imp_`, mark the corresponding non prefixed symbol as used in a regular object for the LTO compilation, to make sure that this non prefixed symbol exists after the LTO compilation, to let LLD do the fixup of the local import. Extend the testcase to test a regular object file calling an LTO object file, which previously failed. This change also fixes another issue; an object file can provide both unprefixed and prefixed versions of the same symbol, like this: void importedFunc(void) { } void (*__imp_importedFunc)(void) = importedFunc; That allows the function to be called both with and without dllimport markings. (The concept of automatically resolving a reference to `__imp_func` to a locally defined `func` only is done in MSVC style linkers, but not in GNU ld, therefore MinGW mode code often uses this construct.) Previously, the aliasing of global resolutions at the LTO level would trigger a failed assert with "Multiple prevailing defs are not allowed" for this case, as both `importedFunc` and `__imp_importedFunc` could be prevailing. Add a case to the existing LLD test case lto-imp-prefix.ll to test this as well. This change (together with previous change in `3ab6209a3f`) completes LLD to work with mingw-w64-crt files (the base glue code for a mingw-w64 toolchain) built with LTO.	2023-11-21 15:06:00 +02:00
Martin Storsjö	6a323e7ec3	[LLD] [COFF] Add tests to observe details about LTO and __imp_ prefixes. NFC. (#72764 ) The commit `3ab6209a3f` had the undesired effect of retaining every `__imp_` symbol, even if it isn't referenced in any way. Add a testcase to observe this behaviour, to serve as a reference point if this behaviour were to be improved later. Port the testcase from `b963c0b658` from the llvm/LTO testsuite into LLD as a preparation for changing that fix; the moved testcase has a comment for one case which doesn't work currently. The testcase is ported mostly as is, but with symbol mangling simplified, rewriting function names from MSVC C++ mangling to plain C, and unnecessary debug info is removed. Add a case of a dllimported data symbol, in addition to the existing call of a dllimported function. Also extend the testcase to test combinations of both regular object files and LTO objects. Leave out one combination which currently fails, with a comment.	2023-11-21 12:40:11 +02:00
dong jianqiang	89f095d204	[lld][ELF] Add armeb support when incoming bc is arm big endian (#72604 ) Add armeb support when incoming bc is arm big endian: Fix error: could not infer e_machine from bitcode target triple armebv7-linux-gnueabi.	2023-11-20 21:12:06 -08:00
Martin Storsjö	797b68c0ba	Revert "[MC][AsmParser] Diagnose improperly nested .cfi frames" This reverts commit `4323da926f`. This broke building libffi for ARM on Windows (and probably Darwin), where one extern function intentionally falls through to another one, while sharing one CFI region. As long as one isn't using .subsections_via_symbols on MachO, this probably shouldn't be a hard error. Secondly, the tested pattern only produces an error on MachO and COFF targets, but not for ELF, making the error case even more inconsistent. Reverting this commit for now, to figure out the best way forward.	2023-11-19 23:22:03 +02:00

1 2 3 4 5 ...

16410 Commits