clang-p2996

Author	SHA1	Message	Date
Alexander Yermolovich	e22ff52c10	[BOLT][DWARF] Change rangelists to use DW_RLE_offset_pair Before we always used DW_RLE_startx_length. This is not very efficient and leads to bigger .debug_addr section. Changed it to use DW_RLE_base_addressx/DW_RLE_offset_pair. clang-16 build in debug mode llvm-bolt ran on it with --update-debug-sections \| section \| before \| after \| diff \| % decrease \| \| .debug_rnglists \| 32732292 \| 31986051 \| -746241 \| 2.3% \| \| .debug_addr \| 14415808 \| 14184128 \| -231680 \| 1.6% \| Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D140439	2023-01-06 13:45:43 -08:00
Amir Ayupov	f40d25dd8d	[BOLT][NFC] Use llvm::reverse Use llvm::reverse instead of `for (auto I = rbegin(), E = rend(); I != E; ++I)` Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D140516	2023-01-03 17:32:11 -08:00
Kazu Hirata	e8d6c537ac	[BOLT] Use std::optional instead of llvm::Optional (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2023-01-02 18:40:21 -08:00
Amir Ayupov	703d94d8f0	[BOLT] Respect -function-order in lite mode Process functions listed in -function-order file even in lite mode. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D140435	2022-12-28 20:50:20 -08:00
Vladislav Khmelevsky	17ed8f2928	[BOLT][AArch64] Handle adrp+ld64 linker relaxations Linker might relax adrp + ldr got address loading to adrp + add for local non-preemptible symbols (e.g. hidden/protected symbols in executable). As usually linker doesn't change relocations properly after relaxation, so we have to handle such cases by ourselves. To do that during relocations reading we change LD64 reloc to ADD if instruction mismatch found and introduce FixRelaxationPass that searches for ADRP+ADD pairs and after performing some checks we're replacing ADRP target symbol to already fixed ADDs one. Vladislav Khmelevsky, Advanced Software Technology Lab, Huawei Differential Revision: https://reviews.llvm.org/D138097	2022-12-23 01:20:18 +04:00
Amir Ayupov	5bcd980137	[BOLT][NFC] Make DWOId std::optional Reviewed By: #bolt, ayermolo Differential Revision: https://reviews.llvm.org/D140450	2022-12-21 10:40:08 -08:00
Maksim Panchenko	be9d3edee8	[BOLT][NFC] Remove unused PrintInstructions argument PrintInstructions was unused in BinaryFunction::print() and dump(). Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D140440	2022-12-20 15:57:13 -08:00
Guillaume Chatelet	828ce42a59	[Alignment] Use Align in SectionRef::getAlignment() Differential Revision: https://reviews.llvm.org/D139110	2022-12-16 12:09:57 +00:00
Amir Ayupov	1628daf6e7	[BOLT][NFC] Use std::optional in ShrinkWrapping	2022-12-11 22:13:47 -08:00
Amir Ayupov	76cfea0c47	[BOLT][NFC] Use std::optional for readDWARFExpressionTargetReg	2022-12-11 22:13:47 -08:00
Amir Ayupov	34e7d65f79	[BOLT][NFC] Use std::optional in DWARFRewriter	2022-12-11 22:13:47 -08:00
Amir Ayupov	72528ee4b4	[BOLT][NFC] Use std::optional in has*NameRegex	2022-12-11 22:13:47 -08:00
Amir Ayupov	6e5b4dacf3	[BOLT][NFC] Use std::optional in RI	2022-12-11 22:13:46 -08:00
Amir Ayupov	15d1e51750	[BOLT][NFC] Use std::optional for getLTOCommonName	2022-12-11 22:13:46 -08:00
Amir Ayupov	e8f5743e86	[BOLT][NFC] Use std::optional in BC	2022-12-11 22:13:46 -08:00
Amir Ayupov	835a9c2801	[BOLT][NFC] Use std::optional in DataAggregator	2022-12-11 22:13:46 -08:00
Amir Ayupov	3d573fdbb4	[BOLT][NFC] Use std::optional in BAT	2022-12-11 22:13:46 -08:00
Kazu Hirata	3c700cf754	[BOLT] Use std::optional instead of None in comments (NFC) This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-10 16:57:33 -08:00
Vladislav Khmelevsky	9556b67840	[BOLT] Fix blocks layout reverse iterators Use container's reverse iterators, fix iterators types. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D139335	2022-12-09 13:32:34 +04:00
Alexander Yermolovich	f7a2131766	[BOLT][DWARF] Don't create extra .debug_str_offsets contributions With ThinLTO mutliple CUs can share the same .debug_str_offsets contribution. We were creating a new one for each CU. This lead to a binary size increase. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D139214	2022-12-07 13:08:35 -08:00
Amir Ayupov	2563fd63c6	[BOLT][NFC] Use std::optional in MCPlusBuilder Reviewed By: maksfb, #bolt Differential Revision: https://reviews.llvm.org/D139260	2022-12-06 14:51:38 -08:00
Amir Ayupov	370e4761bc	[BOLT][NFC] Use std::optional for findAttributeInfo LLVM started switching from `llvm::Optional` to `std::optional`: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716/11 Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D139259	2022-12-06 14:51:35 -08:00
Nico Weber	b2dc1a2ee1	Revert "[BOLT] Fix blocks layout reverse iterators" This reverts commit `7bb0cbfc32`. Doesn't build at least on macOS, see https://reviews.llvm.org/D139335#3974169	2022-12-06 08:40:36 -05:00
Vladislav Khmelevsky	7bb0cbfc32	[BOLT] Fix blocks layout reverse iterators Use container's reverse iterators Differential Revision: https://reviews.llvm.org/D139335	2022-12-06 12:00:38 +04:00
Fangrui Song	89fab98e88	[DebugInfo] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-05 00:09:22 +00:00
Fangrui Song	b0df70403d	[Target] llvm::Optional => std::optional The updated functions are mostly internal with a few exceptions (virtual functions in TargetInstrInfo.h, TargetRegisterInfo.h). To minimize changes to LLVMCodeGen, GlobalISel files are skipped. https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 22:43:14 +00:00
Fangrui Song	f4c16c4473	[MC] llvm::Optional => std::optional https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-04 21:36:08 +00:00
Fangrui Song	ea47ccc78f	[BOLT] Fix after DebugInfoMetadata change `0ca43d4488`	2022-12-04 18:57:52 +00:00
Kazu Hirata	e324a80fab	[BOLT] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 23:12:38 -08:00
Guillaume Chatelet	6c09ea3fdd	[Alignment][NFC] Use Align in MCStreamer::emitValueToAlignment Differential Revision: https://reviews.llvm.org/D138674	2022-11-24 16:09:44 +00:00
Guillaume Chatelet	4f17734175	[Alignment][NFC] Use Align in MCStreamer::emitCodeAlignment This patch makes code less readable but it will clean itself after all functions are converted. Differential Revision: https://reviews.llvm.org/D138665	2022-11-24 14:51:46 +00:00
Nico Weber	e8ce5f1ec9	[bolt] Use llvm::sys::RWMutex instead of std::shared_timed_mutex This has the following advantages: - std::shared_timed_mutex is macOS 10.12+ only. llvm::sys::RWMutex automatically switches to a different implementation internally when targeting older macOS versions. - bolt only needs std::shared_mutex, not std::shared_timed_mutex. llvm::sys::RWMutex automatically uses std::shared_mutex internally where available. std::shared_mutex and RWMutex have the same API, so no code changes other than types and includes are needed. Differential Revision: https://reviews.llvm.org/D138423	2022-11-21 19:24:32 -05:00
Kazu Hirata	1fa870b1bd	Use None consistently (NFC) This patch replaces NoneType() and NoneType::None with None in preparation for migration from llvm::Optional to std::optional. In the std::optional world, we are not guranteed to be able to default-construct std::nullopt_t or peek what's inside it, so neither NoneType() nor NoneType::None has a corresponding expression in the std::optional world. Once we consistently use None, we should even be able to replace the contents of llvm/include/llvm/ADT/None.h with something like: using NoneType = std::nullopt_t; inline constexpr std::nullopt_t None = std::nullopt; to ease the migration from llvm::Optional to std::optional. Differential Revision: https://reviews.llvm.org/D138376	2022-11-20 00:24:40 -08:00
Alexey Moksyakov	1fb186198a	adds huge pages support of PIE/no-PIE binaries This patch adds the huge pages support (-hugify) for PIE/no-PIE binaries. Also returned functionality to support the kernels < 5.10 where there is a problem in a dynamic loader with the alignment of pages addresses. Differential Revision: https://reviews.llvm.org/D129107	2022-11-04 15:14:21 +03:00
serge-sans-paille	f71d32a0ee	Honor LLVM_LIBDIR_SUFFIX Some distribution install libraries under lib64. LLVM supports this through LLVM_LIBDIR_SUFFIX, have bolt do the same. Differential Revision: https://reviews.llvm.org/D137039	2022-11-01 23:54:06 +01:00
Maksim Panchenko	bcc4c90954	[BOLT] Fix instruction encoding validation Always use non-symbolizing disassembler for instruction encoding validation as symbols will be treated as undefined/zeros be the encoder and causing byte sequence mismatches. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D136118	2022-10-18 13:50:00 -07:00
Maksim Panchenko	4d3a0cade2	[BOLT] Section-handling refactoring/overhaul Simplify the logic of handling sections in BOLT. This change brings more direct and predictable mapping of BinarySection instances to sections in the input and output files. * Only sections from the input binary will have a non-null SectionRef. When a new section is created as a copy of the input section, its SectionRef is reset to null. * RewriteInstance::getOutputSectionName() is removed as the section name in the output file is now defined by BinarySection::getOutputName(). * Querying BinaryContext for sections by name uses their original name. E.g., getUniqueSectionByName(".rodata") will return the original section even if the new .rodata section was created. * Input file sections (with relocations applied) are emitted via MC with ".bolt.org" prefix. However, their name in the output binary is unchanged unless a new section with the same name is created. * New sections are emitted internally with ".bolt.new" prefix if there's a name conflict with an input file section. Their original name is preserved in the output file. * Section header string table is properly populated with section names that are actually used. Previously we used to include discarded section names as well. * Fix the problem when dynamic relocations were propagated to a new section with a name that matched a section in the input binary. E.g., the new .rodata with jump tables had dynamic relocations from the original .rodata. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D135494	2022-10-13 23:10:39 -07:00
Rafael Auler	4f158995b9	[BOLT] Add pass to fix ambiguous memory references This adds a round of checks to memory references, looking for incorrect references to jump table objects. Fix them by replacing the jump table reference with another object reference + offset. This solves bugs related to regular data references in code accidentally being bound to a jump table, and this reference being updated to a new (incorrect) location because we moved this jump table. Fixes #55004 Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D134098	2022-10-12 18:39:50 -07:00
Rafael Auler	8d1fc45dc3	[BOLT][NFC] Refactor creation of symbol+addend references Put code that creates references to symbol+addend behind MCPlusBuilder. Will use this later in validate memory references pass. Reviewed By: #bolt, maksfb, yota9 Differential Revision: https://reviews.llvm.org/D134097	2022-10-12 18:39:26 -07:00
Maksim Panchenko	5fca9c5763	[BOLT] Change order of new sections While the order of new sections in the output binary was deterministic in the past (i.e. there was no run-to-run variation), it wasn't always rational as we used size to define the precedence of allocatable sections within "code" or "data" groups (probably unintentionally). Fix that by defining stricter section-ordering rules. Other than the order of sections, this should be NFC. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D135235	2022-10-07 11:20:42 -07:00
Gabriel Ravier	9966b3e728	[BOLT] Fixed some typos I went over the output of the following mess of a command: `(ulimit -m 2000000; ulimit -v 2000000; git ls-files -z \| parallel --xargs -0 cat \| aspell list --mode=none --ignore-case \| grep -E '^[A-Za-z][a-z]*$' \| sort \| uniq -c \| sort -n \| grep -vE '.{25}' \| aspell pipe -W3 \| grep : \| cut -d' ' -f2 \| less)` and proceeded to spend a few days looking at it to find probable typos and fixed a few hundred of them in all of the llvm project (note, the ones I found are not anywhere near all of them, but it seems like a good start). Reviewed By: Amir, maksfb Differential Revision: https://reviews.llvm.org/D130824	2022-09-30 17:07:04 +02:00
Amir Ayupov	90d87dbf4b	[BOLT] Report BB reordering %-age vs profiled and total number of functions Reviewed By: spupyrev Differential Revision: https://reviews.llvm.org/D134819	2022-09-29 12:35:45 +02:00
Rafael Auler	ba9cc6537c	[PERF2BOLT] Fix unittest failure Fix failure caused by commit `e549ac072b` "Do not issue parsing error on weird build ids".	2022-09-28 16:01:57 -07:00
Amir Ayupov	39336fc09c	[BOLT] Control aggregation mode output profile file format In perf2bolt and `-aggregate-only` BOLT mode, the output profile file is written in fdata format by default. Provide a knob `-profile-format=[fdata,yaml]` to control the format. Note that `-w` option still dumps in YAML format. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D133995	2022-09-19 13:37:10 -07:00
spupyrev	539b6c68cb	[BOLT] Unifying implementations of ext-tsp After BOLT's merge to LLVM, there are two (almost identical) versions of the code layout algorithm. The diff unifies the implementations by keeping the one in LLVM. There are mild changes in the resulting block orders. I tested the changes extensively both on the clang binary and on prod services. Didn't see stat sig differences on average. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D129895	2022-09-19 08:29:08 -07:00
Kazu Hirata	ad2449f375	[BOLT] Remove duplicate types (NFC) This patch, a follow-up for `588628de3e`, removes duplicate types like T and PointerT in favor of reference and pointer, respectively.	2022-09-18 16:23:19 -07:00
Kazu Hirata	c9696322bd	[BOLT] Use x.empty() instead of llvm::empty(x) (NFC) I'm planning to deprecate and eventually remove llvm::empty. Note that no use of llvm::empty requires the ability of llvm::empty to determine the emptiness from begin/end only.	2022-09-18 11:01:56 -07:00
Maksim Panchenko	1d5393526c	[BOLT] Change base class of ExecutableFileMemoryManager When we derive EFMM from SectionMemoryManager, it brings into EFMM extra functionality, such as the registry of exception handling sections, page permission management, etc. Such functionality is of no use to llvm-bolt and can even be detrimental (see https://github.com/llvm/llvm-project/issues/56726). Change the base class of ExecutableFileMemoryManager to MemoryManager, avoid registering EH sections, and skip memory finalization. Fixes #56726 Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D133994	2022-09-16 13:39:12 -07:00
Maksim Panchenko	9742c25b98	[BOLT] Fix empty function emission in non-relocation mode In non-relocation mode, every function is emitted in its own section. If a function is empty, RuntimeDyld will still allocate 1-byte section for the function and initialize it with zero. As a result, we will overwrite the first byte of the original function contents with zero. Such scenario can happen when the input function had only NOP instructions which BOLT removes by default. Even though such functions likely cause undefined behavior, it's better to preserve their contents. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D133978	2022-09-16 13:38:32 -07:00
Amir Ayupov	e002523b65	[BOLT] Verify externally referenced blocks against jump table targets For functions with references to internal offsets from data, verify externally referenced blocks against the set of jump table targets. Mark the function as non-simple if there are any unclaimed data to code references. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D132495	2022-09-16 11:44:33 -07:00

1 2 3 4 5

222 Commits