clang-p2996

Author	SHA1	Message	Date
Amir Ayupov	4627446d38	[BOLT] Fix AutoFDO output format after D154120 AutoFDO profile has no leading 0x in hex dumps. Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D159507	2023-09-12 13:58:25 -07:00
Amir Ayupov	ffef4fe0db	[BOLT][NFC] Use formatv in DataAggregator/DataReader prints Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D154120	2023-09-11 16:01:02 -07:00
Amir Ayupov	d796f36fbc	[BOLT][NFC] Simplify DataAggregator Use short loop instead of duplicating the code for setHasProfileAvailable. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D154749	2023-07-31 14:54:41 -07:00
Amir Ayupov	224e4cc516	[BOLT] Sort BranchData in DataAggregator Align perf reader to fdata behavior by sorting BranchData after reading samples, in the same way as DataReader: `20c66a0c66/bolt/lib/Profile/DataReader.cpp (L1239)` Namely, that order affects CallSiteInfo annotations which determine the construction order of CallGraph, which in turn affects function reordering. Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D152731	2023-06-15 12:08:57 -07:00
Amir Ayupov	5acac7db6e	[BOLT][NFCI] Use StringRef.split in launchPerfProcess Use StringRef method instead of reimplementing the splitting. Incidentally, it also fixes the duplicate printing of the command arguments: ``` PERF2BOLT: spawning perf job to read branch events Launching perf: /usr/bin/perf script^@-F^@pid,ip,brstack -F^@pid,ip,brstack pid,ip,brstack -f -i PERF2BOLT: spawning perf job to read mem events Launching perf: /usr/bin/perf script^@-F^@pid,event,addr,ip -F^@pid,event,addr,ip pid,event,addr,ip -f -i PERF2BOLT: spawning perf job to read process events Launching perf: /usr/bin/perf script^@--show-mmap-events --show-mmap-events -f -i PERF2BOLT: spawning perf job to read task events Launching perf: /usr/bin/perf script^@--show-task-events --show-task-events -f -i ``` Fixes it to: ``` PERF2BOLT: spawning perf job to read branch events Launching perf: /usr/bin/perf script -F pid,ip,brstack -f -i PERF2BOLT: spawning perf job to read mem events Launching perf: /usr/bin/perf script -F pid,event,addr,ip -f -i PERF2BOLT: spawning perf job to read process events Launching perf: /usr/bin/perf script --show-mmap-events -f -i PERF2BOLT: spawning perf job to read task events Launching perf: /usr/bin/perf script --show-task-events -f -i ``` Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D152483	2023-06-09 06:24:17 -07:00
Amir Ayupov	c061f75554	[BOLT] Handle recursive calls as inter-branches in DataAggregator Align yaml and fdata profiles by applying the same treatment to recursive calls (direct, indirect, tail). fdata profile increments entry count when handling recursive calls. Make perf/pre-aggregated perf reader (DataAggregator) do the same. Test Plan: In pre-aggregated-perf.test, add a dummy pre-aggregated branch entry between an indirect call in `frame_dummy` function and its entry point. Check that YAML profile gets incremented entry count for this function. End-to-end test: https://github.com/rafaelauler/bolt-tests/pull/24 Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D152338	2023-06-08 04:17:07 -07:00
Amir Ayupov	713b28532e	[BOLT][NFC] Fix debug messages Fix debug printing, making it easier to compare two debug logs side by side: - `BinaryFunction::addRelocation`: print function name instead of `this` ptr, - `DataAggregator::doTrace`: remove duplicated function name. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D152314	2023-06-06 15:50:58 -07:00
Amir Ayupov	a478a09131	[BOLT][NFC] Drop MMap events for deleted files Don't parse/handle mmap events with "(deleted)" filename. Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D151948	2023-06-05 13:03:40 -07:00
Amir Ayupov	bce889c8df	[BOLT] Align BranchInfo and FuncBranchData in DataAggregator::recordTrace `DataAggregator::recordTrace` serves two purposes: - Attaching LBR fallthrough ("trace") information to CFG (`getBranchInfo`), which eventually gets emitted as YAML profile. - Populating vector of offsets that gets added to `FuncBranchData`, which eventually gets emitted as fdata profile. `recordTrace` is invoked from `getFallthroughsInTrace` which checks its return status and passes on the collected vector of offsets to `doTrace`. However, if a malformed trace is passed to `recordTrace` it might partially attach the profile to CFG and exit with false, not propagating the vector of offsets to `doTrace`. This leads to a difference between fdata and yaml profile collected from the same binary and the same perf file. (Skylake LBR errata might produce such malformed traces where the last entry is duplicated, resulting in invalid fallthrough path between the last two entries). There are two ways to handle this mismatch: conservative (aligned with fdata), or aggressive (aligned with yaml). Conservative approach would discard the trace entirely, buffering the CFG updates until all fallthroughs are confirmed. Aggressive approach would apply CFG updates and return the matching fallthroughs in the vector even if the trace is invalid (doesn't correspond to a valid fallthrough path). I chose to go with the former (conservative/fdata) approach which produces more accurate profile. We can't rely on pre-filtering such traces early (in LBR sample processing) as DataAggregator is used for both perf samples and pre-aggregated perf information which loses branch stack information. Test Plan: https://github.com/rafaelauler/bolt-tests/pull/22 Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D151614	2023-05-30 18:03:45 -07:00
Amir Ayupov	860543d96e	[BOLT][NFC] Extract DataAggregator::parseLBRSample Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D150986	2023-05-19 17:50:02 -07:00
Amir Ayupov	17f3cbe3af	[BOLT][NFC] Use llvm::make_range Use `llvm::make_range` convenience wrapper from ADT. Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D145887	2023-05-17 10:50:56 -07:00
Amir Ayupov	c7af4f383d	[BOLT][NFC] Simplify preprocessProfile Move out prepareToParse lambda, generalize it to handle mem events perf process. Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D146002	2023-03-15 12:56:06 -07:00
Maksim Panchenko	73b89e3f38	[BOLT] Remove dependency on StringMap iteration order Remove the usage of StringMap in places where the iteration order affects the output since the iteration over StringMap is non-deterministic. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D145194	2023-03-03 09:21:26 -08:00
Amir Ayupov	4a7966ea1b	[BOLT][NFC] DataAggregator code cleanup Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D139794	2023-01-18 13:44:44 -08:00
Joe Loser	a288d7f937	[llvm][ADT] Replace uses of `makeMutableArrayRef` with deduction guides Similar to how `makeArrayRef` is deprecated in favor of deduction guides, do the same for `makeMutableArrayRef`. Once all of the places in-tree are using the deduction guides for `MutableArrayRef`, we can mark `makeMutableArrayRef` as deprecated. Differential Revision: https://reviews.llvm.org/D141814	2023-01-16 14:49:37 -07:00
Amir Ayupov	6b05a62a6b	[BOLT] Check no-LBR samples in mayHaveProfileData No-LBR mode wasn't tested and slipped when mayHaveProfileData was added for Lite mode. This enables processing of profiles collected without LBR and converted with `perf2bolt -nl` option. Test Plan: bin/llvm-lit -a tools/bolt/test/X86/nolbr.s https://github.com/rafaelauler/bolt-tests/pull/20 Reviewed By: #bolt, rafauler Differential Revision: https://reviews.llvm.org/D140256	2023-01-03 14:43:36 -08:00
Matt Arsenault	765f3cafa1	bolt: Update more sys::Wait calls	2022-12-14 12:00:41 -05:00
Matt Arsenault	6be2db6ca5	bolt: Try to fix build after sys::Program API change Hopefully fixes build after `15a6e3c636`	2022-12-14 11:56:13 -05:00
Amir Ayupov	e8f5743e86	[BOLT][NFC] Use std::optional in BC	2022-12-11 22:13:46 -08:00
Amir Ayupov	835a9c2801	[BOLT][NFC] Use std::optional in DataAggregator	2022-12-11 22:13:46 -08:00
Amir Ayupov	3d573fdbb4	[BOLT][NFC] Use std::optional in BAT	2022-12-11 22:13:46 -08:00
Maksim Panchenko	0f915826cc	[BOLT] Handle access errors while reading profile When the user does not have permissions to access the profile, consume the error contained in Expected<> to avoid dumping stack to the user. Differential Revision: https://reviews.llvm.org/D139480	2022-12-07 17:11:30 -08:00
Krzysztof Parzyszek	3c255f679c	Process: convert Optional to std::optional This applies to GetEnv and FindInEnvPath.	2022-12-06 09:56:14 -08:00
Kazu Hirata	e324a80fab	[BOLT] Use std::nullopt instead of None (NFC) This patch mechanically replaces None with std::nullopt where the compiler would warn if None were deprecated. The intent is to reduce the amount of manual work required in migrating from Optional to std::optional. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716	2022-12-02 23:12:38 -08:00
Kazu Hirata	1028b165ee	[BOLT] Fix a build error This patch fixes: bolt/lib/Profile/DataAggregator.cpp:264:66: error: no viable conversion from 'Optional<llvm::StringRef>[3]' to 'ArrayRef<std::optional<StringRef>>'	2022-12-01 15:48:03 -08:00
Kazu Hirata	34bcadc38c	Use std::nullopt_t instead of NoneType (NFC) This patch replaces those occurrences of NoneType that would trigger an error if the definition of NoneType were missing in None.h. To keep this patch focused, I am deliberately not replacing None with std::nullopt in this patch or updating comments. They will be addressed in subsequent patches. This is part of an effort to migrate from llvm::Optional to std::optional: https://discourse.llvm.org/t/deprecating-llvm-optional-x-hasvalue-getvalue-getvalueor/63716 Differential Revision: https://reviews.llvm.org/D138539	2022-11-23 14:16:04 -08:00
Kazu Hirata	1fa870b1bd	Use None consistently (NFC) This patch replaces NoneType() and NoneType::None with None in preparation for migration from llvm::Optional to std::optional. In the std::optional world, we are not guranteed to be able to default-construct std::nullopt_t or peek what's inside it, so neither NoneType() nor NoneType::None has a corresponding expression in the std::optional world. Once we consistently use None, we should even be able to replace the contents of llvm/include/llvm/ADT/None.h with something like: using NoneType = std::nullopt_t; inline constexpr std::nullopt_t None = std::nullopt; to ease the migration from llvm::Optional to std::optional. Differential Revision: https://reviews.llvm.org/D138376	2022-11-20 00:24:40 -08:00
Rafael Auler	ba9cc6537c	[PERF2BOLT] Fix unittest failure Fix failure caused by commit `e549ac072b` "Do not issue parsing error on weird build ids".	2022-09-28 16:01:57 -07:00
Rafael Auler	e549ac072b	[PERF2BOLT] Do not issue parsing error on weird build ids In weird entries we were issueing a parse error. For example, in line 5 here: 6862acc063b0aa86595f52ff81628577df4296ff a.so 6862acc063b0aa86595f52ff81628577df4296ff a.so 6862acc063b0aa86595f52ff81628577df4296ff a.so db758cb3c970044e78d5a4c99b011708a9995636 bin1 60326683eab31acfd03435d9ed4ff9a8 bin2 7d448e51851b4bdb33eac84f90e74628a14a5f00 b.so 742aa26e0211794356cc25f415c25230a26aa045 c.so Error reading BOLT data input file: line 89, column 33: malformed field Fix that. Reviewed By: #bolt, Amir Differential Revision: https://reviews.llvm.org/D134822	2022-09-28 14:41:55 -07:00
Amir Ayupov	39336fc09c	[BOLT] Control aggregation mode output profile file format In perf2bolt and `-aggregate-only` BOLT mode, the output profile file is written in fdata format by default. Provide a knob `-profile-format=[fdata,yaml]` to control the format. Note that `-w` option still dumps in YAML format. Reviewed By: #bolt, maksfb Differential Revision: https://reviews.llvm.org/D133995	2022-09-19 13:37:10 -07:00
Kazu Hirata	20f0f15a40	Use StringRef::contains (NFC)	2022-08-28 23:29:02 -07:00
Amir Ayupov	f119a2483d	[BOLT][NFC] Use llvm::any_of Replace the imperative pattern of the following kind ``` bool IsTrue = false; for (Element : Range) { if (Condition(Element)) { IsTrue = true; break; } } ``` with functional style `llvm::any_of`: ``` bool IsTrue = llvm::any_of(Range, [&](Element) { return Condition(Element); }); ``` Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D132276	2022-08-27 21:36:15 -07:00
Fabian Parzefall	d5c03def24	[BOLT] Towards FunctionLayout const-correctness A const-qualified reference to function layout allows accessing non-const qualified basic blocks on a const-qualified function. This patch adds or removes const-qualifiers where necessary to indicate where basic blocks are used in a non-const manner. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D132049	2022-08-24 16:32:33 -07:00
Fabian Parzefall	f24c299e7d	Revert "[BOLT] Towards FunctionLayout const-correctness" This reverts commit `587d265342`.	2022-08-24 10:51:38 -07:00
Fabian Parzefall	587d265342	[BOLT] Towards FunctionLayout const-correctness A const-qualified reference to function layout allows accessing non-const qualified basic blocks on a const-qualified function. This patch adds or removes const-qualifiers where necessary to indicate where basic blocks are used in a non-const manner. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D132049	2022-08-24 10:17:17 -07:00
Kazu Hirata	b498a8991e	[bolt] Remove redundaunt control-flow statements (NFC) Identified with readability-redundant-control-flow.	2022-07-30 10:35:49 -07:00
Rafael Auler	fc0ced73dc	Add BAT testing framework This patch refactors BAT to be testable as a library, so we can have open-source tests on it. This further fixes an issue with basic blocks that lack a valid input offset, making BAT omit those when writing translation tables. Test Plan: new testcases added, new testing tool added (llvm-bat-dump) Differential Revision: https://reviews.llvm.org/D129382	2022-07-29 14:55:04 -07:00
Maksim Panchenko	661577b5f4	[BOLT] Add support for the latest perf tool The latest perf tool can return non-empty buffer when executing buildid-list command, even when perf.data was recorded with -B flag. Some binaries will be listed without the ID, while others may have a recorded ID. Allow invalid entires on the input, while checking the valid ones for the match. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D130223	2022-07-22 07:56:15 -07:00
Fabian Parzefall	8477bc6761	[BOLT] Add function layout class This patch adds a dedicated class to keep track of each function's layout. It also lays the groundwork for splitting functions into multiple fragments (as opposed to a strict hot/cold split). Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D129518	2022-07-16 17:23:24 -07:00
Amir Ayupov	d2c8769936	[BOLT][NFC] Use range-based STL wrappers Replace `std::` algorithms taking begin/end iterators with `llvm::` counterparts accepting ranges. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D128154	2022-06-23 22:16:27 -07:00
Fangrui Song	b92436efcb	[bolt] Remove unneeded cl::ZeroOrMore for cl::opt options	2022-06-05 13:29:49 -07:00
Rahman Lavaee	733dc3e50b	[BOLT] Report per-section hotness in bolt-heatmap. This patch adds a new feature to bolt heatmap to print the hotness of each section in terms of the percentage of samples within that section. Sample output generated for the clang binary: Section Name, Begin Address, End Address, Percentage Hotness .text, 0x1a7b9b0, 0x20a2cc0, 1.4709 .init, 0x20a2cc0, 0x20a2ce1, 0.0001 .fini, 0x20a2ce4, 0x20a2cf2, 0.0000 .text.unlikely, 0x20a2d00, 0x431990c, 0.3061 .text.hot, 0x4319910, 0x4bc6927, 97.2197 .text.startup, 0x4bc6930, 0x4c10c89, 0.0058 .plt, 0x4c10c90, 0x4c12010, 0.9974 Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D124412	2022-05-05 11:37:46 -07:00
Rahman Lavaee	e59e580116	[BOLT] Refactor DataAggregator::printLBRHeatMap. This also fixes some logs that were impacted by D123067. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D124281	2022-04-25 11:39:44 -07:00
Maksim Panchenko	77b75ca53f	[BOLT][perf2bolt] Fix base address calculation for shared objects When processing profile data for shared object or PIE, perf2bolt needs to calculate base address of the binary based on the map info reported by the perf tool. When the mapping data provided is for the second (or any other than the first) segment and the segment's file offset does not match its memory offset, perf2bolt uses wrong assumption about the binary base address. Add a function to calculate binary base address using the reported memory mapping and use the returned base for further address adjustments. Reviewed By: yota9 Differential Revision: https://reviews.llvm.org/D123755	2022-04-14 10:29:53 -07:00
Rahman Lavaee	0c13d97e2b	Allow building heatmaps from basic sampled events with `-nl`. I find that this is useful for finding event hotspots. Reviewed By: rafauler Differential Revision: https://reviews.llvm.org/D123067	2022-04-11 15:04:44 -07:00
serge-sans-paille	290e482342	Cleanup LLVMDWARFDebugInfo As usual with that header cleanup series, some implicit dependencies now need to be explicit: llvm/DebugInfo/DWARF/DWARFContext.h no longer includes: - "llvm/DebugInfo/DWARF/DWARFAcceleratorTable.h" - "llvm/DebugInfo/DWARF/DWARFCompileUnit.h" - "llvm/DebugInfo/DWARF/DWARFDebugAbbrev.h" - "llvm/DebugInfo/DWARF/DWARFDebugAranges.h" - "llvm/DebugInfo/DWARF/DWARFDebugFrame.h" - "llvm/DebugInfo/DWARF/DWARFDebugLoc.h" - "llvm/DebugInfo/DWARF/DWARFDebugMacro.h" - "llvm/DebugInfo/DWARF/DWARFGdbIndex.h" - "llvm/DebugInfo/DWARF/DWARFSection.h" - "llvm/DebugInfo/DWARF/DWARFTypeUnit.h" - "llvm/DebugInfo/DWARF/DWARFUnitIndex.h" Plus llvm/Support/Errc.h not included by a bunch of llvm/DebugInfo/DWARF/DWARF*.h files Preprocessed lines to build llvm on my setup: after: 1065629059 before: 1066621848 Which is a great diff! Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D119723	2022-02-15 09:16:03 +01:00
Vladislav Khmelevsky	5c2ae5f454	[BOLT] Refactor heatmap to be standalone tool Separate heatmap from bolt and build it as standalone tool. Reviewed By: maksfb Differential Revision: https://reviews.llvm.org/D118946	2022-02-07 22:00:44 +03:00
Amir Ayupov	a9cd49d50e	[BOLT][NFC] Move Offset annotation to Group 1 Summary: Move the annotation to avoid dynamic memory allocations. Improves the CPU time of instrumenting a large binary by 1% (+-0.8%, p-value 0.01) Test Plan: NFC Reviewers: maksfb FBD30091656	2022-01-18 13:24:50 -08:00
Amir Ayupov	d914486a9a	[BOLT][NFC] Refactor reset-release to move assignment Summary: Follow the clang-tidy suggestion to replace reset-release with move assignment. Move assignment's effect for unique_ptr: > Effects: Transfers ownership from `u` to `*this` as if by calling `reset(u.release())` followed by an assignment from `std::forward<D>(u.get_deleter())`.	2022-01-13 22:47:15 -08:00
Amir Ayupov	def464aaae	[BOLT][NFC] Fix braces usage in Profile Summary: Refactor bolt/*/Profile to follow the braces rule for if/else/loop from [LLVM Coding Standards](https://llvm.org/docs/CodingStandards.html). (cherry picked from FBD33345741)	2021-12-28 18:29:54 -08:00

1 2

54 Commits