clang-p2996

Files

Fangrui Song 3b4d800911 [ELF] Parallelize writes of different OutputSections

We currently process one OutputSection at a time and for each OutputSection
write contained input sections in parallel. This strategy does not leverage
multi-threading well. Instead, parallelize writes of different OutputSections.

The default TaskSize for parallelFor often leads to inferior sharding. We
prepare the task in the caller instead.

* Move llvm::parallel::detail::TaskGroup to llvm::parallel::TaskGroup
* Add llvm::parallel::TaskGroup::execute.
* Change writeSections to declare TaskGroup and pass it to writeTo.

Speed-up with --threads=8:

* clang -DCMAKE_BUILD_TYPE=Release: 1.11x as fast
* clang -DCMAKE_BUILD_TYPE=Debug: 1.10x as fast
* chrome -DCMAKE_BUILD_TYPE=Release: 1.04x as fast
* scylladb build/release: 1.09x as fast

On M1, many benchmarks are a small fraction of a percentage faster. Mozilla showed the largest difference with the patch being about 1.03x as fast.

Differential Revision: https://reviews.llvm.org/D131247

2022-08-24 09:40:03 -07:00

Inputs

[ELF][test] Clean up linkerscript/{filename-spec.s,group.s}

2022-05-13 11:53:03 -07:00

absolute2.s

…

absolute-expr.test

…

absolute.s

…

addr-zero.test

…

addr.test

…

address-expr-symbols.s

…

align1.test

…

align2.test

…

align3.test

…

align4.test

…

align5.test

…

align-empty.test

…

align-r.test

…

align-section-offset.test

…

align-section.test

…

alignof.test

…

alternate-sections.s

…

arm-exidx-discard-all.s

…

arm-exidx-discard.s

…

arm-exidx-order.test

…

arm-exidx-phdrs.test

…

arm-exidx-sentinel-and-assignment.s

…

arm-lscript.test

…

assert.s

…

at2.test

…

at3.test

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

at5.test

…

at6.test

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

at7.test

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

at8.test

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

at-addr.s

…

at.s

…

avr5.test

[lld][ELF] Support BFD name elf32-avr

2022-05-18 00:00:14 +00:00

broken-memory-declaration.s

…

bss-fill.test

…

comdat-gc.s

…

common-assign.s

…

common-exclude.s

…

common-filespec.test

…

common.s

[ELF] Fix llvm_unreachable failure when COMMON is placed in SHT_PROGBITS output section

2022-03-28 11:05:52 -07:00

compress-debug-sections-custom.s

…

compress-debug-sections.s

…

constructor.test

…

copy-rel-symbol-value-err.s

…

copy-rel-symbol-value.s

…

custom-section-type.s

[ELF] Change (NOLOAD) type mismatch to use SHT_NOBITS instead of SHT_PROGBITS

2022-05-06 07:49:42 -07:00

data-commands1.test

…

data-commands2.test

…

data-commands-gc.s

…

data-segment-relro.test

[ELF] Support custom sections between DATA_SEGMENT_ALIGN and DATA_SEGMENT_RELRO_END

2022-05-04 01:10:46 -07:00

define.test

…

defsym.s

…

diag1.test

…

diag2.test

…

diag3.test

…

diag4.test

…

diag5.test

…

diag6.test

…

discard-gnu-hash.s

[ELF][test] Improve discard-gnu-hash.s to check DT_HASH and DT_GNU_HASH

2022-01-12 12:43:49 -08:00

discard-gnu-version.s

…

discard-group.s

…

discard-interp.test

…

discard-linkorder.s

…

discard-phdr.s

…

discard-plt.s

[ELF] Support discarding .got.plt

2021-11-19 10:50:53 -08:00

discard-print-gc.s

…

discard-section-dynsym.s

[ELF] Fix spurious GOT/PLT assertion failure when .dynsym is discarded

2022-04-20 22:49:49 -07:00

discard-section-err.s

[ELF] Fix spurious GOT/PLT assertion failure when .dynsym is discarded

2022-04-20 22:49:49 -07:00

discard-section.s

…

dot-is-not-abs.s

…

double-bss.test

…

dynamic-sym.s

…

dynamic.s

…

early-assign-symbol.s

…

edata-etext.s

…

eh-frame-emit-relocs.s

…

eh-frame-hdr.s

…

eh-frame-merge.s

…

eh-frame-reloc-out-of-range.test

…

eh-frame.s

…

ehdr_start.s

…

emit-reloc-section-names.s

…

emit-reloc.s

…

emit-relocs-discard.s

…

emit-relocs-ehframe-discard.s

…

emit-relocs-multiple.s

…

emit-relocs-rela-dyn.s

…

empty-link-order.test

…

empty-load.s

…

empty-relaplt-dyntags.test

…

empty-section-size.test

…

empty-sections-expressions.s

…

empty-sections-expressions.test

…

empty-synthetic-removed-flags.s

…

empty-tls.test

…

entry.s

[ELF] Support quoted symbol in the ENTRY command

2022-06-25 12:19:45 -07:00

exclude-multiple.s

…

excludefile.s

…

exidx-crash.test

…

expr-invalid-sec.test

…

expr-sections.test

…

extend-pt-load1.test

…

extend-pt-load2.test

…

extend-pt-load3.test

…

filename-spec.s

[ELF][test] Add an input section description test with "()" in the filename

2022-05-13 12:02:14 -07:00

fill-exec-sections.s

…

fill.test

…

got-write-offset.s

…

group.s

[ELF][test] Clean up linkerscript/{filename-spec.s,group.s}

2022-05-13 11:53:03 -07:00

header-addr.test

…

header-phdr2.s

…

header-phdr.test

…

huge-temporary-file.s

…

i386-sections-max-va-overflow.s

…

icf-output-sections.s

…

icf.s

…

image-base.s

…

implicit-program-header.test

[ELF] Avoid adding an orphan section to a less suitable segment

2021-10-21 11:38:39 +07:00

include-cycle.s

…

info-section-type.s

…

input-archive.s

…

input-order.s

…

input-relative.s

…

input-sec-dup.s

…

insert-after.test

[ELF] Rename adjustSectionsBeforeSorting to adjustOutputSections and make it affect INSERT commands

2022-02-01 10:16:12 -08:00

insert-before.test

[ELF] Update flag propagation rule to ignore discarded output sections

2022-02-01 10:19:30 -08:00

insert-broken.test

…

insert-duplicate.test

[ELF] Add OVERWRITE_SECTIONS command

2021-06-13 12:41:11 -07:00

insert-multi.test

[ELF] Preserve section order within an INSERT AFTER command

2021-06-30 11:35:50 -07:00

insert-not-exist.test

…

lazy-symbols.test

…

linker-script-in-search-path.s

…

linkerscript.s

…

linkorder2.s

…

linkorder-linked-to.s

…

linkorder.s

…

lma-align2.test

[ELF] Expand LMA region if output section alignment introduces padding

2021-11-19 11:27:21 +01:00

lma-align.test

…

lma-offset2.s

…

lma-offset.s

…

lma-overflow.test

…

loadaddr.s

…

locationcountererr2.s

…

locationcountererr.test

…

map-file2.test

…

map-file.test

…

memory2.s

…

memory3.s

…

memory-at.test

…

memory-attr.test

[ELF] Support the "read-only" memory region attribute

2021-11-24 12:17:09 +07:00

memory-data-commands.test

…

memory-err.s

[ELF][test] Improve test coverage

2021-09-25 11:57:54 -07:00

memory-gap-explicit-expr.test

…

memory-ignored-dot-assign.test

…

memory-ignored-output-address.test

…

memory-include.test

…

memory-loc-counter.test

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

memory-nonalloc.test

[ELF] Do not try to assign a memory region to a non-allocatable section

2021-11-15 15:59:39 +07:00

memory-region-alignment.test

…

memory.s

…

merge-header-load.s

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

merge-nonalloc.s

…

merge-output-sections.s

…

merge-sections-reloc.s

…

merge-sections-syms.s

…

merge-sections.s

…

multi-sections-constraint.s

…

multiple-tbss.s

…

nmagic-alignment.test

…

no-filename-spec.s

[ELF] Disallow input section description without a filename

2022-05-13 11:06:01 -07:00

no-pt-load.test

…

no-space.s

…

nobits-offset.s

[ELF] Consider that NOLOAD sections should be placed in a PT_LOAD segment

2021-06-16 12:36:45 +02:00

noload.s

[ELF] Change (NOLOAD) type mismatch to use SHT_NOBITS instead of SHT_PROGBITS

2022-05-06 07:49:42 -07:00

non-absolute2.test

…

non-absolute.s

…

non-alloc-segment.s

…

non-alloc.s

…

numbers.s

…

obj-symbol-value.s

…

openbsd-bootdata.test

…

openbsd-randomize.s

…

openbsd-wxneeded.test

…

operators.test

[ELF] Optimize some non-constant alignTo with alignToPowerOf2. NFC

2022-07-24 11:20:49 -07:00

orphan-align.s

…

orphan-discard.s

[ELF][test] Avoid llvm-readelf/llvm-readobj one-dash long options

2021-07-16 10:02:47 -07:00

orphan-end.s

…

orphan-first-cmd.test

…

orphan-live-only.s

…

orphan-memory.test

[ELF] Better resemble GNU ld when placing orphan sections into memory regions

2021-11-11 15:07:38 +07:00

orphan-phdrs2.test

[ELF] Avoid adding an orphan section to a less suitable segment

2021-10-21 11:38:39 +07:00

orphan-phdrs.s

…

orphan-report.s

…

orphan.s

…

orphans.s

…

out-of-order-section-in-region.test

…

out-of-order.s

…

output-section-include.test

…

output-too-large-32bit.s

…

output-too-large.s

…

outputarch.test

…

outsections-addr.s

…

overlapping-sections.s

[ELF] Parallelize writes of different OutputSections

2022-08-24 09:40:03 -07:00

overlay-reject2.test

…

overlay-reject.test

…

overlay.test

…

overwrite-sections-discard.test

[ELF] Add OVERWRITE_SECTIONS command

2021-06-13 12:41:11 -07:00

overwrite-sections.test

[split-file] Default to --no-leading-lines

2021-08-16 19:23:11 -07:00

page-size-align.test

…

page-size.s

…

parse-section-in-addr.test

…

phdr-check.s

…

phdrs-flags.s

…

phdrs.s

…

plugin.test

[llvm-ar][test] Test that --plugin is ignored

2022-01-12 11:32:31 -08:00

ppc32-got2.s

[ELF][PPC32] Support .got2 in an output section description

2021-12-23 11:32:44 -08:00

preinit-array-empty.test

[ELF] Ensure output section is not discarded in addStartEndSymbols()

2021-11-19 11:45:58 +00:00

provide-empty-section.s

…

provide-shared2.s

…

provide-shared.s

…

pt_gnu_eh_frame.s

…

pt-interp.test

…

quoted-section-name.test

…

region-alias.s

…

relocatable-discard.s

…

repsection-symbol.s

…

repsection-va.s

…

rosegment.test

…

searchdir.s

…

section-address-align.test

…

section-align2.test

…

section-align.s

…

section-include.test

…

sections-constraint2.s

…

sections-constraint3.s

…

sections-constraint4.s

…

sections-constraint5.s

…

sections-constraint.s

…

sections-gc2.s

…

sections-gc.s

…

sections-keep.s

…

sections-max-va-overflow.s

…

sections-nonalloc.s

…

sections-padding.s

…

sections-sort.s

…

sections-va-overflow.test

…

sections.s

…

segment-headers.s

…

segment-none.s

…

segment-start.s

…

sizeof.s

…

sizeofheaders.s

…

sort2.s

…

sort-constructors.test

…

sort-init.s

…

sort-nested.s

…

sort-non-script.s

…

sort.s

…

start-end.test

…

subalign.s

…

symbol-alias-relocation.s

…

symbol-assign-many-passes2.test

…

symbol-assign-many-passes.test

…

symbol-assign-not-converge.test

…

symbol-assign-type.s

…

symbol-assignexpr.s

[ELF][test] Improve expression test

2022-06-25 21:11:32 -07:00

symbol-location.s

[ELF] Remove -Wl,-z,notext hint

2021-10-31 12:10:43 -07:00

symbol-memoryexpr.s

…

symbol-name.test

[ELF] Support symbol names with space in linker script expressions

2021-09-27 09:50:42 -07:00

symbol-only-align.test

…

symbol-only-flags.test

[ELF][test] Improve INSERT [AFTER|BEFORE] and adjustSectionsBeforeSorting tests

2022-01-28 22:21:13 -08:00

symbol-only.test

…

symbol-ordering-file2.s

…

symbol-ordering-file.s

…

symbol-pie.s

…

symbol-reserved.s

…

symbolreferenced.s

[ELF] Support quoted symbols in symbol assignments

2021-07-25 16:26:37 -07:00

symbols-non-alloc.test

…

symbols.s

[ELF] Fix assertion failure when PROVIDE/HIDDEN/PROVIDE_HIDDEN does not have =

2022-06-25 20:26:47 -07:00

synthetic-relsec-layout.s

…

synthetic-symbols1.test

…

synthetic-symbols2.test

…

synthetic-symbols3.test

…

synthetic-symbols4.test

…

target.s

[ELF] Support quoted name in the TARGET command

2022-06-25 12:31:20 -07:00

tbss.s

[ELF] Make dot in .tbss correct

2021-08-04 08:58:50 -07:00

thunk-gen-mips.s

…

tls-nobits-offset.s

[ELF] Align the first section of a PT_TLS even if its type is SHT_NOBITS

2021-07-29 15:14:00 +01:00

ttext-script.s

…

undef.s

…

unused-synthetic2.test

…

unused-synthetic.s

…

va.s

…

version-linker-symbol.s

…

version-script.s

…

visibility.s

…

wildcards2.s

…

wildcards.s

…