clang-p2996

Files

Sam Parker 103119a435 [WebAssembly] Lower wide SIMD i8 muls (#130785 )

Currently, 'wide' i32 simd multiplication, with extended i8 elements,
will perform the multiplication with i32 So, for IR like the following:
```
  %wide.a = sext <8 x i8> %a to <8 x i32>
  %wide.b = sext <8 x i8> %a to <8 x i32>
  %mul = mul <8 x i32> %wide.a, %wide.b
  ret <8 x i32> %mul
```

We would generate the following sequence:
```
  i16x8.extend_low_i8x16_s $push6=, $1
  local.tee $push5=, $3=, $pop6
  i32x4.extmul_low_i16x8_s $push0=, $pop5, $3
  v128.store 0($0), $pop0
  i8x16.shuffle $push1=, $1, $1, 4, 5, 6, 7, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0
  i16x8.extend_low_i8x16_s $push4=, $pop1
  local.tee $push3=, $1=, $pop4
  i32x4.extmul_low_i16x8_s $push2=, $pop3, $1
  v128.store 16($0), $pop2
  return
```

But now we perform the multiplication with i16, resulting in:
```
  i16x8.extmul_low_i8x16_s $push3=, $1, $1
  local.tee $push2=, $1=, $pop3
  i32x4.extend_high_i16x8_s $push0=, $pop2
  v128.store 16($0), $pop0
  i32x4.extend_low_i16x8_s $push1=, $1
  v128.store 0($0), $pop1
  return
```

2025-03-21 06:57:57 +00:00

add-prototypes-conflict.ll

…

add-prototypes.ll

…

address-offsets.ll

…

aliases.ll

…

atomic-fence.ll

…

atomic-fence.mir

…

atomic-mem-consistency.ll

…

atomic-pic.ll

…

atomic-rmw.ll

…

atomicrmw-cond-sub-clamp.ll

Add usub_cond and usub_sat operations to atomicrmw (#105568 )

2024-09-06 16:19:20 +01:00

atomicrmw-uinc-udec-wrap.ll

…

bulk-memory64.ll

[WebAssembly] Protect memory.fill and memory.copy from zero-length ranges. (#112617 )

2024-10-24 14:13:58 -07:00

bulk-memory.ll

[WebAssembly] Protect memory.fill and memory.copy from zero-length ranges. (#112617 )

2024-10-24 14:13:58 -07:00

byval.ll

…

call-indirect.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

call-pic.ll

…

call.ll

…

cfg-stackify-dbg-skip.ll

…

cfg-stackify-dbg.mir

…

cfg-stackify-eh-legacy.ll

[WebAssembly] Add -wasm-use-legacy-eh option (#122158 )

2025-01-09 22:36:10 -08:00

cfg-stackify-eh-legacy.mir

[WebAssembly] Add -wasm-use-legacy-eh option (#122158 )

2025-01-09 22:36:10 -08:00

cfg-stackify-eh.ll

[WebAssembly] Add unreachable before catch destinations (#123915 )

2025-01-22 22:39:43 -08:00

cfg-stackify.ll

…

cfi.ll

…

clear-cache.ll

…

comparisons-f32.ll

…

comparisons-f64.ll

…

comparisons-i32.ll

…

comparisons-i64.ll

…

conv-trap.ll

…

conv.ll

…

copysign-casts.ll

…

cpus.ll

…

custom-sections.ll

…

data-align.ll

[WebAssembly] Add -i128:128 to the datalayout string. (#119204 )

2024-12-10 09:21:58 -08:00

dead-vreg.ll

…

debugtrap.ll

…

disable-feature.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

divrem-constant.ll

…

eh-lsda.ll

…

eh-option-errors.ll

[WebAssembly] Add -wasm-use-legacy-eh option (#122158 )

2025-01-09 22:36:10 -08:00

exception-legacy.ll

[WebAssembly] Remove wasm-specific findWasmUnwindDestinations (#130374 )

2025-03-10 20:56:38 -07:00

exception-legacy.mir

[WebAssembly] Add unreachable before catch destinations (#123915 )

2025-01-22 22:39:43 -08:00

exception.ll

[WebAssembly] Remove wasm-specific findWasmUnwindDestinations (#130374 )

2025-03-10 20:56:38 -07:00

expand-variadic-call.ll

[IRBuilder] Generate nuw GEPs for struct member accesses (#99538 )

2024-08-09 13:25:04 +01:00

explicit-locals.mir

…

export-name.ll

…

extend-shuffles.ll

[WebAssembly] Recognise EXTEND_HIGH (#123325 )

2025-02-17 09:04:29 +00:00

externref-globalget.ll

[WebAssembly] Fix MIR printing of reference types (#113028 )

2024-10-22 13:48:00 -07:00

externref-globalset.ll

…

externref-inttoptr.ll

…

externref-ptrtoint.ll

…

externref-tableget.ll

…

externref-tableset.ll

…

externref-unsized-load.ll

…

externref-unsized-store.ll

…

f16.ll

…

f32.ll

…

f64.ll

…

fast-isel-br-i1.ll

…

fast-isel-i24.ll

…

fast-isel-i256.ll

…

fast-isel-no-offset.ll

[WebAssembly] Don't fold non-nuw add/sub in FastISel (#111278 )

2024-10-09 14:31:16 -07:00

fast-isel-noreg.ll

…

fast-isel-pr47040.ll

[WebAssembly] Don't fold non-nuw add/sub in FastISel (#111278 )

2024-10-09 14:31:16 -07:00

fast-isel.ll

…

fpclamptosat_vec.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

fpclamptosat.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

frem.ll

…

fshl.ll

…

func-attr-annotate.ll

…

func.ll

…

funcref-call.ll

…

funcref-globalget.ll

…

funcref-globalset.ll

…

funcref-table_call.ll

…

funcref-tableget.ll

…

funcref-tableset.ll

…

function-addr-offset.ll

…

function-bitcasts-varargs.ll

…

function-bitcasts.ll

…

function-info.mir

[WebAssembly] Rename CATCH/CATCH_ALL to *_LEGACY (#107187 )

2024-09-04 16:14:13 -07:00

function-pointer64.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

functype-emission.ll

…

global_dtors.ll

…

global-get-unlowerable.ll

…

global-get.ll

…

global-set-unlowerable.ll

…

global-set.ll

…

global.ll

…

globl.ll

…

half-precision.ll

[WebAssembly] Use the same lowerings for f16x8 as other float vectors. (#127897 )

2025-02-25 11:01:32 -08:00

i32-load-store-alignment.ll

…

i32.ll

…

i64-load-store-alignment.ll

…

i64.ll

…

i128-returned.ll

…

i128.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

ident.ll

…

immediates.ll

…

implicit-def.ll

…

import-module.ll

…

indirect-import.ll

…

indirectbr.ll

…

inline-asm-failure.ll

…

inline-asm-m.ll

…

inline-asm-roundtrip.ll

…

inline-asm.ll

…

inlineasm-output-template.ll

…

int-mac-reduction-loops.ll

[WebAssembly] Recognise EXTEND_HIGH (#123325 )

2025-02-17 09:04:29 +00:00

interleave.ll

[WebAssembly] Enable interleaved memory accesses (#125696 )

2025-02-17 09:09:52 +00:00

ir-locals-stackid.ll

…

ir-locals.ll

…

irreducible-cfg-exceptions.ll

…

irreducible-cfg.ll

…

irreducible-cfg.mir

…

legalize.ll

…

libcalls-trig.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

libcalls.ll

[WebAssembly] Add Libcall signatures for modf and variants (#130201 )

2025-03-06 15:48:39 -08:00

lit.local.cfg

…

llround-conv-i32.ll

…

load-ext-atomic.ll

…

load-ext.ll

…

load-store-i1.ll

…

load-store-pic.ll

…

load-store-static.ll

…

load.ll

…

lower-em-ehsjlj-multi-return.ll

…

lower-em-ehsjlj-options.ll

[WebAssemblyLowerEmscriptenEHSjLj] Avoid setting import_name where possible (#128564 )

2025-02-26 14:05:00 -08:00

lower-em-ehsjlj.ll

…

lower-em-exceptions-allowed.ll

…

lower-em-exceptions-resume-only.ll

…

lower-em-exceptions.ll

…

lower-em-sjlj-alias.ll

…

lower-em-sjlj-debuginfo.ll

…

lower-em-sjlj-indirect-setjmp.ll

…

lower-em-sjlj-sret.ll

…

lower-em-sjlj.ll

[WebAssemblyLowerEmscriptenEHSjLj] Avoid setting import_name where possible (#128564 )

2025-02-26 14:05:00 -08:00

lower-global-dtors.ll

…

lower-wasm-ehsjlj-phi.ll

…

lower-wasm-ehsjlj.ll

[WebAssemblyLowerEmscriptenEHSjLj] Avoid setting import_name where possible (#128564 )

2025-02-26 14:05:00 -08:00

lower-wasm-sjlj.ll

…

main-declaration.ll

…

main-no-args.ll

…

main-three-args.ll

…

main-with-args.ll

…

masked-shifts.ll

…

mem-intrinsics.ll

…

memory64-feature.ll

…

memory-addr32.ll

…

memory-addr64.ll

…

muloti4.ll

…

multi-return.ll

[WebAssembly] Add -i128:128 to the datalayout string. (#119204 )

2024-12-10 09:21:58 -08:00

multivalue_libcall.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

multivalue-dont-move-def-past-use.mir

[win] NFC: Rename EHCatchret to EHCont to allow for EH Continuation targets that aren't catchret instructions (#129953 )

2025-03-06 09:28:44 -08:00

multivalue-stackify.ll

…

multivalue-stackify.py

…

multivalue.ll

…

mutable-globals.ll

…

naked-fn-with-frame-pointer.ll

[SelectionDAG] Not issue TRAP node if naked function (#132147 )

2025-03-20 18:18:03 -07:00

negative-base-reg.ll

…

no-strip.ll

…

null-streamer.ll

…

offset-atomics.ll

…

offset-fastisel.ll

…

offset-folding.ll

…

offset.ll

[WebAssembly] Change half-precision feature name to fp16. (#105434 )

2024-08-22 09:44:33 -07:00

only-data.ll

…

phi.ll

…

pr47375.ll

…

pr51651.ll

…

pr58904.ll

…

pr59625.ll

…

pr59626.ll

…

pr61828.ll

…

pr63817.ll

…

PR40172.ll

…

PR40267.ll

…

PR41149.ll

…

PR41841.ll

…

profile.ll

[Coverage][WebAssembly] Add initial support for WebAssembly/WASI (#111332 )

2024-10-15 02:41:43 +09:00

ref-null.ll

…

ref-type-mem2local.ll

…

reference-types.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

reg-argument.mir

…

reg-copy.mir

…

reg-stackify.ll

…

regcoalesce-disable.ll

…

return-address-emscripten.ll

…

return-address-unknown.ll

…

return-int32.ll

…

return-void.ll

…

returned.ll

…

rotate-i3264.ll

…

scmp.ll

…

select.ll

…

signext-arg.ll

…

signext-inreg.ll

…

signext-zeroext-callsite.ll

…

signext-zeroext.ll

…

simd-arith.ll

…

simd-asm-pred.ll

…

simd-bitcasts.ll

…

simd-bitmask-mask.ll

…

simd-bitmask.ll

…

simd-build-pair.ll

…

simd-build-vector.ll

…

simd-comparisons.ll

…

simd-concat.ll

…

simd-conversions.ll

…

simd-extended-extract.ll

…

simd-extending-convert.ll

…

simd-extending.ll

…

simd-extract64.ll

…

simd-illegal-signext.ll

…

simd-intrinsics.ll

[clang][wasm] Replace the target integer sub saturate intrinsics with the equivalent generic __builtin_elementwise_sub_sat intrinsics (#109405 )

2024-09-22 10:12:41 +01:00

simd-load-lane-offset.ll

…

simd-load-promote-wide.ll

…

simd-load-splat.ll

…

simd-load-store-alignment.ll

…

simd-load-zero-offset.ll

…

simd-nested-shuffles.ll

…

simd-offset.ll

…

simd-pr51605.ll

…

simd-pr61780.ll

…

simd-reductions.ll

…

simd-select.ll

…

simd-sext-inreg.ll

…

simd-shift-complex-splats.ll

[SelectionDAG] Scalarize binary ops of splats before legal types (#100749 )

2024-08-15 00:07:00 +08:00

simd-shift-in-loop.ll

[SCEVExpander] Clear flags when reusing GEP (#109293 )

2024-10-01 14:22:54 +02:00

simd-shuffle-bitcast.ll

…

simd-simplify-demanded-vector-elts.ll

…

simd-unsupported.ll

[LegalizeVectorOps] Enable ExpandFABS/COPYSIGN to use integer ops for fixed vectors in some cases. (#109232 )

2024-09-30 11:44:49 -07:00

simd-vecreduce-bool.ll

…

simd-vector-trunc.ll

…

simd.ll

DAG: Fix vector_shuffle -> splat fold defining undef lanes (#123596 )

2025-01-21 23:55:50 +07:00

snan_literal.ll

…

stack-alignment.ll

…

stack-insts.ll

…

stack-protector.ll

…

store-trunc-atomic.ll

…

store-trunc.ll

…

store.ll

…

suboptimal-compare.ll

…

swiftcc.ll

…

switch-in-loop.ll

…

switch-unreachable-default.ll

…

switch.ll

…

table-copy.ll

…

table-fill.ll

…

table-grow.ll

…

table-size.ll

…

table-types.ll

…

tailcall.ll

…

target-features-attrs.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

target-features-cpus.ll

[WebAssembly] Support the new "Lime1" CPU (#112035 )

2024-12-03 16:35:23 -08:00

target-features-tls.ll

[WebAssembly] Define call-indirect-overlong and bulk-memory-opt features (#117087 )

2024-12-02 17:08:07 -08:00

thread_pointer.ll

[WebAssembly] Implement %llvm.thread.pointer intrinsic (#117817 )

2024-11-26 17:19:14 -08:00

tls-general-dynamic.ll

…

tls-local-exec.ll

…

ucmp.ll

…

umulo-128-legalisation-lowering.ll

[SelectionDAG] Use the nuw flag when expanding loads. (#119288 )

2024-12-10 06:28:09 -08:00

umulo-i64.ll

…

unreachable.ll

…

unrolled-mem-indices.ll

…

unsupported-function-bitcasts.ll

…

unused-argument.ll

…

userstack.ll

…

vararg-frame.ll

…

varargs.ll

…

vector-reduce.ll

…

vector-sdiv.ll

…

vtable.ll

…

wasm-eh-em-sjlj-error.ll

…

wasm-eh-invalid-personality.ll

…

wasm-eh-prepare.ll

…

wasm-eh-sjlj-setjmp-within-catch.ll

…

weak.ll

…

wide-arithmetic.ll

[WebAssembly] Implement the wide-arithmetic proposal (#111598 )

2024-10-23 11:39:58 -07:00

wide-simd-mul.ll

[WebAssembly] Lower wide SIMD i8 muls (#130785 )

2025-03-21 06:57:57 +00:00

xor_reassociate.ll

…