clang-p2996

Files

Florian Hahn a7c6471a85 [Passes] Run vector-combine early with -fenable-matrix.

IR with matrix intrinsics is likely to also contain large vector
operations, which can benefit from early simplifications.

This is the last step in a series of changes to improve code-gen for
code using matrix subscript operators with the C/C++ matrix extension in
CLang, like

    using matrix_t = double __attribute__((matrix_type(15, 15)));

    void foo(unsigned i, matrix_t &A, matrix_t &B) {
      for (unsigned j = 0; j < 4; ++j)
        for (unsigned k = 0; k < i; k++)
          B[k][j] -= A[k][j] * B[i][j];
    }

https://clang.godbolt.org/z/6dKxK1Ed7

Reviewed By: spatel

Differential Revision: https://reviews.llvm.org/D102496

2021-09-22 12:48:32 +01:00

globals-aa-required-for-vectorization.ll

[opt] Remove some legacy PM flags

2021-09-13 15:50:03 -07:00

hoisting-sinking-required-for-vectorization.ll

[NewPM] Remove SpeculateAroundPHIs pass

2021-06-15 20:35:55 +03:00

lit.local.cfg

…

matrix-extract-insert.ll

[Passes] Run vector-combine early with -fenable-matrix.

2021-09-22 12:48:32 +01:00

peel-multiple-unreachable-exits-for-vectorization.ll

[PhaseOrdering] Add test for missed vectorization with vector::at calls.

2021-08-16 09:43:30 +01:00