clang-p2996/llvm/test/CodeGen at a2f156b84ab124ccfbbe2bd6cbbdb2f3bcbba0ce - clang-p2996 - Caio's Gitea

caio/clang-p2996

Files

History

Durgadoss R c507a0830d [NVPTX] Add TMA Bulk Copy Intrinsics (#138679 )

This patch adds a new variant of TMA Bulk Copy
intrinsics introduced in sm100+. This variant
has an additional byte_mask to select the bytes
for the copy operation.

* Selection is all done through table-gen now.
  So, this patch removes the corresponding
  SelectCpAsyncBulkS2G() function.
* lit tests are verified with a cuda-12.8 ptxas
  executable.

PTX Spec link:

https://docs.nvidia.com/cuda/parallel-thread-execution/#data-movement-and-conversion-instructions-bulk-copy

Signed-off-by: Durgadoss R <durgadossr@nvidia.com>

2025-05-15 16:08:01 +05:30

..

[AArch64] Use vecshiftL64 instead of vecshiftR64 to match scalar SLI imm. (#139904 )

2025-05-15 09:36:06 +01:00

[AMDGPU] Add flag to prevent reruns of LowerModuleLDS (#129520 )

2025-05-15 09:54:21 +02:00

…

CodeGen: Add ISD::AssertNoFPClass (#138839 )

2025-05-15 16:05:15 +08:00

…

…

…

[DirectX] Set shader feature flags MinimumPrecision and NativeLowPrecision, and refactor the logic for setting low-precision-related flags (#139623 )

2025-05-14 10:37:27 -07:00

[LLVM][VecLib] Refactor LIBMVEC integration to be target neutral. (#138262 )

2025-05-07 11:05:25 +01:00

Hexagon: sfmax/sfmin instructions are IEEE754-2019 (#139056 )

2025-05-14 11:55:11 +08:00

…

…

LoongArch: Set FMAXNUM and FMINNUM as Legal (#139010 )

2025-05-09 16:26:08 +08:00

…

CodeGen: Add ISD::AssertNoFPClass (#138839 )

2025-05-15 16:05:15 +08:00

[NVPTX] Vectorize and lower 256-bit global loads/stores for sm_100+/ptx88+ (#139292 )

2025-05-13 13:36:09 -07:00

…

…

[NVPTX] Add TMA Bulk Copy Intrinsics (#138679 )

2025-05-15 16:08:01 +05:30

[PowerPC] catch v2i64 shift left by 1 is add case (#138772 )

2025-05-13 11:26:46 -04:00

[RISCV][MC] Add support for Q extension (#139369 )

2025-05-15 10:51:06 +08:00

…

[SPIR-V] Fix LIT tests, improve ICmpInst's type inference (#139726 )

2025-05-15 10:46:54 +02:00

[MCP] Disable BackwardCopyPropagateBlock for copies with implicit registers. (#137687 )

2025-05-08 16:27:08 -07:00

…

…

…

[WebAssembly] Fix trunc in FastISel (#138479 )

2025-05-06 14:16:35 -07:00

…

[WinEH] Fix asm in catchpad being turned into unreachable (#138392 )

2025-05-08 21:46:51 +02:00

[X86] avgfloors.ll - regenerate test checks for TERNLOG comments

2025-05-15 10:13:50 +01:00

…

…