[MLIR][AMDGPU] Fix bug in GatherToLDSOpLowering, get the correct MemRefType for destination (#142915)

This PR fixes a bug in GatherToLDSOpLowering, we were getting the MemRefType of source for the destination. Additionally, some related typos are corrected. CC: @krzysz00 @umangyadav @lialan
2025-06-13 18:33:51 +02:00
parent bd33eef7f1
commit 68b6f392ed
3 changed files with 14 additions and 12 deletions
--- a/llvm/docs/AMDGPUUsage.rst
+++ b/llvm/docs/AMDGPUUsage.rst
@@ -1215,12 +1215,12 @@ The AMDGPU backend implements the following LLVM IR intrinsics.
                                                   denormalization mode, enabled traps, and floating point exceptions.
                                                   The format is a 64-bit concatenation of the MODE and TRAPSTS registers.

-  :ref:`llvm.set.fpenv<int_set_fpenv>`             Sets the floating point environment to the specifies state.
+  :ref:`llvm.set.fpenv<int_set_fpenv>`             Sets the floating point environment to the specified state.
  llvm.amdgcn.load.to.lds.p<1/7>                   Loads values from global memory (either in the form of a global
                                                   a raw fat buffer pointer) to LDS. The size of the data copied can be 1, 2,
                                                   or 4 bytes (and gfx950 also allows 12 or 16 bytes). The LDS pointer
                                                   argument should be wavefront-uniform; the global pointer need not be.
-                                                   The LDS pointer is implicitly offset by 4 * lane_id bytes for sies <= 4 bytes
+                                                   The LDS pointer is implicitly offset by 4 * lane_id bytes for size <= 4 bytes
                                                   and 16 * lane_id bytes for larger sizes. This lowers to `global_load_lds`,
                                                   `buffer_load_* ... lds`, or `global_load__* ... lds` depending on address
                                                   space and architecture. `amdgcn.global.load.lds` has the same semantics as