[llvm] [NVPTX] fold movs into loads and stores (PR #144581)
Princeton Ferro via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 17 18:50:46 PDT 2025
================
@@ -138,9 +138,9 @@ define ptx_kernel void @foo13(ptr noalias readonly %from, ptr %to) {
}
; SM20-LABEL: .visible .entry foo14(
-; SM20: ld.global.v4.b16
+; SM20: ld.global.v2.b32
; SM35-LABEL: .visible .entry foo14(
-; SM35: ld.global.nc.v4.b16
+; SM35: ld.global.nc.v2.b32
----------------
Prince781 wrote:
This is happening because of the changes in `getVectorLoweringShape()` to prefer packing all types maximally first, then allowing these new combiner rules to split them later as needed.
https://github.com/llvm/llvm-project/pull/144581
More information about the llvm-commits
mailing list