[llvm] [NVPTX] Use v2.u64 to load/store 128-bit values (PR #136638)
Artem Belevich via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 22 11:25:26 PDT 2025
================
@@ -35,11 +35,10 @@ define void @test_b128_input_from_load(ptr nocapture readonly %data) {
; CHECK-NEXT: // %bb.0:
; CHECK-NEXT: ld.param.u64 %rd2, [test_b128_input_from_load_param_0];
; CHECK-NEXT: cvta.to.global.u64 %rd3, %rd2;
-; CHECK-NEXT: ld.global.u64 %rd4, [%rd3+8];
-; CHECK-NEXT: ld.global.u64 %rd5, [%rd3];
-; CHECK-NEXT: mov.b128 %rq1, {%rd5, %rd4};
+; CHECK-NEXT: ld.global.v2.u64 {%rd4, %rd5}, [%rd3];
----------------
Artem-B wrote:
^^^ This is where we could've loaded .b128 directly. Probably makes no difference on the SASS, though, just nicer looking PTX.
https://github.com/llvm/llvm-project/pull/136638
More information about the llvm-commits
mailing list