[PATCH] D99352: [AMDGPU] ds_read_*/ds_write_* operations require strict alignment.
    Matt Arsenault via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Tue Mar 30 09:38:10 PDT 2021
    
    
  
arsenm requested changes to this revision.
arsenm added inline comments.
This revision now requires changes to proceed.
================
Comment at: llvm/lib/Target/AMDGPU/SIISelLowering.cpp:1407-1408
     if (Size == 64) {
-      // ds_read/write_b64 require 8-byte alignment, but we can do a 4 byte
-      // aligned, 8 byte access in a single operation using ds_read2/write2_b32
-      // with adjacent offsets.
-      bool AlignedBy4 = Alignment >= Align(4);
+      // ds_read/write_b64 always require 8-byte alignment for performance
+      // reasons.
+      bool AlignedBy8 = Alignment >= Align(8);
----------------
They do not *require* the alignment for performance reasons. This should report whether it works, and isFast for whether we want to prefer it
Repository:
  rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D99352/new/
https://reviews.llvm.org/D99352
    
    
More information about the llvm-commits
mailing list