[PATCH] D157683: [AMDGPU] W/a for gfx940 byte0 fp8 conversion bug

Jay Foad via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Aug 11 02:06:16 PDT 2023


foad accepted this revision.
foad added inline comments.
This revision is now accepted and ready to land.


================
Comment at: llvm/lib/Target/AMDGPU/GCNSubtarget.h:1175
 
+  // \returns true is FP8/BF8 VOP1 form of conversion to F32 in unreliable.
+  bool hasCvtFP8VOP1Bug() const { return true; }
----------------
"true **if** ... **is** unreliable"


================
Comment at: llvm/lib/Target/AMDGPU/VOP1Instructions.td:589
     (f32 (node i32:$src, index)),
     !if (index,
          (inst_sdwa 0, $src, 0, 0, index),
----------------
Could simplify this since index is never 0 now.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D157683/new/

https://reviews.llvm.org/D157683



More information about the llvm-commits mailing list