[PATCH] D147145: [AMDGPU] Consistently use v2i16 for packed bf16 inputs. NFC.
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 29 05:16:14 PDT 2023
foad added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/FLATInstructions.td:735
defm FLAT_ATOMIC_PK_ADD_F16 : FLAT_Atomic_Pseudo<"flat_atomic_pk_add_f16", VGPR_32, v2f16>;
- defm FLAT_ATOMIC_PK_ADD_BF16 : FLAT_Atomic_Pseudo<"flat_atomic_pk_add_bf16", VGPR_32, v2f16>;
+ let FPAtomic = 1 in
+ defm FLAT_ATOMIC_PK_ADD_BF16 : FLAT_Atomic_Pseudo<"flat_atomic_pk_add_bf16", VGPR_32, v2i16>;
----------------
FPAtomic is usually deduced from the type, so I override it here to ensure NFC.
//However// it is only used by the hazard detection code for "FPAtomicToDenormModeHazard" which is GFX10-only, and GFX10 does not have these instructions, so perhaps I should not bother to override FPAtomic here?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D147145/new/
https://reviews.llvm.org/D147145
More information about the llvm-commits
mailing list