[PATCH] D158059: [AMDGPU/wmma] - Disable 3-address syntax for f16
Jessica Del via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Sep 8 08:19:37 PDT 2023
OutOfCache added a comment.
I'd be happy to change the approach, but I can't think of a better way to preserve the old behavior while also guaranteeing the correct initialization of register values.
I assume the current behavior of `wmma` (only writing to one half of the register while leaving the other half untouched) is correct, or should it copy the content of the input accumulator into the other half?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D158059/new/
https://reviews.llvm.org/D158059
More information about the llvm-commits
mailing list