[PATCH] D112733: [AMDGPU] Fix cvt_f32_ubyte combine with shl
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Oct 28 12:55:11 PDT 2021
arsenm added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/cvt_f32_ubyte_vector.ll:4-6
+%Vec = type { [4 x i8] }
+
+define amdgpu_kernel void @cvt_f32_ubyte0_vector() local_unnamed_addr {
----------------
vangthao wrote:
> arsenm wrote:
> > You can just put this in the existing test
> When I put it with the existing test, I am getting:
>
> > LLVM ERROR: Cannot select: t86: ch = store<(store (s8) into i32* undef + 3), trunc to i8> t97, t51, undef:i64, undef:i64
>
> This is coming from the first check with -mcpu=tahiti.
>
Just change the flat pointers to addrspace(1)
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D112733/new/
https://reviews.llvm.org/D112733
More information about the llvm-commits
mailing list