[PATCH] D98081: [AMDGPU] Improve Codegen for build_vector
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 5 15:09:07 PST 2021
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:2400-2402
+ (v2f16 (build_vector (f16 (bitconvert (i16 (trunc VGPR_32:$src0)))),
+ (f16 (bitconvert (i16 (trunc VGPR_32:$src1)))))),
+ (V_PACK_B32_F16_e64 SRCMODS.NONE, VGPR_32:$src0, SRCMODS.NONE, VGPR_32:$src1)
----------------
This isn't a simple bitpacking, this has FP output effects like flushing
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D98081/new/
https://reviews.llvm.org/D98081
More information about the llvm-commits
mailing list