[PATCH] D44210: [AMDGPU] Supported ds_read_b128 generation; Widened vector length for local address-space
Farhana Aleen via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 9 08:42:54 PST 2018
FarhanaAleen added inline comments.
================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:5434
+ // Use ds_read_b128 if possible.
+ if (Subtarget->useDS128(EnableDS128) && Load->getAlignment() >= 16 &&
+ MemVT.getStoreSize() == 16)
----------------
rampitec wrote:
> You only have pattern for v4i32, but enable operation for all 128 bit. Will it work with v8i16 for example?
Yes, it works for i16/i8.
During dag combine, AMDGPU loadCombiner combines vector types of 8/16/64 to vector types of 32 bit type.
https://reviews.llvm.org/D44210
More information about the llvm-commits
mailing list