[PATCH] D44210: [AMDGPU] Supported ds_read_b128 generation; Widened vector length for local address-space
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 7 14:31:35 PST 2018
rampitec added a comment.
Given performance benefit is somewhat unclear can you put it under an option?
================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:5428
+ if (Subtarget->getGeneration() >= SISubtarget::SEA_ISLANDS &&
+ isAligned16(Alignment) && MemVT.getStoreSize() == 16)
+ return SDValue();
----------------
arsenm wrote:
> You don't need the isAligned16 helper. You just need to check that the alignment is >= 16, not % 16
Second to that.
https://reviews.llvm.org/D44210
More information about the llvm-commits
mailing list