[PATCH] D80032: [AMDGPU] Always expand ext/insertelement with divergent idx
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed May 20 12:02:52 PDT 2020
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIISelLowering.cpp:9549
// instructions.
- if (VecSize <= 256 && (VecSize > 64 || EltSize >= 32) &&
+ // Always do this if var-idx is divergent, otherwise it will become a loop.
+ if (ExpandDynVectors &&
----------------
arsenm wrote:
> GlobalISel needs the compare and select path implemented
Yes, although that is a separate issue. GlobalISel also needs to work with non-power of two vectors for movrel. Yet another piece of work is to tune the limits, they seem to be suboptimal at least for doubles.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D80032/new/
https://reviews.llvm.org/D80032
More information about the llvm-commits
mailing list