[PATCH] D123634: [AMDGPU] Split unaligned 4 DWORD DS operations
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Apr 12 15:43:37 PDT 2022
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIInstrInfo.td:591-595
+def load_align_less_than_4_local : PatFrag<(ops node:$ptr),
+ (load_local node:$ptr)> {
+ let IsLoad = 1;
+ let IsNonExtLoad = 1;
+}
----------------
arsenm wrote:
> Do you really need these PatFrags? The legalizer made the split decisions and the patterns shouldn't need logic deper than it's able to select (which it can regardless of the alignment)
If I am using just load_local/store_local I need to override selection of the ds_read2_b64 for alignments less than 4. Using AddedComplexity does not help, it shall be selected differently for alignment less than 4 and 4.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D123634/new/
https://reviews.llvm.org/D123634
More information about the llvm-commits
mailing list