[PATCH] D81638: [AMDGPU][GlobalISel] Fix 96 and 128 local loads and stores

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Aug 10 12:13:01 PDT 2020


arsenm added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUInstructions.td:507
+def store_align4_local: PatFrag<(ops node:$val, node:$ptr),
+                                (store_local node:$val, node:$ptr)>, Aligned<4> {
+  let IsStore = 1;
----------------
mbrkusanin wrote:
> arsenm wrote:
> > I think the Aligned<> subclasses didn't actually work for some reason, but I only half fixed the patterns maybe?
> I changed it so now Aligned<> subclasses are used for both load and store. They seem to work fine.
Did you double check the generated matcher table? The problem was the current emitter only checks one of these predicates at a time, so it successfully imports but then doesn't actually perform the check. It's silently ignored


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D81638/new/

https://reviews.llvm.org/D81638



More information about the llvm-commits mailing list