[PATCH] D55263: [CodeGen][ExpandMemcmp] Add an option for allowing overlapping loads.
    Sanjay Patel via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Fri Dec 14 06:11:41 PST 2018
    
    
  
spatel added inline comments.
================
Comment at: lib/CodeGen/ExpandMemCmp.cpp:166
+  // is larger than the max allowed.
+  if ((NumNonOverlappingLoads + !!(Size > 0)) > MaxNumLoads)
+    return {};
----------------
I'm still not clear on this: if Size is 0, does that imply that computeGreedy failed?
================
Comment at: lib/Target/X86/X86TargetTransformInfo.cpp:2902
     Options.LoadSizes.push_back(1);
+    // All GPR loads can be unaligned, and vector loads too starting form SSE2.
+    Options.AllowOverlappingLoads = true;
----------------
form -> from (as suggested previously)
Repository:
  rL LLVM
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D55263/new/
https://reviews.llvm.org/D55263
    
    
More information about the llvm-commits
mailing list