[PATCH] D98362: [AMDGPU] Fix -amdgpu-inline-arg-alloca-cost

Thu Mar 11 18:07:10 PST 2021

arsenm added inline comments.

================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUTargetTransformInfo.cpp:1192
   if (AllocaSize)
-    return ArgAllocaCost;
+    return ArgAllocaCost * getInliningThresholdMultiplier();
   return 0;
----------------
rampitec wrote:
> dfukalov wrote:
> > rampitec wrote:
> > > arsenm wrote:
> > > > Should the just adjust for the scale then?
> > > I thought about this, but whenever we will adjust the scale the next time we will have to visit it again.
> > Nit: It seems instead of this modification you can just swap two lines
> > ```
> > 1582:  Threshold *= TTI.getInliningThresholdMultiplier();
> > 1583:  Threshold += TTI.adjustInliningThreshold(&Call);
> > 
> > ```
> > [[ https://reviews.llvm.org/source/llvm-github/browse/main/llvm/lib/Analysis/InlineCost.cpp$1582-1583 | in InlineCost.cpp ]] so we'll stay with just one place of `* getInliningThresholdMultiplier()`.
> That would change behavior for all targets.
I thought the point of the multiplier was to just amplify the expense of calls. I don't understand scaling up the cost here

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D98362/new/

https://reviews.llvm.org/D98362