[PATCH] D143731: [AMDGPU] Scalarize some large PHIs for DAGISel

Fri Feb 10 05:23:13 PST 2023

Pierre-vh created this revision.
Pierre-vh added reviewers: foad, arsenm.
Herald added subscribers: kosarev, StephenFan, kerbowa, hiraditya, tpr, dstuttard, yaxunl, jvesely, kzhuravl.
Herald added a project: All.
Pierre-vh requested review of this revision.
Herald added subscribers: llvm-commits, wdng.
Herald added a project: LLVM.

DAGISel uses CopyToReg/CopyFromReg to lower PHI nodes. With large, non 2^n sized PHIs this can result in poor codegen.
This is because it introduces a need to have a build_vector before copying the PHI value, and that build_vector may have many undef elements. This can cause very high register pressure and abnormal stack usage in some cases.

This scalarization can be easily tuned/disabled through CL options in case it's not beneficial for some users. It's also only enabled for DAGIsel and GlobalISel handles PHIs much better (as it works on the whole function).

Fixes SWDEV-321581

Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D143731

Files:
  llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
  llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-scalarize-large-phis.ll
  llvm/test/CodeGen/AMDGPU/tuple-allocation-failure.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D143731.496438.patch
Type: text/x-patch
Size: 100220 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230210/d40f2640/attachment-0001.bin>