[PATCH] D155050: [AMDGPU] Wave32 CodeGen for amdgcn.ballot.i64

Pierre van Houtryve via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jul 12 01:23:47 PDT 2023


Pierre-vh created this revision.
Pierre-vh added reviewers: AMDGPU, arsenm, foad.
Herald added subscribers: StephenFan, kerbowa, hiraditya, tpr, dstuttard, yaxunl, jvesely, kzhuravl.
Herald added a project: All.
Pierre-vh requested review of this revision.
Herald added subscribers: llvm-commits, wdng.
Herald added a project: LLVM.

A recent addition to the device libs, `__ockl_dm_trim`, caused a series of
failures at O0 due to a i64 ballot intrinsic being inlined into a wave32 function.

The quick fix for this is to support codegen for this rare case.
A proper long-term fix for this type of issue is still being discussed.

Fixes SWDEV-408929, SWDEV-408957, SWDEV-409885, SWDEV-410193


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D155050

Files:
  llvm/lib/Target/AMDGPU/VOPCInstructions.td
  llvm/test/CodeGen/AMDGPU/llvm.amdgcn.ballot.i64.wave32.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D155050.539423.patch
Type: text/x-patch
Size: 5288 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230712/483b5d9e/attachment.bin>


More information about the llvm-commits mailing list