[all-commits] [llvm/llvm-project] cfa2d0: [AMDGPU] Wave32 CodeGen for amdgcn.ballot.i64
Pierre van Houtryve via All-commits
all-commits at lists.llvm.org
Thu Jul 13 06:21:13 PDT 2023
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: cfa2d0a3aa0beb5422107dc9943cb0eae6d93896
https://github.com/llvm/llvm-project/commit/cfa2d0a3aa0beb5422107dc9943cb0eae6d93896
Author: pvanhout <pierre.vanhoutryve at amd.com>
Date: 2023-07-13 (Thu, 13 Jul 2023)
Changed paths:
M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
M llvm/lib/Target/AMDGPU/VOPCInstructions.td
A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.ballot.i64.wave32.ll
Log Message:
-----------
[AMDGPU] Wave32 CodeGen for amdgcn.ballot.i64
A recent addition to the device libs, `__ockl_dm_trim`, caused a series of
failures at O0 due to a i64 ballot intrinsic being inlined into a wave32 function.
The quick fix for this is to support codegen for this rare case.
A proper long-term fix for this type of issue is still being discussed.
Fixes SWDEV-408929, SWDEV-408957, SWDEV-409885, SWDEV-410193
Reviewed By: #amdgpu, arsenm
Differential Revision: https://reviews.llvm.org/D155050
More information about the All-commits
mailing list