[all-commits] [llvm/llvm-project] 07c592: Reland "[AMDGPU] Wave32 CodeGen for amdgcn.ballot....

Pierre van Houtryve via All-commits all-commits at lists.llvm.org
Thu Jul 13 06:59:07 PDT 2023


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: 07c59204878045b58ea1fe976ac2aac58d2efbb6
      https://github.com/llvm/llvm-project/commit/07c59204878045b58ea1fe976ac2aac58d2efbb6
  Author: pvanhout <pierre.vanhoutryve at amd.com>
  Date:   2023-07-13 (Thu, 13 Jul 2023)

  Changed paths:
    M llvm/lib/Target/AMDGPU/AMDGPUInstructionSelector.cpp
    M llvm/lib/Target/AMDGPU/VOPCInstructions.td
    A llvm/test/CodeGen/AMDGPU/llvm.amdgcn.ballot.i64.wave32.ll

  Log Message:
  -----------
  Reland "[AMDGPU] Wave32 CodeGen for amdgcn.ballot.i64"

This time without the extra `->dump()`

A recent addition to the device libs, `__ockl_dm_trim`, caused a series of
failures at O0 due to a i64 ballot intrinsic being inlined into a wave32 function.

The quick fix for this is to support codegen for this rare case.
A proper long-term fix for this type of issue is still being discussed.

Fixes SWDEV-408929, SWDEV-408957, SWDEV-409885, SWDEV-410193

Reviewed By: #amdgpu, arsenm

Differential Revision: https://reviews.llvm.org/D155050




More information about the All-commits mailing list