[all-commits] [llvm/llvm-project] c1852a: [AMDGPU][SIMemoryLegalizer][GFX12] Correctly inser...

Pierre van Houtryve via All-commits all-commits at lists.llvm.org
Mon Oct 20 03:04:09 PDT 2025


  Branch: refs/heads/main
  Home:   https://github.com/llvm/llvm-project
  Commit: c1852afa4b1ebb17f101b9105d36ace700981dac
      https://github.com/llvm/llvm-project/commit/c1852afa4b1ebb17f101b9105d36ace700981dac
  Author: Pierre van Houtryve <pierre.vanhoutryve at amd.com>
  Date:   2025-10-20 (Mon, 20 Oct 2025)

  Changed paths:
    M llvm/lib/Target/AMDGPU/SIMemoryLegalizer.cpp
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-flat-agent.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-flat-cluster.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-flat-system.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-flat-workgroup.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-global-agent.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-global-cluster.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-global-system.ll
    M llvm/test/CodeGen/AMDGPU/memory-legalizer-global-workgroup.ll

  Log Message:
  -----------
  [AMDGPU][SIMemoryLegalizer][GFX12] Correctly insert sample/bvhcnt (#161637)

The check used was not strong enough to prevent the insertion of sample/bvhcnt when they were not needed.
I assume SIInsertWaitCnts was trimming those away anyway, but this was a bug nonetheless.

We were inserting SAMPLE/BVHcnt waits in places where we only needed to wait on the previous atomic operation. Neither of these counter have any atomics associated with them.



To unsubscribe from these emails, change your notification settings at https://github.com/llvm/llvm-project/settings/notifications


More information about the All-commits mailing list