[PATCH] D123548: AMDGPU: Emit metadata for the hidden_multigrid_sync_arg conditionally

Changpeng Fang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Apr 11 17:19:10 PDT 2022


cfang marked an inline comment as done.
cfang added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUAttributor.cpp:545
+      return false;
+    AAPointerInfo::OffsetAndSize OAS(AMDGPU::ImplicitArg::HEAP_PTR_OFFSET, 8);
+    return funcRetrievesImplicitKernelArg(A, OAS);
----------------
b-sumner wrote:
> I expected to see getMultigridSyncArgImplicitArgPosition used here.
Thanks. I was thinking whether we should only limit the change to v5
and not sure how did I end up with the garbage code. Will correct it.

The reason I was not sure for pre-v5 is that printf and hostcall hold the same offset, so it is not possible to use implicitarg_ptr + offset to detect.

To say that, our hostcall_ptr check logic is broken because even though you are doing printf, you may still end up hostcall metadata emission ( for earlier code object versions). 


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D123548/new/

https://reviews.llvm.org/D123548



More information about the llvm-commits mailing list