[PATCH] D120265: AMDGPU: Use the implicit kernargs for code object version 5

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Feb 21 17:46:44 PST 2022


arsenm added inline comments.


================
Comment at: clang/lib/CodeGen/CGBuiltin.cpp:16243
+  Call->addRetAttr(
+      Attribute::getWithDereferenceableBytes(Call->getContext(), 64));
+  Call->addRetAttr(Attribute::getWithAlignment(Call->getContext(), Align(4)));
----------------
Isn't the total size 256?


================
Comment at: clang/lib/CodeGen/CGBuiltin.cpp:16244
+      Attribute::getWithDereferenceableBytes(Call->getContext(), 64));
+  Call->addRetAttr(Attribute::getWithAlignment(Call->getContext(), Align(4)));
+  return Call;
----------------
If I remember correctly we require 8 byte alignment (not that it makes much difference)


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120265/new/

https://reviews.llvm.org/D120265



More information about the llvm-commits mailing list