[PATCH] D145401: [AMDGPU] Reserve extra SGPR blocks wth XNACK "any" TID Setting

Austin Kerbow via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Mar 7 17:11:11 PST 2023


kerbowa added inline comments.


================
Comment at: llvm/test/CodeGen/AMDGPU/tid-kd-xnack-any.ll:3
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx90a < %s | FileCheck --check-prefixes=ASM %s
+; RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx90a --filetype=obj < %s | llvm-objdump -s -j .rodata - | FileCheck --check-prefixes=OBJ %s
+
----------------
kzhuravl wrote:
> kerbowa wrote:
> > foad wrote:
> > > Could you pipe the binary into `llvm-readelf --notes -` instead of `llvm-objdump`, and then match text instead of hex dumps?
> > The notes section contains metadata that may not match the kernel descriptor (KD) in .rodata. In a failed test, the metadata was the same in both passing and failing cases, but the KD was different. I can add an extra runline though.
> What was the differfence in the kernel descriptor between passing and failing case?
The difference was the granulated SGPR field. It was 1 in the failing case, 2 in the passing version. I can't really replicate it here since the only reason it was passing with save-temps, is that the ASM parser was correctly checking the dynamic target ID setting and encoding the correct kernel descriptor. We already have MC tests for that. The metadata and what the ASM printer is doing has always been incorrect.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D145401/new/

https://reviews.llvm.org/D145401



More information about the llvm-commits mailing list