justinfargnoli wrote: Note: I've only tested this PR on public CUDA 13.0 and internal ToT `ptxas`. Trying to see if I can trigger a build with the public build bot. https://github.com/llvm/llvm-project/pull/154439