choikwa wrote: > Have you done any other benchmarking on this patch? It seems like it could have a big effect on performance, both good and bad. I ran the ROCmValidation suite but didn't observe significant perf delta. https://github.com/llvm/llvm-project/pull/140674