[libc-commits] [PATCH] D158320: [libc] Initial support for microbenchmarking GPU code
Joseph Huber via Phabricator via libc-commits
libc-commits at lists.llvm.org
Fri Aug 18 15:26:52 PDT 2023
jhuber6 added a comment.
In D158320#4600380 <https://reviews.llvm.org/D158320#4600380>, @JonChesterfield wrote:
> You want memory fences to keep the operations inside the profiled region, the asm won't do that unless it has a memory clobber. Inline asm is likely to mess up codegen too.
I messed around with fences but didn't notice any difference when I was messing around with this. The noinline and ordering seems to handle that for me.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D158320/new/
https://reviews.llvm.org/D158320
More information about the libc-commits
mailing list