[libc-commits] [PATCH] D158320: [libc] Initial support for microbenchmarking GPU code

Joseph Huber via Phabricator via libc-commits libc-commits at lists.llvm.org
Fri Aug 18 15:26:52 PDT 2023


jhuber6 added a comment.

In D158320#4600380 <https://reviews.llvm.org/D158320#4600380>, @JonChesterfield wrote:

> You want memory fences to keep the operations inside the profiled region, the asm won't do that unless it has a memory clobber. Inline asm is likely to mess up codegen too.

I messed around with fences but didn't notice any difference when I was messing around with this. The noinline and ordering seems to handle that for me.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D158320/new/

https://reviews.llvm.org/D158320



More information about the libc-commits mailing list