[clang] [compiler-rt] [llvm] [openmp] [PGO][Offload] Add GPU profiling flags to driver (PR #94268)
Joseph Huber via cfe-commits
cfe-commits at lists.llvm.org
Mon Jun 24 03:59:18 PDT 2024
jhuber6 wrote:
> @jhuber6 The clang format errors are mostly due to my local version of `clang-format` disagreeing with the buildbot's version. Its a bit annoying, but it shouldn't be too much of a problem given I plan on squashing and merging once this gets approved.
>
> I added new flags for GPU PGO specifically because I didn't want to modify the PGO flags' existing behavior. PGO has a significant runtime cost, so I figured it would be best for the end user experience to only enable PGO on the GPU when it was specifically requested.
Is this something that specifically requires its own flag? Or could we just do `-Xarch_device -fprofile-generate`.
https://github.com/llvm/llvm-project/pull/94268
More information about the cfe-commits
mailing list