[libc-commits] [PATCH] D159276: [libc][gpu] Thread divergence fix on volta, WIP

Joseph Huber via Phabricator via libc-commits libc-commits at lists.llvm.org
Thu Aug 31 05:57:58 PDT 2023


jhuber6 added a comment.

We have a single test that opens and closes in a divergent state, it was the original one that caught deadlocks on AMDGPU https://github.com/llvm/llvm-project/blob/main/libc/test/integration/startup/gpu/rpc_test.cpp. This is definitely a good idea on AMDGPU, when I originally tested it we end up exchanging about 7  VGPRs for about 6 SGPRs which is a very good trade-off. I think the best we can do is just maintain the divergence that we know of when we open the RPC interface. That is, when we broadcast the value we should write it to the mask we know of, since that's always a subset of the "true mask" right?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D159276/new/

https://reviews.llvm.org/D159276



More information about the libc-commits mailing list