[PATCH] D125639: [NVPTX] Enable AtomicExpandPass for NVPTX
Shilei Tian via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon May 16 11:59:33 PDT 2022
tianshilei1992 added inline comments.
================
Comment at: llvm/lib/Target/NVPTX/NVPTXISelLowering.cpp:5159
+ case 64:
+ return AtomicExpansionKind::None;
+ default:
----------------
tra wrote:
> According to https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#parallel-synchronization-and-communication-instructions-atom
>
> `64-bit atom.{and,or,xor,min,max} require sm_32 or higher.`
>
> This must be conditional on the GPU variant we're compiling for, similar to how we handle f64 above.
Thanks for pointing it out. Will do.
I got a question actually. What's the minimum SM version do we currently support? I did find from one of your patch (https://reviews.llvm.org/D24943) that it had the function `hasAtomAddF32`, but now I can't find it. Was it removed because we support a minimum SM version, like SM20?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D125639/new/
https://reviews.llvm.org/D125639
More information about the llvm-commits
mailing list