[PATCH] D112718: Add intrinsics and builtins for PTX atomics with semantic orders
Artem Belevich via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Thu Jan 6 14:16:50 PST 2022
tra added inline comments.
================
Comment at: clang/include/clang/Basic/BuiltinsNVPTX.def:1057
+
+BUILTIN(__nvvm_atom_xchg_global_i, "iiD*i", "n")
+TARGET_BUILTIN(__nvvm_atom_cta_xchg_global_i, "iiD*i", "n", SM_60)
----------------
We need to figure out how address-space-specific builtins are supposed to work.
Right now two competing approaches.
This patch declares builtins with generic pointer as an argument, but, according to the test, expects to be used with the AS-specific pointer.
It probably does not catch a wrong-AS pointer passed as an argument, either.
It does happen to work, but I think it's mostly due to the fact that LLVM intrinsics are overloaded and we happen to end up addrspacecast'ing things correctly if the builtin gets the right kind of pointer.
The other approach is to declare the pointer with the expected AS. E.g:
> TARGET_BUILTIN(__nvvm_mbarrier_init_shared, "vWi*3i", "", AND(SM_80,PTX70))
IMO, this is the correct way to do it, but it is also rather inconvenient to use from CUDA code as it operates on generic pointers.
@jdoerfert - WDYT?
================
Comment at: clang/test/CodeGen/builtins-nvptx.c:557
+ // expected-error at +1 {{'__nvvm_atom_acquire_add_global_i' needs target feature sm_70}}
+ __nvvm_atom_acquire_add_global_i((__attribute__((address_space(1))) int *)ip, i);
+
----------------
What happens if I pass a wrong pointer kind? E.g. a generic or shared pointer?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D112718/new/
https://reviews.llvm.org/D112718
More information about the cfe-commits
mailing list