[PATCH] D20388: AMDGPU: Fix supported CL features
Jan Vesely via llvm-commits
llvm-commits at lists.llvm.org
Tue May 31 11:50:18 PDT 2016
jvesely added inline comments.
================
Comment at: lib/Basic/Targets.cpp:2024-2025
@@ -2023,2 +2023,4 @@
Builder.defineMacro("__HAS_LDEXPF__");
+ if (hasFP64)
+ Builder.defineMacro("__HAS_FP64__");
}
----------------
arsenm wrote:
> I don't think we need this. I want device macros for other tuning and intrinsic availability reasons. Right now there are builtins that only work on some subtargets but no way to test for that
Why not have macro per feature that determines intrinsic/optimization availability?
it was preferred last year [0]
at least on r600 it works nicer than separating EG/EG+FP64/NI_but_eg_isa/NI+FP64+CM_ISA
and the feature selection is done in two places (llvm+clang) instead of every piece of compiled code.
[0]http://lists.llvm.org/pipermail/cfe-commits/Week-of-Mon-20150420/127643.html
Repository:
rL LLVM
http://reviews.llvm.org/D20388
More information about the llvm-commits
mailing list