[PATCH] D80315: Fix CC1 command line options mapping into fast-math flags.
Michele Scandale via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed May 20 10:54:53 PDT 2020
michele.scandale marked 2 inline comments as done.
michele.scandale added inline comments.
================
Comment at: clang/test/CodeGen/libcalls.c:11-21
+ // CHECK-FAST: call reassoc nsz arcp afn float @llvm.sqrt.f32(float
float l0 = sqrtf(a0);
// CHECK-YES: call double @sqrt
// CHECK-NO: call double @llvm.sqrt.f64(double
- // CHECK-FAST: call double @llvm.sqrt.f64(double
+ // CHECK-FAST: call reassoc nsz arcp afn double @llvm.sqrt.f64(double
double l1 = sqrt(a1);
----------------
For CUDA the default FP contract mode is `fast`, therefore the `contract` FMF is emitted.
================
Comment at: clang/test/CodeGenCUDA/builtins-amdgcn.cu:13
// CHECK-LABEL: @_Z12test_ds_fmaxf(
-// CHECK: call float @llvm.amdgcn.ds.fmax(float addrspace(3)* @_ZZ12test_ds_fmaxfE6shared, float %{{[^,]*}}, i32 0, i32 0, i1 false)
+// CHECK: call contract float @llvm.amdgcn.ds.fmax(float addrspace(3)* @_ZZ12test_ds_fmaxfE6shared, float %{{[^,]*}}, i32 0, i32 0, i1 false)
__global__
----------------
For CUDA the default FP contract mode is `fast`, therefore the `contract` FMF is emitted.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D80315/new/
https://reviews.llvm.org/D80315
More information about the cfe-commits
mailing list