[PATCH] D41599: [X86] Lowering X86 avx512 sqrt intrinsics to IR - LLVM
Uriel Korach via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jan 1 01:17:10 PST 2018
uriel.k added a comment.
In https://reviews.llvm.org/D41599#964739, @RKSimon wrote:
> Won't this mean that explicit calls to the SSE sqrt intrinsics may be converted to the rsqrt+NR estimates in some cases?
Yes, this is expected as that's what we are aiming by lowering the intrinsics to IR code, we want the compiler to make a better decision, to get better performance.
Correct me if miss something special about this intrinsic.
================
Comment at: test/CodeGen/X86/sse-intrinsics-x86.ll:476
}
-declare <4 x float> @llvm.x86.sse.sqrt.ps(<4 x float>) nounwind readnone
+declare void @llvm.x86.sse.stmxcsr(i8*) nounwind
----------------
RKSimon wrote:
> Why did you move this test?
You are right, my mistake. fixed.
================
Comment at: test/CodeGen/X86/sse2-intrinsics-fast-isel.ll:2954
}
-declare <2 x double> @llvm.x86.sse2.sqrt.pd(<2 x double>) nounwind readnone
+declare <2 x double> @llvm.sqrt.v2f32(<2 x double>) nounwind readnone
----------------
RKSimon wrote:
> Shouldn't that be llvm.sqrt.v2f64?
fixed.
https://reviews.llvm.org/D41599
More information about the llvm-commits
mailing list