[PATCH] D158950: [X86][BF16] Lower FP_EXTEND for vector types under AVX512BF16
Phoebe Wang via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Aug 27 06:46:45 PDT 2023
pengfei added inline comments.
================
Comment at: llvm/test/CodeGen/X86/bfloat.ll:1518
+; SSE2-NEXT: movlhps {{.*#+}} xmm3 = xmm3[0],xmm4[0]
+; SSE2-NEXT: retq
+;
----------------
RKSimon wrote:
> how come this is scalarized given how easy it is to bit-twiddle bfloat to float?
We cannot do vector fptrunc without AVX512BF16, so I deferred to set vXbf16 legal until the feature ready.
I think we may make it asymmetry between fptrunc and fpext. We can do it in a follow up. This is to address #64460 and backport to 17.0 with a low risk.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D158950/new/
https://reviews.llvm.org/D158950
More information about the llvm-commits
mailing list