[llvm] [LLVM][CodeGen][SVE] Improve isel for split vector bfloat conversions. (PR #184357)
Graham Hunter via llvm-commits
llvm-commits at lists.llvm.org
Tue Mar 3 07:43:47 PST 2026
================
@@ -85,16 +85,13 @@ define <vscale x 4 x bfloat> @fadd_nxv4bf16(<vscale x 4 x bfloat> %a, <vscale x
define <vscale x 8 x bfloat> @fadd_nxv8bf16(<vscale x 8 x bfloat> %a, <vscale x 8 x bfloat> %b) {
; NOB16B16-LABEL: fadd_nxv8bf16:
; NOB16B16: // %bb.0:
-; NOB16B16-NEXT: uunpkhi z2.s, z1.h
-; NOB16B16-NEXT: uunpkhi z3.s, z0.h
-; NOB16B16-NEXT: uunpklo z1.s, z1.h
-; NOB16B16-NEXT: uunpklo z0.s, z0.h
+; NOB16B16-NEXT: movi v2.2d, #0000000000000000
----------------
huntergr-arm wrote:
Any reason to prefer the NEON instruction over dup/fmov 0? I guess it shouldn't matter too much either way...
https://github.com/llvm/llvm-project/pull/184357
More information about the llvm-commits
mailing list