[PATCH] D152350: [X86][FP16] Do not generate VBROADCAST for fp16

Phoebe Wang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jun 7 03:06:01 PDT 2023


pengfei added inline comments.


================
Comment at: llvm/test/CodeGen/X86/vector-half-conversions.ll:942
+; AVX1-NEXT:    vmovaps %xmm0, {{[-0-9]+}}(%r{{[sb]}}p) # 16-byte Spill
+; AVX1-NEXT:    vbroadcastss 8(%rdi), %xmm0
+; AVX1-NEXT:    callq __extendhfsf2 at PLT
----------------
LuoYuanke wrote:
> vbroadcastss would access 32-bit memory and vpinsrw only access 16-bit memory?
The `vbroadcastss` is an optimization because the high 16-bit is undefined.
`ptr %a0` is a `<8 x i16>` pointer, so we are free to access 32-bit memory without worrying about the size.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D152350/new/

https://reviews.llvm.org/D152350



More information about the llvm-commits mailing list