[clang] [llvm] [X86][AMX-AVX512] Add *i intrinsics for immediate variants (PR #173545)

Feng Zou via llvm-commits llvm-commits at lists.llvm.org
Tue Dec 30 22:11:26 PST 2025


================
@@ -126,6 +197,43 @@
 #define _tile_cvtrowps2bf16l(tsrc, row)                                        \
   __builtin_ia32_tcvtrowps2bf16l(tsrc, row)
 
+/// Moves a row from a tile register to a zmm destination register, converting
+///    the fp32 source elements to bf16. It places the resulting bf16 elements
+///    in the low 16 bits within each dword. The row of the tile is selected
+///    by a 8b immediate value.
+///
+/// \headerfile <x86intrin.h>
+///
+/// \code
+/// __m512i _tile_cvtrowps2bf16li(__tile tsrc, const unsigned int imm8);
+/// \endcode
+///
+/// \code{.operation}
+/// VL := 512
+/// VL_bytes := VL >> 3
+/// row_index := imm8 & 0x3ff
----------------
fzou1 wrote:

Same as above.

https://github.com/llvm/llvm-project/pull/173545


More information about the llvm-commits mailing list