[Mlir-commits] [mlir] 9a3ece2 - [mlir][sparse] Fix the calling convention of __truncsfbf2 on windows x64

Tue Mar 19 05:49:59 PDT 2024

Author: Benjamin Kramer
Date: 2024-03-19T13:48:10+01:00
New Revision: 9a3ece232ced907f87ed0d5846a57f6d9cbc9832

URL: https://github.com/llvm/llvm-project/commit/9a3ece232ced907f87ed0d5846a57f6d9cbc9832
DIFF: https://github.com/llvm/llvm-project/commit/9a3ece232ced907f87ed0d5846a57f6d9cbc9832.diff

LOG: [mlir][sparse] Fix the calling convention of __truncsfbf2 on windows x64

It also wants us to return the value in XMM0.

Added: 
    

Modified: 
    mlir/lib/ExecutionEngine/Float16bits.cpp

Removed: 
    


################################################################################
diff  --git a/mlir/lib/ExecutionEngine/Float16bits.cpp b/mlir/lib/ExecutionEngine/Float16bits.cpp
index 841610e3c161dc..e5b4f18dd644b8 100644

--- a/mlir/lib/ExecutionEngine/Float16bits.cpp
+++ b/mlir/lib/ExecutionEngine/Float16bits.cpp
@@ -165,7 +165,7 @@ bool operator==(const bf16 &f1, const bf16 &f2) { return f1.bits == f2.bits; }
 #endif
 #endif
 
-#if defined(__x86_64__)
+#if defined(__x86_64__) || defined(_M_X64)
 // On x86 bfloat16 is passed in SSE registers. Since both float and __bf16
 // are passed in the same register we can use the wider type and careful casting
 // to conform to x86_64 psABI. This only works with the assumption that we're