[llvm-branch-commits] [mlir] fbd2950 - [mlir][sparse] Refine f695554a2a55 wording a bit and fix the double conversion I broke with aggressive copy&paste

Thu Aug 11 23:40:51 PDT 2022

Author: Benjamin Kramer
Date: 2022-08-12T08:39:37+02:00
New Revision: fbd2950d8d0dd0ad953b439374237497b47d2ecf

URL: https://github.com/llvm/llvm-project/commit/fbd2950d8d0dd0ad953b439374237497b47d2ecf
DIFF: https://github.com/llvm/llvm-project/commit/fbd2950d8d0dd0ad953b439374237497b47d2ecf.diff

LOG: [mlir][sparse] Refine f695554a2a55 wording a bit and fix the double conversion I broke with aggressive copy&paste

(cherry picked from commit 726719e970abb6d6fe014ed96f5ca838807dd840)

Added: 
    

Modified: 
    mlir/lib/ExecutionEngine/Float16bits.cpp

Removed: 
    


################################################################################
diff  --git a/mlir/lib/ExecutionEngine/Float16bits.cpp b/mlir/lib/ExecutionEngine/Float16bits.cpp
index 5c3fb61cea2e..dd158d7a258c 100644

--- a/mlir/lib/ExecutionEngine/Float16bits.cpp
+++ b/mlir/lib/ExecutionEngine/Float16bits.cpp
@@ -159,10 +159,12 @@ std::ostream &operator<<(std::ostream &os, const bf16 &d) {
 #endif
 
 #if defined(__x86_64__)
-// On x86 bfloat16 is passed in SSE2 registers. Since both float and _Float16
+// On x86 bfloat16 is passed in SSE registers. Since both float and __bf16
 // are passed in the same register we can use the wider type and careful casting
 // to conform to x86_64 psABI. This only works with the assumption that we're
 // dealing with little-endian values passed in wider registers.
+// Ideally this would directly use __bf16, but that type isn't supported by all
+// compilers.
 using BF16ABIType = float;
 #else
 // Default to uint16_t if we have nothing else.
@@ -184,9 +186,5 @@ extern "C" BF16ABIType ATTR_WEAK __truncsfbf2(float f) {
 extern "C" BF16ABIType ATTR_WEAK __truncdfbf2(double d) {
   // This does a double rounding step, but it's precise enough for our use
   // cases.
-  uint16_t bf = __truncsfbf2(static_cast<float>(d));
-  // The output can be a float type, bitcast it from uint16_t.
-  BF16ABIType ret = 0;
-  std::memcpy(&ret, &bf, sizeof(bf));
-  return ret;
+  return __truncsfbf2(static_cast<float>(d));
 }