<html>
    <head>
      <base href="https://bugs.llvm.org/">
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW - extract_symbols.py uses wrong encoding for dumpbin"
   href="https://bugs.llvm.org/show_bug.cgi?id=47258">47258</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>extract_symbols.py uses wrong encoding for dumpbin
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>Build scripts
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>trunk
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>PC
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Windows NT
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>enhancement
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>P
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>cmake
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>unassignedbugs@nondot.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>dmajor@mozilla.com
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>llvm-bugs@lists.llvm.org
          </td>
        </tr></table>
      <p>
        <div>
        <pre>When running extract_symbols.py with Python 3, we often see errors such as
`UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 4045:
character maps to <undefined>`.

This can be fixed for Python 3 by adding `encoding='mbcs'` to
<a href="https://github.com/llvm/llvm-project/blob/b74b80bb2d31dfb09559fcb71a1c474a421c633d/llvm/utils/extract_symbols.py#L35">https://github.com/llvm/llvm-project/blob/b74b80bb2d31dfb09559fcb71a1c474a421c633d/llvm/utils/extract_symbols.py#L35</a>
(only for this one call; the other Popens in the file are fine as is) however
I'm told this breaks Python 2.7 which is still supported for LLVM. Would need
either a more universal fix or a version check.</pre>
        </div>
      </p>


      <hr>
      <span>You are receiving this mail because:</span>

      <ul>
          <li>You are on the CC list for the bug.</li>
      </ul>
    </body>
</html>