<html>
    <head>
      <base href="http://llvm.org/bugs/" />
    </head>
    <body><table border="1" cellspacing="0" cellpadding="8">
        <tr>
          <th>Bug ID</th>
          <td><a class="bz_bug_link 
          bz_status_NEW "
   title="NEW --- - minor display problem with text diagnostic source lines when source contains multi-byte characters"
   href="http://llvm.org/bugs/show_bug.cgi?id=20453">20453</a>
          </td>
        </tr>

        <tr>
          <th>Summary</th>
          <td>minor display problem with text diagnostic source lines when source contains multi-byte characters
          </td>
        </tr>

        <tr>
          <th>Product</th>
          <td>clang
          </td>
        </tr>

        <tr>
          <th>Version</th>
          <td>trunk
          </td>
        </tr>

        <tr>
          <th>Hardware</th>
          <td>PC
          </td>
        </tr>

        <tr>
          <th>OS</th>
          <td>Windows XP
          </td>
        </tr>

        <tr>
          <th>Status</th>
          <td>NEW
          </td>
        </tr>

        <tr>
          <th>Severity</th>
          <td>normal
          </td>
        </tr>

        <tr>
          <th>Priority</th>
          <td>P
          </td>
        </tr>

        <tr>
          <th>Component</th>
          <td>Frontend
          </td>
        </tr>

        <tr>
          <th>Assignee</th>
          <td>unassignedclangbugs@nondot.org
          </td>
        </tr>

        <tr>
          <th>Reporter</th>
          <td>seth.cantrell@gmail.com
          </td>
        </tr>

        <tr>
          <th>CC</th>
          <td>llvmbugs@cs.uiuc.edu
          </td>
        </tr>

        <tr>
          <th>Classification</th>
          <td>Unclassified
          </td>
        </tr></table>
      <p>
        <div>
        <pre>UTF-8 source file 'main.cpp':

int main() {
    int x = 'µ';
}

from cmd.exe run command 'chcp 65001'

run command 'clang++ main.cpp'

Output:

C:\path\main.cpp:2:10: error: character too large for enclosing character
literal type
        int x = 'µ';;
                ^
1 error generated.


There's an extra semicolon printed. Compiling with the console codepage set to
437 produces:

C:\path\main.cpp:2:10: error: character too large for enclosing character
literal type
        int x = '┬╡';
                ^

The output under chcp 65001 is what you would get writing out the string using
one column per byte and then overwriting it using the correct number of columns
for each character.</pre>
        </div>
      </p>
      <hr>
      <span>You are receiving this mail because:</span>
      
      <ul>
          <li>You are on the CC list for the bug.</li>
      </ul>
    </body>
</html>