<table border="1" cellspacing="0" cellpadding="8">
    <tr>
        <th>Issue</th>
        <td>
            <a href=https://github.com/llvm/llvm-project/issues/54338>54338</a>
        </td>
    </tr>

    <tr>
        <th>Summary</th>
        <td>
            Invalid macro identifiers cause weird diagnostic output
        </td>
    </tr>

    <tr>
      <th>Labels</th>
      <td>
            clang
      </td>
    </tr>

    <tr>
      <th>Assignees</th>
      <td>
      </td>
    </tr>

    <tr>
      <th>Reporter</th>
      <td>
          cor3ntin
      </td>
    </tr>
</table>

<pre>
    ```cpp
#define ðŸ¦„ &
int ðŸ¦„;```

Produces

```cpp
<source>:2:5: error: unexpected character <U+1F984>
int Unknown code<U+1F984>;
    ^~~~~~~~~
```

The **Unknowm code** in the diagnostic here is is unexpected an confusing, we'd expect the source line to be reproduced accurately
</pre>
<img width="1px" height="1px" alt="" src="http://email.email.llvm.org/o/eJx1kcuS2yAQRb-mtaHikkGyrIUW9oxdNbssMh-AoCWRYFDxmMffpyUlGU8qUTUIaOh7uPRev3dwKLdQ8wzlI5Qn4ELjYBwyuJbQXuF0gGPFgB-2vHHpUwbE-U-RXxXW_mvwOiuM92v_UBMP0eegEMQFxIlTq6kxDMGHZZAdvs2oEmqmJhkkjQKjU8_Az_tru-hfPsCe3Q_nXx1TXuPfm4hz3cfog_oCzX_iM-k9_reJTOHk0GnTuW066wozjiXKayNH52Myik0YkJm4xN0t5ELnhhyNG4E_sFcq0Gi25dcSmyHMLm-QPOuRBZw3N-m4UjnIhPa90J3QrWhlkUyy2D25F2mNZjepgmdGo0tmMBgiUzJHJCET9D2fz2nOqcjBdlNKcyS7gV8pRpOm3O-Uv9HE2pffvy9E8Z0oaWpizPS2_FpXQhyLqRuqepClrJvjoUdVVkMlW-QcB9EKsd_rwsoebeygPgPnysrl9hzqx8J0vOS8pE37RtTlcdfwGkUj-mo_tDXnR6hKvEljdwvDzoexCN2K0-cxUtKamOJHUsZoRoe4SlF9mdPkQ6d8EOSIK1b0buX-CSyn5p8">