<div dir="ltr">Hello.<div><br></div><div><font face="monospace">> <span style="color:rgb(0,0,0);white-space:pre-wrap">2) Add patches to Clang to allow EBCDIC and ASCII (ISO-8859-1) encoded </span></font></div><pre style="white-space:pre-wrap;color:rgb(0,0,0)">input source files. This would be done at the file open time to allow the 

rest of Clang to operate as if the source was UTF-8 and so require no 

changes downstream. Feedback on this plan is welcome from the Clang 

community.</pre><div>Would it be correct to assume that this EBCDIC -> UTF-8 mapping would be as prescribed by<br>UTF-EBCDIC / IBM CDRA, notably for the control characters that do not map exactly?<br>Notably, if the execution encoding is EBCDIC, is '0x06' equivalent to '0086', etc?</div><div><br></div><div>The question "Is Unicode sufficient to represent all characters present in the input source without using the Private Use Area?" is one that</div><div>is relevant to both Clang and the C/C++ standard. ( I do hope that it is the case!)  <br><div><font face="arial, sans-serif"><br></font></div><div><font face="arial, sans-serif">Thanks,</font></div><div><div><div><pre style="white-space:pre-wrap;color:rgb(0,0,0)"><font face="arial, sans-serif">Corentin</font></pre></div></div></div></div></div>