[cfe-dev] Assertion on valid use of highlighting whitespace

Douglas Gregor dgregor at apple.com
Fri Jun 18 13:26:35 PDT 2010


On Jun 17, 2010, at 7:47 PM, Tom Care wrote:

> I think there might be other things broken in this function too.
> 
> I'm having problems trying to get highlighting/fixits working for parts of printf statements (rather than the whole thing).
> 
> Currently, if I give a SourceLocation within a printf formatting string, it will highlight it from the beginning until the end of the format string (rather than the end I specified). For example:
> 
> /tmp/fixit.c:13:15: warning: precision used with 'n' conversion specifier,
>       resulting in undefined behavior [-Wformat]
>   printf("%100.100n", (int*) 0);
>               ^~~~~
> 
> When what I expect is:
> /tmp/fixit.c:13:15: warning: precision used with 'n' conversion specifier,
>       resulting in undefined behavior [-Wformat]
>   printf("%100.100n", (int*) 0);
>               ^~~~
> 
> This ends up breaking the fixits, which delete the rest of the format string rather than the invalid part.
> 
> The offending line is lib/Frontend/TextDiagnosticPrinter.cpp:116
>       // Add in the length of the token, so that we cover multi-char tokens.
>       EndColNo += Lexer::MeasureTokenLength(End, SM, *LangOpts);
> 
> I'm sure this line has importance, but I don't think it is correct.
> 
> I'm a bit lost in this section! Does anyone have any ideas?

The mental model for a source range is that it is a pair of source locations [B, E), where both B and E are expected to point at the beginning of the token. That line above is relexing the token at the location E to find the end of the token. Since you've adjusted the ranges to point at specific characters, when you're getting is E adjusted to the end of the "token" where you're pointing... which is in the middle of a string literal, hence the somewhat odd highlighting behavior.

We would need to introduce a new notion (e.g., a new "CharacterSourceRange") that forgoes this adjustment and treats the ending source location in the range as character-level positions in the source rather than token-level positions. CharacterSourceRange would only be used in limited places when needed, such as FixItHint and perhaps Diagnostic, and all affected clients would need to be updated.

  - Doug

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20100618/43fd6452/attachment.html>


More information about the cfe-dev mailing list