[clang] [Clang][Lexer][Performance] Optimize Lexer whitespace skipping logic (PR #180819)

Thibault Monnier via cfe-commits cfe-commits at lists.llvm.org
Sat Feb 14 11:33:15 PST 2026


Thibault-Monnier wrote:

> We should also check whether a loop that simply checks for all whitespaces is faster. All the non-' ' whitespaces are consecutive numbers together, so it boils down to a `(c >= '\t' && c <= '\r') || c == ' '` (though `\n` has special handling, so it might not be as perfect).

`isHorizontalWhitespace` only matches `' '`, `'\t'`, `'\f'`, `'\v'`. I tried replacing the conditions by
```cpp
*CurPtr == ' ' || *CurPtr == '\t' || *CurPtr == '\f' || *CurPtr == '\v'
```
but it is slower, which is understandable seeing [what it optimizes to](https://godbolt.org/z/P6coT8nn5).

https://github.com/llvm/llvm-project/pull/180819


More information about the cfe-commits mailing list