[PATCH] D34581: Fix missing/mismatched html tags

Matthias Braun via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Aug 30 13:23:11 PDT 2017


MatzeB added a comment.

In https://reviews.llvm.org/D34581#856259, @kristof.beyls wrote:

> In https://reviews.llvm.org/D34581#856055, @MatzeB wrote:
>
> > For the record: I just added an optional integration to pytidylib/tidy-html5 that checks lnt pages for html problems (r312061). It can be used with `lnt -Dtidylib=1`.
>
>
> Very nice! Thanks for all the cleanups and improvements you've been making to LNT lately!
>
> I guess there's a chance tidylib might be better than just aiming to parse XHTML as proper XML, as it may go beyond what an XML validator is capable of, by being written specifically for HTML?


I would have preferred simpler xml validation too, in fact that is what I tried first. In the end I failed with that approach because the WTForms library that we use only outputs HTML5 and has no way to produce XHTML, thus resulting in unclosed <input> tags without an good way to fix it. So various pages with forms on them will fail xml validation right now.

So the best thing I could find was the tidylib/tidy-html5 combination.


Repository:
  rL LLVM

https://reviews.llvm.org/D34581





More information about the llvm-commits mailing list