Computer code on a screen
  • VersionDude
  • Status: Maintained elsewhere

The htmlparser project is an HTML5 parsing engine written in Java that provides SAX and DOM interfaces. Early development repositories were hosted here under /whattf/htmlparser/trunk/.

The parser became the engine behind the Nu HTML Checker (validator.nu), implementing the HTML5 tokenization and tree-construction algorithm that makes browsers agree on how to read markup.

Lines of code displayed on a screen.
Lines of code displayed on a screen.

Current home

Projects