Download of htmlcleaner-2.7-src.zip (htmlcleaner-2.7-src.zip ( external link: SF.net): 272,446 bytes) will begin shortly. If not so, click link on the left.
HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.