Download of htmlcleaner-2.26-src.zip (htmlcleaner-2.26-src.zip ( external link: SF.net): 414,195 bytes) will begin shortly. If not so, click link on the left.
HtmlCleaner is HTML parser written in Java. It transforms dirty HTML to well-formed XML following the same rules that the most web-browsers use.