Bug#382995: ITP: nekohtml -- HTML parser for Java

marcus at better.se marcus at better.se
Mon Aug 14 14:55:27 UTC 2006


Package: wnpp
Severity: wishlist

* Package name    : nekohtml
  Version         : 0.9.5
  Upstream Author : Andy Clark
* URL or Web page : http://people.apache.org/~andyc/neko/doc/html/
* License         : CyberNeko Software License, Version 1.0
  Description     : HTML parser for Java

 This is a simple HTML scanner and tag balancer that enables
 application programmers to parse HTML documents and access the
 information using standard XML interfaces. The parser can scan HTML
 files and "fix up" many common mistakes that human (and computer)
 authors make in writing HTML documents. NekoHTML adds missing parent
 elements; automatically closes elements with optional end tags; and
 can handle mismatched inline element tags.





More information about the pkg-java-maintainers mailing list