Hpricot - HTML parser for Ruby
Hpricot is a fast, flexible HTML parser. Hpricot can be handy for reading broken XML files, since many of the same techniques can be used. If a quote is missing, Hpricot tries to figure it out. If tags overlap, Hpricot works on sorting them out. Source code location: http://github.com/hpricot/hpricot
- Hpricot is a standalone library. It requires no other libraries. Just Ruby!
- Hpricot works hard to sort out bad HTML and pays a small penalty in order to get that right.
- If you can see it in Firefox, then Hpricot should parse it.
- Primarily, Hpricot is used for reading HTML and tries to sort out troubled HTML by having some idea of what good HTML is.
http://hpricot.com/
License:
Tech:
Tags: