tagsoup-0.11.1: Parsing and extracting information from (possibly malformed) HTML/XML documentsContentsIndex
tagsoup-0.11.1: Parsing and extracting information from (possibly malformed) HTML/XML documents

TagSoup is a library for parsing HTML/XML. It supports the HTML 5 specification, and can be used to parse either well-formed XML, or unstructured and malformed HTML from the web. The library also provides useful functions to extract information from an HTML document, making it ideal for screen-scraping.

Users should start from the Text.HTML.TagSoup module.

Modules
show/hideText
show/hideHTML
Text.HTML.Download
show/hideText.HTML.TagSoup
Text.HTML.TagSoup.Entity
Text.HTML.TagSoup.Match
Text.HTML.TagSoup.Tree
Text.StringLike
Produced by Haddock version 2.6.1