The elementtree.TidyHTMLTreeBuilder Module

Tools to build well-formed element trees from arbitrary HTML or XHTML files, based on the TIDY parser library.

This tree builder requires the _elementtidy extension module (available from http://effbot.org/downloads).

Module Contents

parse(source) [#]

Parses an HTML document into an XHTML-style element tree.

source
A filename or file object containing HTML data.
Returns:
An ElementTree instance

TidyHTMLTreeBuilder() (class) [#]

ElementTree builder for HTML source code.

Raises RuntimeError:
If the _elementtidy processor is not available.

For more information about this class, see The TidyHTMLTreeBuilder Class.

TreeBuilder (variable) [#]

An alias for the TidyHTMLTreeBuilder class.

The TidyHTMLTreeBuilder Class

TidyHTMLTreeBuilder() (class) [#]

ElementTree builder for HTML source code. This builder converts an HTML document or fragment to an XHTML ElementTree, by running it through the _elementtidy processor.

Raises RuntimeError:
If the _elementtidy processor is not available.

close() ⇒ Element [#]

Finishes feeding data to the parser.

Returns:
An element structure.

feed(text) [#]

Feeds data to the parser.

data
Encoded data.