Class Scrubyt::PreFilterDocument
In: lib/scrubyt/core/scraping/pre_filter_document.rb
Parent: Object

Apply different functions on the input document

Before the document is passed to Hpricot for parsing, we may need to do different stuff with it which are clumsy/not appropriate/impossible to do once the document is loaded.

Methods

Public Class methods

Replace <br/> tags with newlines

[Validate]