7

Hybrid Sweeping: Streamlined Perceptual Structured-Text Refinement

This thesis discusses the KrdWrd Project. The Project goals are to provide tools and infrastructure for acquisition, visual annotation, merging and storage of Web pages as parts of bigger corpora, and to develop a classification engine that learns to …