May 23, 2006

Visually guided bottom-up table detection and segmentation in web documents

Key Points

Key points are not available for this paper at this time.

Abstract

In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. Our algorithm works bottom-up by grouping word bounding boxes into larger groups and uses a set of heuristics. It has already been implemented and a preliminary evaluation on about 6000 Web documents has been carried out.

Mark Helpful

Bookmark

Relay

Cite This Study

Krüpl et al. (Tue,) studied this question.

synapsesocial.com/papers/6a218cea5c0c8498e2582042 https://doi.org/https://doi.org/10.1145/1135777.1135951

Mark Helpful

Bookmark

Relay