Talks and Poster Presentations (with Proceedings-Entry):
B. Krüpl, M. Herzog:
"Visually Guided Bottom-Up Table Detection and Segmentation in Web Documents";
- 05-26-2006; in: "Conference Proceedings of WWW 2006",
In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. Our algorithm works bottom-up by grouping word bounding boxes into larger groups and uses a set of heuristics. It has already been implemented and a preliminary evaluation on about 6000 Web documents has been carried out.
Online library catalogue of the TU Vienna:
Project Head Reinhard Pichler:
Know it All and Know it Right: High Quality Mining in the Web
Created from the Publication Database of the Vienna University of Technology.