[Back]


Talks and Poster Presentations (with Proceedings-Entry):

W. Gatterbauer, B. Krüpl, W. Holzinger, M. Herzog:
"Web Information Extraction Using Eupeptic Data in Web Tables";
Talk: RAWS 2005, Tocna, Tschechien; 09-14-2005 - 09-16-2005; in: "Proceedings of the 1st International Workshop on Representation and Analysis of Web Space", V. Snásel, V. Svátek (ed.); Faculty of Electrical Engineering and Computer Science, VSB - Technical University of Ostrava, (2005), ISBN: 80-248-0864-1; 41 - 48.



English abstract:
By leveraging on the redundant information on the Web, we are
building an Web information extraction system that concentrates on
eupeptic data in Web tables. We use the term eupeptic to describe
such representations of information that allow for easy
interpretation of the subject--predicate--object nature of
individual data items. The system mimics a human approach to
information gathering. It explicitly uses visual cues on rendered
Web pages to locate eupeptic data; it uses keywords to identify
relevant chunks of data that gets processed later on a deeper
level; and it expands its initial search to include more pages
when it spots new relevant information.


Online library catalogue of the TU Vienna:
http://aleph.ub.tuwien.ac.at/F?base=tuw01&func=find-c&ccl_term=AC05936218



Related Projects:
Project Head Reinhard Pichler:
Know it All and Know it Right: High Quality Mining in the Web


Created from the Publication Database of the Vienna University of Technology.