[Back]


Talks and Poster Presentations (with Proceedings-Entry):

B. Krüpl, W. Holzinger, Y. Darmaputra, R. Baumgartner:
"A Flight Meta-Search Engine with Metamorph";
Poster: 18th Int. World Wide Web Conference, Madrid, Spanien; 04-20-2009 - 04-24-2009; in: "WWW 2009 Proceedings", J. Quemada, G. Léon (ed.); Association for Computing Machinery, Inc. (ACM), (2009), ISBN: 978-1-60558-487-4; 2 pages.



English abstract:
We demonstrate a flight meta-search engine that is based on the Metamorph framework. Metamorph provides mechanisms to model web forms together with the interactions which are needed to fulfil a request, and can generate interaction sequences that pose queries using these web forms and collect the results. In this paper, we discuss an interesting new feature that makes use of the forms themselves as
an information source. We show how data can be extracted from web forms (rather than the data behind web forms) to generate a graph of flight connections between cities.
The flight connection graph allows us to vastly reduce the number of queries that the engine sends to airline websites in the most interesting search scenarios; those that involve the controversial practice of creative ticketing, in which agencies attempt to find lower price fares by using more than one airline for a journey. We describe a system which attains data from a number of websites to identify promising routes and prune the search tree. Heuristics that make use of geographical information and an estimation of cost based
on historical data are employed. The results are then made available to improve the quality of future search requests.
Categories and Subject Descriptors: H.3.4 [Information Storage and Retrieval]: Systems and Software General Terms: Algorithms, Design, Experimentation.

Keywords:
Hidden Web, Web Data Extraction, Web Form Mapping, Web Form Extraction


Related Projects:
Project Head Reinhard Pichler:
Advanced Barrier-free Browser Accessibility


Created from the Publication Database of the Vienna University of Technology.