Use this link to cite:
http://hdl.handle.net/2183/41925 Maintaining Web Navigation Flows for Wrappers
Loading...
Identifiers
Publication date
Advisors
Other responsabilities
Journal Title
Bibliographic citation
Raposo, J., Álvarez, M., Losada, J., Pan, A. (2006). Maintaining Web Navigation Flows for Wrappers. In: Lee, J., Shim, J., Lee, Sg., Bussler, C., Shim, S. (eds) Data Engineering Issues in E-Commerce and Services. DEECS 2006. Lecture Notes in Computer Science, vol 4055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780397_9
Type of academic work
Academic degree
Abstract
[Abstract]: A substantial subset of the web data follows some kind of underlying structure. In order to let software programs gain full benefit from these "semi-structured" web sources, wrapper programs are built to provide a "machine-readable" view over them. A significant problem with wrappers is that, since web sources are autonomous, they may experience changes that invalidate the current wrapper, so automatic maintenance is an important research issue. Web wrappers must perform two kinds of tasks: automatically navigating through websites and automatically extracting structured data from HTML pages. While several previous works have addressed the automatic maintenance of the components performing the data extraction task, the problem of automatically maintaining the required web navigation sequences remains unaddressed to the best of our knowledge. In this paper we propose and expirementally validate a set of novel heuristics and algorithms to fill this gap. © Springer-Verlag Berlin Heidelberg 2006
Description
Conference held on June 26, 2006, San Francisco, CA, USA
This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections.
This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections.
Editor version
Rights
© 2006 Springer-Verlag Berlin Heidelberg






