Maintaining Web Navigation Flows for Wrappers

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitleData Engineering Issues in E-Commerce and Services: Second International Workshop, DEECS 2006, San Francisco, CA, USA, June 26, 2006es_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.endPage100-114es_ES
UDC.grupoInvTelemáticaes_ES
dc.contributor.authorRaposo Santiago, Juan
dc.contributor.authorÁlvarez Díaz, Manuel
dc.contributor.authorLosada, José
dc.contributor.authorPan Bermúdez, Alberto
dc.date.accessioned2025-05-07T10:28:26Z
dc.date.available2025-05-07T10:28:26Z
dc.date.issued2006-06-21
dc.descriptionConference held on June 26, 2006, San Francisco, CA, USAes_ES
dc.descriptionThis version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections.es_ES
dc.description.abstract[Abstract]: A substantial subset of the web data follows some kind of underlying structure. In order to let software programs gain full benefit from these "semi-structured" web sources, wrapper programs are built to provide a "machine-readable" view over them. A significant problem with wrappers is that, since web sources are autonomous, they may experience changes that invalidate the current wrapper, so automatic maintenance is an important research issue. Web wrappers must perform two kinds of tasks: automatically navigating through websites and automatically extracting structured data from HTML pages. While several previous works have addressed the automatic maintenance of the components performing the data extraction task, the problem of automatically maintaining the required web navigation sequences remains unaddressed to the best of our knowledge. In this paper we propose and expirementally validate a set of novel heuristics and algorithms to fill this gap. © Springer-Verlag Berlin Heidelberg 2006es_ES
dc.description.sponsorshipThis research was partially supported by the Spanish Ministry of Education and Science under project TSI2005-07730. Alberto Pan’s work was partially supported by the "Ramón y Cajal" programme of the Spanish Ministry of Education and Sciencees_ES
dc.identifier.citationRaposo, J., Álvarez, M., Losada, J., Pan, A. (2006). Maintaining Web Navigation Flows for Wrappers. In: Lee, J., Shim, J., Lee, Sg., Bussler, C., Shim, S. (eds) Data Engineering Issues in E-Commerce and Services. DEECS 2006. Lecture Notes in Computer Science, vol 4055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11780397_9es_ES
dc.identifier.isbn978-3-540-35440-6
dc.identifier.isbn978-3-540-35441-3
dc.identifier.issn0302-9743
dc.identifier.urihttp://hdl.handle.net/2183/41925
dc.language.isoenges_ES
dc.publisherSpringer Naturees_ES
dc.relation.projectIDinfo:eu-repo/grantAgreement/AEI/Plan Nacional del I+D+I 2004-2007/TSI2005-07730/ES/COMPONENTES TELEMATICOS PARA BUSQUEDA, EXTRACCION Y ESTRUCTURACION EFICIENTE DE INFORMACION EN REDes_ES
dc.relation.urihttps://doi.org/10.1007/11780397_9es_ES
dc.rights© 2006 Springer-Verlag Berlin Heidelberges_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectData structureses_ES
dc.subjectHTMLes_ES
dc.subjectMaintenancees_ES
dc.subjectNavigationes_ES
dc.subjectProblem solvinges_ES
dc.subjectData extractiones_ES
dc.subjectNavigation flowses_ES
dc.subjectSoftware programses_ES
dc.subjectWrapperses_ES
dc.titleMaintaining Web Navigation Flows for Wrapperses_ES
dc.typeconference outputes_ES
dc.type.hasVersionAMes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication76f0a84a-79bb-4d46-8de5-a960191fb925
relation.isAuthorOfPublication8fb413a7-b40a-48ad-861f-985d0492628e
relation.isAuthorOfPublication400c236a-710a-4526-b9f3-f496a36ccfe0
relation.isAuthorOfPublication79d8a555-94f9-4edc-b6d1-ad514f81941d
relation.isAuthorOfPublication.latestForDiscovery76f0a84a-79bb-4d46-8de5-a960191fb925

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Raposo_Juan_2006_Maintaining_Web_ Navigation_Wrappers.pdf
Size:
713.22 KB
Format:
Adobe Portable Document Format
Description:
Accepted Manuscript