ITPilot: a toolkit for industrial-strength Web data extraction

UDC.coleccionInvestigaciónes_ES
UDC.conferenceTitleThe 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05)es_ES
UDC.departamentoCiencias da Computación e Tecnoloxías da Informaciónes_ES
UDC.grupoInvTelemáticaes_ES
dc.contributor.authorPan Bermúdez, Alberto
dc.contributor.authorRaposo Santiago, Juan
dc.contributor.authorÁlvarez Díaz, Manuel
dc.contributor.authorMontoto, Paula
dc.contributor.authorLosada, José
dc.contributor.authorHidalgo, Justo
dc.date.accessioned2025-05-07T09:48:44Z
dc.date.available2025-05-07T09:48:44Z
dc.date.issued2005-10-17
dc.description© 2005 IEEE. This version of the paper has been accepted for publication. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.es_ES
dc.descriptionConference held from 19 to 22 September 2005, Compiègne, Francees_ES
dc.description.abstract[Abstract]: In recent years, many research systems have been proposed to perform data extraction and automation tasks on Web sources. Since most of today's Web sources are "human-readable" but not "machine-readable", these systems must address a number of difficult challenges, such as dealing with complex navigation sequences, extracting data from HTML pages and reacting to source changes. Denodo Corporation has developed ITPilot, an industrial-strength solution that allows complex "wrappers" for Web sources to be graphically generated and automatically maintained. This paper presents the architecture and the basic ideas "behind the scenes" in ITPilot.es_ES
dc.identifier.citationA. Pan, J. Raposo, M. Alvarez, P. Montoto, J. Losada, y J. Hidalgo, «ITPilot: A Toolkit for Industrial-Strength Web Data Extraction», en The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI’05), Compiegne, France: IEEE, 2005, pp. 798-801. doi: 10.1109/WI.2005.85es_ES
dc.identifier.isbn0-7695-2415-X
dc.identifier.urihttp://hdl.handle.net/2183/41924
dc.language.isoenges_ES
dc.publisherIEEEes_ES
dc.relation.urihttps://doi.org/10.1109/WI.2005.85es_ES
dc.rightsCopyright © 2005, IEEEes_ES
dc.rights.accessRightsopen accesses_ES
dc.subjectData mininges_ES
dc.subjectBookses_ES
dc.subjectNavigationes_ES
dc.subjectWeb serviceses_ES
dc.subjectHTMLes_ES
dc.subjectJavaes_ES
dc.subjectComputer languageses_ES
dc.subjectAutomationes_ES
dc.subjectComputer architecturees_ES
dc.subjectWorld Wide Webes_ES
dc.titleITPilot: a toolkit for industrial-strength Web data extractiones_ES
dc.typeconference outputes_ES
dc.type.hasVersionAMes_ES
dspace.entity.typePublication
relation.isAuthorOfPublication79d8a555-94f9-4edc-b6d1-ad514f81941d
relation.isAuthorOfPublication76f0a84a-79bb-4d46-8de5-a960191fb925
relation.isAuthorOfPublication8fb413a7-b40a-48ad-861f-985d0492628e
relation.isAuthorOfPublication6711ba39-80ba-4e57-8881-db47fc022efd
relation.isAuthorOfPublication400c236a-710a-4526-b9f3-f496a36ccfe0
relation.isAuthorOfPublication.latestForDiscovery79d8a555-94f9-4edc-b6d1-ad514f81941d

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Pan_Alberto_2005_ITPilot.pdf
Size:
785.46 KB
Format:
Adobe Portable Document Format
Description:
Accepted Manuscript