Buscar
Mostrando ítems 1-3 de 3
Distributed and Collaborative Web Change Detection System
(ComSIS Consortium, 2015)
[Absctract]: Search engines use crawlers to traverse the Web in order to download
web pages and build their indexes. Maintaining these indexes up-to-date is an
essential task to ensure the quality of search results. ...
Twitter: A Good Place to Detect Health Conditions
(PLoS, 2014-01)
[Absctract]: With the proliferation of social networks and blogs, the Internet is increasingly being used to disseminate personal health information rather than just as a source of information. In this paper we exploit the ...
Soft-404 Pages, A Crawling Problem
(Society for Information Organization in India, 2014-04)
[Absctract]: During its traversal of the Web, crawler
systems have to deal with multiple challenges. Some of
them are related with detecting garbage content to avoid
wasting resources processing it. Soft-404 pages are ...