Genealogical trees on the web: A search engine user perspective

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.creatorYates, Ricardo Baeza-
Autor(es): dc.creatorPereira Junior, Álvaro Rodrigues-
Autor(es): dc.creatorZiviani, Nivio-
Data de aceite: dc.date.accessioned2019-11-06T13:25:25Z-
Data de disponibilização: dc.date.available2019-11-06T13:25:25Z-
Data de envio: dc.date.issued2012-10-18-
Data de envio: dc.date.issued2012-10-18-
Data de envio: dc.date.issued2008-
Fonte completa do material: dc.identifierhttp://hdl.handle.net/123456789/1676-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/capes/555060-
Descrição: dc.descriptionThis paper presents an extensive study about the evolution of textual content on the Web, which shows how some new pages are created from scratch while others are created using already existing content. We show that a significant fraction of the Web is a byproduct of the latter case. We introduce the concept of Web genealogical tree, in which every page in a Web snapshot is classified into a component. We study in detail these components, characterizing the copies and identifying the relation between a source of content and a search engine, by comparing page relevance measures, documents returned by real queries performed in the past, and click-through data. We observe that sources of copies are more frequently returned by queries and more clicked than other documents.-
Idioma: dc.languageen-
Palavras-chave: dc.subjectWeb-
Palavras-chave: dc.subjectText-
Palavras-chave: dc.subjectContent evolution-
Palavras-chave: dc.subjectSearch engine-
Palavras-chave: dc.subjectWeb mining-
Título: dc.titleGenealogical trees on the web: A search engine user perspective-
Aparece nas coleções:Repositório Institucional - UFOP

Não existem arquivos associados a este item.