A framework to collect and extract publication lists of a given researcher from the web

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.creatorGarcia, Cristiano Mesquita-
Autor(es): dc.creatorPereira, Armando Honorio-
Autor(es): dc.creatorPereira, Denilson Alves-
Data de aceite: dc.date.accessioned2026-02-09T12:10:24Z-
Data de disponibilização: dc.date.available2026-02-09T12:10:24Z-
Data de envio: dc.date.issued2018-07-27-
Data de envio: dc.date.issued2018-07-27-
Data de envio: dc.date.issued2017-
Fonte completa do material: dc.identifierhttps://repositorio.ufla.br/handle/1/29780-
Fonte completa do material: dc.identifierhttps://www.inderscienceonline.com/doi/abs/10.1504/IJWET.2017.088391-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/capes/1155885-
Descrição: dc.descriptionResearchers usually publish their publication lists on the web. Collecting and extracting them can be of great value to research funding agencies and to applications such as academic network analysis and ranking systems. Because of the wide variety of citation styles and different web page formats, it is not straightforward to develop an automatic system to collect and extract researchers' publication lists. In this paper, we describe the method used by our framework to collect and extract publication lists. It is composed of two tools, named Raposa - Citation Extractor, and Tucano - Publication Lists Collector. Raposa uses a method that identifies regions in the web page containing citations and the delimiters separating them. Tucano collects publication lists by submitting queries to a web search engine. Experimental results show that our framework obtains 93.5% of F1 measure for collecting publication lists, which is a better value when compared to Google Scholar.-
Idioma: dc.languageen-
Publicador: dc.publisherInderscience-
Direitos: dc.rightsrestrictAccess-
???dc.source???: dc.sourceInternational Journal of Web Engineering and Technology-
Palavras-chave: dc.subjectCitation extractor-
Palavras-chave: dc.subjectPublication lists collector-
Palavras-chave: dc.subjectWeb search engine-
Palavras-chave: dc.subjectExtrator de citação-
Palavras-chave: dc.subjectColeta de listas de publicação-
Palavras-chave: dc.subjectMotor de busca-
Título: dc.titleA framework to collect and extract publication lists of a given researcher from the web-
Tipo de arquivo: dc.typeArtigo-
Aparece nas coleções:Repositório Institucional da Universidade Federal de Lavras (RIUFLA)

Não existem arquivos associados a este item.