Dynamic topic hierarchies and segmented rankings in textual OLAP technology.

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.creatorSouza, Adriano Neves de Paula e-
Autor(es): dc.creatorFortes, Reinaldo Silva-
Autor(es): dc.creatorLima, Joubert de Castro-
Data de aceite: dc.date.accessioned2025-08-21T15:44:56Z-
Data de disponibilização: dc.date.available2025-08-21T15:44:56Z-
Data de envio: dc.date.issued2018-01-26-
Data de envio: dc.date.issued2018-01-26-
Data de envio: dc.date.issued2017-
Fonte completa do material: dc.identifierhttp://www.repositorio.ufop.br/handle/123456789/9361-
Fonte completa do material: dc.identifierhttp://www.globalcis.org/jcit/home/index.html-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/capes/1023186-
Descrição: dc.descriptionThe OLAP technology emerged 20 years ago and recently has been redesigned so that its dimensions, hierarchies and measures can support the particularities of textual data. Organizing textual data hierarchically can be solved with topic hierarchies. Currently, the topic hierarchy is defined only once in the data cube, i.e., for the entire lattice of cuboids. However, such hierarchy is sensitive to the document collection content. Thus, a data cube cell can contain a collection of documents distinct from others in the same cube, causing potential changes in the topic hierarchy. Furthermore, the text segment used in OLAP analysis also changes this hierarchy. In this work, we present a textual data cube with multiple dynamic topic hierarchies for each cube cell. Multiple hierarchies, since the presented approach builds a topic hierarchy per text segment. Another contribution of this work refers to query response. The state-of-the-art normally returns the top-k documents to the topic selected in the query. We go beyond by returning other text segments, such as the most significant titles, abstracts and paragraphs. The approach is designed in four additional steps and each step attenuates a bit more the impact of building multiple topic hierarchies and segmented rankings per cube cell. Experiments using part of the DBLP papers as a document collection reinforce our hypotheses.-
Formato: dc.formatapplication/pdf-
Idioma: dc.languageen-
Direitos: dc.rightsrestrito-
Palavras-chave: dc.subjectData cube-
Palavras-chave: dc.subjectText database-
Palavras-chave: dc.subjectRanking-
Palavras-chave: dc.subjectTopic hierarchy-
Título: dc.titleDynamic topic hierarchies and segmented rankings in textual OLAP technology.-
Aparece nas coleções:Repositório Institucional - UFOP

Não existem arquivos associados a este item.