Double distance-calculation-pruning for similarity search

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.contributorUniversidade Estadual Paulista (UNESP)-
Autor(es): dc.creatorPola, Ives Renê Venturini-
Autor(es): dc.creatorPola, Fernanda Paula Barbosa-
Autor(es): dc.creatorEler, Danilo Medeiros-
Data de aceite: dc.date.accessioned2021-03-11T00:57:21Z-
Data de disponibilização: dc.date.available2021-03-11T00:57:21Z-
Data de envio: dc.date.issued2018-12-11-
Data de envio: dc.date.issued2018-12-11-
Data de envio: dc.date.issued2018-05-17-
Fonte completa do material: dc.identifierhttp://dx.doi.org/10.3390/info9050124-
Fonte completa do material: dc.identifierhttp://hdl.handle.net/11449/179874-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/11449/179874-
Descrição: dc.descriptionMany modern applications deal with complex data, where retrieval by similarity plays an important role. Complex data main comparison mechanisms are based on similarity predicates. They are usually immersed in metric spaces where distance functions are employed to express the similarity and a lower bound property is usually employed to prevent distance calculations. Retrieval by similarity is implemented by unary and binary operators. Most of the studies aimed at improving the efficiency of unary operators, either by using metric access methods or mathematical properties to prune parts of the search space during query answering. Studies on binary operators to solve similarity joins aim to improve efficiency and most of them use only the metric lower bound property for pruning. However, they are dependent on the query parameters, such as the range radius. In this paper, we propose a generic concept that uses both lower and upper bound properties based on the Metric Spaces Theory to increase the avoidance of element comparisons. The concept can be applied on any existing similarity retrieval method. We analyzed the prunability power increase and show an example of its application on classical join nested loops algorithms. Practical evaluation over both synthetic and real data sets shows that our method reduced the number of distance evaluations on similarity joins.-
Idioma: dc.languageen-
Relação: dc.relationInformation (Switzerland)-
Relação: dc.relation0,222-
Direitos: dc.rightsopenAccess-
Palavras-chave: dc.subjectInformation retrieval-
Palavras-chave: dc.subjectMetric indexing-
Palavras-chave: dc.subjectSimilarity joins-
Título: dc.titleDouble distance-calculation-pruning for similarity search-
Tipo de arquivo: dc.typelivro digital-
Aparece nas coleções:Repositório Institucional - Unesp

Não existem arquivos associados a este item.