Evaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.creatorAmine, Abdelmalek-
Autor(es): dc.creatorElberrichi, Zakaria-
Autor(es): dc.creatorSimonet, Michel-
Autor(es): dc.creatorMalki, Mimoun-
Data de aceite: dc.date.accessioned2026-02-09T11:57:10Z-
Data de disponibilização: dc.date.available2026-02-09T11:57:10Z-
Data de envio: dc.date.issued2008-03-01-
Data de envio: dc.date.issued2017-08-01-
Data de envio: dc.date.issued2017-08-01-
Data de envio: dc.date.issued2017-08-01-
Fonte completa do material: dc.identifierhttp://www.dcc.ufla.br/infocomp/index.php/INFOCOMP/article/view/203-
Fonte completa do material: dc.identifierhttps://repositorio.ufla.br/handle/1/14967-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/capes/1151093-
Descrição: dc.descriptionWith the great and rapidly growing number of documents available in digital form (Internet, library, CD-Rom…), the automatic classification of texts has become a significant research field and a fundamental task in document processing. This paper deals with unsupervised classification of textual documents also called text clustering using Self-Organizing Maps of Kohonen in two new situations: a conceptual representation of texts and a representation based on n-grams, instead of a representation based on words. The effects of these combinations are examined in several experiments using 4 measurements of similarity. The Reuters-21578 corpus is used for evaluation. The evaluation was done by using the F-measure and the entropy.-
Formato: dc.formatapplication/pdf-
Publicador: dc.publisherUniversidade Federal de Lavras-
Relação: dc.relationhttp://www.dcc.ufla.br/infocomp/index.php/INFOCOMP/article/view/203/188-
???dc.source???: dc.sourceINFOCOMP; Vol 7 No 1 (2008): March, 2008; 27-35-
???dc.source???: dc.source1982-3363-
???dc.source???: dc.source1807-4545-
Palavras-chave: dc.subjectText clustering-
Palavras-chave: dc.subjectSelf-Organizing Maps of Kohonen-
Palavras-chave: dc.subjectN-grams-
Palavras-chave: dc.subjectConcept-
Palavras-chave: dc.subjectSimilarity-
Palavras-chave: dc.subjectReuters21578-
Título: dc.titleEvaluation and Comparison of Concept Based and N-Grams Based Text Clustering Using SOM-
Tipo de arquivo: dc.typeinfo:eu-repo/semantics/article-
Tipo de arquivo: dc.typeinfo:eu-repo/semantics/publishedVersion-
Aparece nas coleções:Repositório Institucional da Universidade Federal de Lavras (RIUFLA)

Não existem arquivos associados a este item.