Corpus-based Methodology for an Online Multilingual Collocations Dictionary: First Steps

Registro completo de metadados
MetadadosDescriçãoIdioma
Autor(es): dc.contributorUniversidade Estadual Paulista (UNESP)-
Autor(es): dc.contributorUniversidade de Santiago de Compostela-
Autor(es): dc.contributorUniversity of Alcalá-
Autor(es): dc.contributorUniversité de Montréal-
Autor(es): dc.contributorUniversidade da Coruña-
Autor(es): dc.creatorOrenha-Ottaiano, Adriane-
Autor(es): dc.creatorGarcia, Marcos-
Autor(es): dc.creatorde Oliveira Silva, Maria Eugênia Olímpio-
Autor(es): dc.creatorL'Homme, Marie-Claude-
Autor(es): dc.creatorRamos, Margarita Alonso-
Autor(es): dc.creatorValêncio, Carlos Roberto-
Autor(es): dc.creatorTenório, William-
Data de aceite: dc.date.accessioned2025-08-21T22:43:21Z-
Data de disponibilização: dc.date.available2025-08-21T22:43:21Z-
Data de envio: dc.date.issued2023-03-02-
Data de envio: dc.date.issued2023-03-02-
Data de envio: dc.date.issued2020-12-31-
Fonte completa do material: dc.identifierhttp://hdl.handle.net/11449/242228-
Fonte: dc.identifier.urihttp://educapes.capes.gov.br/handle/11449/242228-
Descrição: dc.descriptionThis paper describes the first steps of a corpus-based methodology for the development of an online Platform for Multilingual Collocations Dictionaries (PLATCOL). The platform is aimed to be customized for different target audiences according to their needs. It covers various syntactic structures of collocations that fit into the following taxonomy: verbal, adjectival, nominal, and adverbial. Part of its design, layout and methodological procedures are based on the Bilingual Online Collocations Dictionary Platform (Orenha-Ottaiano, 2017). The methodology also relies on the combination of automatic methods to extract candidate collocations (Garcia et al., 2019a) with careful post-editing performed by lexicographers. The automatic approaches take advantage of NLP tools to annotate large corpora with lemmas, PoS-tags and dependency relations in five languages (English, French, Portuguese, Spanish and Chinese). Using these data, we apply statistical measures (Evert et al., 2017; Garcia et al., 2019b) and distributional semantics strategies to select the candidates (Garcia et al., 2019c) and retrieve corpus-based examples (Kilgarriff et al., 2008). We also rely on automatic definition extraction (Bond & Foster, 2013) so that collocations can be more effectively organized according to their specific senses.-
Descrição: dc.descriptionSão Paulo State University (UNESP)-
Descrição: dc.descriptionUniversidade de Santiago de Compostela-
Descrição: dc.descriptionUniversity of Alcalá-
Descrição: dc.descriptionOLST Université de Montréal-
Descrição: dc.descriptionUniversidade da Coruña-
Descrição: dc.descriptionSão Paulo State University (UNESP)-
Formato: dc.format1-28-
Idioma: dc.languageen-
Relação: dc.relationProceedings of Electronic Lexicography in the 21st Century Conference-
???dc.source???: dc.sourceScopus-
Palavras-chave: dc.subjectautomatic extraction-
Palavras-chave: dc.subjectcollocations-
Palavras-chave: dc.subjectcollocations dictionary-
Palavras-chave: dc.subjectlexicography-
Palavras-chave: dc.subjectonline platform-
Título: dc.titleCorpus-based Methodology for an Online Multilingual Collocations Dictionary: First Steps-
Tipo de arquivo: dc.typeaula digital-
Aparece nas coleções:Repositório Institucional - Unesp

Não existem arquivos associados a este item.