Atenção: Todas as denúncias são sigilosas e sua identidade será preservada.
Os campos nome e e-mail são de preenchimento opcional
Metadados | Descrição | Idioma |
---|---|---|
Autor(es): dc.contributor | Federal University of Technology-Paraná (UTFPR) | - |
Autor(es): dc.contributor | Universidade de São Paulo (USP) | - |
Autor(es): dc.contributor | Universidade Nove de Julho (UNINOVE) | - |
Autor(es): dc.contributor | Universidade Estadual Paulista (Unesp) | - |
Autor(es): dc.creator | Bonidia, Robson P. | - |
Autor(es): dc.creator | MacHida, Jaqueline Sayuri | - |
Autor(es): dc.creator | Negri, Tatianne C. | - |
Autor(es): dc.creator | Alves, Wonder A.L. | - |
Autor(es): dc.creator | Kashiwabara, André Y. | - |
Autor(es): dc.creator | Domingues, Douglas S. [UNESP] | - |
Autor(es): dc.creator | De Carvalho, André | - |
Autor(es): dc.creator | Paschoal, Alexandre R. | - |
Autor(es): dc.creator | Sanches, Danilo S. | - |
Data de aceite: dc.date.accessioned | 2022-02-22T00:46:04Z | - |
Data de disponibilização: dc.date.available | 2022-02-22T00:46:04Z | - |
Data de envio: dc.date.issued | 2021-06-25 | - |
Data de envio: dc.date.issued | 2021-06-25 | - |
Data de envio: dc.date.issued | 2019-12-31 | - |
Fonte completa do material: dc.identifier | http://dx.doi.org/10.1109/ACCESS.2020.3028039 | - |
Fonte completa do material: dc.identifier | http://hdl.handle.net/11449/206069 | - |
Fonte: dc.identifier.uri | http://educapes.capes.gov.br/handle/11449/206069 | - |
Descrição: dc.description | Machine learning algorithms have been applied to numerous transcript datasets to identify Long non-coding RNAs (lncRNAs). Nevertheless, before these algorithms are applied to RNA data, features must be extracted from the original sequences. As many of these features can be redundant or irrelevant, the predictive performance of the algorithms can be improved by performing feature selection. However, the most current approaches usually select features independently, ignoring possible relations. In this paper, we propose a new model, which identifies the best subsets, removing unnecessary, irrelevant, and redundant predictive features, taking the importance of their co-occurrence into account. The proposed model is based on decomposing solutions and is called k-rounds of decomposition features. In this model, the least relevant features are suppressed according to their contribution to a classification task. To evaluate our proposal, we extract from 5 plant species datasets, a set of features based on sequence structures, using GC content, k-mer (1-6), sequence length, and Open Reading Frame. Next, we apply 5 metaheuristics approaches (Genetic Algorithm, (μ +λ) Evolutionary Algorithm, Artificial Bee Colony, Ant Colony Optimization, and Particle Swarm Optimization) to select the best feature subsets. The main contribution of this work was to include in each metaheuristic a decomposition model that uses round and voting scheme. To investigate its relevance, we select the REPTree classifier to assess the predictive capacity of each subset of features selected in 8 plant species.We identified that the inclusion of the proposed decomposition model significantly reduces the dimensions of the datasets and improves predictive performance, regardless of the metaheuristic. Furthermore, the resulting pipeline has been compared with five approaches in the literature, for lncRNA, when it also showed superior predictive performance. Finally, this study generated a new pipeline to find a minimum number of features in lncRNAs and biological sequences. | - |
Descrição: dc.description | Department of Computer Science Bioinformatics Graduate Program Federal University of Technology-Paraná (UTFPR) | - |
Descrição: dc.description | Institute of Mathematics and Computer Sciences University of São Paulo (USP) | - |
Descrição: dc.description | Universidade Nove de Julho (UNINOVE) | - |
Descrição: dc.description | Department of Botany Institute of Biosciences São Paulo State University (UNESP) | - |
Descrição: dc.description | Department of Botany Institute of Biosciences São Paulo State University (UNESP) | - |
Formato: dc.format | 181683-181697 | - |
Idioma: dc.language | en | - |
Relação: dc.relation | IEEE Access | - |
???dc.source???: dc.source | Scopus | - |
Palavras-chave: dc.subject | Bioinformatics | - |
Palavras-chave: dc.subject | Feature selection | - |
Palavras-chave: dc.subject | LncRNAs | - |
Palavras-chave: dc.subject | Machine learning | - |
Palavras-chave: dc.subject | Metaheuristic | - |
Título: dc.title | A novel decomposing model with evolutionary algorithms for feature selection in long non-coding rnas | - |
Tipo de arquivo: dc.type | livro digital | - |
Aparece nas coleções: | Repositório Institucional - Unesp |
O Portal eduCAPES é oferecido ao usuário, condicionado à aceitação dos termos, condições e avisos contidos aqui e sem modificações. A CAPES poderá modificar o conteúdo ou formato deste site ou acabar com a sua operação ou suas ferramentas a seu critério único e sem aviso prévio. Ao acessar este portal, você, usuário pessoa física ou jurídica, se declara compreender e aceitar as condições aqui estabelecidas, da seguinte forma: