Performance evaluation of data migration methods between the host and the device in CUDA-based programming

Registro completo de metadados
Autor(es): dc.contributorUniversidade Estadual Paulista (UNESP)-
Autor(es): dc.creatorSantos, Rafael Silva-
Autor(es): dc.creatorEler, Danilo Medeiros-
Autor(es): dc.creatorGarcia, Rogério Eduardo-
Data de aceite:
Data de disponibilização:
Data de envio:
Data de envio:
Data de envio:
Fonte completa do material: dc.identifier
Fonte completa do material: dc.identifier
Fonte: dc.identifier.uri
Descrição: dc.descriptionCUDA-based programming model is heterogeneous – composed of two components: host (CPU) and device (GPU). Both components have separated memory spaces and processing units. A great challenge to increase GPU-based application performance is the data migration between these memory spaces. Currently, the CUDA platform supports the following data migration methods: UMA, zero-copy, pageable and pinned memory. In this paper, we compare the zero-copy performance method with the other methods by considering the overall application runtime. Additionally, we investigated the aspects of data migration process to enunciate causes of the performance variations. The obtained results demonstrated in some cases the zero-copy memory can provide an average performance on 19% higher than the pinned memory transfer. In the studied situation, this method was the second most efficient. Finally, we present limitations of zero-copy memory as a resource for improving performance of CUDA applications.-
Formato: dc.format689-700-
Idioma: dc.languageen-
Relação: dc.relationAdvances in Intelligent Systems and Computing-
Direitos: dc.rightsopenAccess-
Título: dc.titlePerformance evaluation of data migration methods between the host and the device in CUDA-based programming-
Tipo de arquivo: dc.typelivro digital-
Aparece nas coleções:Repositório Institucional - Unesp

Não existem arquivos associados a este item.