Web of Science: 2 citas, Scopus: 3 citas, Google Scholar: citas
An approach for an efficient execution of SPMD applications on Multi-core environments
Muresano Cáceres, Ronal Roberto (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Meyer, Hugo Daniel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Rexachs del Rosario, Dolores Isabel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Luque, Emilio (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)

Fecha: 2017
Resumen: Executing traditional Message Passing Interface (MPI) applications on multi-core cluster balancing speed and computational efficiency is a difficult task that parallel programmers have to deal with. For this reason, communications on multi-core clusters ought to be handled carefully in order to improve performance metrics such as efficiency, speedup, execution time and scalability. In this paper we focus our attention on SPMD (Single Program Multiple Data) applications with high communication volume and synchronicity and also following characteristics such as: static, local and regular. This work proposes a method for SPMD applications, which is focused on managing the communication heterogeneity (different cache level, RAM memory, network, etc. ) on homogenous multi-core computing platform in order to improve the application efficiency. In this sense, the main objective of this work is to find analytically the ideal number of cores necessary that allows us to obtain the maximum speedup, while the computational efficiency is maintained over a defined threshold (strong scalability). This method also allows us to determine how the problem size must be increased in order to maintain an execution time constant while the number of cores are expanded (weak scalability) considering the tradeoff between speed and efficiency. This methodology has been tested with different benchmarks and applications and we achieved an average improvement around 30. 35% of efficiency in applications tested using different problems sizes and multi-core clusters. In addition, results show that maximum speedup with a defined efficiency is located close to the values calculated with our analytical model with an error rate lower than 5% for the applications tested.
Ayudas: Ministerio de Ciencia e Innovación TIN2011-24384
Ministerio de Ciencia e Innovación TIN2014-53172-P
Derechos: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial, la distribució, i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. Creative Commons
Lengua: Anglès
Documento: Article ; recerca ; Versió acceptada per publicar
Materia: Performance improvements ; Multi-core ; Mapping ; Scheduling ; Scalability analysis ; Tiling applications ; SPMD
Publicado en: Future generation computer systems, Vol. 66 (Gener 2017) , p. 11-26, ISSN 0167-739X

DOI: 10.1016/j.future.2016.06.016


Post-print
25 p, 2.3 MB

El registro aparece en las colecciones:
Documentos de investigación > Documentos de los grupos de investigación de la UAB > Centros y grupos de investigación (producción científica) > Ingeniería > HPC4EAS (High Performance Computing for Efficient Applications and Simulation Research Group)
Artículos > Artículos de investigación
Artículos > Artículos publicados

 Registro creado el 2016-07-11, última modificación el 2022-02-06



   Favorit i Compartir