Per citar aquest document: http://ddd.uab.cat/record/160448
An approach for an efficient execution of SPMD applications on Multi-core environments
Muresano Cáceres, Ronal Roberto (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Meyer, Hugo Daniel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Rexachs del Rosario, Dolores Isabel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
Luque, Emilio (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)

Data: 2017
Resum: Executing traditional Message Passing Interface (MPI) applications on multi-core cluster balancing speed and computational efficiency is a difficult task that parallel programmers have to deal with. For this reason, communications on multi-core clusters ought to be handled carefully in order to improve performance metrics such as efficiency, speedup, execution time and scalability. In this paper we focus our attention on SPMD (Single Program Multiple Data) applications with high communication volume and synchronicity and also following characteristics such as: static, local and regular. This work proposes a method for SPMD applications, which is focused on managing the communication heterogeneity (different cache level, RAM memory, network, etc. ) on homogenous multi-core computing platform in order to improve the application efficiency. In this sense, the main objective of this work is to find analytically the ideal number of cores necessary that allows us to obtain the maximum speedup, while the computational efficiency is maintained over a defined threshold (strong scalability). This method also allows us to determine how the problem size must be increased in order to maintain an execution time constant while the number of cores are expanded (weak scalability) considering the tradeoff between speed and efficiency. This methodology has been tested with different benchmarks and applications and we achieved an average improvement around 30. 35% of efficiency in applications tested using different problems sizes and multi-core clusters. In addition, results show that maximum speedup with a defined efficiency is located close to the values calculated with our analytical model with an error rate lower than 5% for the applications tested.
Nota: Número d'acord de subvenció MICINN/TIN2011-24384
Nota: Número d'acord de subvenció MICINN/TIN2014-53172-P
Drets: Aquest document està subjecte a una llicència d'ús Creative Commons. Es permet la reproducció total o parcial i la comunicació pública de l'obra, sempre que no sigui amb finalitats comercials, i sempre que es reconegui l'autoria de l'obra original. No es permet la creació d'obres derivades. Creative Commons
Llengua: Anglès
Document: article ; recerca ; acceptedVersion
Matèria: Performance improvements ; Multi-core ; Mapping ; Scheduling ; Scalability analysis ; Tiling applications ; SPMD
Publicat a: Future generation computer systems, Vol. 66 (Jan. 2017) , p. 11-26, ISSN 0167-739X

DOI: 10.1016/j.future.2016.06.016


Disponible a partir de: 2019-01-30
Post-print

El registre apareix a les col·leccions:
Documents de recerca > Documents dels grups de recerca de la UAB > Centres i grups de recerca (producció científica) > Enginyeries > HPC4EAS (High Performance Computing for Efficient Applications and Simulation Research Group)
Articles > Articles de recerca
Articles > Articles publicats

 Registre creat el 2016-07-11, darrera modificació el 2016-11-16



   Favorit i Compartir