UAB Digital Repository of Documents 2 records found  Search took 0.03 seconds. 
1.
45 p, 1.8 MB Hybrid Message Pessimistic Logging : improving current pessimistic message logging protocols / Meyer, Hugo Daniel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Muresano Cáceres, Ronal Roberto (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Castro León, Marcela (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Rexachs del Rosario, Dolores Isabel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Luque, Emilio (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
With the growing scale of HPC applications, there has been an increase in the number of interruptions as a consequence of hardware failures. The remarkable decrease of Mean Time Between Failures (MTBF) in current systems encourages the research of suitable fault tolerance solutions. [...]
2017 - 10.1016/j.jpdc.2017.02.003
Journal of parallel and distributed computing, Vol. 104 (2017) , p. 206-222  
2.
14 p, 2.1 MB Fault tolerance at system level based on RADIC architecture / Castro León, Marcela (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Meyer, Hugo Daniel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Rexachs del Rosario, Dolores Isabel (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius) ; Luque, Emilio (Universitat Autònoma de Barcelona. Departament d'Arquitectura de Computadors i Sistemes Operatius)
The increasing failure rate in High Performance Computing encourages the investigation of fault tolerance mechanisms to guarantee the execution of an application in spite of node faults. This paper presents an automatic and scalable fault tolerant model designed to be transparent for applications and for message passing libraries. [...]
2015 - 10.1016/j.jpdc.2015.08.005
Journal of parallel and distributed computing, Vol. 86 (Dec. 2015) , p. 98-111  

Interested in being notified about new results for this query?
Set up a personal email alert or subscribe to the RSS feed.