4.7 Article

Replication-Based Fault Tolerance for MPI Applications

相关参考文献

注意:仅列出部分参考文献,下载原文获取全部文献信息。
Article Computer Science, Hardware & Architecture

The LAM/MPI checkpoint/restart framework: System-initiated checkpointing

S Sankaran et al.

INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS (2005)

Article Computer Science, Software Engineering

The LINPACK benchmark: past, present and future

JJ Dongarra et al.

CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE (2003)

Article Computer Science, Theory & Methods

A survey of rollback-recovery protocols in message-passing systems

EN Elnozahy et al.

ACM COMPUTING SURVEYS (2002)