Fault tolerance of MPI applications in exascale systems: The ULFM solution

Title
Fault tolerance of MPI applications in exascale systems: The ULFM solution
Authors
Keywords
MPI, Resilience, ULFM, Application-level checkpointing
Publisher
Elsevier BV
Online
2020-01-21
DOI
10.1016/j.future.2020.01.026

Ask authors/readers for more resources

Reprint

Contact the author

Find Funding. Review Successful Grants.

Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.

Explore

Discover Peeref hubs

Discuss science. Find collaborators. Network.

Join a conversation