W. Bland, A. Bouteiller, T. Herault, J. Hursey, G. Bosilca et al., An evaluation of user-level failure mitigation support in MPI, EuroMPI'12, pp.193-203, 2012.

R. D. Blumofe and C. E. Leiserson, Scheduling multithreaded computations by work stealing, Journal of the ACM, vol.46, issue.5, pp.720-748, 1999.

D. Cunningham, D. Grove, B. Herta, A. Iyengar, K. Kawachiya et al., Resilient X10: Efficient failure-aware programming, ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp.67-80, 2014.

E. W. Dijkstra and C. S. Scholten, Termination detection for diffusing computations, Information Processing Letters, vol.11, issue.1, pp.1-4, 1980.

G. Kestor, S. Krishnamoorthy, and W. Ma, Localized fault recovery for nested fork-join programs, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.397-408, 2017.

T. H. Lai and L. F. Wu, An (n-1)-resilient algorithm for distributed termination detection, IEEE Transactions on Parallel and Distributed Systems, vol.6, issue.1, pp.63-78, 1995.

J. Lifflander, P. Miller, and L. Kale, Adoption protocols for fanout-optimal fault-tolerant termination detection, ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), 2013.

J. Milthorpe, D. Grove, B. Herta, and O. Tardieu, Exploring the APGAS programming model using the LULESH proxy application, IBM Research, 2015.

R. Stewart, P. Maier, and P. Trinder, Transparent fault tolerance for scalable functional computation, Journal of Functional Programming, vol.26, 2016.

T. The and . Page, TLA+ specification of the optimistic finish protocol and the replication protocol