Skip to Main content Skip to Navigation
Journal articles

Resilient Computing on ROS using Adaptive Fault Tolerance

Abstract : Computer-based systems are now expected to evolve during their service life in order to cope with changes of various nature, ranging from evolution of user needs, e.g., additional features requested by users, to system configuration changes, e.g., modifications in available hardware resources. When considering resilient embedded systems that must comply with stringent dependability requirements, the challenge is even greater, as evolution must not impair dependability attributes. Maintaining dependability properties when facing changes is, indeed, the exact definition of resilient computing. In this paper, we consider the evolution of systems with respect to their dependability mechanisms, and show how such mechanisms can evolve with the system evolution, in the case of ROS, the Robot Operating System. We provide a synthesis of the concepts required for resilient computing using a component-based approach. We particularly emphasize the process and the techniques needed in order to implement an adaptation layer for fault tolerance mechanisms. In the light of this analysis, we address the implementation of Adaptive Fault Tolerance (AFT) on ROS (Robot Operating System) in two steps: firstly, we provide an architecture to implement fault tolerance mechanisms in ROS, and secondly, we describe the actual adaptation of fault tolerance mechanisms in ROS. Beyond the implementation details given in the paper, we draw the lessons learned from this work and discuss the limits of this run-time support to implement AFT features in embedded systems.
Complete list of metadatas
Contributor : Jean-Charles Fabre <>
Submitted on : Friday, February 16, 2018 - 3:11:27 PM
Last modification on : Tuesday, June 23, 2020 - 3:02:02 PM
Document(s) archivé(s) le : Monday, May 7, 2018 - 2:42:28 AM


Files produced by the author(s)



Michaël Lauer, Matthieu Amy, Jean-Charles Fabre, Matthieu Roy, William Excoffon, et al.. Resilient Computing on ROS using Adaptive Fault Tolerance. Journal of Software: Evolution and Process, John Wiley & Sons, Ltd., 2018, 30 (3), pp.e1917. ⟨10.1002/smr.1917⟩. ⟨hal-01703968⟩



Record views


Files downloads