Communication Aware Recovery Configurations for Networks-on-Chip
Résumé
In this paper we propose a set of different configurations of failure recovery schemes, developed for network-on-chip (NoC) based systems. These configurations exploit the fact that communication in NoCs tends to be partitioned and eventually localized. The failure recovery approach is based on checkpoint and rollback and is aimed towards fast recovery from system or application level failures. The proposed recovery configurations and partitions of the NoC enhance the performance/overhead of the recovery mechanism. We analyze the effectiveness of these solutions, depending on the traffic characteristics and the expected failure rate.