Distributed online software monitoring of manycore architectures

Abstract : This paper describes the design principles of a software based on-line testing application used to monitor manycore architectures running multi thread functional applications. The key idea is to have a non intrusive monitoring application running in parallel with the functional one. The monitoring application aims at detecting and reacting to software or hardware malfunctions, and can be seen as a service provided by the operating system. This monitoring method relies on the use of embedded sensors that capture physical values (temperature, ...) from the chip, or software-related indicators like CPU load. A case-study implementing this methodology has been performed and results in terms of memory usage and performance overhead are given.
Document type :
Conference papers
Complete list of metadatas

https://hal.archives-ouvertes.fr/hal-01291845
Contributor : Lip6 Publications <>
Submitted on : Tuesday, March 22, 2016 - 11:05:49 AM
Last modification on : Thursday, March 21, 2019 - 1:04:54 PM

Identifiers

Citation

Etienne Faure, Mounir Benabdenbi, François Pêcheux. Distributed online software monitoring of manycore architectures. 16th IEEE International On-Line Testing Symposium, Jul 2010, Corfou Island, Greece. pp.56-61, ⟨10.1109/IOLTS.2010.5560232⟩. ⟨hal-01291845⟩

Share

Metrics

Record views

147