Adaptive fault tolerance through invasive computing

Beitrag bei einer Tagung
(Konferenzbeitrag)


Details zur Publikation

Autor(en): Witterauf M, Tanase AP, Teich J, Lari V, Zwinkau A, Snelting G
Verlag: Institute of Electrical and Electronics Engineers Inc.
Jahr der Veröffentlichung: 2015
Tagungsband: Proceedings of the 2015 NASA/ESA Conference on Adaptive Hardware and Systems
Seitenbereich: 1-8
ISBN: 9781467375016


Abstract


Fault tolerance is a basic necessity to make today's complex systems reliable. Adequate fault tolerance, however, demands a high degree of redundancy, possibly wasting resources when the fault probability is low or when some applications do not require fault tolerance. Under the term adaptive fault tolerance, we investigate means to instead provide on-demand fault tolerance on multi-core systems dynamically and according to application and environmental needs. Such means are provided on a per-application basis by invasive computing, a recent paradigm for resource-aware programming and design of parallel systems: applications request resources in an invade phase, infect the acquired resources with code and data, and finally release them in a retreat phase. We show how to use these simple but powerful constructs to adaptively tolerate faults and that invasive computing harmonizes well with many existing fault tolerance approaches. Finally, a case study on adaptively providing fault tolerance for loops demonstrates how effective invasive computing is for adapting to a varying soft error rate and handling of faults.



FAU-Autoren / FAU-Herausgeber

Lari, Vahid
Sonderforschungsbereich/Transregio 89 Invasives Rechnen
Tanase, Alexandru-Petru Dr.-Ing.
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)
Teich, Jürgen Prof. Dr.-Ing.
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)
Witterauf, Michael
Lehrstuhl für Informatik 12 (Hardware-Software-Co-Design)


Autor(en) der externen Einrichtung(en)
Karlsruhe Institute of Technology (KIT)


Zitierweisen

APA:
Witterauf, M., Tanase, A.-P., Teich, J., Lari, V., Zwinkau, A., & Snelting, G. (2015). Adaptive fault tolerance through invasive computing. In Proceedings of the 2015 NASA/ESA Conference on Adaptive Hardware and Systems (pp. 1-8). Montreal, CA: Institute of Electrical and Electronics Engineers Inc..

MLA:
Witterauf, Michael, et al. "Adaptive fault tolerance through invasive computing." Proceedings of the NASA/ESA Conference on Adaptive Hardware and Systems, AHS 2015, Montreal Institute of Electrical and Electronics Engineers Inc., 2015. 1-8.

BibTeX: 

Zuletzt aktualisiert 2018-10-10 um 14:50