Online Library TheLib.net » Resilience Assessment and Evaluation of Computing Systems

The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems.

Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques.

The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.




The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems.

Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques.

The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.


The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems.

Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques.

The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.
Content:
Front Matter....Pages i-xviii
Front Matter....Pages 1-1
Fault Tolerance and Resilience: Meanings, Measures and Assessment....Pages 3-24
Resilience in Mobile Networks: A Need and a Challenge....Pages 25-40
Assessing Dependability and Resilience in Critical Infrastructures: Challenges and Opportunities....Pages 41-63
Providing Dependability and Resilience in the Cloud: Challenges and Opportunities....Pages 65-81
Front Matter....Pages 83-83
Phase-Type Distributions....Pages 85-113
Scalable Stochastic Modelling for Resilience....Pages 115-149
Front Matter....Pages 151-151
Modelling and Model-Based Assessment....Pages 153-165
Software Aging and Rejuvenation for Increased Resilience: Modeling, Analysis and Applications....Pages 167-183
Online Prediction: Four Case Studies....Pages 185-202
Front Matter....Pages 203-203
Foundations of Metrology in the Observation of Critical Systems....Pages 205-211
Field Studies on Resilience: Measurements and Repositories....Pages 213-237
Failure Diagnosis of Complex Systems....Pages 239-261
Fault Injection....Pages 263-281
Resilience Benchmarking....Pages 283-301
Front Matter....Pages 303-303
Resilience Assessment Based on Performance Testing....Pages 305-322
Robustness Testing Techniques and Tools....Pages 323-339
Front Matter....Pages 341-341
Case Study: Mobile Networks....Pages 343-364
Case Study on Critical Infrastructures: Assessment of Electric Power Systems....Pages 365-390
Providing Dependability and Performance in the Cloud: Case Studies....Pages 391-412
Front Matter....Pages 413-413
Future of Resilience Assessment: The AMBER Research Roadmap....Pages 415-439
Back Matter....Pages 441-489


The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems.

Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques.

The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.
Content:
Front Matter....Pages i-xviii
Front Matter....Pages 1-1
Fault Tolerance and Resilience: Meanings, Measures and Assessment....Pages 3-24
Resilience in Mobile Networks: A Need and a Challenge....Pages 25-40
Assessing Dependability and Resilience in Critical Infrastructures: Challenges and Opportunities....Pages 41-63
Providing Dependability and Resilience in the Cloud: Challenges and Opportunities....Pages 65-81
Front Matter....Pages 83-83
Phase-Type Distributions....Pages 85-113
Scalable Stochastic Modelling for Resilience....Pages 115-149
Front Matter....Pages 151-151
Modelling and Model-Based Assessment....Pages 153-165
Software Aging and Rejuvenation for Increased Resilience: Modeling, Analysis and Applications....Pages 167-183
Online Prediction: Four Case Studies....Pages 185-202
Front Matter....Pages 203-203
Foundations of Metrology in the Observation of Critical Systems....Pages 205-211
Field Studies on Resilience: Measurements and Repositories....Pages 213-237
Failure Diagnosis of Complex Systems....Pages 239-261
Fault Injection....Pages 263-281
Resilience Benchmarking....Pages 283-301
Front Matter....Pages 303-303
Resilience Assessment Based on Performance Testing....Pages 305-322
Robustness Testing Techniques and Tools....Pages 323-339
Front Matter....Pages 341-341
Case Study: Mobile Networks....Pages 343-364
Case Study on Critical Infrastructures: Assessment of Electric Power Systems....Pages 365-390
Providing Dependability and Performance in the Cloud: Case Studies....Pages 391-412
Front Matter....Pages 413-413
Future of Resilience Assessment: The AMBER Research Roadmap....Pages 415-439
Back Matter....Pages 441-489
....
Download the book Resilience Assessment and Evaluation of Computing Systems for free or read online
Read Download
Continue reading on any device:
QR code
Last viewed books
Related books
Comments (0)
reload, if the code cannot be seen