Book Image

Dynamic System Reliability

By : Liudong Xing, Gregory Levitin, Chaonan Wang
Book Image

Dynamic System Reliability

By: Liudong Xing, Gregory Levitin, Chaonan Wang

Overview of this book

This book focuses on hot issues of dynamic system reliability, systematically introducing the reliability modeling and analysis methods for systems with imperfect fault coverage, systems with function dependence, systems subject to deterministic or probabilistic common-cause failures, systems subject to deterministic or probabilistic competing failures, and dynamic standby sparing systems. It presents recent developments of such extensions involving reliability modeling theory, reliability evaluation methods, and features numerous case studies based on real-world examples. The presented dynamic reliability theory can enable a more accurate representation of actual complex system behavior, thus more effectively guiding the reliable design of real-world critical systems. The book begins by describing the evolution from the traditional static reliability theory to the dynamic system reliability theory and provides a detailed investigation of dynamic and dependent behaviors in subsequent chapters. Although written for those with a background in basic probability theory and stochastic processes, the book includes a chapter reviewing the fundamentals that readers need to know in order to understand the contents of other chapters that cover advanced topics in reliability theory and case studies.
Table of Contents (14 chapters)
Preface
Free Chapter
1
Nomenclature
2
1 Introduction
12
Index
13
End User License Agreement

3
Imperfect Fault Coverage

Many systems especially those used in life‐critical or mission‐critical applications such as aerospace, flight controls, nuclear plants, data storage systems and communication systems are fault‐tolerant systems (FTSs) [1, 2]. An FTS can continue to perform its function correctly even in the presence of software errors or hardware failures [3, 4]. Its development typically requires using certain form of redundancy and an automatic reconfiguration and recovery mechanism to restore the system function in the case of the occurrence of a component failure. The mechanism itself (involving fault detection, fault location, fault isolation, and fault recovery) is often not perfect; it can fail such that the system cannot adequately detect, locate, isolate or recover from a component fault happening in the system. The uncovered component fault may propagate through the system and further cause the failure of the entire system or subsystem in spite...