50
A. Bobbio Reggio Emilia, June 17-18 , 2003 1 Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions Andrea Bobbio Dipartimento di Informatica Università del Piemonte Orientale, “A. Avogadro” 15100 Alessandria (Italy) bobbio @ unipmn .it - http://www.mfn.unipmn.it/~bobbio/IFOA IFOA, Reggio Emilia, June 17-18, 2003

Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

  • Upload
    ninon

  • View
    57

  • Download
    0

Embed Size (px)

DESCRIPTION

Andrea Bobbio Dipartimento di Informatica Universit à del Piemonte Orientale, “ A. Avogadro ” 15100 Alessandria (Italy) [email protected] - http://www.mfn.unipmn.it/~bobbio/IFOA. Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions. - PowerPoint PPT Presentation

Citation preview

Page 1: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 1

Dependability & Maintainability Theory and Methods

Part 1: Introduction and definitions

Andrea BobbioDipartimento di Informatica

Università del Piemonte Orientale, “A. Avogadro”15100 Alessandria (Italy)

[email protected] - http://www.mfn.unipmn.it/~bobbio/IFOA

IFOA, Reggio Emilia, June 17-18, 2003

Page 2: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 2

Dependability: DefinitionDependability: Definition

Dependability is the property of a system to be dependable in time, i.e. such that reliance can justifiably be placed on the service it delivers.

Dependability extends the interest on the system from the design and construction phase to the operational phase (life cycle).

Page 3: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 3

What dependability theory and practicewants to avoid

Page 4: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 4

dependability

measures

reliabilityavailabilitymaintainabilitysafetysecurity

means fault forecastingfault tolerancefault removalfault prevention

threats faults errorsfailures

Dependability: TaxonomyDependability: Taxonomy

Page 5: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 5

Quantitative analysisQuantitative analysis

The quantitative analysis aims at numerically evaluating measures to characterize the dependability of an item:

Risk assessment and safety

Design specifications

Technical assistance and maintenance

Life cycle cost

Market competition

Page 6: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 6

Risk assessment and safetyThe risk associated to an activity is given proportional to the probability of occurrence of the activity and to the magnitute of the consequences.

A safety critical system is a system whose incorrect behavior may cause a risk to occur, causing undesirable consequences to the item, to the operators, to the population, to the environment.

R = P M

Page 7: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 7

Design specifications

Technological items must be dependable.

Some times, dependability requirements (both qualitative and quantitative) are part of the design specifications:

Mean time between failures

Total down time

Page 8: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 8

Technical assistance and maintenance

The planning of all the activity related to the technical assistance and maintenance is linked to the system dependability (expected number of failure in time).

planning spare parts and maintenance crews;

cost of the technical assistance (warranty period);

preventive vs reactive maintenance.

Page 9: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 9

Market competition

The choice of the consumers is strongly influenced by the perceived dependability.

advertisement messages stress the dependability;

the image of a product or of a brand may depend on the dependability.

Page 10: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 10

Purpose of evaluation

Understanding a system– Observation– Operational environment– Reasoning

Predicting the behavior of a system– Need a model– A model is a convenient abstraction– Accuracy based on degree of extrapolation

Page 11: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 11

Methods of evaluation

Measurement-Based Most believable, most expensive Not always possible or cost effective during system design

Model-Based Less believable, Less expensive Analytic vs Discrete-Event Simulation Combinatorial vs State-Space Methods

Page 12: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 12

Measurement-BasedMost believable, most expensive;

Data are obtained observing the behavior of physical objects.

field observations; measurements on prototypes; measurements on components (accelerated tests).

Page 13: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 13

Closed-formAnswers

NumericalSolution

Analytic

Simulation

All models are wrong; some models are useful

Models

Page 14: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 14

Methods of evaluation

Measurements + Models data bank

Page 15: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 15

The probabilistic approachThe probabilistic approachThe mechanisms that lead to failure a technological object are very complex and depend on many physical, chemical, technical, human, environmental … factors.

The time to failure cannot be expressed by a determin-istic law.

We are forced to assume the time to failure as a random variable.

The quantitative dependability analysis is based on a probabilistic approach.

Page 16: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 16

ReliabilityReliability

The reliability is a measurable attribute of the dependability and it is defined as:

The reliability R(t) of an item at time t is the probability that the item performs the required function in the interval (0 – t) given the stress and environmental conditions in which it operates.

Page 17: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 17

Basic Definitions: cdfLet X be the random variable representing the time to failure of an item.

The cumulative distribution function (cdf) F(t) of the r.v. X is given by:

F(t) = Pr { X t }

F(t) represents the probability that the item is already failed at time t (unreliability) .

Page 18: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 18

Basic Definitions: cdf

Equivalent terminoloy for F(t) :

CDF (cumulative distribution function)

Probability distribution function

Distribution function

Page 19: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 19

Basic Definitions: cdf

1

0

F(t)

ta

F(b)

F(a)

b

F(0) = 0lim F(t) = 1t

F(t) = non-decreasing

Page 20: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 20

Basic Definitions: ReliabilityLet X be the random variable representing the time to failure of an item.

The survivor function (sf) R(t) of the r.v. X is given by:

R (t) = Pr { X > t } = 1 - F(t)

R(t) represents the probability that the item is correctly working at time t and gives the reliability function .

Page 21: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 21

Basic Definitions

Equivalent terminology for R(t) = 1 -F(t) :

Reliability

Complementary distribution function

Survivor function

Page 22: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 22

Basic Definitions: Reliability

1

0

R(t)

ta b

R(0) = 1lim R(t) = 0t

R(t) = non-increasing

R(a)

Page 23: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 23

Basic Definitions: density

Let X be the random variable representing the time to failure of an item and let F(t) be a derivable cdf:

The density function f(t) is defined as:

d F(t)f (t) = ——— dt

f (t) dt = Pr { t X < t + dt }

Page 24: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 24

Basic Definitions: Density

0

f (t)

ta b

f(x) dx = Pr { a < X b } = F(b) – F(a) a

b

Page 25: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 25

Basic Definitions: Density

1

0

f (t)

t

00

dttRdtttfXEMTTF

Page 26: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 26

Basic Definitions

Equivalent terminology: pdf

probability density function

density function

density

f(t) = dtdF ,)(

)()(

0

t

t

dxxf

dxxftF

For a non-negativerandom variable

Page 27: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 27

Quiz 1:The higher the MTTF is, the higher the

item reliability is.1. Correct2. Wrong

The correct answer is wrong !!!

Page 28: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 28

Hazard (failure) rate

h(t) t = Conditional Prob. system will fail in (t, t + t) given that it is survived until time t

f(t) t = Unconditional Prob. System will fail in (t, t + t)

)(1)(

)()()(

tFtf

tRtfth

Page 29: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 29

is the conditional probability that the unit will fail in the interval given that it is functioning at time t.

is the unconditional probability that the unit will fail in the interval

Difference between the two sentences:– probability that someone will die between 90 and 91, given that he

lives to 90– probability that someone will die between 90 and 91

The Failure Rate of a Distribution

tΔth),( ttt

ttf ),( ttt

Page 30: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

30Reggio Emilia, June 17-18, 2003A. Bobbio

DFR IFR

Decreasing failure rate Increasing fail. rate

h(t)

t

CFRConstant fail. rate

(useful life)

(infant mortality – burn in) (wear-out-phase)

Bathtub curve

Page 31: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 31

Infant mortality (dfr)Also called infant mortality phase or reliability growth phase. The failure rate decreases with time.

Caused by undetected hardware/software defects; Can cause significant prediction errors if steady-state failure rates are used;Weibull Model can be used;

Page 32: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 32

Useful life (cfr)The failure rate remains constant in time (age independent) .

Failure rate much lower than in early-life period.

Failure caused by random effects (as environmental shocks).

Page 33: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 33

Wear-out phase (ifr)The failure rate increases with age.

It is characteristic of irreversible aging phenomena (deterioration, wear-out, fatigue, corrosion etc…)

Applicable for mechanical and other systems.

(Properly qualified electronic parts do not exhibit wear-out failure during its intended service life)

Weibull Failure Model can be used

Page 34: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 34

Cumul. distribution function:

Reliability :

Density Function :

Failure Rate (CFR):

Mean Time to Failure:

0 1 tetF t

0 t tetf

0 ttR e t

tRtfth

1MTTF

Exponential DistributionFailure rate is age-independent (constant).

Page 35: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 35

2.50

The Cumulative Distribution Function of an Exponentially Distributed Random

Variable With Parameter = 1

F(t)1.0

0.5

0 1.25 3.75 5.00 t

F(t) = 1 - e - t

Page 36: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 36

2.50

The Reliability Function of an Exponentially Distributed Random

Variable With Parameter = 1

R(t)1.0

0.5

0 1.25 3.75 5.00 t

R(t) = e - t

Page 37: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 37

Exponential Density Function (pdf)

f(t)

MTTF = 1/

Page 38: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 38

Memoryless Property of the Exponential Distribution

Assume X > t. We have observed that the

component has not failed until time t

Let Y = X - t , the remaining (residual) lifetime

y

t

etXPtyXtPtXtyXP

tXyYPyG

1)(

)()|(

)|()(

Page 39: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 39

Memoryless Property of the Exponential Distribution (cont.)

Thus Gt(y) is independent of t and is identical to the original exponential distribution of X

The distribution of the remaining life does not depend on how long the component has been operating

An observed failure is the result of some suddenly appearing failure, not due to gradual deterioration

Page 40: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 40

Quiz 3: If two components (say, A and B) have independent

identical exponentially distributed times to failure, by the “memoryless” property, which of the following is

true? 1. They will always fail at the same time2. They have the same probability of failing at time

‘t’ during operation3. When these two components are operating

simultaneously, the component which has been operational for a shorter duration of time will survive longer

Page 41: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 41

0

0

0 1

1

tetR

tettf

tetF

t

t

t

Weibull Distribution

Distribution Function:

Density Function:

Reliability:

Page 42: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 42

1

1

0 1

)(

)( ttth

tR

tf

Weibull Distribution : shape parameter;

: scale parameter.

Failure Rate:

1 DfrCfr

Ifr

Page 43: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 43

Failure Rate of the Weibull Distribution with Various Values of

Page 44: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 44

Weibull Distribution for Various Values of

Cdf density

Page 45: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 45

We use a truncated Weibull Model

Infant mortality phase modeled by DFR Weibull and the steady-state phase by the exponential

0 2,190 4,380 6,570 8,760 10,950 13,140 15,330 17,520Operating Times (hrs)

Failu

re-R

ate

Mul

tiplie

r

76543210

Figure 2.34 Weibull Failure-Rate Model

Failure Rate Models

Page 46: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 46

Failure Rate Models (cont.)

This model has the form:

where:steady-state failure rate

is Weibull shape parameter

Failure rate multiplier =

SS

W tCt

1)(760,8760,81

tt

SSWC ,11

SSW t)(

Page 47: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 47

Failure Rate Models (cont.)

There are several ways to incorporate time dependent failure rates in availability modelsThe easiest way is to approximate a continuous function by a piecewise constant step function

2,190 4,380 6,570 10,950 13,140 15,330 17,520Operating Times (hrs)

Failu

re-R

ate

Mul

tiplie

r

76543210

Discrete Failure-Rate Model

8,7600

1

2 SS

Page 48: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 48

Failure Rate Models (cont.)

Here the discrete failure-rate model is defined by:

ss

W t

2

1)(

760,8760,8380,4380,40

ttt

Page 49: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 49

A lifetime experimentA lifetime experiment

N i.i.d components are put in a life test experiment.

1

2

3

4

N

t = 0

X 1

X 2X 3

X 4

X N

Page 50: Dependability & Maintainability Theory and Methods Part 1: Introduction and definitions

A. Bobbio Reggio Emilia, June 17-18, 2003 50

A lifetime experimentA lifetime experiment1234

N

X 1X 2

X 3X 4

X N