24
Probability and Probability and Statistics in Statistics in Engineering Engineering Philip Bedient, Ph.D. Philip Bedient, Ph.D.

1608 probability and statistics in engineering

Embed Size (px)

Citation preview

Page 1: 1608 probability and statistics in engineering

Probability and Probability and Statistics in Statistics in Engineering Engineering Philip Bedient, Ph.D.Philip Bedient, Ph.D.

Page 2: 1608 probability and statistics in engineering

Probability: Basic IdeasProbability: Basic Ideas Terminology:Terminology:

Trial:Trial: each time you repeat an each time you repeat an experimentexperiment

Outcome:Outcome: result of an experiment result of an experiment Random experiment:Random experiment: one with random one with random

outcomes (cannot be predicted exactly)outcomes (cannot be predicted exactly) Relative frequency:Relative frequency: how many times a how many times a

specific outcome occurs within the entire specific outcome occurs within the entire experiment.experiment.

Page 3: 1608 probability and statistics in engineering

Statistics: Basic IdeasStatistics: Basic Ideas Statistics is the area of science that deals with Statistics is the area of science that deals with

collection, organization, analysis, and collection, organization, analysis, and interpretation of data. interpretation of data.

It also deals with methods and techniques that It also deals with methods and techniques that can be used to draw conclusions about the can be used to draw conclusions about the characteristics of a large number of data points--characteristics of a large number of data points--commonly called a commonly called a populationpopulation-- --

By using a smaller subset of the entire data.By using a smaller subset of the entire data.

Page 4: 1608 probability and statistics in engineering

For Example…For Example… You work in a cell phone factory and are asked to You work in a cell phone factory and are asked to

remove cell phones at random off of the assembly remove cell phones at random off of the assembly line and turn it on and off. line and turn it on and off.

Each time you remove a cell phone and turn it on Each time you remove a cell phone and turn it on and off, you are conducting a and off, you are conducting a random experiment.random experiment.

Each time you pick up a phone is a Each time you pick up a phone is a trialtrial and the and the result is called an result is called an outcomeoutcome..

If you check 200 phones, and you find 5 bad If you check 200 phones, and you find 5 bad phones, thenphones, then

relative frequencyrelative frequency of failure = 5/200 = 0.025 of failure = 5/200 = 0.025

Page 5: 1608 probability and statistics in engineering

Statistics in EngineeringStatistics in Engineering Engineers apply physical Engineers apply physical

and chemical laws and and chemical laws and mathematics to design, mathematics to design, develop, test, and develop, test, and supervise various supervise various products and services. products and services.

Engineers perform tests Engineers perform tests to learn how things to learn how things behave under stress, and behave under stress, and at what point they might at what point they might fail.fail.

Page 6: 1608 probability and statistics in engineering

Statistics in EngineeringStatistics in Engineering As engineers perform experiments, they As engineers perform experiments, they

collect data that can be used to explain collect data that can be used to explain relationships better and to reveal relationships better and to reveal information about the quality of products information about the quality of products and services they provide.and services they provide.

Page 7: 1608 probability and statistics in engineering

Frequency Distribution: Frequency Distribution: Scores for an engineering class are as follows: 58, 95, 80, Scores for an engineering class are as follows: 58, 95, 80,

75, 68, 97, 60, 85, 75, 88, 90, 78, 62, 83, 73, 70, 70, 85, 75, 68, 97, 60, 85, 75, 88, 90, 78, 62, 83, 73, 70, 70, 85, 65, 75, 53, 62, 56, 72, 7965, 75, 53, 62, 56, 72, 79

To better assess the success of the class, we make a To better assess the success of the class, we make a frequency chart:frequency chart:

Page 8: 1608 probability and statistics in engineering

Now the information can be better analyzed. Now the information can be better analyzed. For example, 3 students did poorly, and 3 did For example, 3 students did poorly, and 3 did

exceptionally well. We know that 9 students exceptionally well. We know that 9 students were in the average range of 70-79. We can were in the average range of 70-79. We can also show this data in a freq. histogram (PDF).also show this data in a freq. histogram (PDF).

Divide each no. by 26

Page 9: 1608 probability and statistics in engineering

Cumulative FrequencyCumulative Frequency The data can be further organized by calculating the The data can be further organized by calculating the

cumulative frequencycumulative frequency (CDF) (CDF).. The cumulative frequency shows the cumulative number The cumulative frequency shows the cumulative number

of students with scores up to and including those in the of students with scores up to and including those in the given range. Usually we normalize the data - divide 26.given range. Usually we normalize the data - divide 26.

Page 10: 1608 probability and statistics in engineering

Measures of Central Tendency & Measures of Central Tendency & VariationVariation

Systematic errorsSystematic errors, also called , also called fixed errorsfixed errors, are , are errors associated with using an inaccurate errors associated with using an inaccurate instrument.instrument. These errors can be detected and avoided by properly These errors can be detected and avoided by properly

calibrating instrumentscalibrating instruments Random errorsRandom errors are generated by a number of are generated by a number of

unpredictable variations in a given measurement unpredictable variations in a given measurement situation.situation. Mechanical vibrations of instruments or variations in Mechanical vibrations of instruments or variations in

line voltage friction or humidity could lead to random line voltage friction or humidity could lead to random fluctuations in observations.fluctuations in observations.

Page 11: 1608 probability and statistics in engineering

When analyzing data, the mean alone cannot signal When analyzing data, the mean alone cannot signal possible mistakes. There are a number of ways to define possible mistakes. There are a number of ways to define the dispersion or spread of data. the dispersion or spread of data.

You can compute how much each number deviates from You can compute how much each number deviates from the mean, add up all the deviations, and then take their the mean, add up all the deviations, and then take their average as shown in the table below.average as shown in the table below.

Page 12: 1608 probability and statistics in engineering

As exemplified in Table 19.4, the sum of deviations from As exemplified in Table 19.4, the sum of deviations from the mean for any given sample is always zero. This can the mean for any given sample is always zero. This can be verified by considering the following:be verified by considering the following:

Where Where xxi i represents data points,represents data points, x x is the average,is the average, n n is the is the number of data points, andnumber of data points, and d, d, represents the deviation represents the deviation from the average.from the average.

x 1n

x ii1

n

di (x i x )

Page 13: 1608 probability and statistics in engineering

Therefore the average of the deviations from the Therefore the average of the deviations from the mean of the data set cannot be used to measure mean of the data set cannot be used to measure the spread of a given data set. the spread of a given data set.

Instead we calculate the average of the Instead we calculate the average of the absolute absolute valuesvalues of deviationsof deviations. (This is shown in the third . (This is shown in the third column of table 19.4 in your textbook)column of table 19.4 in your textbook)

For group A the mean deviation is 290, and Group For group A the mean deviation is 290, and Group B is 820. We can conclude that Group B is more B is 820. We can conclude that Group B is more scattered than A.scattered than A.

dii1

n

x ii1

n

x i1

n

dii1

n

nx nx 0

Page 14: 1608 probability and statistics in engineering

VarianceVariance Another way of measuring the data is by Another way of measuring the data is by

calculating the calculating the variancevariance.. Instead of taking the absolute values of Instead of taking the absolute values of

each deviation, you can just square the each deviation, you can just square the deviation and find the means. deviation and find the means.

(n-1) makes estimate unbiased(n-1) makes estimate unbiased

v i1

n

(x i x )2

n 1

Page 15: 1608 probability and statistics in engineering

Taking the square root of the variance Taking the square root of the variance which results in the which results in the standard deviation.standard deviation.

The standard deviation can also provide The standard deviation can also provide information about the relative spread of a information about the relative spread of a data set. data set.

s i1

n

(x i x )2

n 1

Page 16: 1608 probability and statistics in engineering

The mean for a grouped distribution is calculated The mean for a grouped distribution is calculated from:from:

WhereWherexx = midpoints of a given range = midpoints of a given rangef f = frequency of occurrence of data in the = frequency of occurrence of data in the

rangerangenn = = ff = total number of data points = total number of data points

x (xf )n

Page 17: 1608 probability and statistics in engineering

The standard deviation for a grouped distribution is The standard deviation for a grouped distribution is calculated from:calculated from:

s (x x )2 fn 1

Page 18: 1608 probability and statistics in engineering

Normal DistributionNormal Distribution We could use the probability distribution from the figures We could use the probability distribution from the figures

below to predict what might happen in the future. (i.e. below to predict what might happen in the future. (i.e. next year’s students’ performance)next year’s students’ performance)

Page 19: 1608 probability and statistics in engineering

Normal DistributionNormal Distribution Any probability distribution with a bell-shaped Any probability distribution with a bell-shaped

curve is called a curve is called a normal distributionnormal distribution. . The detailed shape of a normal distribution The detailed shape of a normal distribution

curve is determined by its mean and standard curve is determined by its mean and standard deviation values. deviation values.

Page 20: 1608 probability and statistics in engineering
Page 21: 1608 probability and statistics in engineering
Page 22: 1608 probability and statistics in engineering

THE NORMAL CURVETHE NORMAL CURVE

Using Table 19.11, approx. 68% of the data will Using Table 19.11, approx. 68% of the data will fall in the interval of fall in the interval of -s-s to to ss, one std deviation , one std deviation

~ 95% of the data falls between -2~ 95% of the data falls between -2ss to 2 to 2ss, and , and approx all of the data points lie between -3approx all of the data points lie between -3ss to 3 to 3ss

For a standard normal distribution, 68% of the For a standard normal distribution, 68% of the data fall in the interval of data fall in the interval of z z = -1 to = -1 to zz = 1. = 1.

zi = (xi - x) / s

Page 23: 1608 probability and statistics in engineering

AREAS UNDER THE NORMAL CURVEAREAS UNDER THE NORMAL CURVE

zz = -2 and = -2 and zz = 2 (two standard deviations below and = 2 (two standard deviations below and above the mean) each represent 0.4772 of the total area above the mean) each represent 0.4772 of the total area under the curve. under the curve.

99.7% or almost all of the data points lie between -399.7% or almost all of the data points lie between -3ss and 3and 3ss..

Page 24: 1608 probability and statistics in engineering

Analysis of Two HistogramsAnalysis of Two Histograms

Graph A is class distribution of numbers 1-10Graph A is class distribution of numbers 1-10Graph B is class distribution of semester creditsGraph B is class distribution of semester credits

Data for A = 5.64 Data for A = 5.64 +/-+/- 2.6 (much greater spread than B) 2.6 (much greater spread than B)Data for B = 15.7 Data for B = 15.7 +/-+/- 1.96 (smaller spread) 1.96 (smaller spread)Skew of A = -0.16 and Skew B = 0.146Skew of A = -0.16 and Skew B = 0.146CV of A = 0.461 and CV of B = 0.125 (CV = SD/Mean)CV of A = 0.461 and CV of B = 0.125 (CV = SD/Mean)

Frequency A

01234567

2 3 4 5 6 7 8 9 10

Frequency B

0123456789

12 13 14 15 16 17 18 19 20