1-12 Basic Statistics I[1]

Embed Size (px)

Citation preview

  • 8/2/2019 1-12 Basic Statistics I[1]

    1/24

    Basic StatisticsBasic Statistics

  • 8/2/2019 1-12 Basic Statistics I[1]

    2/24

    Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Serves as a means to analyze data collected in themeasurement phase.

    Allows us to numerically describe the data which

    characterizes our process KPIVs & KPOVs.

    Uses past process and performance data to make

    inferences about the future.

    Serves as a foundation for advanced statistical

    problem solving methodologies.

    Provides a language based on numerical facts and

    not intuition.

    WHY DO WE NEED BASIC STATISTICS?

  • 8/2/2019 1-12 Basic Statistics I[1]

    3/24

    Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    The primary objective of statistical analysis is to determine certain

    characteristics of a group from a representative sample.

    To be valid, this generalization must consider certain important

    concepts.

    The Random Sample:A random sample is a subgroup of elements gathered so that the results of

    statistical analysis can be extended to the population from which it came.

    One frequently applied sampling method is to choose the elements

    at random. However, this assumes that each element of the population hasan equal chance of being selected.

    DATA CHARACTERISTICS

  • 8/2/2019 1-12 Basic Statistics I[1]

    4/24

    Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Variable Data:

    Variable data can be expressed in the form of numbers and measured

    with a measuring instrument. Since they supply a lot of information

    and are sensitive to small variations, variable data are widely used.

    Examples of variable data The diameter of the holes in a part. The thickness of an aluminum plate. The time taken by a machine tool to perform an operation.

    Room temperature variations. Light intensity variations.

    DATA CHARACTERISTICS

  • 8/2/2019 1-12 Basic Statistics I[1]

    5/24

    Copyright NN, Inc. 2004 Company Confidential

    Attribute Data:Attribute data are based on the number of conforming or nonconforming

    characteristics, but not on their measurement. For example, the attribute

    data indicate whether a part is good or not, or how many parts conform to

    the specifications.

    Attribute data are determined with an instrument like a go/no-go gauge(passes the test or not; yes or no; conforming or nonconforming, etc. ).

    Basic StatisticsBasic Statistics

    Examples of attribute dataNumber of parts whose dimensions do not conform to the tolerances. Number of stored parts. Assessment of the quality of a surface coating ( a lot, moderate, or

    little orange peel). Number of errors on purchase orders. Number of packing slips with the wrong address.

    The type of data, variable or attribute, will determine

    the statistical tools to use in processing the data.

    DATA CHARACTERISTICS

  • 8/2/2019 1-12 Basic Statistics I[1]

    6/24

    Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Accurate but Imprecise Process

    LS US

    Nominal

    Inaccurate but Precise Process

    LS US

    Nominal

    PROCESS VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    7/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Accurate & Precise Process

    LS US

    Nominal

    Inaccurate & Imprecise Process

    LS

    Nominal

    US

    PROCESS VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    8/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    LS US

    LS US

    The dispersion is within the tolerance. The distribution is well centered between

    the limits. The risk of defect is minimal.

    The dispersion is within the tolerance. The distribution is off-center. Production of defects is likely.

    PROCESS CENTERING

  • 8/2/2019 1-12 Basic Statistics I[1]

    9/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    LS US

    LS US

    The dispersion is less than the tolerance. The distribution is very off-center and

    exceeds the upper tolerance. Production of defects has occurred.

    The dispersion is equal to the tolerance. The distribution is centered. The risk of defects is high.

    PROCESS VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    10/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    LS US

    LS US

    The dispersion is greater than the tolerance. The distribution is also off-center. Production of defects has occurred.

    The dispersion is greater than the tolerance. The distribution is centered. Production of defects has occurred.

    PROCESS VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    11/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    LS US

    LS US

    The distribution is bimodal (two peaks). Both dispersions are within the tolerance. Indicates that the data comes from two

    distinct populations ( two machines, two

    shifts, two suppliers, etc.)

    The dispersion is within the tolerance. The distribution is asymmetrical (shifted

    towards one limit). There is a possibility that the data are

    distorted or inaccurate. However, certain

    processes are naturally asymmetrical.

    PROCESS VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    12/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Population:- The entire group of objects that

    have been made or will be made

    containing the characteristic of

    interest.- Highly unlikely we can ever obtain

    population parameters. all NN-made bearing balls. all Millionaires in the world

    Sample:-The group of objects on which one

    actually gathers data in a statistical

    study.-Usually a sample is a subset of the

    population.

    NN-made balls produced today. Millionaires at NN.

    Population Parameters Sample Parameters

    = Population Mean= Population Standard

    Deviation

    X = Sample Mean

    S = Sample StandardDeviation

    POPULATION vs. SAMPLE

  • 8/2/2019 1-12 Basic Statistics I[1]

    13/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Mean:-The arithmetic average of a set of values.

    uses the quantitative value of each data point. strongly influenced by extreme values.

    Median:- Number reflecting the 50% rank of a set of values.

    can be easily identified as the center after all of

    the values are sorted from high to low. hardly affected by extreme values.

    Mode:- Most frequently occurring value in a data set.

    n

    n

    ii

    =

    = 1

    MEASUREMENTS OF CENTRAL TENDENCY

    12

    14

    17

    18

    19

    2227

    27

    28

  • 8/2/2019 1-12 Basic Statistics I[1]

    14/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Range:- Numerical distance between the highest and thelowest values in a data set.

    very sensitive to extreme values in the data.maxmin=Range

    Variance :- Sum of the distances between individual data points and

    population or sample mean. Distances are squared toremove negative.

    very sensitive to extreme values in the data.

    );(22

    s

    1

    )(

    1

    2

    2

    =

    =

    n

    s

    n

    i

    i

    Standard Deviation :- The square root of the variance.

    most commonly used measurement to

    quantify variability.

    );( s

    1

    )(1

    2

    =

    =

    n

    s

    n

    i

    i

    Adding Variances :)(2

    2

    2

    1 +

    2

    1

    2

    2 = Variance of sample 2

    = Variance of sample 1

    2

    2

    2

    1

    2+=Then

    )(2

    2

    2

    1 +=

    MEASUREMENTS OF VARIABILITY

  • 8/2/2019 1-12 Basic Statistics I[1]

    15/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Rules:= 68.25% of the data will fall within +/- 1 s from the mean

    = 95.46% of the data will fall within +/- 2 s from the mean

    = 99.73% of the data will fall within +/- 3 s from the mean

    = 99.9937% of the data will fall within +/- 4 s from the mean

    = 99.999943% of the data will fall within +/- 5 s from the mean= 99.9999998% of the data will fall within +/- 6 s from the mean

    -4 -3 -2 -1 0 1 2 3 4

    68.3%

    95.5%

    99.73%

    NORMAL CURVE PROPERTIES

  • 8/2/2019 1-12 Basic Statistics I[1]

    16/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    LSL USL

    Long-Term

    Capability

    Short-Term

    Capability

    Over time, a process tends to shift by approximately 1.5.

    THE DYNAMIC PROCESS

  • 8/2/2019 1-12 Basic Statistics I[1]

    17/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    If a manufacturing process is represented by:

    The variation of Y is driven by the variation of the Xs.

    The nature of the variation of each x can differ from the others. Some Xs vary over short cycles, others over long cycles.

    Thus, a process generally exhibits different variation patterns

    over the long term than it does over the short term.

    Y = f(X X X X1 2 3 k , , ,..., )

    LONG-TERM vs. SHORT-TERM VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    18/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Long Term Capability Covers a relatively long period of time (e.g. weeks, months) Will include effect of long term noise variables . Generally consists of 100-200 points.

    Short Term Capability Covers a relatively short period of time (e.g. days, weeks).

    Will include effect of short term noise variables.

    Generally consists of 30 to 50 data points.

    Instantaneous Capability Covers a very short period of time (e.g. one shift) Minimal effects due to noise variables.

    LONG-TERM vs. SHORT-TERM VARIATION

  • 8/2/2019 1-12 Basic Statistics I[1]

    19/24Copyright NN, Inc. 2004 Company Confidential

    PROCESS CAPABILITY RATIO - Cp

    Basic StatisticsBasic Statistics

    LSL USLTolerance

    Process Width

    -3 +3

    (Max Allowable Range of Characteristic)=

    (Normal Variation of the Process)

    USL - LSL=

    6OR

    99.7% of values.

    pCpC

  • 8/2/2019 1-12 Basic Statistics I[1]

    20/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    This index accounts for the static mean shift in the

    process (the amount that the process is off target).

    Both the Cp andCpk indexes should be considered if upper and

    lower specifications exist. Why?

    USL - X X - LSLCpk= Min or

    3 3

    PROCESS CAPABILITY RATIO - Cpk

  • 8/2/2019 1-12 Basic Statistics I[1]

    21/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Lower Spec Upper Spec

    3.5 4.5 5.5 6.5

    No Mean Shift

    3.5 6.5 7.55.54.5

    Lower Spec Upper Spec

    Mean Shift of 1.5 Sigma

    Cp = 1.02

    Cpk = 1.02

    Cp = 1.00

    Cpk = 0.52

    3 SIGMA PROCESS

  • 8/2/2019 1-12 Basic Statistics I[1]

    22/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Lower Spec Upper Spec

    4.0 5.0 6.0 7.0

    No Mean Shift

    3.02.0 8.0

    Lower Spec Upper Spec

    4.0 5.0 6.0 7.0

    Mean Shift of 1.5 Sigma

    3.02.0 8.0

    Cp = 2.04

    Cpk = 2.04

    Cp = 2.01

    Cpk = 1.52

    6 SIGMA PROCESS

  • 8/2/2019 1-12 Basic Statistics I[1]

    23/24Copyright NN, Inc. 2004 Company Confidential

    Basic StatisticsBasic Statistics

    Relation Cp, Cpk and PPM (Parts Per Million)

  • 8/2/2019 1-12 Basic Statistics I[1]

    24/24

    Basic StatisticsBasic Statistics

    Predict the percent of time that the process will fail

    to operate as required.- defects, downgrades or rework

    Set the performance baseline from which to

    measure any improvements made.

    Establish a benchmark against which we cancompare other equipment, other plants, etc.

    KNOWING OUR PROCESS CAPABILITY CAN.