Simple Data Anlysis

Embed Size (px)

Citation preview

  • 8/4/2019 Simple Data Anlysis

    1/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 11

    ICFAI INSZODIACCFAI INSZODIACPof. B.O.BaxiPof. B.O.Baxi

  • 8/4/2019 Simple Data Anlysis

    2/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 22

    a a na ys s wa na ys s wstastical toolstastical tools Object: To know the health consciousness ofObject: To know the health consciousness of

    Youngsters of AhmedabadYoungsters of AhmedabadAge -Group Hours

    Spend inGym

    15-25 65

    26-45 24

    46-65 10Above 65 1

    Total 100

    H o u r s S p e n d

    1 5 -

    6 5

    2 6 - 4

    2 4

    4 6 -

    1 0 %

    A b o v e

    1 %

  • 8/4/2019 Simple Data Anlysis

    3/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 33

    Frequency DistributionFrequency Distribution

    In aIn a frequency distributionfrequency distribution,, oneone

    variable is considered at a time.variable is considered at a time.

    A frequency distribution for a variableA frequency distribution for a variable

    produces a table of frequency counts,produces a table of frequency counts,

    percentages, and cumulativepercentages, and cumulativepercentages for all the values associatedpercentages for all the values associated

    with that variable.with that variable.

  • 8/4/2019 Simple Data Anlysis

    4/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 44

    Tabulation of dataabulation of datacollectedollected Factors determining Service Quality of a BankFactors determining Service Quality of a Bank

    Factors No Of Respondents Amount of Deposit's

    Ambience 24 100000

    Co-operation of Staff 12 240000

    Availability of ValueAdded other service

    39 350000

    Transparency ofDocuments

    12 25000

    Accessibility ofservice

    13 85000

    Total 100 80000

  • 8/4/2019 Simple Data Anlysis

    5/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 55

    Data Analysis of Commonata Analysis of CommonStatistical Tool.tatistical Tool.HOW DO YOU FIND VARIETY OF SOAP

    Satis Level LUX LIFE BOUY SANTOOR BREEZE GODREJ.1 CINTHOL OTHERS Total

    EXCELLENT 6 6 9 0 0 0 1 22

    GOOD 3 1 3 2 0 0 2 11

    O.K. 2 3 1 0 3 2 2 13

    AVARAGE 0 0 0 0 1 0 3 4

    BAD 0 0 0 0 0 0 0 0

    TOTAL 11 10 13 2 4 2 8 50

  • 8/4/2019 Simple Data Anlysis

    6/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 66

    Statistics Associated withtatistics Associated withFrequency Distribution Measuresrequency Distribution Measuresof Location.f Location. TheThe meanmean,, or average value, is the mostor average value, is the most

    commonly used measure of central tendency.commonly used measure of central tendency.The mean, ,is given byThe mean, ,is given by

    Where,Where,

    XXii

    = Observed values of the variable= Observed values of the variableXX

    nn = Number of observations (sample size)= Number of observations (sample size)

    TheThe modemode is the value that occurs mostis the value that occurs mostfrequently. It represents the highest peak of thefrequently. It represents the highest peak of thedistribution. The mode is a good measure ofdistribution. The mode is a good measure oflocation when the variable is inherentlylocation when the variable is inherentlycategorical or has otherwise been grouped intocategorical or has otherwise been grouped intocategories.categories.

  • 8/4/2019 Simple Data Anlysis

    7/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 77

    MEASURES OF CENTRALEASURES OF CENTRALTENDENCYENDENCY TheThe medianmedian of a sample is the middle value when the dataof a sample is the middle value when the data

    are arranged in ascending or descending order. If the numberare arranged in ascending or descending order. If the numberof data points is even, the median is usually estimated as theof data points is even, the median is usually estimated as the

    midpoint between the two middle values by adding the twomidpoint between the two middle values by adding the two

    middle values and dividing their sum by 2. The median is themiddle values and dividing their sum by 2. The median is the

    50th percentile.50th percentile. TOTAL AMOUNT OF INCOME OF AN ENGINNER. NO OFTOTAL AMOUNT OF INCOME OF AN ENGINNER. NO OF

    PEOPLE VISITED STORES ON LAST FIVE SUNDAYS ANDPEOPLE VISITED STORES ON LAST FIVE SUNDAYS AND20 WEEK DAYS.20 WEEK DAYS.

    THOUGH IT IS SIMPLE BUT MOST POWERFULL TOOL OFTHOUGH IT IS SIMPLE BUT MOST POWERFULL TOOL OFDATA ANLYSIS.DATA ANLYSIS.

    TOTAL NO OF COMPLAINTS. REPETITIVE COMPLAINTSTOTAL NO OF COMPLAINTS. REPETITIVE COMPLAINTSOF PARTICULAR NATURE.OF PARTICULAR NATURE.

    Depending upon the circumstances it is to be used by theDepending upon the circumstances it is to be used by theresearchers.researchers.

  • 8/4/2019 Simple Data Anlysis

    8/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 88

    Statistics Associated withStatistics Associated with

    Frequency DistributionFrequency Distribution

    Measures of VariabilityMeasures of Variability TheThe rangerange measures the spread of the data. It ismeasures the spread of the data. It is

    simply the difference between the largest and smallestsimply the difference between the largest and smallestvalues in the sample.values in the sample.

    Range =Range =XXlargestlargest SmallestSmallest..(Age of the respondents or income level of Res.)(Age of the respondents or income level of Res.)

    TheThe interquartile rangeinterquartile range is the difference between theis the difference between the75th and 25th percentile. For a set of data points75th and 25th percentile. For a set of data pointsarranged in order of magnitude, the parranged in order of magnitude, the pthth percentile ispercentile isthe value that has p% of the data points below it andthe value that has p% of the data points below it and(100 - p)% above it.(100 - p)% above it.

    (Data of Respondents visited during entire day and(Data of Respondents visited during entire day andthen divided in four parts and then find out that whichthen divided in four parts and then find out that whichduring which part maxi no of visitors are there)during which part maxi no of visitors are there)

  • 8/4/2019 Simple Data Anlysis

    9/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 99

    Statistics Associated withStatistics Associated with

    Frequency DistributionFrequency Distribution

    Measures of VariabilityMeasures of Variability TheThe variancevariance is the mean squared deviation fromis the mean squared deviation from

    the mean. The variance can never be negative.the mean. The variance can never be negative. TheThe standard deviationstandard deviation is the square root of theis the square root of the

    variance.variance.

    TheThe coefficient of variationcoefficient of variation is the ratio of theis the ratio of the

    standard deviation to the mean expressed as astandard deviation to the mean expressed as apercentage, and is a unitless measure of relativepercentage, and is a unitless measure of relativevariability.variability.

    sx= (Xi-X)

    2

    n-1i=1n

    CV= sx/X

  • 8/4/2019 Simple Data Anlysis

    10/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 1010

    StatisticsStatistics Associated with FrequencyAssociated with FrequencyDistribution Measures of ShapeDistribution Measures of Shape

    Skew ness.Skew ness.The tendency of the deviations from the mean toThe tendency of the deviations from the mean to

    be larger in one direction than in the other. It can be thoughtbe larger in one direction than in the other. It can be thought

    of as the tendency for one tail of the distribution to be heavierof as the tendency for one tail of the distribution to be heavier

    than the other.than the other.

    KurtosisKurtosis is a measure of the relative peaked ness or flatnessis a measure of the relative peaked ness or flatness

    of the curve defined by the frequency distribution. Theof the curve defined by the frequency distribution. The

    kurtosis of a normal distribution is zero. If the kurtosis iskurtosis of a normal distribution is zero. If the kurtosis is

    positive, then the distribution is more peaked than a normalpositive, then the distribution is more peaked than a normal

    distribution. A negative value means that the distribution isdistribution. A negative value means that the distribution is

    flatter than a normal distribution.flatter than a normal distribution.

  • 8/4/2019 Simple Data Anlysis

    11/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 1111

    Skewness of a DistributionSkewness of a Distribution

    Skewed Distribution

    Symmetric Distribution

    Mean

    MedianMode(a)

    Mean Median Mode

    (b)

  • 8/4/2019 Simple Data Anlysis

    12/12

    04/15/1204/15/12 PROF. B.O. BAXIPROF. B.O. BAXI 1212

    Hypothesis Testing:ypothesis Testing: