PUAF 610 TA

Session 2

• Class Review- summary statistics

• STATA Introduction

• Reminder: HW this week

Review: Two types of Statistics

• Descriptive statistics summarize numerical information.

• Inferential statistics uses a sample to infer the population.

Summary statistic

• In descriptive statistics, summary statistics are used to summarize a set of observations.

• Typically, – What is the central value?– How widely are values spread from the

center?– Are there data that are very atypical?– ….

Summary statistic

• a measure of location, or central tendency

• a measure of statistical dispersion

• a measure of the shape of the distribution

Central tendency

• Central tendency relates to the way in which quantitative data tend to cluster around some value.

• A measure of central tendency is any of a number of ways of specifying the “central value”.

Basic measures of central tendency

• Mean

• Median

• Mode

• the sum of all measurements divided by the number of observations in the data set

• population mean () v. sample mean (“x-bar”)

Example

• Assume 4 people take PUAF 610, and their final exam scores are 95, 87, 93, 83. What’s the mean for exam score?

Example

• Mean= (95+87+93+83)/4=89.5

Median

• the middle observation, when data are ordered from smallest to largest

• the point of a distribution that divides the bottom 50% from the top 50% of the data. The median is the 50th percentile.

Median

• If there is an odd number of observations, the median is the middle observation

• If there is an even number of observations, the median is the average of the two middle observations

• If the dataset is arranged in increasing order the median is located at position (n+1)/2

Example

• Calculate the sample median for the following observations: 1, 5, 2, 8, 7.

• Start by sorting the values: 1, 2, 5, 7, 8.

• The median is located at position (n+1)/2=3, thus it is 5.

• An odd number of values.

Example

• Calculate the sample median for the following observations: 1, 5, 2, 8, 7, 2.

• Start by sorting the values: 1, 2, 2, 5, 7, 8.

• The median is located at position (n+1)/2=3.5, Thus, it is the average of the two middlemost terms (2 + 5)/2 = 3.5.

• An even number of values

• the most frequent value in the data set

• It is possible for a distribution to have more than one mode or not to have a mode at all.

Example

• The mode for the following data set

• (1) 1, 2, 2, 3, 4, 7, 9

• (2) 12, 26, 26, 53, 84, 71, 71, 79

• (3) 32, 46, 53, 94, 37, 29

Comparing of Mode, Median and Mean

• Pros and Cons

• For descriptive purposes we might use the measure that suits the data.

• If we would like to infer from samples to populations, the mean is a measure of choice because it can be manipulated mathematically.

Summary statistic

• a measure of statistical dispersion, or variation

Measures of Variation

• Variation is variability or spread in a variable

• Measures of variation are lengths of intervals on the measurement scale that indicate the spread of values in a distribution.

Measures of Variation

• Range

• Quartiles

• Interquartile range

• Variance

• Standard Deviation

• the length of the smallest interval which contains all the data

• (highest value – lowest value) + 1

Quartiles

• any of the three values which divide the sorted data set into four equal parts, so that each part represents one fourth of the sampled population.

Quartiles

• first quartile (Q1) = lower quartile = cuts off lowest 25% of data = 25th percentile

• second quartile (Q2) = median = cuts data set in half = 50th percentile

• third quartile (Q3) = upper quartile = cuts off highest 25% of data, or lowest 75% = 75th percentile

• * The difference between the upper and lower quartiles is called the interquartile range.

Variance

• Describes how far values lie from the mean. • Use the absolute values or to square the

deviation scores to get rid of the minus signs.• Averaging absolute values cannot be used in

more advanced analyses.– By averaging the sum of squared deviations (sum of

squares) we can get a measure that is susceptible to further algebraic manipulations that are difficult or impossible with absolute values.

Variance• Less intuitive and more difficult to interpret,

because it is measured in squared units rather than original units

• Do not use variance much

• (in population) and (in sample)

where μ is the mean and N is the number of population.

Standard deviation

•A widely used measure of the variability or dispersion.

•It shows how much variation there is from the "average“.

•Standard deviation is obtained by taking a square root of the variance, i.e.

(population) (sample)

Standard deviation

• A low standard deviation indicates that the data points tend to be very close to the mean.

• A high standard deviation indicates that the data is spread out over a large range of values.

Summary statistic

• a measure of statistical dispersion, or variation

Shape of the distribution

• Skewness

• Kurtosis

Skewness

• a measure of the asymmetry of the distribution

• The skewness value can be positive or negative, or even undefined.

Skewness

• negative skew: The left tail is longer; the mass of the distribution is concentrated on the right of the figure. It has relatively few low values.

Skewness

• positive skew: The right tail is longer; the mass of the distribution is concentrated on the left of the figure. It has relatively few high values.

Skewness

• A zero value indicates that the values are relatively evenly distributed on both sides of the mean.

Kurtosis

• a measure of the "peakedness" of the distribution

• Higher kurtosis means more of the variance is the result of infrequent extreme deviations, as opposed to frequent modestly sized deviations

That’s all for class review. So far so good?

Let’s go to STATA!

PUAF 610 TA

Documents

Physics 610 - pages.uoregon.edupages.uoregon.edu/jimbrau/ph610-2014/lectures/610-1.pdf · Physics 610 Adv Particle Physics April 2, 2014 . ... Physics 610 - introduction 12 Project

ΧΡΟΝΟΣ ΕΚΤΕΛΕΣΗΣ ΣΕΡΒΙΣ · rsv4 factory 90 230 610 230 610 790 rsv4 factory a-prc 90 230 610 230 610 790 rsv4 r 90 230 610 230 610 790 rsv4 r a-prc 90 230 610

332 SECTION BB Rev B v12 - Robert Barnes Architects...Metsec 254230 E 25eaves beamall round 610 x229 UB101 610 x229 UB101 610 x229 UB101 610 x229 UB101 610 x229 UB101 533 x210 UB109

West Grove, Pa. 19390 Star Phone: 610-869-9334, Fax: 610 ...westgroveumc.org/wp-content/uploads/2016/05/Newsletter-09-2016.pdf · Phone: 610-869-9334, Fax: 610-869-0110 E:mail:

TCEQ House Bill 610 (HB 610) Viewer User Guide · 2014-02-19 · 2013 12 17 House Bill 610 Viewer User Guide.docx 1 TCEQ House Bill 610 (HB 610) Viewer – User Guide The House Bill

日塗工近似色一覧 - Sangetsu...TA-4789 TA-4790 NEW TA-4791 TA-4792 NEW TA-4793 TA-4794 TA-4795 TA-4796 NEW TA-4797 NEW TA-4780 NEW TA-4781 NEW TA-4782 NEW TA-4783 NEW TA-4784

VERIFICATION - Advantech...Model Name IPC-610-F; IPC610BPF1401E-T; IPC-610-BTO-MAG51; IPC-610XXXXXXXXXXXXXXXX; IPC610XXXXXXXXXXXXXXXX; IPC-610-BTO-XXXXXXXXXXXXXXXX; IPC-610-BTO-MAGXXXXXXXXXXXXXXXX

· 2019-12-04 · Disney's The Lion King KIDS ku-na ma - ta-ta. #15 Hakuna Matata (Part 2) 22 ta - ta! Ha-ku - na ma - ta-ta! ku-na ma-ta-ta. Ha - ku-na ma -ta-ta. Ha - ku-na ma-ta-ta

1 PUAF 610 TA Session 2. 2 Today Class Review- summary statistics STATA Introduction Reminder: HW this week

610 Carcinogenesis

OMNI - usgboral.comTDS).pdf · 60060015 61061015 60060015 61061015 600120015 610122015 60060015 61061015 60060019 61061019 ... Low $ $ $$ $$ $$ $$$ Omni Acoustical

2018 HARRIS COUNTY VOTER PRECINCTS - hctax.net · §¨¦45 §¨¦45 §¨¦10 §¨¦10 610!(288!(225!(146!(6249 £¤90 £¤290 £¤290 1960 2100 §¨¦ 610 §¨¦ 610 §¨¦ 610 529

Examination Nov/Dec 2019 H-610 SUBJECT CODE NO:- H-610 ... TY.pdf · H-610 1 H-610 Total No. of Printed Pages:1 SUBJECT CODE NO:- H-610 FACULTY OF SCIENCE AND TECHNOLOGY T.Y.Arch

PUAF 610 TA

TA/K TA/LI TA/L

Air Filter Catalog...1219 1219 1524 1524 H W 1.1 2.3 5 3.3 7 14 .4 4.7 10 20 .6 4.7 10 20 .6 4.9 10 .3 21 .1 127 147 132 108 35 50 65 80 100 305 610 610 610 610 610 610 610 610 610

2020 UPS Canada · Worldwide Export 1 Congo, Republic of CG – 810 – 610 Cook Islands CK – 810 – 610 ... Russia RU910 –810 610 Rwanda RW – 810– 610 Saba (Netherlands

Vimek 610 SE BioCombi is the market's 610 most efficient system … · 2019. 6. 26. · Vimek 610 SE BioCombi is the market's 610 most efficient system for extracting and transporting

610 SIdata Sheets

API 610 Pumps