35
STATISTICS

STATISTICS. What is Statistics? Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Embed Size (px)

Citation preview

Page 1: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

STATISTICS

Page 2: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

What is Statistics? Statistics consists of a body of methods

for collecting and analyzing data (Agresti & Finlay, 1997). It is a method of dealing with data. It is a tool concerned with the collection, organization, presentation, analysis and interpretation of numerical information.

Page 3: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Two branches of statistics. Descriptive Statistics is concerned with the presentation of information in a convenient, usable and understandable form (Runyon and Haber, 1986). Other writers refer to descriptive statistics as the procedure used in describing properties of a sample, or of a population where complete population data are available.

Page 4: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: If we measure the Intelligence Quotient (IQ) of all the students in the School of Graduate Studies and calculate its mean, that mean is a descriptive statistics because it describes the characteristics of a complete population.

Page 5: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Inferential Statistics is concerned with generalizing this information more specifically, with making inferences about population which are based upon samples taken from population (Runyon & Haber, 1986). Here a sample is selected with the intent of predicting what the larger population is like.

Page 6: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: If we wish to make a statement about the mean IQ of all students in the School of Graduate Studies at the Bukidnon State College computed on a sample of 100 students and estimate the error involved, we use the procedure from inferential statistics.

Page 7: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Terms and Concepts

Variable and Constant

Variable refers to a characteristics or phenomenon which may take on different values. In addition, a variable is something that has two or more meaningful and useful divisions, categories, characteristics, or values (Grimm & Wozniak, 1990).

Page 8: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: 1. Grade point average 2. Height 3. Weight 4. Tribe 5. Age

These will take on different values when different individuals are observed.

Page 9: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Another example of variables are: shirt in different sizes (small, medium, large, extra-large). Social class with categories of upper, middle and lower class. Religion with categories of Roman Catholic, Protestant, Seventh Day Adventist, Mormons, etc.

A variable is contrasted with a constant, the value of which never changes.

Example: pi, is a constant which always takes the value of 3.1416….

Page 10: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Population, Sample and Census

Population is a complete set of individuals, objects or measurements of interest in a study. Sometimes the population is a clearly defined set of subjects.

Page 11: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: We may wish to investigate all the students’ grades after this course to find out relationship between their Grade Point Average and their scores in other foundation subjects.

Page 12: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Sample is a subset of a population. It is a portion of the population. Oftentimes it is impossible to take all the members of the population because of cost, time and manpower constraints. A subgroup may be selected to represent the total population.

Page 13: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: We may choose only 100 students from the School of Graduate Studies at the Bukidnon State College. The 100 students are then the sample.

Page 14: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Census is the collection of data from every element in the population (Triola, 1998). In census there is what we call as complete enumeration.

Closely related to the concepts of population and sample are the concepts of parameter and statistic. The following definitions are easy to remember if we recognize the alliteration in “population parameter” and sample statistic.”

Page 15: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Parameter and Estimates

Parameter is any characteristic of the population which is measurable. It is a numerical measurement describing some characteristic of a population. Usually, parameter or population values are unknown. We estimate them from sample values. In statistical notation, the Greek letters (e.g. . µ and σ are to represent population parameters).

Page 16: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: The grade point average and standard deviation of all students in the School of Graduate Studies.

Page 17: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Estimate or statistic calculated from a sample in order to estimate the population parameter. It is a numerical summary of the sample data. We shall employ the Roman letters (X and s) to represent estimates. Different symbols are used for parametersand statistics.

Page 18: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: The mean IQ scores of a random sample of students under this class is used to estimate the IQ scores of all the students in School of Graduate Studies. Characteristic Parameter Statistic Mean 𝜇 , mu _ X Standard deviation s Variance 2 S2 Pearson Correlation Coefficient r Number of Cases N n

Page 19: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

The Nature of Data

Some data sets consist of numbers (such as heights, scores in the test, etc.) and others are nonnumerical (such as gender). The terms quantitative and qualitative data are often used to distinguish between these two types.

Page 20: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

1.Quantitative data consists of numbers representing counts or measurements.

. Quantitative data can be described by distinguishing between the discrete and continuous types.

Page 21: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Discrete data result from either a finite number of possible values or countable number of possible values. The number of possible values is 0, or 1, or 2 and so on. Continuous data result from infinitely many possible values that can be associated with points on a continuous scale in such a way that there are no gaps or interruptions

Page 22: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

. When data represent counts, they are discrete; when they represent measurements, they are continuous.

The number of students in this class is discrete data; the amount each one has in the wallet now is a continuous data because they are measurements that can assume any value over a continuous span.

Page 23: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Four Levels of Measurement

Another way to classify data is to use four levels of measurement:

1. nominal, 2. ordinal, 3. interval and 4. ratio.

Page 24: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

The nominal level of measurement is characterized by data that consist of names, labels, or categories only. The data cannot be arranged in an ordering scheme (such as low to high). The simplest measurement scale is termed nominal or classificatory.

The categories of nominal variables do not differ by quantity, degree, or amount, but only by kind.

Page 25: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: The two categories of the nominal variable “gender” (male and female) are distinct, do not overlap, include possible sexes, and cannot be ordered or ranked. The same would be true of the nominal variable “region” which might be broken into the categories of NCR, Region I, Region II, Region III, Region IV, Region V, Region VI, Region VII, Region VIII, Region IX, Region X, Region XI, Region XII, and ARMM, etc.

Nominal scales represent the lowest level of measurement because they allow you only to count and compare the number of cases in each category.

Page 26: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Other examples of nominal scales are given below: The numbers on baseball players’ uniforms

are nominal in nature. In Social Science research, groups in sample are commonly labeled with numbers (such as 1 = Matigsalog, 2 = Talaandig, 3 = Higaonon, 4 = Manobo). However, when these numbers have been attached to categories, averaging the numbers together is not usually advisable. On the scale above for ethnic groups, the average score of 1.87 would have no meaning.

Page 27: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

The ordinal measurement scales involves data that may be arranged in some order, but differences between data values either cannot be determined or are meaningless. The ordinal measurement scales classify people or things into types or kinds, but with one additional feature. Here the classes or categories can be ranked. Ordinal categories are distinct, mutually exclusive, and exhaustive, but they are also orderable in terms of quantity, magnitude, or some other criteria.

Page 28: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

In other words, ordinal measurement scales have the property of magnitude but not the property of equal intervals for the property of absolute 0. It allows us to rank individuals or objects but not to say anything about the meaning of the differences between the ranks.

Page 29: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: For example, the three categories of the ordinal scale “social classes” (upper, middle, and lower) are distinct, do not overlap, include the entire range of social class, and can be ranked: The upper class is higher than the middle class and the middle class is higher than the lower class. No statement can be made however about the amount of difference between categories. The differences between upper and middle and between middle and lower are not calculable.

Page 30: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Another example is ranking students GPA. If you ranked 1st in a class of 400, the rank indicates greater than or less than, but not how much higher or lower.

Page 31: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

The interval level of measurement is like the ordinal level, with the additional property that we can determine meaningful amounts of differences between data. However, there is no inherent (natural) zero starting point (where none of the quantity is present.

Page 32: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Although the categories of nominal and ordinal scales cannot be further subdivided on a measurement scale, the values of interval permit distances and differences between values on a scale to be considered or measured. Some social researchers even distinguish between interval and ratio scales. In both cases interval scales are of equal size. Whereas with interval scales there is an arbitrary zero point, however, with ratio variables there is a true zero point where zero is equivalent to a total absence of the variable.

Page 33: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

Example: For example, time measured by calendars temperature on the Fahrenheit scale, and intelligence by IQ scores are interval variables because zero values do not mean the total absence of time, temperature, or intelligence, respectively. In contrast, age, income, and urbanization (percent of a population living in urban places) are ratio variables because zero values do indicate a total absence of those attributes.

Page 34: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

The ratio level of measurement scale is the interval level modified to include the inherent zero starting point (where zero indicates that none of the quantity is present). For values at this level, differences and rations are both meaningfully.

Page 35: STATISTICS. What is Statistics?  Statistics consists of a body of methods for collecting and analyzing data (Agresti & Finlay, 1997). It is a method

For most statistical purposes interval and ratio scales are treated as a similar type of measurement scales. Note, however, that a major difference is the fact that one cannot form ratios with values of interval scale. For example, it is incorrect to say that 60o is twice as hot as 30o; but it is correct to say that PhP 60,000.00 is twice as much as PhP 30,000.00. Because of the scarcity of interval variables, the ambiguity concerning the differences between interval and ratio scales, and their similar statistical treatment, it makes sense to treat these two types of measurement scales as one type.