Upload
hriffs-guns
View
2.571
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Statistics Topics : 2013 Sabah
Citation preview
MUHAMMAD SHARIFF BIN ARIFIN960713-11-5237SABAH SCIENCE SECONDARY SCHOOL
TABLE OF CONTENT
NUM.CONTENTPAGE
1. Acknowledgement
2. Objective
3. Introduction
4. Part 1 Importance of data analysis in daily life Measure of central tendency Measure of dispersion
5. Part 2 Raw Data Ungrouped Data Analysis Mean Median Mode Standard Deviation Grouped Data Analysis Mean Mode Median Standard Deviation Interquartile Range Comparison between Ungrouped and Grouped Data
6. Part 3 Conjecture on Uniform Data Changes on Grouped Data
7. Further Exploration
8. Reflection
9. Conclusion
ACKNOWLEDGEMENT
First and foremost, praise be to Allah, God the Almighty for giving me strength and will to done this Additional Mathematics Project Work Form 5 2013.
I would like to give a lot of appreciation to the school management lead by Tuan Haji Zaini b. Zair, the Excellent Principal of Sabah Science Secondary School for giving me opportunity and support to done this task.
I would like to thank the most important person, Mr. Rasman b. Sabar, my Additional Mathematics teacher, as he gives us important guidance and commitment during this project work. He has been a very supportive figure throughout the whole project.
I also would like to thanks to all my classmates from 5 Beta for helping me and always supporting me to complete this project work. They have done a vital job, sharing fundamental information with other people including me. Without them this project would never have had its conclusion.
For their strong support, I would like to express my gratitude to my beloved parents for helping me to find the mark to complete this project. They have always been by my side and I hope they will still be there in the future.
Last but not least, I would also like to thank the entire squad of teachers and my friends for helping me collect the much needed data and statistics for this task.
Not forgotten, to all the individuals who were involved directly or indirectly towards making this project a reality.
OBJECTIVE
The aims of carrying out this project work are;
To apply and adapt a variety of problem-solving strategies to solve routine and non-routine problems.
To experience classroom environments which are challenging, interesting and meaningful and hence improve their thinking skills.
To experience classroom environments where knowledge and skills are applied in meaningful ways in solving real-life problems.
To experience classroom environments where expressing students mathematical thinking, reasoning and communication are highly encouraged and expected.
To experience classroom environments that stimulates and enhances effective learning.
To acquire effective mathematical communication through oral and writing, and to use the language of mathematics to express mathematical ideas correctly and precisely.
To enhance acquisition of mathematical knowledge and skills through problem-solving in ways that increases interest and confidence.
To prepare students for the demands of their future undertakings and in workplace.
To realise that mathematics is an important and powerful tool in solving real-life problems and hence develop positive attitude towards mathematics.
To train students not only to be independent learners but also to collaborate, to cooperate, and to share knowledge in an engaging and healthy environment.
To use technology especially the ICT appropriately and effectively.
To train students to appreciate the intrinsic values of mathematics and to become more creative and innovative.
To realise the importance and the beauty of mathematics.
INTRODUCTION
By the 18th century, the term "statistics" designated the systematic collection of demographic and economic data by states. In the early 19th century, the meaning of "statistics" broadened, then including the discipline concerned with the collection, summary, and analysis of data. Today statistics is widely employed in government, business, and all the sciences.
Electronic computers have expedited statistical computation, and have allowed statisticians to develop "computer-intensive" methods. The term "mathematical statistics" designates the mathematical theories of probability and statistical inference, which are used in statistical practice. The relation between statistics and probability theory developed rather late, however. In the 19th century, statistics increasingly used probability theory, whose initial results were found in the 17th and 18th centuries, particularly in the analysis of games of chance (gambling). By 1800, astronomy used probability models and statistical theories, particularly the method of least squares, which was invented by Legendre and Gauss.
Early probability theory and statistics was systematized and extended by Laplace; following Laplace, probability and statistics have been in continual development. In the 19th century, social scientists used statistical reasoning and probability models to advance the new sciences of experimental psychology and sociology; physical scientists used statistical reasoning and probability models to advance the new sciences of thermodynamics and statistical mechanics. The development of statistical reasoning was closely associated with the development of inductive logic and the scientific method.
Statistics is not a field of mathematics but an autonomous mathematical science, like computer science or operations research. Unlike mathematics, statistics had its origins in public administration and maintains a special concern with demography and economics. Being concerned with the scientific method and inductive logic, statistical theory has close association with the philosophy of science; with its emphasis on learning from data and making best predictions, statistics has great overlap with the decision science and microeconomics. With its concerns with data, statistics has overlap with information science and computer science
During the 20th century, the creation of precise instruments for agricultural research, public health concerns (epidemiology, biostatistics, etc.), industrial quality control, and economic and social purposes (unemployment rate, econometric, etc.) necessitated substantial advances in statistical practices. Today the used of statistic has broadened far beyond its origin. Individuals and organisations use statistics to understand data and make informed decisions throughout the natural and social sciences, medicines, business, and other area. Statistics are generally regarded not as the subfield of mathematics but rather as a distinct, allied, field. Many universities maintain separate mathematics and statistic departments. Statistic is also taught in department as diverse as psychology, education and public health.
PART 1Importance of Data Analysis in Daily LifeData analysis is a process of inspecting, cleaning, transforming, and modelling data with the goal of highlighting useful information, suggesting conclusions, and supporting decision making. It has multiple facts and approaches, encompassing diverse techniques under a variety of names, in different business, science, and social science domains. Data analysis can be done in different method according to its purpose, needs and requirement. As example, if the dean of a university wants to know whether the place of living of its student affected their academic performance. In other words, does student who live in university hostel can perform better in examination compared with students who live outside the alma mater? Or are there other factors that responsible for the variance in academic performance? For proving this theory, an analysis of data will help describe students performance and explain the relationship between the performance issues and students place of living.Analysis does not involving complex statistics. Data analysis in schools involves the collection of data and using the data to improve teaching and learning. Interestingly, principals and teachers have it simple. In most cases, the collection of data has already been done. Schools regularly collect attendance data, academic performance records, discipline referrals, and a variety of other useful data. Rather than complex statistical formulas and tests, it is generally simple counts, averages, percents, and rates that educators are interested in. There are many benefits of data analysis. For example, data analysis helps in structuring the findings from different sources of data collection like survey research. It is again very helpful in breaking a macro problem into micro parts. Data analysis acts like a filter when it comes to acquiring meaningful insights out of huge data-set. Every researcher has sort out huge pile of data that he/she has collected, before reaching to a conclusion of the research question. Mere data collection is of no use to the researcher. Data analysis proves to be crucial in this process. It provides a meaningful base to critical decisions. It helps to create a complete dissertation proposal. One of the most important uses of data analysis is that it helps in keeping human bias away from research conclusion with the help of proper statistical treatment. With the help of data analysis a researcher can filter both qualitative and quantitative data for an assignment writing projects. Thus, it can be said that data analysis is of utmost importance for both the research and the researcher. Or to put it in another words data analysis is as important to a researcher as it is important for a doctor to diagnose the problem of the patient before giving him any treatment.
Measure Of Central Tendency Central tendency gets at the typical score on the variable, while dispersion gets at how much variety there is in the scores. When describing the scores on a single variable, it is customary to report on both the central tendency and the dispersion. Not all measures of central tendency and not all measures of dispersion can be used to describe the values of cases on every variable. What choices you have depend on the variables level of measurement.
a) Mean
The mean is what in everyday conversation is called the average. It is calculated by simply adding the values of all the valid cases together and dividing by the number of valid cases.
The mean is an interval/ratio measure of central tendency. Its calculation requires that the attributes of the variable represent a numeric scale. In the physical sciences, such variability may result from random measurement errors: instrument measurements are often not perfectly precise, i.e., reproducible, and there is additional inter-rater variability in interpreting and reporting the measured results. One may assume that the quantity being measured is stable, and that the variation between measurements is due to observational error.
b) ModeThe mode is the attribute of a variable that occurs most often in the data set.
For ungroup data, we can find mode by finding the modal class and draw the modal class and two classes adjacent to the modal class. Two lines from the adjacent we crossed to find the intersection. The intersection value is known as the mode.This information is used by business company to find their opportunity to make profit. For example, a shoe manufacturer can find out the most popular model and size of shoes that consumers demand by using this calculation.c) MedianThe median is a measure of central tendency. It identifies the value of the middle case when the cases have been placed in order or in line from low to high. The middle of the line is as far from being extreme as you can get.
There are as many cases in line in front of the middle case as behind the middle case. The median is the attribute used by that middle case. When you know the value of the median, you know that at least half the cases had that value or a higher value, while at least half the cases had that value or a lower value.This calculation is used to find out the average of something. For example, we can find out the average height of 5 Beta students by simply line up the students according to their height. Then, the student in the middle of the line tells his or her height. His or her height picturing the average of the class height.
Measure Of Dispersiona) RangeThe distance between the minimum and the maximum is called the range. The larger the value of the range, the more dispersed the cases are on the variable; the smaller the value of the range, the less dispersed (the more concentrated) the cases are on the variableRange = maximum value minimum valueInterquartile range (IQR) is the distance between the 75th percentile and the 25th percentile. The IQR is essentially the range of the middle 50% of the data. Because it uses the middle 50%, the IQR is not affected by outliers or extreme values.
IQR = Q3 - Q1b) Standard DeviationThe standard deviation tells you the approximate average distance of cases from the mean. This is easier to comprehend than the squared distance of cases from the mean. The standard deviation is directly related to the variance. If you know the value of the variance, you can easily figure out the value of the standard deviation. The reverse is also true. If you know the value of the standard deviation, you can easily calculate the value of the variance. The standard deviation is the square root of the variance.
PART 2
1. Mid-Semester 1 Additional Mathematics examination scores for 34 students in 5 Beta.
StudentsMarks
1 72
2 70
3 68
4 60
5 57
6 56
7 56
8 55
9 53
10 49
11 48
12 47
13 47
14 47
15 46
16 46
17 43
18 42
19 42
20 42
21 41
22 41
23 39
24 39
25 38
26 38
27 38
28 38
29 36
30 35
31 34
32 30
33 28
34 21
2. Calculation of mean, median, mode and standard deviation.StudentsMarks (x)
1 725184
2 704900
3 684624
4 603600
5 573249
6 563136
7 563136
8 553025
9 532801
10 492401
11 482304
12 472209
13 472209
14 472209
15 462116
16 462116
17 431849
18 421764
19 421764
20 421764
21 411681
22 411681
23 391521
24 391521
25 381444
26 381444
27 381444
28 381444
29 361296
30 351225
31 341156
32 30900
33 28784
34 21441
Question 2a) Mean
N = 34
b) Median
Arrange all of the marks in ascending order:
21, 28, 30, 34, 35, 36, 38, 38, 38, 38, 39, 39, 41, 41, 42, 42, 42, 43, 46, 46, 47, 47, 47, 48, 49, 53, 55, 56, 56, 57, 60, 68, 70, 72.
17th valueMedian of the marks =
= 17th value = 42 marks
c) Mode of the marks:
21, 28, 30, 34, 35, 36, 38, 38, 38, 38, 39, 39, 41, 41, 42, 42, 42, 43, 46, 46, 47, 47, 47, 48, 49, 53, 55, 56, 56, 57, 60, 68, 70, 72.
Occur frequently
Number of the marks that occur frequently = 38 marks.
d) Standard deviation
=
= = 12.50
Question 3 Frequency distributions table:MARKSFREQUENCY
21-303
31-409
41-5013
51-606
61-702
71-801
a) I) Mean The mean mark of 34 students can be found by using the formula:
MarksMidpoint,
Frequency,
21-3025.5376.5
31-4035.59319.5
41-5045.513591.5
51-6055.56333
61-7065.52131
71-8075.5175.5
From the table,f = 34fx = 1527Therefore, mean, = 44.91
II) Mode
The modal class is 41-50, this because, majority of the students got that marks.
To find the mode weight, we draw the modal class and two classes adjacent to the modal class.
From the histogram, the mode mark is 45.5
III) Median
Method 1 By using formula Median is the value of the centre of a set of data Median weight for 34 students can be obtained by using the formula:
MarksLower boundaryUpper boundaryFrequency,Cumulative frequency
21-3020.530.533
31-4030.540.5912
41-5040.550.51325
51-6050.560.5631
61-7060.570.5233
71-8070.580.5134
From the table, Median class = 34 2 = 17th value = 41-50 classL = 70.5 fm = 13 Total frequency N = 34
F = 12 C = 50.5-40.5 = 10
Method 2 by drawing an ogive
Ogive
Ogive is a graph constructed by plotting the cumulative frequency of a set of data against the corresponding upper boundary of each class.
Not only that, ogive is also the method of calculation, the median, and the interquartile range of a set of data can also be estimated from its ogive.
From the ogive,
Median = 44.5IV) Interquartile range
Method I By using formula
class = 34 x = value= 31-40
Total frequency N= 34
= 36.61
class = 34 x = value = 51-60
Therefore, the interquartile range, : = 14Method II By using ogive
From the ogiveInterquartile range = 50.5 36.5 = 14
V) The Standard deviationMethod IMarksMidpoint,Frequency,
21-3025.5376.51950.75
31-4035.59319.511342.25
41-5045.513591.526913.25
51-6055.5633318481.5
61-7065.521318580.5
71-8075.5175.55700.25
= 44.91
= 11.37Method IIMarksMidpoint,Frequency,
21-3025.53-19.41376.751130.25
31-4035.59-9.4188.55796.95
41-5045.5130.590.354.55
51-6055.5610.59112.15672.9
61-7065.5220.59423.95847.9
71-8075.5130.59935.75935.75
b) Mean, x = 44.91 kg Median, m = 44.35 Mode = 45.5
From the above measure of central tendency, mean is the best measure of central tendency because all the values in the data are taken into account when determining the mean. It also quite evenly distributed and no extreme values exist in the data, whereas mode and median does not take all the values in the data into account which decrease the accuracy of central tendency.
b) The standard deviation gives a measure of dispersion of the data about the mean. A direct analogy would be that of the interquartile range, which gives a measure of dispersion about the median. However, the standard deviation is generally more useful than the interquartile range as it includes all data in its calculation. The interquartile range is totally dependent on just two values and ignores all the other observations in the data. This reduces the accuracy it extreme value is present in the data. Since the marks does not contain any extreme value, standard deviation give a better measures compared to interquartile range.
Question 4a) Group data gives a more accurate representation as the data that been grouped is easier to look for patterns and saves more spaces than ungrouped data. Besides, group data also make the calculation much easier than ungrouped data.b) Grouped data is when there is a large number of possible outcomes, we will usually need to group the data. For example, the ages of 200 people entering a park on a Saturday afternoon. The ages have been grouped into the classes 0-9, 10-19, 20-29, and so on. Ungrouped data is the opposite of grouped data with only one possible answer. For example, the ages of 200 people entering a park on a Saturday afternoon. The ages are: 27, 8, 10, 49 and so on.
PART 3
Question 1)
a) If the teacher adds 3 marks for each student in class for completing all their assignment, the new marks for 5 Beta students as shown below
StudentsMarks
158
241
331
442
545
650
737
849
949
1045
1171
1259
1341
1439
1558
1641
1775
1845
1944
2052
2141
2263
2350
2459
2538
2651
2773
2842
2933
3044
3146
3260
3324
3450
b) New frequency distributions table:MarksLower boundaryMidpoint,Frequency,Cumulative frequency
21-3020.525.51125.5650.25
31-4030.535.556177.56301.25
41-5040.545.51521682.531053.75
51-6050.555.5930499.527722.25
61-7060.565.513165.54290.25
71-8070.575.5334226.517100.75
a) New mean,
b) New Mode,
New Mode = 46.5
c) New MedianMedian class = 41-50
Class interval remain same Class interval = 10 c) New interquartile range Q1 class = 34 x = 8.5th value = 31-40
class = 34 x = value = 51-60
Therefore the interquartile range, Q3 Q1 = 51.33-36.61 = 14.72
d) New standard deviation,
Question 2: MarksLower boundaryMidpoint
Frequency,
Cumulative frequency
21-3020.525.51125.5650.25
31-4030.535.556177.56301.25
41-5040.545.51521682.531053.75
51-6050.555.5930499.527722.25
61-7060.565.513165.54290.25
71-8070.575.5334226.517100.75
81-9080.585.503400
91-10090.595.513595.59120.25
New mean,
New standard deviation,
FURTHER EXPLORATION
2. Aspect5 BetaMr. Mas class
Mean44.7976.79
Standard Deviation12.0010.36
Mr. Mas class has gotten better marks for Additional Mathematics subject for this Mid-Year Examination. Mean shows the average scores of students. The mean mark of 5 Beta is 47.8 while Mr. Mas class is 76.79 which are higher than 5 Beta class. This case shows that the students in Mr. Mas class scored better than 5 Beta class. The standard deviation shows that dispersion between the data sets. Standard deviation of Mr. Mas class is lower than the class of 5 Beta. Standard deviation of 5 Beta is 20.28 while standard deviation of Mr. Mas class is 10.36 which are lower than 5 Beta class. This case shows that the marks have small differences. It is proven that Mr. Mas class scored very close to the average. The standard deviation for 5 Beta is greater which means the marks dispersion is larger. This case shows that the marks have greater difference. Given the mean of Mr. Mas class is greater than the class of 5 Beta showing that the average is higher than 5 Beta, concluded that Mr. Mas class has better result than 5 Beta.
REFLECTIONWhile I conducting this project, a lot of information that I found. I have learnt how statistics appear in our daily life.
Apart from that, this project encourages the student to work together and share their knowledge. It is also encourage student to gather information from the internet, improve thinking skills and promote effective mathematical communication.
Not only that, I had learned some moral values that I practice. This project had taught me to responsible on the works that are given to me to be completed. This project also had made me felt more confidence to do works and not to give easily when we could not find the solution for the question. I also learned to be more discipline on time, which I was given about a month to complete this project and pass up to my teacher just in time. I also enjoy doing this project I spend my time with friends to complete this project and it had tighten our friendship. Last but not least, I proposed this project should be continue because it brings a lot of moral value to the student and also test the students understanding in Additional Mathematics.Additional Mathematics...From The Moment I Was Born...From The Moment I Was Able To Hold a Pencil...From The Moment I Start Learning...And..From The Moment I Started to Know Your Name...I Always Thought That You Will Be My Greatest EnemyAndRival In Excelling In My Life...But After Countless Of Hours...Countless Of Days...Countless Of Seasons...After Sacrificing My Precious Time Just For You...Sacrificing My IPhone...Sacrificing My Nikon Camera...Sacrificing My Galaxy Tab...Sacrificing My Facebook...I Realized Something Really Important In You...I Really Love You...You Are My Best Friend Forever...You Are My Superheroes...You Are My Soul Mate...
ADDITIONAL MATHEMATICS PROJECT WORK | 29STATISTICS