biostatistics

1. BIOSTATISTICS

2. Contents Introduction Terminologies Sources & Presentation of Data Measures of Central Tendency Measures of Dispersion Normal curve Sampling Tests of significance Correlation & regression Conclusion

3. Introduction Statistics Statista (Italian) Statistik (German) John Graunt (1620-1674)

4. Introduction Why Statistics?

5. Epidemiology and Statistics Introduction

6. Variable X Constant , mean, standard deviation etc Observation event +measurement Sample Parameter Summary value Mean height, birth rate Statistic Terminologies

7. Terminologies Parametric test population constants are described (mean, variances) Non parametric test- no population constants - data do not follow specific distribution

8. Sources and presentation of data Collective recording of observations either numerical or otherwise.

9. Sources and presentation of data By the investigator himself Interviews, questionnare, oral health examination Primary Data already present Records of OPD Secondar y Classification of Data

10. Sources and presentation of data Nominal Qualitative data Male / Female White / Black Socio- economic status Ordinal Arranged in rank / order Ramu is taller than Ravi and Ravi is taller than Ajay

11. Interval Placed in intervals or order - Uses a scale graded in equal increments - Height, weight, blood pressure Ratio Interval scale data is placed with meaningful ratio - Biomedically most significant - Presented in frequency distribution

12. Qualitative Quantitative

13. Sources and presentation of data Methods of presentation Tabulation Diagrams

14. Tabulation Most common way frequency distribution table Important step in statistical analysis Presents a large amount of data concisely Quantitative and qualitative data Sources and presentation of data

15. Diagrams Through graphs Histogram, frequency polygon, frequency curve, line graph, scatter or dot diagram Quantitative Through diagrams Bar diagrams, pie diagram, picture diagram, map diagram Qualitative Sources and presentation of data

16. Histogram Teeth Pocket depth Pocket depth in five teeth Sources and presentation of data 2 4 6

17. Frequency polygon 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 Pocket depth Number of teeth 0 1 2 3 4 5 6 7 8 1 2 3 4 5 6 Sources and presentation of data

18. 0 2000 4000 6000 8000 10000 12000 14000 1999 2003 2007 2011 Numberofpeople Prevalence of periodontitis in Belgaum Sources and presentation of data Line graph

19. Pie Chart Post graduate students Orthodontics Periodontics Community dentistry Prosthodontic Sources and presentation of data 32% 32 % 23 %13 %

20. Bar diagram Teeth Pocket depth Pocket depth in five teeth Sources and presentation of data 2 4 6

21. Measures of Central Tendency The single estimate of a series of data that summarizes the data is known as parameter. Objective : Condense the entire mass of data Facilitate comparison 3 types: Mean Median Mode

22. Measures of Central Tendency Mean Simplest Sum of all observation s/number of observation s Median Middle value in a distribution Mode Value of greatest frequency Number of flap surgeries done by five doctors in a week are 7,5,4,9,5 Calculation of Mean = (7+5+4+9+5)/5 Mean = 6 Number of flap surgeries done by five doctors in a week are 7,5,4,9,5 Calculation of Median 4,5,5,7,9 Median = 5 Number of flap surgeries done by five doctors in a week are 7,5,4,9,5 Calculation of Mode 4,5,5,7,9 Mode = 5

23. Measures of Dispersion Measures of central tendency single value to represent data Dispersion - degree of spread or variation of the variable about the central value. 3 types Range Standard deviation Coefficien t of variation

24. Measures of Dispersion Range Simplest method Difference between the value of the smallest item to the value of the largest item

25. Standard deviation Most important and widely used Root mean square deviation Summary measure of the differences of each observation from mean of all observations Greater the deviation greater the dispersion Lesser the deviation greater uniformity Measures of Dispersion

26. Coefficient of variation Standard deviation deviation within a series. Compare two or more series, with different units of measurement Coefficient of variation = Standard deviation Mean 100 Measures of Dispersion

27. Normal curve Properties Bell shaped Symmetrical Height is maximum at mean , Mean=Median=Mode Maximum number of observation at mean and it decreases on either side Relation between mean and standard deviation Forms basis of tests of significance Normal distribution or Gaussian curve

28. Sampling Need for sampling ?? Two types of sample selection Purposive Random

29. Sampling techniques Simple Random Systematic Random Stratified Random Cluster sampling Multiphase sampling Pathfinder survey Sampling

30. Sampling 1. Simple random sampling 2. Systematic random sampling One unit is selected at random and all other at evenly spaced intervals No periodicity of occurrence Lottery Table of Random numbers

31. 3. Stratified Random sampling When the population is not homogenous. Population is divided in homogenous groups, followed by simple random selection Merits : Representative sample from each strata is secured. Gives great accuracy Sampling

32. Disadvantage: Utmost care has to be taken while dividing the population into strata (regarding homogeneity of the strata) Sampling

33. Cluster sampling Natural clusters school, village etc. From these clusters- the entire population is surveyed Advantages: Simple Involves less time and cost Disadvantage : Higher standard error Sampling

34. Multiphase sampling Part of information is collected from whole sample and a part from sub-sample. Advantages : less expensive less laborious more purposeful Sampling

35. All patients on OPD examined (first phase) Only those suffering chronic periodontitis selected (second phase) Only those within the age group of 35-45 years selected (third phase) Sample size keeps on becoming smaller Sampling

36. Pathfinder survey A specified proportion of the population Stratified cluster sampling Subjects in specific index age groups are selected. Helps to assess 1. The variations in severity of disease in different subgroups 2. Picture of age profiles of various oral diseases. Sampling

37. Sample size Optimum size of sample based on following: 1. Approximate idea of estimate of characteristics- Obtained from previous studies or pilot studies prior to starting study. 2. Knowledge about the estimate of precision probability level for precision. Sampling precision= n / s (s=SD)

38. Sample size n = Sample size, p = Approx prevalence rate, L = Permissible error in p estimation, Z = Normal value for probability level. Sampling Z 2 * p * (1-p) L2 n =

39. If p = 10% , investigator allows an error of prevalence rate of 20%, n =900 Sampling 4* 0.1*(0.9) (0.01)2 n =

40. Tests of significance Sampling variability Tests of hypothesis

41. Tests of significance Null Hypothesis and Alternative Hypothesis Null hypothesis No real difference Difference found - accidental Alternative hypothesis Real difference present

42. Level of significance Probability level P Small P value Tests of significance Null Hypothesis rejected P-value 0.05-0.01 Statistically significant < 0.01 Highly statistically significant < 0.001 or 0.005 Very highly statistically significant

43. Degree of freedom Number of independent members in a sample Degree of freedom = (n 1)

44. Tests of significance Standard error Standard error of mean Gives the standard deviation of means of various samples from the same population Measure of chance variation Mean error or mistake Standard error of mean = Standard deviation n

45. Types of error Hypothesis Accept Reject True Right Type I error False Type II error Right Decision Tests of significance

46. Steps involved in testing of hypothesis 1. State Null and Alternative hypothesis 2. Calculate t, F, 2 3. Determine degree of freedom 4. Find probability P using appropriate data 5. Null hypothesis rejected p < 0.05 Null hypothesis accepted p > 0.05 Tests of significance

47. t-test- paired/unpaired ANOVA Test of significance b/w means Pearsons Correlation Coefficient Mann Whitney Wilcoxons signed rank test Mc nemars Kruskal Wallis Freidman Kendalls S Chi-Square Fischers exact Spearmans Rank Correlation Parametric tests Non-parametric tests Tests of significance Classification of tests

48. These are mathematical tests They assess the probability of an observed difference, occurring by chance Most commonly used tests are - Z test, t test, 2 test Tests of significance

49. Students t test Designed by W.S. Gossett Applied to find the difference between two means Criteria for applying t test 1. Random samples 2. Quantitative data 3. Sample size < 30 4. Variable normally distributed Tests of significance

50. Unpaired t test Data of independent observations made on individuals of two different groups or samples Checks sampling variability between experimental and control groups e.g. checking sampling variability between SRP+ subgingival irrigation (experimental group) and SRP alone (control group) Tests of significance

51. Paired t test Paired data of independent observations from one sample only who gives a pair of observations. E.g. sampling variability in the decrease in the microbial load before and after administration of antimicrobial therapy. Tests of significance

52. Wilcoxons signed rank test Developed by Frank Wilcoxon Alternative to the Students paired t test Tests of significance

53. Analysis of Variance (ANOVA) test Compares more than two samples drawn from corresponding normal population E.g : to check if different agents used for subgingival irrigation have an effect on the decrease in microbial load. Use 3 groups (chlorhexidine , saline, povidone iodine) Tests of significance

54. If the difference between their means is significant - different agents used do have different effect on the decrease in microbial load. To assess this difference in means- ANOVA test is important Tests of significance

55. Chi square test Developed by Karl Pearson Data measured - terms of attributes/qualities- intended to test if difference is due to sampling variation Involves calculation of a quantity 3 important applications: 1. Proportion 2. Association 3. Goodness of fit Tests of significance

56. E.g. : Two groups are present Oral hygiene Oral hygiene instructions given instructions not given To assess if there is an association between gingivitis and oral hygiene instructions. Tests of significance

57. Correlation and Regression Correlation The relationship between two quantitatively measured variables Change in the value of one variable, results in a change in the other Magnitude or degree of relationship between two variables is called correlation coefficient (r)

58. Correlation and Regression Pearsons correlation coefficient Pearsons correlation coefficient Variables are normally distributed (height and weight) Variables are not normally distributed (IQ, income) Pearsons correlation coefficient

59. Correlation and Regression Types of correlation 1. r = +1 2. r = - 1 0 < r < 1 4. -1 < r < 0 5. r = 0 1 65 43 2

60. Regression Regression coefficient measure of change in one character (dependent variable - Y) , with one unit change in the independent character (X) Denoted by b Regression line Correlation and Regression

61. Change of dependent variable in linear way Y = a+bX Y = dependent variable a = Y value b = regression coefficient X = independent variable Correlation and Regression

62. Conclusion Clinician Facts Figures Statistics

63. References B K Mahajan ; Methods in Biostatistics, 6th edition Soben Peter ; Essentials of Preventive and Community dentistry , 2nd edition K. Park ; Parks Textbook of Preventive And Social medicine , 19th edition

biostatistics

Education

BIOSTATISTICS - Wiley

Biostatistics II

Biostatistics in Prostate Cancer Research - MUSCpeople.musc.edu/~elg26/talks/Biostatistics in Prostate Cancer Resea… · Biostatistics in Prostate Cancer Research Elizabeth Garrett-Mayer,

Biostatistics Collaboration and Consulting Core (BCCC) · 23-08-2019 · PUBLIC HEALTH SCIENCES . DIVISION OF BIOSTATISTICS . Biostatistics . Collaboration and . Consulting Core

DEPARTMENT OF BIOSTATISTICS Biostatistics Resource and Training · PDF fileDr DEPARTMENT OF BIOSTATISTICS Biostatistics Resource and Training Centre Advanced Clinical Data Management

Biost 518 / Biost 515 Applied Biostatistics II ... · Biost 518 / Biost 515 Applied Biostatistics II / Biostatistics II Scott S. Emerson, M.D., Ph.D. Professor of Biostatistics University

Biostatistics iii

biostatistics basic

Applied biostatistics

Biostatistics basics - Biostatistics

Biostatistics and Experimental Design - …genome.tugraz.at/biostatistics/biostat2009.pdf · Biostatistics and Experimental Design Hubert Hackl Institute for Genomics and Bioinformatics

Hypothesis - Biostatistics

Biostatistics in biology. Why we use biostatistics in biology

Biostatistics - cbb.sjtu.edu.cn

Biostatistics - Math Alliance: The National Alliance fo€¦ · Choose Public Health Biostatistics Biostatistics Health Behavior and Health Nutritional Sciences Epidemiology Health

Biostatistics ANOVA.pptx

Biostatistics Collaboration Center Basic Biostatistics in Medical Research: Emerging Trends November 14,

Applied Biostatistics

Bayesian Biostatistics - USP · Bayesian Biostatistics Emmanuel Lesaﬀre Interuniversity Institute for Biostatistics and statistical Bioinformatics KU Leuven & University Hasselt

Biostatistics khushbu