4
SPECIAL CONTRIBUTION biostatistics Introduction to Biostatistics: Part 1, Basic Concepts Statistical methods commonly used to analyze data presented in journal articles should be understood by both medical scientists and practicing clinicians. Inappropriate data analysis methods have been reported in 42% to 78% of original publications in critical reviews of selected medi- cal journals. The only way to halt researchers' misuse of statistics and improve the clinician's knowledge of statistics is through education. This is the first of a six-part series of articles intended to provide the reader with a basic, yet fundamental knowledge of common biomedical statisti- cal methods. The series will cover basic concepts of statistical analysis, descriptive statistics, statistical inference theory, comparison of means, X 2, and correlational and regression techniques. A conceptual explanation will accompany discussion of the appropriate use of these techniques. [Gaddis ML, Gaddis GM: Introduction to biostatistics: Part I, basic con- cepts. Ann Emerg Med January 1990;19:86-89.] INTRODUCTION The practicing physician must remain abreast of new information and techniques and meet standards for continuing medical education. A major source of new information is the medical journal. Given the competition involved in achieving publication of research, as well as the review and editing process, the reader might justifiably assume that published studies are correct in their methodology, regardless of the conclusions made. Al- though this assumption seems reasonable, it is not correct. Between 1979 and 1984, 42% to 78% of original publications from selected medical jour- nals used inappropriate statistical analysis methods. 1-5 Statistical analysis involves organization and mathematical manipula- tion of data. This process can describe characteristics studied and help to infer conclusions from the data, thus guiding the acceptance or rejection of a given treatment or theory. Incorrect data analysis is a grave error in the research process, often leading to inappropriate conclusions, continued study of erroneous hypotheses, and curtailed study of viable therapies and therapeutic adjuncts. Additional dangers ensue when a physician uses non- efficacious treatment on a patient. From this, it becomes apparent that a basic knowledge of statistics can be an important tool for any clinician, whether in performing research or simply reading about it. However, statistical analysis of data is a task commonly delegated to statistical consultants. This is often justified by citing that there are those more qualified to perform this function than the principal investigator of a study. Though this seems logical, the principal investigator remains the individual ultimately responsible for the content and conclusions of a re- search project. The principal investigator cannot effectively meet this obligation fully if he does not have an adequate working knowledge of biomedical statistics. The investigator cannot plead innocence through ig- norance when serious errors are made. Thus, there is an obvious need for statistical education among clinicians, not only to provide for a better understanding when reading the biomedical literature but also to aid medical researchers in communication with con- sulting statisticians and for selection of appropriate data analysis tech- niques. Monica L Gaddis, PhD Gary M Gaddis, MD, PhD Kansas City, Missouri From the Departments of Surgery and Emergency Health Services, Truman Medical Center, University of Missouri, Kansas City. Received for publication September 1, 1989. Accepted for publication October 4, 1989. Address for reprints: Monica L Gaddis, PhD, Department of Surgery, Truman Medical Center, 2301 Holmes, Kansas City, Missouri 64108. 19:1 January 1990 Annals of Emergency Medicine 86/143

Annals of EM Gaddis Biostat

Embed Size (px)

DESCRIPTION

for stat

Citation preview

Page 1: Annals of EM Gaddis Biostat

SPECIAL CONTRIBUTION biostatistics

Introduction to Biostatistics: Part 1, Basic Concepts

Statistical methods commonly used to analyze data presented in journal articles should be understood by both medical scientists and practicing clinicians. Inappropriate data analysis methods have been reported in 42% to 78% of original publications in critical reviews of selected medi- cal journals. The only way to halt researchers' misuse of statistics and improve the clinician's knowledge of statistics is through education. This is the first of a six-part series of articles intended to provide the reader with a basic, yet fundamental knowledge of common biomedical statisti- cal methods. The series will cover basic concepts of statistical analysis, descriptive statistics, statistical inference theory, comparison of means, X 2, and correlational and regression techniques. A conceptual explanation will accompany discussion of the appropriate use of these techniques. [Gaddis ML, Gaddis GM: Introduction to biostatistics: Part I, basic con- cepts. Ann Emerg Med January 1990;19:86-89.]

I N T R O D U C T I O N The practicing physician must remain abreast of new information and

techniques and meet standards for continuing medical education. A major source of new information is the medical journal. Given the competition involved in achieving publication of research, as well as the review and editing process, the reader might justifiably assume that published studies are correct in their methodology, regardless of the conclusions made. Al- though this assumption seems reasonable, it is not correct. Between 1979 and 1984, 42% to 78% of original publications from selected medical jour- nals used inappropriate statistical analysis methods. 1-5

Statistical analysis involves organization and mathematical manipula- tion of data. This process can describe characteristics studied and help to infer conclusions from the data, thus guiding the acceptance or rejection of a given treatment or theory. Incorrect data analysis is a grave error in the research process, often leading to inappropriate conclusions, continued study of erroneous hypotheses, and curtailed study of viable therapies and therapeutic adjuncts. Additional dangers ensue when a physician uses non- efficacious treatment on a patient. From this, it becomes apparent that a basic knowledge of statistics can be an important tool for any clinician, whether in performing research or simply reading about it.

However, statistical analysis of data is a task commonly delegated to statistical consultants. This is often justified by citing that there are those more qualified to perform this function than the principal investigator of a study. Though this seems logical, the principal investigator remains the individual ultimately responsible for the content and conclusions of a re- search project. The principal investigator cannot effectively meet this obligation fully if he does not have an adequate working knowledge of biomedical statistics. The investigator cannot plead innocence through ig- norance when serious errors are made.

Thus, there is an obvious need for statistical education among clinicians, not only to provide for a better understanding when reading the biomedical literature but also to aid medical researchers in communication with con- sulting statisticians and for selection of appropriate data analysis tech- niques.

Monica L Gaddis, PhD Gary M Gaddis, MD, PhD Kansas City, Missouri

From the Departments of Surgery and Emergency Health Services, Truman Medical Center, University of Missouri, Kansas City.

Received for publication September 1, 1989. Accepted for publication October 4, 1989.

Address for reprints: Monica L Gaddis, PhD, Department of Surgery, Truman Medical Center, 2301 Holmes, Kansas City, Missouri 64108.

19:1 January 1990 Annals of Emergency Medicine 86/143

Page 2: Annals of EM Gaddis Biostat

BIOSTATISTICS Gaddis & Gaddis

FIGURE I. A bell-shaped or normal distribution curve.

FIGURE 2. Frequency distributions: A, bimodal; B, rectangular; C, posi- tively skewed; D, negatively skewed. (Hopkins KD, Glass GV: Basic Statis- tics for the Behavioral Sciences. En- glewood Cliffs, New Jersey, Prentice- Hall Inc, 1978, p 36.)

STATEMENT OF PURPOSE It is our purpose to present a six-

part series discussing the basics of biomedical statistics, with the intent to familiarize the reader with the ter- minology and appropriate use of data analysis techniques commonly used in original papers published in An- nals of Emergency Medic ine and o ther c l in ica l medica l journals . Learning will be directed toward a conceptual understanding of statisti- cal analysis methods rather than computational exercises.

Part 1, in this issue of Annals, pre- sents some basic concepts of statisti- cal analysis. Knowledge of these building blocks is necessary for a complete understanding of biomedi- cal statistics and the topics of future articles in this series.

Part 2 will address descriptive sta- tistics. These include measures of central tendency (mean, median, and mode), measures of variability (stan- dard deviation and standard error of the mean), and confidence intervals. Appropriate uses of these will be dis- cussed.

Part 3 will introduce statistical in- ference theory. An explanation of the concept of hypothesis testing, defini- tion of the probability value P, a dis- cussion of the terms alpha, beta, and power, and a discussion of clinical versus statistical significance will be included. Sensitivity, specificity, and predict ive value will also be ad- dressed.

Part 4 will present parametric and nonparametric methods used for the comparison of means. Included will be analysis of variance (ANOVA), the Student's t test, the Mann-Whitney U test, and methods of mult iple comparisons.

Part 5 will present a discussion of X 2 and Fisher's exact tests. Both sta- tistical methods should be under- stood by readers of biomedical re- search involving a study of the effi- c a c y of e x p e r i m e n t a l m e d i c a l treatments.

1

B, I I

2

o

Part 6 will give a basic discussion of correlation and regression, along with the series' concluding remarks.

SAMPLE VERSUS POPULATION

It is clearly impossible for most scientific studies to survey all indi- viduals of a group about which con- clusions are to be drawn. Cost and t ime considerat ions force the re- searcher to settle for studying a sub- set of a group in order to form con- clusions about the entire group. For example, if one wished to know the systolic, diastolic, and mean blood pressures of the entire population of the United States, this testing could be done as part of the next census, in which all members of the population would be surveyed. However, the cost of training census takers for this time-consuming task and the need to hire additional census takers because of the increased t ime required to check each individual would make sampling the entire population an impractical task. Similarly, in bio- medical research, time and financial costs preclude study of every mem- ber of the population.

However, if a representative sam- ple of an appropriate population can be obtained and studied, conclusions

about the sample can be properly ex- trapolated to the defined population. The key to obtaining a representative sample of that population lies in ran- dom selection of a study sample from the applicable population. Random- ization can be accomplished by sam- ple subject selection using a random number table, drawing numbers or names out of a hat, or the like. Ran- dom sampling implies that all indi- viduals in a population have an equal chance to be included in the sample.

It is when random allocation of study subjects to treatment groups is violated that bias is introduced. Bias can easily lead to erroneous conclu- sions because control and treatment groups may inherently differ in rele- vant characteristics before the study is initiated. Therefore, post-treat- ment differences between groups can erroneously be ascribed to an effect of the experimental treatment! Im- proper allocation of subjects to con - trol and treatment groups remains a significant problem confounding cur- rent biomedical research.

DATA SCALES Before an analysis method can be

selected, the type of data that will be generated by the research process must be defined. Data will fit a nom-

144/87 Annals of Emergency Medicine 19:1 January 1990

DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
Page 3: Annals of EM Gaddis Biostat

Statistical analysis

Population

Sample

Random sample

Nominal scale

Ordinal scale

Interval scale

Ratio scale

Distribution Normal

distribution

Parametric methods

Nonparametric methods

The organization and mathematical manipulation of data, used to describe characteristics studied and/or to help infer conclusions from the data

A large group possessing a given characteristic or set of characteristics. A population may be finite (the states of the United States) or infinite (blood pressure measurements of all infants born in New York) 6

The studied subset of members of a defined population 6

The process of selection of a sample from a population whereby each member of the population has an equal and independent chance of being chosen 6

Numbers are arbitrarily assigned to characteristics for data classification

Numbers are used to denote rank-order, without defining a magnitude of difference between numbers

Numbers denote units of equal magnitude as well as rank order on a scale without an absolute zero

Numbers denote units of equal magnitude as well as rank order on a scale with an absolute zero

The systematic organization of a collection of data

A grouping of data that is graphically symmetrical and bell shaped. Many human anatomic and physiologic characteristics are normally distributed

Used when the data studied are from a sample or population that is normally distributed. Data must be of an interval or ratio scale

Used when the data studied are from a sample or population that deviates from a normal distribution. Ordinal data are analyzed using nonparametric methods of analysis

inal, ordinal, interval, or ratio scale.

Nominal Scale N o m i n a l is the most p r imi t ive of

the data scales. Informat ion classified by an a s s igned n u m b e r or code to make the data numer ic fits a nomi- na l scale. For ins tance, gender m a y be defined as 1 for " female" and 2 for "male . " Cl inical diagnoses may also be ass igned represen ta t ive numbers , such as 1 for " rena l fa i lure ," 2 for "congest ive hear t fai lure," 3 for "dia- betes ," and so forth. It mus t be em- phas ized tha t the number s se lec ted are pure ly arbi t rary and are chosen at the discret ion of the researcher with- out regard to any order of ranking of severity.6, 7

Ordinal Scale Ordinal scale data can be ranked in

a specific order, be i t low to high or h igh to low. An example wou ld be data from a quest ionnaire in which a

response of "s t rongly agree" is scored as 5, "agree" is scored 4, "no opin- ion" is scored 3, "disagree" is scored 2, and "s t rongly disagree" is scored 1. In th i s example , the r e sponses are scored on a c o n t i n u u m , w i t h o u t a cons is ten t level of magni tude of dif- fe rences b e t w e e n ranks . However , un l ike the nomina l scale, numbers of an ordinal scale are not arbi trary be- cause the order of numbers is mean- ingful. 6 For instance, in the above ex- ample, p rogress ive ly larger number s ind ica te progress ive ly greater agree- m e n t w i t h the q u e s t i o n p resen ted . The Glasgow Coma Score s a l lo ts a progressively decreased classif icat ion n u m b e r t ha t i m p l i e s p rog re s s ive ly worsened obtundat ion. Other exam- p les of o r d i n a l sca les f a m i l i a r to m a n y emergency phys ic i ans would inc lude the Trauma Score 9 and the Injury Severi ty Score. 1°

A ve ra ge v a l u e s of o r d i n a l sca le data are often calculated but are usu-

FIGURE 3. Summary of biostatisti- cal terms.

ally mis leading because of the lack of any cons i s t en t magn i tude of differ- ence b e t w e e n un i t s of the scale, u Also, i t is not u n c o m m o n in t rauma outcome studies to see such data re- p o r t e d w i t h s t a n d a r d d e v i a t i o n values. Calcu la t ion of such s ta t is t ics f rom nonparamet r i c , and thus non- n o r m a l l y d i s t r ibu ted data, is h igh ly quest ionable because use of the stan- dard deviat ion assumes that the data are no rma l ly distr ibuted. H

Interval Scale Interval scale data are a step more

sophis t ica ted than ordinal scale data. Not only is there a predetermined or- der to the number ing of the scale, but also the re is a c o n s i s t e n t leve l of magni tude of difference described be- tween the observed data units. 6 Inter- val data also have a c lear ly def ined uni t of measure. However, " the zero po in t on the scale is arbi t rary, and does not co r r e spond to a t o t a l ab- sence of the character is t ic measured . . . . ,,6 The Farenheit scale for tem- perature is interval in nature, as the number ing of the scale is consistent , ye t the zero va lue is arbi t rary. Be- cause of the consis tent number ing of the scale and equa l m a g n i t u d e be- t w e e n m e a s u r e m e n t uni t s , average values and measures of var iabi l i ty of i n t e rva l scale da t a are mean ing fu l . An interval scale can be converted to an ordinal scale to show ranking, but an ordinal scale ordinar i ly cannot be converted to an interval scale. 6

Ratio Scale The rat io scale is s imply an inter-

val scale w i t h an abso lu te zero. 6 A prede termined order to the number- ing of the scale is present, as is a con- s i s ten t level of magn i tude be tween each uni t of measure. Ratio data can be conver ted to ord ina l data. Hear t rate, b lood pressure, d is tance, t ime, and degrees Kelvin represent exam- pies of rat io scale data.

DISTRIBUTIONS Once da ta are col lected, t hey can

be organized into a d i s t r ibu t ion , or g raph of f r e q u e n c y of o c c u r r e n c e . Th is is a v i s u a l l y de sc r ip t i ve tool that al lows the researcher to begin to define and analyze data.

Figure 1 is a theoret ical frequency d i s t r i bu t ion of res t ing hear t rate of

19:1 January 1990 Annals of Emergency Medicine 88/145

DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
Page 4: Annals of EM Gaddis Biostat

BIOSTATISTICS Gaddis & Gaddis

emergency phys ic ians . Its shape is symmetr ical and bell-shaped, and is defined as the "normal distr ibution." Many h u m a n a n a t o m i c and physi- ologic charac ter i s t ics approach the n o r m a l d i s t r ibu t ion . 6 Other te rmi- n o l o g y u s e d s y n o n y m o u s l y w i t h " n o r m a l d i s t r i b u t i o n " i n c l u d e s Gauss ian curve, curve of error, and normal probabili ty curve. An under- standing of the characteristics of the normal d i s t r ibu t ion is f undamen ta l in the deve lopment of even a basic l~nowledge of biostatistics. This topic will be discussed in greater detail in Part 2 of this series.

Other distr ibutions are depicted in Figure 2. 6 Bimodal distr ibutions have two peaks of cluster, or areas wi th a high frequency level. For example, if the weights of American adults were plotted, there would be two defini- tive points of cluster, one for female weight and one for male weight.

Data that are rectangularly distrib- uted show equal frequency of occur- rence for all levels of a characteristic. The date of bir th (month and day) of a sample of all pa t ients seen in an emergency d e p a r t m e n t approaches this dis t r ibut ion pattern.

Skewed data are those that tail off to either the high or low end of mea- surement units . The annual income of patients seen in the ED of an in- ner-city c o m m u n i t y hospital will be defined by a dis t r ibut ion that is pos- i t ively skewed, showing a high fre- quency of lower annual incomes. A negatively skewed dis tr ibut ion has a cluster of data on the high end of the u n i t scale and tails off toward the low end. 6

Given the nature of data collected in medical science research, the nor- ma l d i s t r i b u t i o n is the one w i t h which the cl inician will be most fa- miliar. Most statistical methods ap-

plied to interval or ratio data assume tha t the data are n o r m a l l y distr ib- uted. It is w h e n this a ssumpt ion is violated that significant controversy exists in the s tat is t ical c o m m u n i t y regarding the proper app l ica t ion of statistical tests.

PARAMETRIC VERSUS NONPARAMETRIC METHODS

Statistical methods are defined as being parametr ic or nonparametr ic . The type of analys is m e t h o d that should be selected is dependent on the nature of the data to be analyzed. Parametric methods use data extrap- olated from a sample of the popula- t ion studied to numerical ly describe some characteristic of a population. Paramet r ic me thods are val id on ly when that characteris t ic follows or near ly follows the normal distr ibu- t i o n in t he p o p u l a t i o n s tud ied . 12 Thus, parametric methods can be ap- plied properly to most in terval and ra t io scale data w h e n those data come from a sample of a no rma l ly d i s t r ibu ted popu la t ion . Paramet r ic methods can be used to derive mea- sures of central t endency and vari- ability, which wil l be discussed in Part 2 of the series.

N o n p a r a m e t r i c m e t h o d s are ap- p l ied to n o n - n o r m a l l y d i s t r i b u t e d data and/or data that do not meet the c r i t e r i a for the use of p a r a m e t r i c methods . Data tha t fit the ordinal scale def in i t ion should be analyzed by n o n p a r a m e t r i c methods . Exam- ples inc lude Glasgow Coma Scale, 8 Trauma Score, 9 the Injury Severity Score, lo and o ther s i m i l a r o rd ina l scales data.

SUMMARY This has been the introductory in-

s ta l lment of a series of articles out- l ining the proper use of biostatistics.

We have introduced some basic con- cepts regarding data and its classi- fication, which will be needed for un- derstanding of topics to be presented subsequent ly (Figure 3). Measures of central tendency, measures of vari- ability, confidence intervals, and the appropr ia te use of these s ta t i s t i ca l concepts will be discussed in part 2 of this series.

REFERENCES 1. Glantz SA: Biostatistics: How to detect, cor- rect, and prevent errors in the medical litera- ture. Circulation 1980;61:1-7. 2. Felson DT, Cupples LA, Meenan RF: Misuse of statistical methods in arthritis and rheuma- tism: 1982 versus 1967-68. Arthri t is Rheum 1984;27:1018-1022. 3. Thorn MD, Pulliam CC, Symons MJ, et al: Statistical and research quality of the medical and pharmacy literature. A m J Hosp Pharm 1985;42:1077-1082. 4. Avram MJ, Shanks CA, Dykes MHM, et ah Statistical methods in anesthesia articles: An evaluation of two American journals during two six-month periods. Anes th Analg 1985;64: 607-611. 5. MacArthur RD, Jackson GG: An evaluation of the use of statistical methodology in the Jour- nal of Infectious Diseases. J Infect Dis 1984; 149:349~354. 6. Hopkins KD, Glass GV: Basic Statistics for the Behavioral Sciences. Englewood Cliffs, New Jersey, Prentice-Hall Inc, 1978, 7. Elenbaas RM, Elenbaas JK, Cuddy PG: Eval- uating the medical literature. Part II: Statistical analysis. Ann Emerg Med 1983;12:610-620. 8. Teasdale G, Jennett B: Assessment of coma and impaired consciousness: A practical scale. Lancet 1974;2:81-84.

9. Champion HR, 8acco WJ, Carnazzo AJ, et al: Trauma score. Crit Care Med 1981;9:672-676. 10. Baker SP, O'Neill B, Haddon W Jr, et al: The Injury Severity Score: A method for describing patients with multiple injuries and evaluating emergency care. I Trauma 1974;14:187q96. ll. Sokal RR, Rohlf FJ: Biometry, ed 2. New York, WH Freeman and Co, 198!, pp 39-52. 12. Glantz SA: Primer of Biostatistics, ed 2. New York, McGraw-Hill Book Co, 1987.

146/89 Annals of Emergency Medicine 19:1 January 1990

DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight
DELL
Highlight