Transcript

Evaluating the Clinical Literature

Peter Flores-Quilala, RPh, MD

Faculty of Pharmacy

Pharmacy Informatics

• The Age of Information– Too much information– Evaluate critically – Abstract reading &

skimming not valid to draw conclusions (common)

• Validity– Internal

• Methodology

– External• Can the sample be a

generalization for the whole population?

• Results

Usefulness of the Primary Literature• One of the most

important sources of information on new and innovative therapies

• Not perfect• Peer-Review

– Journals published should be peer-reviewed (refereed)

• Syntax of PR– Submission of

manuscripts to journals

– Sent to experts• For comments and

opinions

– Evaluation of reviews• Still mistakes may still

arise after publication

The ultimate responsibility in interpreting the results correctly is left to the reader.

Sampling from the Population

• Subjects are selected at a random from populations

• If we choose to study the effects of two drugs in the treatment of acute otitis media then the sample is those who are actually studied, and the population refers to all patients with acute otitis media.

Sample

Group A

Group B

Random Allocation of Treatment

Apply Intervention

A

Apply Intervention

B

Measure Outcome

Measure Outcome

Compare Results

Draw Conclusions

Population

Sample

The Prospective Comparative Drug Study Process

Association vs Causation

• The ideal clinical study is designed such that a sample of patients with a particular disease state is selected at a random from a population.

• Patients are then randomly assigned to two or more treatment groups

• Assessment are conducted to determine if the groups are similar to each other at baseline

• Then an intervention is applied, and differences in the outcome (if any) are measured. If differences in the outcome are present the assumption is the reason for the difference is due to the effect of the independent variables

• Therefore there is an association between a particular variable and the outcome

• Difficult to prove that causation is the reason for the difference

Example

• Association– A retrospective cohort study to test whether Drug A

causes colon cancer. The results indicate that patients who consume Drug A were twice as likely to develop cancer as those patients not taking the drug.

However, we might not prove that Drug A causes cancer since this is a retrospective study that control of other variables would be necessary like other diet and intake of other drugs within the study

Criteria Used to Determine Causation— Sir Austin Bradford Hill

1. The strength of the association

2. The consistency of the association (reproducibility)

3. The specificity of the association

4. The temporal relationship between the presumed cause and effect

5. The presence of a biological gradient or dose-response relationship

6. The plausibility of the finding with respect to the biological knowledge of the day

7. Coherence with current scientific theory

8. Use of experimental evidence to determine if the frequency of events is related

9. Judgment by analogy, where a finding may be more readily accepted based on similar evidence

May not be met in whole

More of the criteria are met the greater the chance that an association is a causal one

Review of the Research Design

• Prospective• Retrospective• Observational• Parallel• Cross-Over

• Longitudinal• Cross-sectional• Multicenter• Case-control• Cohort

Common Study Designs in Clinical Research

• The study design– Defines what

conclusions may be drawn

– Draws Limitations• Methodology• Logistical• Financial• Ethical

• Comparative Study– One set of

independent variables• Thought to induce

changes in the outcome of the dependent variable

– One ste of dependent variables

Which among the two Non-Steroidal anti-inflammatory drugs can lower core body temperature?

Independent variable—the NSAiD

Dependent variable----- Core Body Temp

Study designs that seek to answer one question is desirable----few samples

More questions-----require larger sample size

• Prospective– For comparative

research– Planned in advance– Exerts maximum

amount of control– Minimizes biases and

confounders

– Limitations• Expensive to conduct• Complex logistical

needs• Address ethical issues

for animal or human subject use

• Retrospective study– Not as powerful as

prospective– Low cost– Ease of data collection

– Limitations• More biases and

confounders

• Observational Study– Similar to retrospective– No attempt to control

certain variables– Patients are simply

observed– No outside

intervention is applied beyond the normal treatment of the patient

– Data are collected in the prospective manner

• Parallel Study Designs– Enroll patients– Conducts the

evaluation– Collect data on the

same points in time for each group being studied

– Utilized by the comparative prospective designs

– Ensure that the environmental influence is the same between groups

• Cross-Over Study– Subjects are exposed

to more than one intervention

– Changes in the dependent variable are compared within the same patient

– Patient are allowed to be their own controls

– Minimizes inter-patient variability

– Requires a washout out period between the interventions

– Can not be used if the effects are permanent• (only transient effects)

• Longitudinal Studies– Data are collected on an

extended period of time

• Multicenter designs– Data collected from different

centers– More representative of the

population– The results also depend on

the site or environment– Use of Analysis of co variance

and Cochran Mantel-Haenszel test controls the phenomenon

• Cross-sectional– Comparison between two or

more groups are made at specific time periods

• Pharmacoepidemiologic studies– Case-control

• Compare groups of patients with a disease to control patients without the disease and look for exposures to certain factors like drugs

– Cohort study• Identifies patients exposed to

some factor and compare them to patient who have not been exposed to determine if there are any differences in a particular clinical outcome

• Pre Clinical Study– Involve the use of

animal models before human subjects

– Used to register as IND

– Undergoes 3 phases of study

• Phase 1– Assess pharmacology,

pharmacokinetics, and safety of the drug

– Conducted with small number of healthy human subjects

– Efficacy is not studied

• Phase 2– To evaluate the efficacy of

the drug– Tested in larger number of

patients– Patients with disease or to

prevent a disease– Strict inclusion criteria

• Phase 3– Efficacy and safety of the

drug– Large scale studies

(hundred to thousands)– Uses randomization,

blinding and control groups– NDA– Drugs are allowed to be

marketed, prescribed and sold

• Phase 4– Post marketing

surveillance– Safety and efficacy– Case reports– Drug utilization evaluation– Less scientific designs– Long term data– Data may include new

indications for the drug– New dosage forms– New patient populations– Long term efficacy– ADR

– Requires prolonged exposure

– Drug stability– Drug interactions– Outcomes research– Pharmacoeconomics

Should be evaluated carefully

Significant variation in their quality

Study designs may not be optimal

Statistical analyses may be inadequate or inappropriate

Presence of bias and confounders may lead to unsupported or invalid conclusions

Chance, Bias, Confounding

• The ideal study is one where the changes in the dependent variable can be attributed to the effect of the independent variables

• These are factors affecting the dependent variables

• May cause potential error

• May become difficult to interpret

• Effect of random chance or unsystematic variation– Coin flip 100x– Probability is the same– May mislead one to believe

that one therapy being investigated is better than the other

• BIAS– ? Favoritism– Systematic variation– Treatment groups under

study are treated and measured differently in a systematic consistent fashion

– Can mislead to conclude erroneously

– Most damaging is selection bias –may be intentional or non intentional

Stages Where bias can occur in a study1. In reading background information for the

study

2. In defining and choosing the study sample

3. In applying the experimental maneuvers (or exposures)

4. In measuring the study’s outcomes

5. In analyzing the data

6. In Interpreting the analysis and results

7. In publishing the findings

Common Biases Found in Research

• Biases of Rhetoric• One-sided reference bias• Positive result bias• Hot stuff bias• Diagnostic access bias• Diagnostic suspicion bias• Wrong sample size bias• Admission rate bias

• Procedure selection bias• Missing clinical data bias• Membership bias• Volunteer bias• Contamination bias• Withdrawal bias• Compliance bias• Attention bias• Post-hoc significance

bias

Confounding

• A confounding variable is one that affects the dependent variable the independent variable or both

• May be impossible to eliminate all confounding variables

• Solution: use the appropriate statistical method

Control Groups

• Used by comparative studies

• As a frame of reference to use when comparing

• Without it, one can not directly compare the results of one study with another study

• Positive control group is usually the Gold Standard therapy

• Dose and dosing should be equivalent

Blinding

• To minimize bias among the study group and the clinician

• Open label studies– No blinding– Both the clinician and

patient are aware • Single blind

– Blinding either the clinician or the patient

• Double blind– Both the clinician and

patient are unaware– Most desirable– Financially, logistically

and ethically impossible

Randomization

• To minimize bias in the study– Random selection

• All from a population have an equal chance of being chosen for the study

– Random Allocation• Assigned to a treatment

group by random chance

• Minimizes selection bias

Hypothesis Testing

• Null hypothesis (Ho)– Opposing hypothesis– Hypothesis of no

difference– No difference in the

outcomes measured– Data collected and

results are used to either to accept the Ho-no diff; or reject Ho-a difference exists

• Research hypothesis– Alternative hypothesis– States that there are

difference in between groups

Random errors

• Type I or alpha error– Ho incorrectly rejected– Thought that there is

difference but in fact there is no difference

– P value usually <0.05 level of significance

• Type II or beta error– Ho is incorrectly accepted– No difference is thought to

exist but actually a difference really does exists

– Related to sample size and statistical power defined as 1-β

– Ie 80%= there is a difference in between groups 80% of the time

– Known as delta (Δ)– Power is directly

proportional to sample size

Sample size parameters:

1. delta

2. reasonable statistical power

3. reasonable clinical difference

Example

• The smaller the difference one wishes the larger sample size is required

• Difference of 20% (Δ) in the cure rates of two antibiotics the study needs 20 patients to achieve 80% power

• To detect a difference of 10% for the same study 200 patients is needed to achieve the same power of 80%

Types of Analyses

• One tailed– Two possibilities only– Ie Drug A is equal Drug B;– Drug A is better than Drug

B and is never worse than Drug B

– Used when one of the group is a placebo group that the active drug may be similar to the placebo but highly unlike that the active drug is will be worse that the placebo

• Two tailed– Utilized in most studies– If you are not sure which

group of drugs is better – Ie Drug A is equal to Drug

B;– Drug A is better than Drug

B;– Drug Ais worse than drug B

Statistical Inference

• Samples– Inclusion criteria– Exclusion criteria

• Useful to maintain the homogeneity of the sample

• Assures patient safety and welfare

• Often excluded, allergy, pregnant, breast feeding, children

• 95% confidence interval (CI)• Blood glucose reduction of

Drug A=45mg/dl; Drug B= 35 mg/dl

• Drug A-Drug B= 10mg/dl• 95% CI = +5, +15• 5mg/dl-15mg/dl• Drug A was superior to Drug B• Values that are (+) positive,

Drug A is always more superior if (Drug A – Drug B); value does not include zero

• If the difference is zero therefore Drug A and B are equal hence they are not statistically different

4 Types of Data

• Nominal– Strictly categorical– Gender (male or female)– Survival outcome

• Ordinal Data– Similar to nominal– Data are in groups or

categories but there are order or magnitude

– Pain scale (VAS 1-10)

• Interval data– Integer values– Temperature scale– Points are arranged in

continuous integer values and the difference between each step is consistent

• Ratio– Most sophisticated type of

data– There is an absolute zero

point or absence of a factor– Blood pressure; zero BP

means no BP unlike Temp 0˚F or centigrade does not mean an absence of temperature

• Age– Ratio data– Ordinal scale

• Appropriate statistical test based on the data– Parametric– Non-parametric

• Evaluation of Variances• Mean Values• The SD square root of the

variance• If with common variances

homoscedastic but if with different variances heteroscedastic.

Common Statistical Tests Used in Clinical

Literature NON PARAMETRIC• Chi-Square Test

– Comparison of nominal data for independent groups (2x2)

• Fischer’s Exact Test– Comparison of

Nominal data for two groups when expected frequencies are <5

• McNemar’s Test– Comparison of

nominal data for two matched or paired groups

• Contingency Table Analysis (R x C)– Comparison of two

nominal data when there are >2 groups or >2 possible outcomes

NON PARAMETRIC

• Cochran Mantel-Haenszel Test– Useful for comparing

nominal data for multiple 2x2 tables (when there are more than one independent variable being considered)

• Wilcoxon Rank Sum or Mann-Whitney U Test– Comparison of

continuous data taken from 2 independent groups, when the data are nonparametric

NON PARAMETRIC

• Wilcoxon Signed Rank Test– Comparison of

continuous data taken from 2 paired groups, when the data are nonparametric

• Kruskal Wallis Test– Comparison of

continuous data taken from >3 independent groups, when the data are nonparametric

Friedman’s Test

Comparison of continuous data taken from >3 paired or matched samples, when the data are nonparametric

PARAMETRIC TEST

• Student’s t-test– Comparison of continuous

data taken from 2 independent groups

• Paired t-test– Comparison of continuous

data taken from 2 paired or matched groups

• 1 way ANOVA– Comparison of continuous

data taken from >3 independent groups

• Repeated Measures– Comparison of continuous

data taken from >3 paired or matched samples

• 2-way ANOVA– Similar to 1 way ANOVA

but comparisons can be made for >2 factors (independent variables simultaneously)

Other Tests

• Pearson Regression– Determines if there is

linear correlation between 2 groups when the data are normally distributed

• Spearman Regression– Determines if there is

linear correlation between 2 groups when the data are NOT normally distributed

• Multivariate Regression– Determines

relationship of multiple variables with a single dependent continuous variable

• Logistic Regression– Determines

relationship of multiple variables with a single dependent dichotomous variable

• Analysis of Covariance (ANCOVA)– Useful for controlling the

effects of a potentially confounding variable on the dependent variable

• Survival Analysis– Evaluates the probability of

achieving or not achieving a specified goal during a specific period of time. A common variable that is often measured with this method is survival, however many other variables with a dichotomous endpoint can be used as well. Other statistical test, such as the log-rank test, can be used to compare the outcomes between the groups

Statistical vs Clinical Significance

• When there is a statistically significant finding (p value <0.05) it means that the chance of getting a type I error is at 5%

• Drug A lowers DBP at 10mmHg while Drug B lowers DBP at 7mmHg at p <0.04

• Statistically significant• But clinically they are

the same or equal

• Drug A remission rate of 90%

• Drug B remission rate of 80%

• P= 0.34

• There is a 34% chance of getting a type I error even if clinician deem this as clinically significant

Basic Elements of an Article

• Abstract • Provides a summary of the article• The objective should be stated, a brief

description of the patients and the methods employed and the main outcomes variable measured.

• The results should be concise and the conclusion based on the findings should be stated

• Introduction • Gives background into the problem• The disease state in question• The typical therapy employed• Results from related studies• And the objective should be clearly

stated

• Methodology • The important section of the paper• Should provide in detail the methodology

employed, such as how the patients were chosen to participate, where the study was conducted, the randomization procedures, the use of blinding or control groups, and the study design.

• The methods of data collection for all outcome variables should be clearly stated.

• Laboratory measurements should be specified and described in detail if considered an outcome variable.

• Any measuring devices or analytic equipment should be described in detail, with an assessment of errors in the measurement if possible

• The statistical methods should be clearly outlined, the level of significance should be stated, and all inferential statistical test used in comparison should be listed. Sample size or statistical power calculations should be described here.

• Results • Presents the finding for the study.• There should be no evaluation or

discussion, just a presentation of the results.

• Tables, graphs and pictures may be used to clarify the findings or present material that is difficult to describe with words.

• Discussion/

Conclusion

• Discuss and interprets the findings from the study.

• May compare or contrast the results with findings from other studies.

• The limitation and strengths of the study should be discussed

• A clear conclusion should be stated based solely on the results of the study

• References • Provides a list of publications that provides background on the problem or evidence of previous investigations on related topics that may support or refute the findings of the study.

Useful Guidelines for Evaluating Statistical Reporting in Medical Articles

• 15 guidelines whew!!!!!!!• Am so exhausted na. Will just email the

test to you.• Good Nyt!!!!!!