Statistical presentation in international scientific publications 6. Reporting more complicated...

Preview:

Citation preview

Statistical presentation in international scientific publications 6. Reporting more complicated findings

Malcolm CampbellLecturer in Statistics, School of Nursing, Midwifery &

Social Work, The University of Manchester

Statistical Editor, Health & Social Care in the Community

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

2

6. Report more complicated findingsContents

• 6.1 Introduction

• 6.2 Reporting factor analysis

• 6.3 Reporting analysis of variance

• 6.4 Reporting multiple regression

• 6.5 Reporting logistic regression

• 6.6 Reporting survival analysis

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

3

6.1 IntroductionReporting multivariate analyses

• It’s important to be consistent and give the reader clear, concise but complete information– more of a problem in more complicated analyses!

– find a compromise between giving too little and too much information

– this compromise may depend on the readership of the journal

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

4

6.2 Reporting factor analysisSuggestions – see Tabachnick & Fidell (2001, pp 647-648)

• You should report

– how variables were initially chosen and types of variables involved

– preliminary assessment of factorability

• correlations, measures of sampling adequacy

– methods used for extracting and rotating factors

• which combinations were compared

– how the number of factors was determined

• whether other factor solutions were explored

– variance explained for each factor

– a table of rotated factor loadings

– an interpretation of the rotated factors

• You could also report

– communalities of variables

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

5

The Good 1An excellent methodological paper on factor analysis

• Matthews et al (2006)– An exploratory study of the conditions important in

facilitating the empowerment of midwives, Midwifery 22, 181-191

– methods used

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

6

The Good 2The excellent methodological paper on factor analysis

• Matthews et al (2006)

– sample size

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

7

The Good 3That excellent methodological paper on factor analysis again

• Matthews et al (2006)

– pattern matrix showing interpretation, % variance explained, factor loadings and internal reliability of factors

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

8

6.3 Reporting analysis of varianceSuggestions – simplify Lang and Secic (1997, pp 127-135)

• Analysis of variance:

– usually one-way ANOVA

– rarely two- or more-way ANOVA

– analysis of covariance

– repeated measures ANOVA

• You should report

– appropriate means and standard deviations

– full F-test results

– post-hoc tests allowing for multiple comparisons if required

• You could also report

– (where applicable) an ANOVA table showing sources of variation, sums of squares, mean squares, F-statistics, degrees of freedom and p-values

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

9

The Good 1A paper using repeated measures ANOVA

• Salmon et al (2006)

– An evaluation of the effectiveness of an educational programme promoting the introduction of routine antenatal enquiry for domestic violence, Midwifery 22, 6-14

– methods used

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

10

The Good 2The paper using repeated measures ANOVA

• Salmon et al (2006): descriptive statistics in table

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

11

The Good 3That paper using repeated measures ANOVA again

• Salmon et al (2006)

– test results in text

– the F-test results were not typeset properly!

– “F(2,23)=54.615, p0.001” or “F2,23=54.615, p0.001”

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

12

6.4 Reporting multiple regressionSuggestions – simplify Lang and Secic (1997, pp 114-119)

• You should report– how variables were initially

chosen and types of variables involved

– how variables were included in the models

• simultaneously, in pre-determined order, stepwise

– whether underlying assumptions were assessed

• linearity; Normality & equality of variance for residuals; no multi-collinearity

– coefficient of multiple determination R2

• % of variation in dependent variable explained by model

– overall test of goodness-of-fit

• analysis of variance F-test results

– a table of estimated coefficients with 95% confidence intervals and p-values of t-test

• give results for every variable in the model, not just those that are significant

• You could also report

– standard errors of coefficients

– estimated regression equation

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

13

The Good 1A paper using stepwise regression

• Perry and McLaren (2004)

– An exploration of nutrition and eating disabilities n relation to quality of life at 6 months post-stroke, HSCC 12(4), 288-297

– methods used

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

14

The Good 2That paper using stepwise regression • Perry and McLaren (2004)

– stepwise selection and final model

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

15

The Good 3That paper using stepwise regression again

• Perry and McLaren (2004)– the final

model explained

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

16

The Good 4An alternative way of presenting results• Roelands et al (2005)

– Knowing the diagnosis and counselling the relatives of a person with dementia: the perspective of home nurses and home care workers in Belgium, HSCC 13(2), 112-124

– final models presented another way

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

17

6.5 Reporting logistic regressionSuggestions – simplify Lang and Secic (1997, pp 122-125)

• You should report

– how variables were initially chosen and types of variables involved

– how variables were included in the models

• simultaneously, in pre-determined order, stepwise

– overall test of goodness-of-fit

• change in -2 log likelihood, Hosmer & Lemeshow test

– a table of estimated odds ratios with 95% confidence intervals and p-values of Wald or t-test

• give results for every variable in the model, not just those that are significant

• You could also report– whether and how

underlying assumptions were assessed

• no multicollinearity

– estimated coefficients and standard errors

– Nagelkerke R2

• measure of variation in dependent explained by model

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

18

The Good 1A paper with a pragmatic way of selecting variables

• Peters et al (2004)

– Factors associated with variations in older people’s use of community-based continence services, HSCC 12(1), 53-62

– methods used

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

19

The Good 2The paper with a pragmatic way of selecting variables

• Peters et al (2004)

– final results

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

20

The Good 3Another way of presenting results

• Darton (2004) – What types of home are closing? The characteristics of

homes which closed between 1996 and 2001, HSCC 12(3), 254-264

– more detailed results

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

21

6.6 Reporting survival analysis 1Suggestions – simplify Lang and Secic (1997, pp 137-146)

• Kaplan-Meier analysis

• You should report

– nature and extent of the censoring

– survival rates with confidence intervals for each group

• percentage surviving at given time

– median survival time with confidence interval for each group

– Kaplan-Meier curves for each group plotting percentage survival (y) by time (x)

– comparison of survival curves by group

• log-rank (Cox-Mantel) test or Breslow-Wilcoxon test

• You could alternatively report

– life-table analysis

• table of events recorded by time interval

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

22

3.6 Reporting survival analysis 2Suggestions – simplify Lang and Secic (1997, pp 137-146)

• Cox regression

• You should report– how variables were

initially chosen and types of variables involved

– how variables were included in the models

• simultaneously, in pre-determined order, stepwise

– overall test of goodness-of-fit

• likelihood ratio test

– how underlying assumption of proportional hazards was assessed

– a table of estimated hazard or risk ratios with 95% confidence intervals and p-values of Wald test

• give results for every variable in the model, not just those that are significant

• You could also report– estimated coefficients and

standard errors

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

23

The Good 1A rare survival analysis example

• Trappes-Lomax et al (2006)– Buying Time I: a prospective, controlled trial of a joint

health/social care residential rehabilitation unit for older people on discharge from hospital, HSCC 14(1), 49-62

– methods used

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

24

The Good 2Survival analysis

• Trappes-Lomax et al (2006)– sample size

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

25

The Good 3Survival analysis

• Trappes-Lomax et al (2006)

– Kaplan-Meier survival plot

26 March 2008 Statistical presentation - 6. Reporting more complicated findings

26

The Good 4Survival analysis

• Trappes-Lomax et al (2006)

– Cox regression results