44
R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Embed Size (px)

Citation preview

Page 1: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

R commander an introduction

user-friendly and absolutely free

Ho KimSCHOOL OF PUBLIC HEALTH, SNU

Page 2: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Installing R commander

• You need to first install R and then R commander.

Page 3: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Starting the R commander> library(Rcmdr)

Page 4: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

R commanderwindows

Page 5: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Importing datasets

Page 6: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Select the data set by clicking on this box

Page 7: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Checking continuous variables

• Statistics->Means• options– Single-sample t-test– Independent samples t-test– Paired t-test– One-way ANOVA– Multi-way ANOVA

Page 8: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

문제 1.

1.1 Pepers.xls 자료를 읽고 angle 변수의 평균이 0인지를 검정하시오 .

• 귀무가설과 대립가설이 무엇인지를 식으로 정확히 표현하시오 .

8

Page 9: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

single-sample t-test (Pepers.xls)

• Statistics > Means > Single-sample t-test (Enter the proposed mean (Null hypothesis: mu=))

Page 10: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 11: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

1.2 angle 변수의 평균이 2 라고 이미 알려져있다고 가정하고 이 자료를 가지고 기존의 지식이 사실이 아니라는 것을 주장하고 싶다면 어떠한 분석을 실시할 수 있는지 귀무가설과 대립가설을 써 보시오 .

* 위 검정을 R commander 를 이용해서 분석하고 결론을 내리시오 .

11

Page 12: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

single-sample t-test (Pepers.xls)

• Statistics > Summaries > Shapiro-Wilk test of normality

• This is a hypothesis tests with the null hypothesis that the data comes from a normal

• distribution.

Page 13: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

문제 2.

2.1 Pulse.xls 자료를 읽고 pre 와 post 변수를 볼 때 어떠한 분석을 실시해야 하는지 설명하시오 .

* 귀무가설과 대립가설이 무엇인지를 식으로 정확히 표현하시오 .

2.2 위의 가설을 모수적인 방법 , 비모수적인 방법으로 증명하고자 할 때 R commander 를 이용해서 분석하시오 . 그리고 통계적 결론을 내리시오 .

13

Page 14: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

paired t-test (Pepers.xls)

• Statistics > Means > Paired t-test

Page 15: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 16: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

paired t-test (Pepers.xls)

• Statistics > nonparametric tests > Paired-samples Wilcoxon test

Page 17: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 18: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

문제 3.

3.1 insul.xls 자료를 읽고 이 자료의 분석 목적에 대해서 설명하시오 .

3.2 자료의 탐색 (Statistics>Summaries) 을 R commander 를 이용해서 실시하고 결과를 해석하시오 .

3.3 5 군의 glucose 값을 비교한다면 귀무가설과 대립가설이 무엇인지를 식으로 정확히 표현하시오 .

3.4 R commander 를 이용한 ANOVA 를 실시하고 그 결과를 해석하시오 .

3.5 사후분석을 실시해서 군간의 차이를 설명하시오3.6 conc=1,2 를 한 그룹으로 conc=4,5 를 다른

그룹으로 (2 군간의 비교 ) 해서 비교를 한다면 어떠한 방법이 가능할 지 설명하고 R commander 를 이용해서 분석을 실시하시오 .

18

Page 19: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

insul.xls

• Effect of glucose concentration on Insulin– Measured the amount of insulin secretion

after administration of five different concentrations of glucose into pancreatic tissue (animal experiments)

• Characteristics for each group– Statistics > Summaries (according to the

study objective)– Graphs (according to the study objective)

• variable “conc” must be declared as a factor variable!

19

Page 20: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 21: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 22: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Graphs->Boxplot

Conc 1,2 < 3 < conc 4,5

Page 23: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

insul.xls

• One-Way ANOVA – Statistics > Means > One-way ANOVA

– Pairwise comparisons of means – Tukey post-hoc comparison procedure (default)

Page 24: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 25: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

• t-test for (1,2) vs (4,5) comparison– Re-define variables

• Data > Manage variable in active data set > Recode variables > select conc variable

• New variable name or prefix for multiple recodes : new• Enter recode directives• 1:2=1; 3=NA; 4:5=2• conc=3 as a missing

– Equality of variance test should be carried out before the t-test• Statistics > Variances > Two variances F-test• the variances are equal

– Statistics > Means > Independent samples t-test • Mean concentration difference between two new groups

(variances are assumed to be equal)• Significant

Insul.xls

25

Page 26: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 27: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Variance ratio test of the two groups:

Statistics > Variances > Two variances F-test

Page 28: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Independent samples t-test (equal variances)

Page 29: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Insul.xls

• Nonparametric way of comparing (1,2) vs (4,5) – Statistics > Nonparametric tests > Two sample

Wilcoxon test

29

Page 30: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

taillite2.sav data

vehtype='Vehicle Type‘group='Group - Light On=1 Light Off=2‘position='Light Position‘speedzn='Speed Zone‘resptime='Response Time‘follotme='Following Time in Vedio Frames‘folltmec='Following Time in Categories‘;• resptme(continounous) difference by

Vehtype(dichotomous) variable=> Analysis of variance?

• Looking at only Group=1

30

Page 31: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

문제 4.

4.1 taillite2.sav 자료를 읽고 이 자료의 분석 목적에 대해서 설명하시오 .

4.2 vehtype 에 따른 resptime 의 차이가 있는지를 ANOVA 를 이용해서 검정하시오 .

4.3 원자료의 정규성 검정을 실시하고 결론을 이야기하시오 .4.4 비모수적인 방법으로 vehtype 에 따른 resptime 의

차이가 있는지를 검정하시오 .4.5 로그변환을 실시하고 정규성 검정을 실시하시오 . 4.6 로그변환 변수를 이용해서 ANOVA 를 실시하시오 4.7 로그변환 후 비모수 검정을 실시하시오 .4.8 4.2 와 4.6 4.4 와 4.7 의 결과들을 비교 설명하시오

31

Page 32: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

• Trying ANOVA– Statistics > Means > One-way ANOVA– Response variable : resptime, Groups :

vehtype – Grouping variables should be converted as

factor variables (Data > Manage variable in active data set > Convert numeric variables to factors)

• A significant difference between Vehtypes on resptime?

taillite2.sav data

32

Page 33: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

taillite2.sav data

• Normality test– Statistics > Summaries > Shapiro-Wilk test of

normality– For normality test for Vehtype,

• by(taillite2$resptime, taillite2$vehtype, shapiro.test)

– Reject the null!! • ANOVA can not be conducted.

33

Page 34: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 35: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

• Trying nonparametric way (Kruskal-Wallis test) – Statistics > Nonparametric tests > Kruskal-Wallis

test– p=0.259• No difference between groups!

taillite2.sav data

35

Page 36: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

taillite2.sav data

• Data > Manage variable in active data set > Compute new variable– New variable name : lresp – Expression to compute : log(resptime)

• Normality test for lresp– Edit command line as

• by(taillite2$lresp, taillite2$vehtype, shapiro.test)

36

Page 37: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 38: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

taillite2.sav data

• Trying ANOVA with lresp– p=0.063

• What do you conclude?

38

Page 39: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

electric.xls 분석

housize = 'House Size'income = 'Family Income‘aircapac = 'Air Conditioning Capacity‘applindx = 'Appliance Index‘family = 'Number of Family Members‘peak = 'Peak Hour Electric Load' ;• Aim: Selecting variables that affect the

variable “peak” (Maximum amount of electricity) and finding the regression equation

• Statistics > Fit models > Linear regression– Create command line first if you want to use the

stepwise method for model selection (use step(model) function)

39

Page 40: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

문제 5.

5.1 eletric.xls 자료를 읽고 이 자료의 분석 목적에 대해서 설명하시오 .

5.2 peak 를 종속변수로 해서 단계적 선택에 의한 회귀분석을 실시하고 해석을 하시오 . (family 변수는 제외 )

Statistics -> Fit models -> Linear Regression

40

Page 41: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 42: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU
Page 43: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

3D graphics

Page 44: R commander an introduction user-friendly and absolutely free Ho Kim SCHOOL OF PUBLIC HEALTH, SNU

Rcmdr

• R commander was developed as an easy to use graphical user interface (GUI) for R

• Rcmdr is not perfect yet, but has been updated

• Expecting menu screen in Korean and Korean fonts variability