5
Module 1: Presentation of data Minitab Laboratory Work Sheet Part I Question 1 The residues (parts per million) of a drug in tissue from a random sample of 40 people, 20 days after oral administration of the drug, were as follows: 1.63 1.40 1.64 1.30 1.49 1.58 1.03 1.06 1.33 1.20 1.52 1.87 1.83 1.97 1.62 1.21 1.01 1.14 1.58 1.76 1.43 1.41 1.51 1.15 1.61 1.10 1.03 1.84 1.61 1.54 1.71 1.32 1.29 1.82 1.99 1.43 1.53 1.56 1.48 1.58 Obtain a histogram and interpret the data using a box plot. Obtain the sample mean and standard deviation, also the median and IQR. Compare the histogram and box plot. Which summary statistics are the most appropriate here? Find a suitable confidence interval for the population mean residue. (Note: See Guidelines on next page.) It is desired to estimate the mean residue in these circumstances to within ±0.05 parts per million with 95% confidence. How large a sample would you judge to be necessary in order to achieve this? Question 2 The following data represent the prepatency times in days observed in 50 people who contracted an infectious disease from the times of their known first contacts with the infection: 12 12 12 13 14 11 13 14 12 12 13 13 15 14 14 14 12 12 15 12 14 12 14 11 14 14 15 12 11 14 12 14 12 13 15 14 13 13 13 14 12 12 13 12 14 13 14 12 13 12 Obtain the sample mean and the sample standard deviation, and estimate the population median and IQR. Using a box plot and histogram, comment on whether you think the data are skew or symmetrical. Choose the most suitable sample estimators of the population location and spread. Question 3 In a nerve system the speed in metres per second at which impulses travelled in 48 axons of approximately the same diameter were as follows: 62 64 61 69 60 62 71 78 73 64 69 76 70 70 62 60 61 75 70 77 76 73 73 72 69 68 73 78 62 71 75 73 59 58 66 69 63 68 65 67 77 73 65 66 69 71 65 62

Minitab Modul

Embed Size (px)

DESCRIPTION

Modul Minitab STMIK Sumedang 2009

Citation preview

Page 1: Minitab Modul

Module 1: Presentation of data

Minitab Laboratory Work Sheet

Part I

Question 1

The residues (parts per million) of a drug in tissue from a random sample of 40 people,20 days after oral administration of the drug, were as follows:

1.63 1.40 1.64 1.30 1.49 1.58 1.03 1.06 1.33 1.201.52 1.87 1.83 1.97 1.62 1.21 1.01 1.14 1.58 1.761.43 1.41 1.51 1.15 1.61 1.10 1.03 1.84 1.61 1.541.71 1.32 1.29 1.82 1.99 1.43 1.53 1.56 1.48 1.58

Obtain a histogram and interpret the data using a box plot. Obtain the sample meanand standard deviation, also the median and IQR. Compare the histogram and box plot.Which summary statistics are the most appropriate here? Find a suitable confidenceinterval for the population mean residue. (Note: See Guidelines on next page.)

It is desired to estimate the mean residue in these circumstances to within ±0.05 partsper million with 95% confidence. How large a sample would you judge to be necessaryin order to achieve this?

Question 2

The following data represent the prepatency times in days observed in 50 people whocontracted an infectious disease from the times of their known first contacts with theinfection:

12 12 12 13 14 11 13 14 12 1213 13 15 14 14 14 12 12 15 1214 12 14 11 14 14 15 12 11 1412 14 12 13 15 14 13 13 13 1412 12 13 12 14 13 14 12 13 12

Obtain the sample mean and the sample standard deviation, and estimate the populationmedian and IQR. Using a box plot and histogram, comment on whether you thinkthe data are skew or symmetrical. Choose the most suitable sample estimators of thepopulation location and spread.

Question 3

In a nerve system the speed in metres per second at which impulses travelled in 48 axonsof approximately the same diameter were as follows:

62 64 61 69 60 62 71 78 73 64 69 7670 70 62 60 61 75 70 77 76 73 73 7269 68 73 78 62 71 75 73 59 58 66 6963 68 65 67 77 73 65 66 69 71 65 62

Page 2: Minitab Modul

Using suitable plots and sample statistics, contrast these results with the following speedsfor impulses observed in 40 axons all of a larger diameter:

68 68 73 67 70 74 68 70 66 6979 64 74 72 69 79 82 76 64 7078 73 69 65 69 68 71 71 70 6370 70 71 67 68 61 66 65 68 71

Prepare a brief statement which communicates your findings to colleagues who haven’tseen the data.

Question 4

The creatinine levels (g/24h) in people suffering from extensive skin lesions and beingtreated with an anti-inflammatory steroid compound were monitored:

1.34 1.42 1.45 1.33 1.35 1.26 1.34 1.40 1.39 1.321.24 1.32 1.29 1.35 1.19 1.34 1.41 1.31 1.44 1.311.28 1.39 1.22 1.41 1.40 1.47 1.36 1.40 1.36 1.38

Prepare a brief report describing the characteristics of the data.

Guidelines

Once you have selected the Minitab application, you can carry out analyses as required.The following guidelines for Question 1 should help you to get started.

Go to the Worksheet window and click in the box corresponding to row 1 and columnC1. Type the first data value (1.63) in this box and hit return. Type in the second datavalue (1.40) — it should appear in row 2 of column C1. Similarly, type the other valuesinto the first column of the worksheet.

Name C1 by typing ‘residue’ in the box below C1 and above the data values.

To create a histogram of ‘residue’, click Graph, then Histogram. Click on residue,then select, OK. The plot should look something like this:

A box plot is obtained from the sequence Graph, Boxplot, residue, select, OK.The graph should look like this:

The mean, standard deviation and median are obtained by clicking Stat, Basic Statis-tics, Display Descriptive Statistics, residue, select, OK.

Page 3: Minitab Modul

Descriptive Statistics

Variable N Mean Median Tr Mean StDev SE Mean

residue 40 1.4778 1.5150 1.4753 0.2657 0.0420

Variable Min Max Q1 Q3

residue 1.0100 1.9900 1.2925 1.6275

In the output above “Tr mean” is the trimmed mean. It is the mean of the middle 90%of values. It is rarely used or quoted in practice and can be ignored in this course.

Summary statistics can also be obtained using Calc, Column Statistics then click onthe statistic of interest and select residue, then click OK. Note however that you canonly select one statistic at a time.

A t-interval is calculated using Stat, Basic Statistics, 1-sample t. (You could also trya z-interval; you need to supply a value for sigma). Your output should look somethinglike this:

Confidence Intervals

Variable N Mean StDev SE Mean 95.0 % CI

residue 40 1.4778 0.2657 0.0420 ( 1.3928, 1.5627)

Page 4: Minitab Modul

Details of how to choose a sample size for a mean are given in the notes for module1 on pages 11–12. To perform the calculations, you can either use your own handheldcalculator or use Minitab.Within Minitab, the simplest approach is to enable the prompt line in the Session window(Editor, Enable Commands) and type commands.For the sample size calculation in question 1, s = 0.2657 (from the Descriptive Statistics)and a = 0.05. The following commands could be typed into Minitab at the“MTB >” prompt:

let k1 = (1.96**2) * (0.2657**2) / (0.05**2)

print k1

Note that Minitab uses K1, K2, etc to store constants.The output from the above commands appears as,

Data Display

K1 108.481

suggesting that the residue would need to be measured in 109 individuals to enableestimation of the sample mean to within ±0.05 parts per million with 95% confidence.

To print and/or save your Minitab workings and plots, see the handout ‘Reference Guideto Minitab and Computing’.

Complete the remaining questions in a similar way.

Choose File, Exit to exit Minitab.

Page 5: Minitab Modul

Module 1: Presentation of data

Minitab Laboratory Work Sheet

Part II

Question 5

See question 2 in part I.Let p be the proportion of such people in whom the prepatency time is 14 days or more.What is the sample proportion p̂? Use this to calculate a 95% confidence interval for p.If it is required to estimate p to within ±0.1, how large a sample would you recommendbe taken

(a) if 95% confidence in the accuracy is required,

(b) if 99% confidence in the accuracy is required?

Guidelines

Firstly, we need to determine the number of people with prepatency times greater thanor equal to 14 days. One way to achieve this is to sort the data (Manip, Sort andsorting by the column of times and placing the sorted data into a new column. You needto Sort by the column of times), then note the number of times satisfying the criteriaand convert that to a proportion (you should find that p = 0.38).

To calculate a confidence interval for a proportion, go to Stat, Basic Statistics,1 Proportion. Check the Summarized data box and enter 50 and 19 for the numberof trials and successes respectively.

Sample size calculations can be performed using the prompt line in the Session windowas in Part I, for example

let k1 = (1.96**2) * 0.38 * (1-0.38) / (0.1**2)

prin k1

suggesting that the prepatency times would need to be measured in 91 people to enableestimation of the sample proportion, p = 0.38, to within ±0.1 with 95% confidence.