18
Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure-Correlate- Predict Analyses Anthony L. Rogers, Ph.D. March 1, 2006

Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Embed Size (px)

Citation preview

Page 1: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Prediction Uncertainties in Measure-Correlate-Predict Analyses

Anthony L. Rogers, Ph.D.

March 1, 2006

Page 2: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Measure-Correlate-Predict (MCP)• Provides estimate of mean wind speed and

wind speed and direction distributions – Uses a short-term data set and a long-term

reference site data set

• How can we estimate prediction uncertainties?– Review of measured uncertainties– Evaluation of jackknife estimate of variance– Discussion of issues

Page 3: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Measure-Correlate-Predict (MCP)• Apply relationship between concurrent target and

reference site data to long-term reference site data. 25

20

15

10

5

0Win

d S

pe

ed

, m/s

40x103 3020100

Time

Reference Site Data

25

20

15

10

5

0Win

d S

pe

ed

, m/s

40x103 3020100

Time

Target Site Data

Mean of X = 6.5X =

Y =

Y = aX+b Predicted Mean of Y = 5.2

Page 4: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Measure-Correlate-Predict (MCP)• Relationship may be a function of wind speed,

direction, time, temperature, …• (SpeedT, DirT)=f(SpeedR, DirR , Time, TempR)

• “Variance” Method used here– Slope = ratio of standard

deviations of x and y data– Line goes through the mean

of x and y

– Provides unbiased estimates

• Correlations done in 8 direction sectors

xyx

yx

x

yy

ˆ

Page 5: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Determining Prediction Uncertainties• Assemble multiple pairs of long-term

concurrent data sets– e.g. US176-US127

97,357 hourly averages

• Determine MCP estimates for multiple independent concurrent subsets– e.g. 21 MCP estimates for 4000 hr segments– Estimate long-term mean, Weibull parameters

• Evaluate how estimates vary

25

20

15

10

5

0

Win

d S

pe

ed

, m/s

80x103 6040200

Data Point Index

US176 US127

Page 6: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Data Sets Used for Analysis• Six inland pairs

– Oregon, Iowa, Indiana

• Six offshore pairs– N. Atlantic, Hawaii

• 4 to 16 years of data

Site LocationDistance

kmYears of good data

Inland

1 Kennewick - Goodnoe* Oregon 112 11.52

2 Red Oak/Cedar* Iowa 219 4.53

3 Estherville/Forest City* Iowa 100 4.23

4 Inwood/Sibley* Iowa 66 4.33

5 Radcliffe/Sutherland* Iowa 186 3.15

6 US176x1 /US127x07* Indiana 9 10.00

Offshore

7 44005/44007* New England 87 10.23

8 Buoy 44013/44008* New England 231 13.49

9 Buoy BUZM3/IOSN3* New England 178 14.33

10 Buoy MDRM1/MISM1* New England 62 16.71

11 Buoy 51001/51003* Hawaii 497 13.49

12 Buoy 51002/51004* Hawaii 566 13.75

* Reference site

Page 7: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Measured Mean Wind SpeedUncertainties

• Normalized standard deviation of mean:– Uncertainty decreases as

concurrent data length increases

– Beyond ~8000 hrs little improvement

– Value depends on site

• Normalized standard deviation of Weibull shape factor:

– Value very site dependant

0.12

0.08

0.04

0.00

8000600040002000

Inland

0.20

0.15

0.10

0.05

0.00

8000600040002000

Offshore

0.20

0.15

0.10

0.05

0.00

8000600040002000

Inland0.20

0.15

0.10

0.05

0.00

8000600040002000

Offshore

Page 8: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Estimating Uncertainty

• In practice– Only one set of concurrent data

– Characteristics of concurrent data may not represent long-term behavior

– Confidence interval may not fall out of the analysis

• Are there methods to determine the confidence one can have in the MCP results?– Linear regression statistics

– Jackknife estimate of variance

– Estimates from correlation coefficients

Page 9: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Estimating Uncertainty from Linear Regression

• Linear regression estimate ≠ measured!– Linear regression assumes data are not serially correlated

– But wind data ARE serially correlated

• Linear regression estimate = measured value when data are randomly jumbled, removing serial correlation

0.8

0.6

0.4

0.2

0.0S

tand

rd D

evia

tion,

m/s

8000600040002000Length of Concurrent Data, hrs

Standard Deviation of MCP Estimatesof Long Term Wind Speed

Measured - original data Measured - randomized data Linear regression - original data Linear regression - randomized data

Page 10: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Jackknife Estimate of Variance• Applicable to any MCP algorithm • Typically works when other methods not available

1) Find long-term predicted value, , using all of concurrent data

2) Find n long term predicted values, , using concurrent data sets that each have a different 1/nth of the data file missing

3) Number of subsets, n, fixed at value that minimizes RMS error over all data sets

4) The estimated uncertainty is:

• Jackknife subsets need to be independent

n

iiy yy

n

n 22ˆ ˆˆ

1

y

iy

Page 11: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Jackknife Results – Mean Wind SpeedInland Offshore

Blue = measured, Red = Estimated

0.25

0.20

0.15

0.10

0.05

0.00

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

RS

0.30

0.20

0.10

0.00

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

EF

0.15

0.10

0.05

0.00

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

IS

0.5

0.4

0.3

0.2

0.1

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

ROC

1.6

1.2

0.8

0.4

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

KG

0.5

0.4

0.3

0.2

0.1

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

US

0.5

0.4

0.3

0.2

0.1

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

570.25

0.20

0.15

0.10

0.05

0.00

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

MDR

0.6

0.4

0.2

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

B440.4

0.3

0.2

0.1

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

BUZ

0.30

0.20

0.10

0.00

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

240.6

0.4

0.2

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

13

Page 12: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Jackknife Results – Mean Wind Speed• Ratio of measured to estimated standard deviation

• Jackknife estimate of uncertainty of mean typically somewhat underestimates correct value

5

4

3

2

1

0

Rat

io,

-

8000600040002000Numbers of hours in concurrent data

Offshore

5

4

3

2

1

0

Rat

io,

-

8000600040002000Numbers of hours in concurrent data

Inland

Page 13: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Jackknife Results – Weibull Shape FactorInland Offshore

Blue = measured, Red = Estimated

0.16

0.12

0.08

0.04

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

570.16

0.12

0.08

0.04

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

B44

0.15

0.10

0.05

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

BUZ

0.6

0.4

0.2

0.0

Std

. D

ev.

8000600040002000Number of hours in concurrent data

13

0.4

0.3

0.2

0.1

0.0

Std

. D

ev.

8000600040002000Number of hours in concurrent data

24

0.12

0.08

0.04

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

MDR

0.10

0.08

0.06

0.04

0.02

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

EF

0.12

0.08

0.04

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

IS

0.15

0.10

0.05

0.00S

td.

Dev

.

8000600040002000Number of hours in concurrent data

RS

0.15

0.10

0.05

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

ROC

0.12

0.08

0.04

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

KG

0.20

0.15

0.10

0.05

0.00

Std

. D

ev.

8000600040002000Number of hours in concurrent data

US

Page 14: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Jackknife Results – Weibull Shape Factor

• Ratio of measured to estimated standard deviation:

• Jackknife estimate of uncertainty of Weibull shape factor provides reasonable estimates

4

3

2

1

0

Rat

io,

-

8000600040002000Numbers of hours in concurrent data

Inland

4

3

2

1

0

Rat

io,

-

8000600040002000Numbers of hours in concurrent data

Offshore

Page 15: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Limitations of EstimatingUncertainty from Short Data Sets

• Uncertainty within concurrent data set may not be same as uncertainty at longer time intervals

7

6

5

4

3

2

1

Mea

n W

ind

Spe

ed,

m/s

12108642Time, years

MCP Predictions and Jackknife UncertantiesUS176 - US127

1000 data points 9000 data points

Uncertainty within 1000 pt segments << variability of 1000 pt MCP predictions

Uncertainty within 9000 pt segments ~ variability of 9000 pt MCP predictions

Better estimates at one year0.6

0.5

0.4

0.3

0.2

0.1

0.0

Std

. D

ev.,

m/s

8000600040002000Number of hours in concurrent data

Measured and Estimated UncertaintyUS176x14 - US127x07 Data

Measured Estimated

Page 16: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Possible Jackknife Modifications• Inclusion of seasonal model

– e.g Monthly correlations• If no correlation for month,

use overall correlation

– Little improvement in ratios

• Empirical correction factors– e.g Scale estimate of standard deviation

of mean wind speed by 1.6– Ratios show great improvement– Does empirical factor apply to all sites?

2.5

2.0

1.5

1.0

0.5

0.0

Ra

tio

8000600040002000Concurrent Data Length, hr

Inland

Correlations using: Concurrent data length Months, where possible

2.5

2.0

1.5

1.0

0.5

0.0

Ra

tio

8000600040002000Concurrent Data Length, hr

Offshore

Correlations using: Concurrent data length Months, where possible

4

3

2

1

0R

atio

, -

8000600040002000Numbers of hours in concurrent data

Offshore

4

3

2

1

0

Rat

io,

-

8000600040002000Numbers of hours in concurrent data

Inland

4

3

2

1

0

Rat

io

8000600040002000Concurrent Data Length, hr

Inland

4

3

2

1

0

Rat

io

8000600040002000Concurrent Data Length, hr

Offshore

Page 17: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Alternative Approaches• Correlation coefficients

– Uncertainty weakly correlated with correlation coefficients

– No improvement over jackknife at these sites

40x10-3

30

20

10

0

Nor

mal

ized

Std

. D

ev.

of M

ean

0.90.80.70.60.5Correlation coefficient between data sets

1000 pts fit_1000 pts 9000 pts fit_9000 pts

Page 18: Renewable Energy Research Laboratory University of Massachusetts Prediction Uncertainties in Measure- Correlate-Predict Analyses Anthony L. Rogers, Ph.D

Renewable Energy Research Laboratory

University of Massachusetts

Conclusions• Jackknife should correctly estimate uncertainty based

on concurrent data– Much better than using linear regression results– Better than using fit to correlation coefficients

• Empirical correction may be used to account for variability at time scales greater than concurrent data length

• Variability at time scales greater than concurrent data length still a problem

• Jackknife estimate can be used with any MCP algorithm