24
Improving an Ecosystem Model Using Earth Science Data Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager (Stanford University)

Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

  • Upload
    others

  • View
    2

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Improving an Ecosystem Model Using Earth Science Data

Kazumi Saito (NTT Communication Science Laboratories)Pat Langley (CSLI Stanford University)

Trond E. Grenager (Stanford University)

Page 2: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Introduction• Developing computational methods for

discovering knowledge in communicable forms.

• Improving CASA using observed data. • CASA: an existing computational model of

aspect of the Earth ecosystem developed by Christopher Potter and his colleagues at NASA Ames.

Page 3: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Portion of CASA

Known ConstantObserved VariableVariable

E IPAR

W T2 T1

PET

FPAR_FAS

SR_FAS SRDIFF

sol_conv

srmin

e_max

eet

ahi

tempcA

Topt

PET_TW_M umd_vegFAS_NDVI

solar

NPPc

Page 4: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Some EquationsNPPc: net primary production.

( )max 0,NPPc = E IPAR×

_ max 1 2E = e T T W × × ×

0 5IPAR = FPAR_FAS Solar sol_conv . × × ×

E: value of maximum possible photosynthetic efficiency under temperature and moisture stress scalars.

IPAR: converter for intercepted photosyntheticallyactive radiation by the vegetation cover.

Page 5: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

General Problem• Revisions to the model must be consistent

with existing knowledge of Earth science and, ideally, retain similarity to the current model.

• Our research involves attempting to improve the CASA model’s predictive accuracy.

Page 6: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Outline of Approach• Transforming the equations into a neural network• Revising weights in that network• Transforming the network back into equations

( )NPPc= f

L

L

L

originalequations

( )NPPc= g

L

L

L

revisedequations

Page 7: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Some Types of Neural NetworksStandard (sigma-sigma) net:

( )j j jk kw f w x∑ ∑Sigma-pi net (generalized polynomial):

( )exp lnjkwj k j jk kw x w w x=∑ ∑ ∑∏

Pi-sigma net (this talk):

( )j j jk kw f w x∑∏

Page 8: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Transforming Equations

Known ConstantObserved VariableVariable

E IPAR

W T2 T1

PET

FPAR_FAS

SR_FAS SRDIFF

sol_conv

srmin

e_max

eet

ahi

tempcA

Topt

PET_TW_M umd_vegFAS_NDVI

solar

NPPcStress scalars

Intrinsic property

Page 9: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Stress ScalarsOriginal equations:

( )

( )

2

2

1 0 8 0 02 0 00051 ( 0 4472 0 0224 )

12 1 18141 exp 0 2 ( 10 )

11 exp 0 3 ( 10 )

0 5 0 5

T = . + . Topt . Topt= - . . Topt

T = .. Topt tempc

. Topt tempc

eetW = . + .PET

× − ×− + ×

×+ × − + −

×+ × − − +

×

_ max 1 2E = e T T W × × ×

Page 10: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Transformation into Network

( ) ( )

( ) ( ) ( ) ( ) ( )( ) ( )( )

( )

( )

21 1 11 12

2

2 21 22

21 21 21 22

3 3 31 32

1 1

1 ( 0 4472 0 0224 )1 12

1 exp 1 exp

11 exp 2 0.2 ( )

0 5 0 5

T = f x x f w w Topt

= - . . Topt

T = f x f x f xx x

f x f w w Topt tempc

Topt tempc

eet eetW = f x x f w w . + .PET PET

= − = + ×

− + ×

= × = ×+ − + −

= + −

=+ − × −

= = + = ×

0_ max 1 2 iE = e T T W w f× × × = ×∏

Page 11: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Intrinsic Values for Vegetation Type

min 0 95

1 ( )

SR_FAS srminFPAR_FAS , .SRDIFF

SR_FAS - srminSRDIFF

− =

≈ ×

FPAR_FAS: fraction of absorbed photosynthetically active radiation by the vegetation cover

SRDIFF: map from the ground cover to an srmax-srmin value

Page 12: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Transformation into Network

( )exp log( ) log( )

SR_FAS - srmin SRDIFF

= SR_FAS srmin SRDIFF− −

1 if_

0 othewiseumd_veg i

umd_veg i=

=

( )13

1log _ _

i- SRDIFF = v_i umd veg i

=

×∑

: weight in neural networkv_i

Page 13: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Revising weights in Networks

supervised learningStep-length

search direction

1st-order2nd-order

BP, etc.Newton method

variablefixed(constant)

Silva-Almeida algorithm,etc.SCG, OSS, BPQ, etc.

2nd-order learning algorithm

Gauss-Newton method

applicability to large-scale problems

×△○

○△×

quasi-Newton methodconjugate gradient method

performance with inaccurate step-length

Page 14: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

BPQ Algorithm• The search direction is calculated on the

basis of partial BFGS update.

• The step-length is calculated by using a second-order approximation.

Page 15: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Demonstration ProblemSample set

y x1

0.73

0.88

0.95

0.980.99

1 Hidden unit

Input unit

Output unit

w

x(t)

w2

z(t)

1Sample

Page 16: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Learning Neural Network: Result

-0.5

0

0.5

1 -0.5 0 0.5 1

00.10.2

0.3

0.4

0.5

0.6

0.7

2W

1W

BPQ

BP + momentum(1st-order method)

(2nt-order method)Squared error

Page 17: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Experimental Result

360.00

400.00

440.00

480.00

520.00

560.00

RMSE RMSE RMSE

original apparent LOO CV

The RMSE of the original model was reduced by 15 percent, as measured using cross validation.

2o b served p red ic ted

sam p les

(N P P -N P P )R M S E =

n u m b er o f sam ples

Page 18: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Intrinsic Values

0

1

2

3

4

5

6

1 2 3 4 5 6 7 8 9 10 11

initial obtained

The intrinsic values associated with vegetation types obtained in this way were consistently lower

Page 19: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Transforming Network

Step1. Quantize by using a clustering method.

Step2. Determine an adequate number of rulesby using cross-validation.

Step3. Generate nominal conditionby solving a standard classification problem.

( ){ }( )exp : 1,nkl klkl

v q n N=∑ L

Page 20: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Clustering Analysis

380.00

400.00

420.00

440.00

460.00

480.00

500.00

520.00

1 2 3 4 5

apparent RMSE LOO CV RMSE

Number of clusters

Page 21: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Evaluating Experimental Result

380

400

420

440

460

480

500

520

540

before clustering after clustering

initial RMSE appar LOO CV RMSE

Page 22: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Obtained Decision Tree8 = 1 : 0 ( 1 0 . 0 )8 = 0 :

| 9 = 1 : 1 ( 5 8 . 0 )| 9 = 0 :| | 7 = 1 : 1 ( 4 6 . 0 )| | 7 = 0 :| | | 1 1 = 1 : 1 ( 1 1 . 0 )| | | 1 1 = 0 :| | | | 1 = 0 : 2 ( 1 6 8 . 0 / 1 . 0 )| | | | 1 =

tt

tt

tt

tt

tt 1 : 1 ( 1 0 . 0 )

Page 23: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Clustered Intrinsic Values

0

1

2

3

4

5

6

1 2 3 4 5 6 7 8 9 10 11

initial obtained

Page 24: Improving an Ecosystem Model Using Earth Science Data · 2001. 4. 17. · Kazumi Saito (NTT Communication Science Laboratories) Pat Langley (CSLI Stanford University) Trond E. Grenager

Conclusion• This talk described an approach to

improving the predictive accuracy of the existing ecosystem model.

• In the experiments, we can reduce the mean squared error of the original model by 15 percent, as measured using cross validation

• In the future, we’ll carry out further experiments along this direction