Vegetation Science Lecture 4 Non-Linear Inversion Lewis, Disney & Saich UCL

Vegetation ScienceLecture 4

Non-Linear Inversion

Lewis, Disney & Saich

Introduction

• Previously considered forward models– Model reflectance/backscatter as fn of

biophysical parameters

• Now consider model inversion– Infer biophysical parameters from

measurements of reflectance/backscatter

Linear Model Inversion

• Dealt with last lecture

• Define RMSE

• Minimise wrt model parameters

• Solve for minimum– MSE is quadratic function– Single (unconstrained) minimum

P1RMSE

Issues

• Parameter transformation and bounding

• Weighting of the error function

• Using additional information

• Scaling

Parameter transformation and bounding

• Issue of variable sensitivity– E.g. saturation of LAI effects– Reduce by transformation

• Approximately linearise parameters

• Need to consider ‘average’ effects

Weighting of the error function

• Different wavelengths/angles have different sensitivity to parameters

• Previously, weighted all equally– Equivalent to assuming ‘noise’ equal for all

observations

imeasured ii

2modelled

Weighting of the error function

• Can ‘target’ sensitivity– E.g. to chlorophyll concentration– Use derivative weighting (Privette 1994)

imeasured

modelled

Using additional information

• Typically, for Veg Sci, use canopy growth model– See Moulin et al. (1998)

• Provides expectation of (e.g.) LAI– Need:

• planting date• Daily mean temperature• Varietal information (?)

• Use in various ways– Reduce parameter search space– Expectations of coupling between parameters

Scaling

• Many parameters scale approximately linearly– E.g. cover, albedo, fAPAR

• Many do not– E.g. LAI

• Need to (at least) understand impact of scaling

Crop Mosaic

LAI 1 LAI 4 LAI 0

Crop Mosaic

• 20% of LAI 0, 40% LAI 4, 40% LAI 1.

• ‘real’ total value of LAI: – 0.2x0+0.4x4+0.4x1=2.0.

)2/exp())2/exp(1( LAILAI s

visible: NIR 1.0;2.0 s

3.0;9.0 s

canopy reflectance

1 4 7 10 13 16 19 22 25 28 31 34 37 40 43 46 49

visible

canopy reflectance over the pixel is 0.15 and 0.60 for the NIR.

• If assume the model above, this equates to an LAI of 1.4. • ‘real’ answer LAI 2.0

Options for Numerical Inversion

• Iterative numerical techniques– Quasi-Newton– Powell

• Knowledge-based systems (KBS)

• Artificial Neural Networks (ANNs)

• Genetic Algorithms (GAs)

• Look-up Tables (LUTs)

Local and Global Minima

Need starting point

How to go ‘downhill’?

Bracketing of a minimum

How far to go ‘downhill’?

Golden Mean Fraction =w=0.38197

z=(x-b)/(c-a)

w=(b-a)/(c-a)

Choose:

z+w=1-w

For symmetry

Choose:

w=z/(1-w)

To keep proportions the same

z=w-w2=1-2w

0= w2 -3w+1

Slow, but sure

Parabolic Interpolation

Inverse parabolic interpolationMore rapid

Brent’s method

• Require ‘fast’ but robust inversion

• Golden mean search– Slow but sure

• Use in unfavourable areas

• Use Parabolic method – when get close to minimum

Multi-dimensional minimisation

• Use 1D methods multiple times– In which directions?

• Some methods for N-D problems– Simplex (amoeba)

Downhill Simplex

• Simplex:– Simplest N-D

• N+1 vertices

• Simplex operations:– a reflection away from the

high point– a reflection and expansion

away from the high point– a contraction along one

dimension from the high point– a contraction along all

dimensions towards the low point.

• Find way to minimum

Simplex

Direction Set (Powell's) Method

• Multiple 1-D minimsations– Inefficient

along axes

Powell

Direction Set (Powell's) Method

• Use conjugate directions– Update

primary & secondary directions

• Issues– Axis

covariance

Powell

Simulated Annealing

• Previous methods:– Define start point– Minimise in some direction(s)– Test & proceed

• Issue:– Can get trapped in local minima

• Solution (?)– Need to restart from different point

Simulated Annealing

• Annealing– Thermodynamic phenomenon– ‘slow cooling’ of metals or crystalisation of liquids– Atoms ‘line up’ & form ‘pure cystal’ / Stronger (metals)– Slow cooling allows time for atoms to redistribute as

they lose energy (cool)– Low energy state

• Quenching– ‘fast cooling’– Polycrystaline state

Simulated Annealing

• Simulate ‘slow cooling’• Based on Boltzmann probability distribution:

• k – constant relating energy to temperature• System in thermal equilibrium at temperature T has

distribution of energy states E• All (E) states possible, but some more likely than others• Even at low T, small probability that system may be in

higher energy state

Simulated Annealing

• Use analogy of energy to RMSE

• As decrease ‘temperature’, move to generally lower energy state

• Boltzmann gives distribution of E states– So some probability of higher energy state

• i.e. ‘going uphill’

– Probability of ‘uphill’ decreases as T decreases

Implementation

• System changes from E1 to E2 with probability exp[-(E2-E1)/kT]

– If(E2< E1), P>1 (threshold at 1)• System will take this option

– If(E2> E1), P<1• Generate random number• System may take this option• Probability of doing so decreases with T

Simulated Annealing

P= exp[-(E2-E1)/kT] – rand() - OK

P= exp[-(E2-E1)/kT] – rand() - X

Simulated Annealing

• Rate of cooling very important

• Coupled with effects of k– exp[-(E2-E1)/kT] – So 2xk equivalent to state of T/2

• Used in a range of optimisation problems

• Not much used in Remote Sensing

(Artificial) Neural networks (ANN)

• Another ‘Natural’ analogy– Biological NNs good at solving complex

problems– Do so by ‘training’ system with ‘experience’

• ANN architecture

• ‘Neurons’ – have 1 output but many inputs– Output is weighted sum of inputs– Threshold can be set

• Gives non-linear response

• Training– Initialise weights for all neurons– Present input layer with e.g. spectral reflectance– Calculate outputs– Compare outputs with e.g. biophysical

parameters– Update weights to attempt a match– Repeat until all examples presented

• Use in this way for canopy model inversion

• Train other way around for forward model

• Also used for classification and spectral unmixing– Again – train with examples

• ANN has ability to generalise from input examples

• Definition of architecture and training phases critical– Can ‘over-train’ – too specific

– Similar to fitting polynomial with too high an order

• Many ‘types’ of ANN – feedback/forward

• In essence, trained ANN is just a (essentially) (highly) non-linear response function

• Training (definition of e.g. inverse model) is performed as separate stage to application of inversion– Can use complex models for training

• Many examples in remote sensing• Issue:

– How to train for arbitrary set of viewing/illumination angles? – not solved problem

Genetic (or evolutionary) algorithms (GAs)

• Another ‘Natural’ analogy

• Phrase optimisation as ‘fitness for survival’

• Description of state encoded through ‘string’ (equivalent to genetic pattern)

• Apply operations to ‘genes’– Cross-over, mutation, inversion

• E.g. of BRDF model inversion:• Encode N-D vector representing current

state of biophysical parameters as string• Apply operations:

– E.g. mutation/mating with another string– See if mutant is ‘fitter to survive’ (lower

RMSE)– If not, can discard (die)

• General operation:– Populate set of chromosomes (strings)– Repeat:

• Determine fitness of each

• Choose best set

• Evolve chosen set – Using crossover, mutation or inversion

– Until a chromosome found of suitable fitness

• Differ from other optimisation methods– Work on coding of parameters, not parameters

themselves

– Search from population set, not single members (points)

– Use ‘payoff’ information (some objective function for selection) not derivatives or other auxilliary information

– Use probabilistic transition rules (as with simulated annealing) not deterministic rules

• Example operation:1. Define genetic representation of state2. Create initial population, set t=03. Compute average fitness of the set

- Assign each individual normalised fitness value- Assign probability based on this

4. Using this distribution, select N parents5. Pair parents at random6. Apply genetic operations to parent sets

- generate offspring- Becomes population at t+1

7. Repeat until termination criterion satisfied

– Flexible and powerful method

– Can solve problems with many small, ill-defined minima

– May take huge number of iterations to solve

– Not applied to remote sensing model inversions

Knowledge-based systems (KBS)

• Seek to solve problem by incorporation of information external to the problem

• Only RS inversion e.g. Kimes et al (1990;1991)– VEG model

• Integrates spectral libraries, information from literature, information from human experts etc

• Major problems:– Encoding and using the information

– 1980s/90s ‘fad’?

LUT Inversion

• Sample parameter space

• Calculate RMSE for each sample point

• Define best fit as minimum RMSE parameters– Or function of set of points fitting to a certain

tolerance

• Essentially a sampled ‘exhaustive search’

LUT Inversion

• Issues:– May require large sample set – Not so if function is well-behaved

• for many optical EO inversions• In some cases, may assume function is locally linear over large

(linearised) parameter range• Use linear interpolation

– Being developed UCL/Swansea for CHRIS-PROBA

– May limit search space based on some expectation• E.g. some loose VI relationship or canopy growth model or land

cover map• Approach used for operational MODIS LAI/fAPAR algorithm

(Myneni et al)

LUT Inversion

• Issues:– As operating on stored LUT, can pre-calculate model

outputs• Don’t need to calculate model ‘on the fly’ as in e.g. simplex

methods• Can use complex models to populate LUT

– E.g. of Lewis, Saich & Disney using 3D scattering models (optical and microwave) of forest and crop

– Error in inversion may be slightly higher if (non-interpolated) sparse LUT

• But may still lie within desirable limits

– Method is simple to code and easy to understand• essentially a sort operation on a table

Summary

• Range of options for non-linear inversion• ‘traditional’ NL methods:

– Powell, AMOEBA• Complex to code

– though library functions available

• Can easily converge to local minima– Need to start at several points

• Calculate canopy reflectance ‘on the fly’– Need fast models, involving simplifications

• Not felt to be suitable for operationalisation

Summary

• Simulated Annealing– Can deal with local minima– Slow– Need to define annealing schedule

• ANNs– Train ANN from model (or measurements)– ANN generalises as non-linear model– Issues of variable input conditions (e.g. VZA)– Can train with complex models– Applied to a variety of EO problems

Summary

• GAs– Novel approach, suitable for highly complex inversion

problems– Can be very slow– Not suitable for operationalisation

• KBS– Use range of information in inversion– Kimes VEG model– Maximises use of data– Need to decide how to encode and use information

Summary

• LUT– Simple method

• Sort

– Used more and more widely for optical model inversion• Suitable for ‘well-behaved’ non-linear problems

– Can operationalise– Can use arbitrarily complex models to populate LUT– Issue of LUT size

• Can use additional information to limit search space• Can use interpolation for sparse LUT for ‘high information

content’ inversion

Vegetation Science Lecture 4 Non-Linear Inversion Lewis, Disney & Saich UCL

Documents

GEOGG121: Methods Inversion I: linear approaches Dr. Mathias (Mat) Disney UCL Geography Office: 113, Pearson Building Tel: 7670 0592 Email: mdisney@ucl.geog.ac.uk

UCL LIBRARY SERVICES LERU Roadmap for Research Data Dr Paul Ayris Director of UCL Library Services and UCL Copyright Officer Chief Executive, UCL Press

An overview - UCL Discovery - UCL Discovery

UCL Algorithmic Trading Platform & UCL Algo. Trading Competition 2010

00001 - UCL Discovery - UCL Discovery

ASPIRES 2 - UCL Discovery - UCL Discovery

CreativeAI: Deep Learning for Graphics Feature Visualization · Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey Tobias Ritschel UCL UCL UCL TU Munich UCL CreativeAI: Deep

INVERSION TABLE - LuckyVitamin Up Exercises ... • Stop using the inversion table immediately if you ... Inversion Table is in the full 90 degree inversion

First Year Mentoring of Education Students in UK Higher Education Ginny Saich University of Stirling Scotland, UK

CONSTRUCTION HISTORY - UCL Discovery - UCL Discovery

1. 2. - UCL Discovery - UCL Discovery

Research Review - UCL Discovery - UCL Discovery

UCL Computer Science - UCL – University College LondonUCL Computer Science - UCL – University College London

UCL LIBRARY SERVICES Open Access Future Developments in UCL Dr Paul Ayris Director of UCL Library Services and UCL Copyright Officer President of LIBER

1 INTRODUCTION - UCL Discovery - UCL Discovery

While you’re waiting……. - UCL · ucl careers essentials ucl career essentials getting started with linkedin - raising your profile and sourcing opportunities sally brown ucl

Rosalind Hayes - UCL Discovery - UCL Discovery

UCL Library Services and UCL Publications Board: New Developments in e-Publishing at UCL Martin Moyle Group Manager, IT Services, UCL Library Services

Hinode inversion strategy Attacking inversion problems

Karia and Krete - UCL Discovery - UCL Discovery