How can we improve the infrared atmospheric correction algorithm?

SST Science Team Meeting Coconut Grove November 2011

Peter J Minnett Meteorology & Physical Oceanography

Sareewan DendamrongvitMiroslav Kubat

Department of Electrical & Computer Engineering

University of Miami

The NLSST has been used for over a decade, is very robust, and has been hard to improve upon.

Where next?Use advanced computational techniques: • Genetic Algorithm (GA)-based equation discovery to derive alternative forms of the correction algorithm

• Regression tree to identify geographic regions with related characteristics

• Support Vector Machines (SVM) to minimize error using state-of-the-art non-linear regression

Equation Discovery using Genetic Algorithms

• Darwinian principles are applied to algorithms that “mutate” between successive generations

• The algorithms are applied to large data bases of related physical variables to find robust relationships between them. Only the “fittest” algorithms survive to influence the next generation of algorithms.

• Here we apply the technique to the MODIS matchup-data bases.• The survival criterion is the size of the RMSE of the SST

retrievals when compared to buoy data.

Genetic Mutation of Equations• The initial population of formulae is created by a generator of random

algebraic expressions from a predefined set of variables and operators. For example, the following operators can be used: {+, -, /, ×, √, exp, cos, sin, log}. To the random formulae thus obtained, we can include “seeds” based on published formulae, such as those already in use.

• In the recombination step, the system randomly selects two parent formulae, chooses a random subtree in each of them, and swaps these subtrees.

• The mutation of variables introduces the opportunity to introduce different variables into the formula. In the tree that defines a formula, the variable in a randomly selected leaf is replaced with another variable.

Successive generations of algorithms

The formulae are represented by tree structures; the “recombination” operator exchanges random subtrees in the parents. Here the parent formulae (yx+z)/log(z) and (x+sin(y))/zy give rise to children formulae (sin(y)+z)/log(z) and (x+yx)/zy. The affected subtrees are indicated by dashed lines.

Subsets of the data set can be defined in any of the available parameter spaces.

(From Wickramaratna, K., M. Kubat, and P. Minnett, 2008: Discovering numeric laws, a case study: CO2 fugacity in the ocean. Intelligent Data Analysis, 12, 379-391.)

GA-based equation discovery

And the “fittest” is….The “fittest” algorithm takes the form:

where:Ti is the brightness temperature at λ= i µm

θs is the satellite zenith angle

θa is the angle on the mirror (a feature of the MODIS paddle-wheel mirror design)

Which looks similar to the NLSST:

Regression tree• Regions identified by the regression tree algorithm• The tree is constructed using

– input variables: latitude and longitude– output variable: Error in retrieved SST

• Algorithm recursively splits regions to minimize variance within them

• The obtained tree is pruned to the smallest tree within one standard error of the minimum-cost subtree, provided a declared minimum number of points is exceeded in each region

• Linear regression is applied separately to each resulting region (different coefficients result)

Regions Mk 2

Aqua MODIS SST (11, 12 µm). Daytime & night-time. Mean difference wrt buoys. Jan-Feb-Mar, 2007.

Regions Mk 2

Replicate data longitudinally in an attempt to avoid region boundaries at ±180o

Regions Mk 2

Aqua MODIS SST (11, 12 µm). Daytime & night-time. St. dev about the mean difference wrt buoys. Jan-Feb-Mar, 2007.

Genetic Algorithms & Regression Tree SST algorithms. Global uncertainties.

Aqua MODIS SST - Day & Night SST Day SST night SST4 night

Population* Mean [K] Sdev [K] Mean [K] Sdev [K] Mean [K] Sdev [K] Mean [K] Sdev [K]

Q1 0.50% 0.001 0.486 -0.002 0.510 0.000 0.450 0.003 0.384

Q2 0.50% 0.001 0.492 0.000 0.519 0.002 0.493 -0.001 0.376

Q3 0.50% 0.001 0.486 -0.003 0.521 0.001 0.424 0.003 0.348

Q4 0.50% 0.001 0.434 -0.001 0.452 0.000 0.406 0.000 0.342

Q1 2.00% -0.001 0.496 -0.002 0.519 -0.001 0.461 0.000 0.392

Q2 2.00% 0.001 0.522 0.000 0.536 0.002 0.509 0.001 0.378

Q3 2.00% 0.000 0.509 -0.003 0.545 0.003 0.430 0.002 0.356

Q4 2.00% 0.000 0.443 -0.001 0.465 0.000 0.410 0.001 0.347

*Minimum population as fraction of training set. 0.5% is ~100 for day or night; ~200 for day & night.

Results• The new algorithms with regions give smaller errors

than NLSST or SST4

• Tsfc term no longer required

• Night-time 4µm SSTs give smallest errors• Aqua SSTs are more accurate than Terra SSTs• Regression-tree induced in one year can be applied to

other years without major increase in uncertainties

Next steps• Can some regions be merged without unacceptable

increase in uncertainties?• Iterate back to GA for regions – different formulations

may be more appropriate in different regions.• Allow scan-angle term to vary with different channel

sets.• Introduce “regions” that are not simply geographical.• Suggestions?

Variants of the new algorithms

Note: No Tsfc

Coefficients are different for each equation

MODIS scan mirror effects

Mirror effects: two-sided paddle wheel has a multi-layer coating that renders the reflectivity in the infrared a function of wavelength, angle of incidence and mirror side.

Regression tree (cont.)

• Example of a regression tree

How can we improve the infrared atmospheric correction algorithm?

Documents

Seminar, May 16 th 2007, NASA GSFC Atmospheric Correction of MODIS Data in the Visible to Shortwave Infrared: Method, Error Estimates and Validation Eric

Atmospheric Correction of Satellite Ocean-Color Imagery

Atmospheric Correction of Master Imagery

A COMPARISON OF ATMOSPHERIC CORRECTION METHODS ON …

Refinement of atmospheric correction of absorbing aerosols

Atmospheric correction of thermal-infrared imagery of the ... · forms (Jacob et al., 2003; Lagouarde et al., 2004; Lagouarde and Irvine, 2008), the need for atmospheric correction

ATCOR3: Atmospheric Correction for Rugged Terrain

ENVI Atmospheric Correction Module User's Guide

Evaluation of Radiometric and Atmospheric Correction

Radiometric correction Noise removal Atmospheric correction Seasonal compensation

Fourth IRAM Millimeter Interferometry School 2004: Atmospheric phase correction 1 Atmospheric phase correction Jan Martin Winters IRAM, Grenoble

APPLYING TAFKAA FOR ATMOSPHERIC CORRECTION OF AVIRIS …

Microwave Atmospheric Refraction Correction Radiometer

Improved Atmospheric Correction Techniques for MASTER … · Improved Atmospheric Correction Techniques for MASTER and HyspIRI Thermal Infrared (TIR) Data Glynn Hulley, Topher Hughes,

Radiometric calibration and atmospheric correction

ENVI Atmospheric Correction Module User’s Guide · 2009. 10. 27. · 6 Chapter 1: Introduction to QUAC and FLAASH About Atmospheric Correction Atmospheric Correction Module User’s

Atmospheric / Topographic Correction for Airborne Imagery ... › pdf › atcor4_manual.pdf · Atmospheric / Topographic Correction for Airborne Imagery (ATCOR-4 User Guide, Version

AIRS (Atmospheric Infrared Sounder) Level 1B data

AN ATMOSPHERIC CORRECTION ALGORITHM FOR HYPERSPECTRAL IMAGERY

Geomatica Focus Atmospheric Correction · Geomatica 2013’s new Atmospheric Correction Wizard makes using DEMs, for the purpose of 3D atmospheric correction, easier than ever before