Adaptive Sampling with Topological Scores

Dan Maljovec1, Bei Wang1, Ana Kupresanin2, Gardar Johannesson2, Valerio Pascucci1, and Peer-Timo

Bremer2

1SCI Institute, University of Utah2Lawrence Livermore National Laboratory

Select training data & run simulation

Fit a Predicting Model (PM)

Select candidates & predict response values from PM

Score candidates & select candidate to add to training data

Framework:

Choosing the “Right” Points4 Models fit by using 10 training points:

True Response Model:

Space-filling SamplingNo prior knowledge of the dataset

Where should we sample the model?

Space-filling SamplingFill the domain space as evenly as possible

Space-filling SamplingObtain responses and fit model

Space-filling Sampling1 round:

Space-filling Sampling2 rounds:

Space-filling SamplingRefit after 5 rounds of “filling voids”:

Space-filling SamplingWhat have we learned?

Initial fit Refit after adding 5 points

Topologically-Inspired Adaptive Sampling

Starting over

1 round:

2 rounds:

3 rounds:

4 rounds:

5 rounds:

Refit after 5 rounds of choosing the most topologically significant point:

What have we learned?

Initial fit Refit after adding 5 points

Space-filling Method Adaptive Method

Comparison

Initial Fit

Space-filling Method Adaptive Method

Comparison

True Response Model

Measuring Topological SignificanceThese points were selected in topologically significant regions:

How do we measure topological impact?

Morse-Smale ComplexA partition of the domain into monotonic regions

Stable Manifolds Unstable Manifolds Morse-Smale Complex

Computing the Morse-Smale Complex

Traditionally:1. Compute steepest ascent/descent gradient from each point in

dataset2. Color points twice once by following ascending gradient to

maximum & another by descending gradient to minimum3. Intersect partitions to form Morse-Smale Complex

Computing the Morse-Smale ComplexTraditionally:

1.a. Compute steepest ascent gradient from each point in dataset2.a. Color points based on maximum where flow halts

Computing the Morse-Smale ComplexTraditionally:

1.b. Compute steepest descent gradient from each point in dataset2.b. Color points based on minimum where flow halts

Traditionally:3. Intersect partitions to form Morse-Smale Complex

Traditionally:1. Compute steepest ascent/descent gradient from each point in

dataset Assumes points have a neighborhood2. Color points twice once by following ascending gradient to

maximum & another by descending gradient to minimum3. Intersect partitions to form Morse-Smale Complex

Computing the Approximate Morse-Smale Complex

Modified Algorithm:1. Compute approximate KNN for a point2. Compute steepest ascent/descent gradient from each point’s

neighborhood in dataset3. Color points by gradient flows4. Intersect partitions to form Morse-Smale Complex

0 1 2 30

Tracking Persistence

Count the number of connected components belonging to the sub-level sets of the function

0 1 2 30

Every connected component has a birth

0 1 2 30

Every connected component has a birth

0 1 2 30

And every connected component has a death

0 1 2 30

Births and deaths of connected components are tied to critical points of the function

0 1 2 30

Births and deaths of connected components are tied to critical points of the function

0 1 2 30

A persistence diagram plots birth-death pairings

The function value difference between the critical point pairings is what we define as the persistence of a feature

In surfaces (and higher dimension corollaries), we have saddle points

Persistence Simplification in 2D

Persistence Simplification in 2DWe can smooth away noise features by thresholding

critical point pairs above a certain persistence level

Persistence Simplification in 2D

0 1 2 30

Bottleneck Distance

Comparing two similar functions persistence diagrams

0 1 2 30

Bottleneck Distance

Optimization: minimize the distance between pairs of points in the two plots including the line y=x in both pairs

Bottleneck distance – the maximum distance resulting from this pairing

Select training data & run simulation

Fit a Predicting Model (PM)

Select candidates & predict response values from PM

Score candidates & select candidate to add to training data

Framework:

Selecting Initial Training DataTests use Latin Hypercube Sampling

Fitting a Statistical ModelExperimented with:• Gaussian Process Model– Sparse Online Gaussian Process (C++)

• 2 Implementations of MARS with bootstrapping– Earth (CRAN)– MDA (CRAN)

• Neural Network with bootstrapping– NNet (CRAN)

Classical ScoringActive-Learning McKay (ALM)– Sample high-frequency or low-confidence regions

Delta– Distribute samples in the range space or areas of steep

gradientExpected Improvement (EI)– Select points with high uncertainty or large discrepancy

with existing dataDistance (*DP)– Scaling factor applied to above, creating 3 new scoring

functions (ALMDP, DeltaDP, EIDP)

Topological ScoringTOPOP– Average change in persistence of each point computed

by comparing the Morse-Smale before and after inserting a candidate

TOPOB– Bottleneck distance between persistence diagram of

Morse-Smale before and after inserting a candidate point

TOPOHP– Highest persistence critical point in Morse-Smale

complex constructed from combined training and predicted candidate data

Topological ScoringAverage Change in Persistence (TOPOP)– Average change in persistence of each point

computed by comparing the Morse-Smale before and after inserting a candidate

Topological ScoringAverage Change in Persistence (TOPOP)– Average change in persistence of each point

computed by comparing the Morse-Smale before and after inserting a candidate

Topological ScoringBottleneck Distance in Persistence (TOPOB)– Bottleneck distance between persistence diagram of

Topological ScoringHighest Persistence (TOPOHP)– Find highest persistence critical point in Morse-

Smale complex constructed from combined training and predicted candidate data

Initial Test Functions

Initial Test FunctionsDiagnosis:Little hope of modeling the complexity of some of these functions, especially in higher dimensions.

Test FunctionsAckley Diagonal 2 Max Diagonal 4 Max

2-Component GMM

5-Component GMM

Mis-Scaled Generalized

Rastrigin (MGR)

Salomon Generalized Rosenbrock

Comparing Model ResultsGPM Earth

NNet MDA

GPM Results

Moving ForwardLimitations

– Computation time of approximate K-nearest neighborhood is cost-prohibitive to build for every candidate point (TOPOB & TOPOP)• Morse-Smale computation is limited by number of extrema which need processed

– Topology is based on current prediction modelUnknowns

– Good parameter selection (k for KNN, parameters of prior)Future Work

– Integrate visualizations to better understand how points are being selected and gain intuition as to how to improve results

– Better neighborhood estimates– Investigate other means of measuring topological impact– Fitting local models based on topological partitioning– Using better models– Hybrid scoring or hybrid sampling techniques

Adaptive Sampling with Topological Scores

Documents

ADAPTIVE SAMPLING WITH TOPOLOGICAL SCORES · functions each with their own advantages and disadvantages. In this paper we present an extensive comparative study of adaptive sampling

TOPOLOGICAL MANIFOLDS - maths.ed.ac.ukv1ranick/haupt/siebicm.pdf · TOPOLOGICAL MANIFOLDS * by L. C. SIEBENMANN 0. Introduction. Homeomorphisms - topological isomorphisms - have repeatedly

Topological Insulators - Basics of Topological Insulators

Extending Topological Properties to Fuzzy Topological Spaces Adarbeh.pdf · Extending Topological Properties to Fuzzy Topological Spaces By ... Extending Topological Properties to

TOPOLOGICAL ORDER AND EMERGENCE - Site Disabledfaculty.poly.edu/~jbain/papers/17Top_Order_Emergence.pdf · Hall effect, topological insulators, and topological superconductors. These

Topological Superconductivity in a Planar Josephson Junctionyacoby.physics.harvard.edu/Publications/Topological... · topological superconductors as edge states in one dimen-sion

TOWARDS TOPOLOGICAL PATTERN DETECTION FLUID AND …kurlin.org/projects/pattern_detection_climate.pdf · 2018-09-30 · TOWARDS TOPOLOGICAL. . . TOWARDS TOPOLOGICAL PATTERN DETECTION

Majorana Fermions in Topological Insulators Fermions in Topological Insulators I. Introduction - Topological band theory and topological insulators II. Two Dimensions : Quantum Spin

Sampling real algebraic varieties for topological …Sampling real algebraic varieties for topological data analysis Emilie Dufresne Parker B. Edwardsy Heather A. Harringtonz Jonathan

Topological color code and symmetry-protected topological phases · 2015-07-06 · PHYSICAL REVIEW B 91, 245131 (2015) Topological color code and symmetry-protected topological phases

Continuum structural topological optimization with … structural topological optimization with dynamic stress . response ... to structure optimization, ... topological optimization

Topological Phases and Topological Insulators Manuel Asoreycud.unizar.es/sites/default/files/personal/pers_svilariño/Asorey.pdf · TOPOLOGICAL INSULATORS • 2D topological insulators

Topological Insulators versus Topological Dirac Semimetals

Topological Cr y stalline Insulators and Topological S

Topological insulators, topological …Topological...Topological insulators, topological superconductors and Weyl fermion semimetals: discoveries, perspectives and outlooks M Zahid

Section 319...the SHI, SMI and SFI scores had each improved to scores of 3, 3 and 2, respectively, for an average score of 2.7. Macroinvertebrate sampling showed an increase in diversity

Topological Descriptors of Histology Imagesnsingh/publications/nsingh2014topology_breast... · Topological Descriptors of Histology Images Nikhil Singh, Heather D. Couture, ... Topological

Topological invariants for interacting topological ... · PHYSICAL REVIEW B 93, 195164 (2016) Topological invariants for interacting topological insulators. II. Breakdown of single-particle

Topological Sort

Sampling conditions and topological guarantees for shape reconstruction algorithms