Subsolutions of a Hamilton-Jacobi-Bellman Equation and the …€¦ · and the Design of Rare Event...

Subsolutions of a Hamilton-Jacobi-Bellman Equationand the Design of Rare Event Simulation Methods

Paul Dupuis

Division of Applied MathematicsBrown University

(A. Budhiraja, A. Buijsrogge, T. Dean, D. Johnson, K. Leder, D. Sezer, M. Snarski, K.Spiliopoulos, H. Wang)

May 2018

Outline

Examples

Monte Carlo and rare event considerations

Two methods: Importance sampling and splitting

Importance Functions

Some heuristics

Why Importance Functions should be subsolutions

Performance for schemes based on subsolutions

Remarks (construction of subsolutions, proofs, ...)

A situation where importance sampling and splitting di�er greatly

Detailed exposition in Chapters 14-17 of

Representations and Weak Convergence Methods for the Analysis andApproximation of Rare Events, A. Budhiraja and D, Springer-Verlag, 2018.

Examples: General framework

Monte Carlo estimation of probabilities and expected values largelydetermined by rare events.

Stochastic processes with light-tailed random variables.

Typical examples: exit problems, risk-sensitive functionals, functionalsof invariant distributions with simple structure.

Exploit a law of large numbers (LLN) scaling, continuous-time limit.

Example 1: Weighted serve-the-longer policy (wireless)

pn = P fQi exceeds cin some i = 1; 2 before Q = (0; 0)jQ(0) = (1; 0)g :Standard large deviation scaling:

X n(t) =1

nQ(nt)

pn = P fX ni exceeds ci some i = 1; 2 before X n = (0; 0)jX n(0) = (1=n; 0)g :

Example 1: Weighted serve-the-longer policy (wireless)

pn = P fQi exceeds cin some i = 1; 2 before Q = (0; 0)jQ(0) = (1; 0)g :Standard large deviation scaling:

X n(t) =1

nQ(nt)

pn = P fX ni exceeds ci some i = 1; 2 before X n = (0; 0)jX n(0) = (1=n; 0)g :

Example 2: Chemical reaction network and metastability

Rates for molecule types to react:

Ar1! B; B

r2! A; 2B + Ar3! 3B:

Suppose n molecules. If (CnA(t);CnB(t)) = (fraction type A, fraction type

B) at t, then CnB(t) = 1� CnA(t),

CnA(t) ! CnA(t)�1

nat rate n

�r1C

nA(t) + r3C

nA(t)C

CnA(t) ! CnA(t) +1

nat rate n [r2C

nB(t)] :

Example 2: Chemical reaction network and metastability

LLN limit for CnA as n!1:

_� = �r1�+ r2(1� �)� r3�(1� �)2

with two stable equilibria when r3 > 3(r1 + r2):

pn = P fCnA(T ) near stable point ajCnA(0) near stable point bg

and reverse rate.

Example 3: Not-so-rare but high cost per sample{SPDE

Spread of pollutant, with concentration un(x ; t); x 2 D � Rd ; t 2 [0;T ],

unt (x ; t) = cunxx(x ; t) + hv(x); unx (x ; t)i � �un(x ; t) +

n� N(dx ; dt)

with boundary and initial conditions and N(dx ; dt) spatial-temporalPoisson noise. Quantity of interest

pn = P fun(x0;T ) � u�g :

However, issue with sampling is high cost and pn small but not exceedinglyso.

Here a moderate deviation approximation may be more useful. Similarissues with other systems with high sampling cost (e.g., mean �eldmodels).

Example 3: Not-so-rare but high cost per sample{SPDE

Spread of pollutant, with concentration un(x ; t); x 2 D � Rd ; t 2 [0;T ],

unt (x ; t) = cunxx(x ; t) + hv(x); unx (x ; t)i � �un(x ; t) +

n� N(dx ; dt)

with boundary and initial conditions and N(dx ; dt) spatial-temporalPoisson noise. Quantity of interest

pn = P fun(x0;T ) � u�g :

However, issue with sampling is high cost and pn small but not exceedinglyso. Here a moderate deviation approximation may be more useful. Similarissues with other systems with high sampling cost (e.g., mean �eldmodels).

Canonical model and quantities to be estimated

As a general discrete time Markov model consider iid random vector �elds�vi (x); x 2 Rd

, with

P fvi (x) 2 Ag = �(Ajx)and the process

X ni+1 = Xni +

nvi (X

ni ); X n0 = x :

Continuous time interpolation:

X n(i=n) = X ni ; piecewise linear interpolation for t 6= i=n:

Canonical model and quantities to be estimated

Continuous time models:

di�usion processes such as

dX " = b(X ")dt +p"�(X ")dW

corresponds to canonical model with

�(�jx) = N(b(x); �(x)�T (x));

(i.e., Euler approximation) with " = 1=n.

continuous time pure jump (e.g., queueing model) do not need timediscretization, have development entirely analogous to discrete timetheory.

Canonical model and quantity to be estimated

Hitting probability: assume LLN trajectories attracted to a point in A

and estimate

pn(x) = P fX n hits B before AjX n(0) = xg

for x 2 (A [ B)c . Essentially a �nite time problem.

De�ne

H(y ; �) = log E exp h�; vi (y)i ; L(y ; �) = sup�2Rd

[h�; �i � H(y ; �)] ;

assume H(y ; �) <1 all � 2 Rd .

Under conditions fX n(�)g satis�es a Large Deviation Principle with ratefunction

IT (�) =

0L(�; _�)dt

if � is AC and �(0) = x , and IT (�) =1 else.

Heuristically, for T <1,given �, small � > 0 and large n

0�t�TkX n(t)� �(t)k � �

)� e�nIT (�):

De�ne

H(y ; �) = log E exp h�; vi (y)i ; L(y ; �) = sup�2Rd

[h�; �i � H(y ; �)] ;

assume H(y ; �) <1 all � 2 Rd .

Under conditions fX n(�)g satis�es a Large Deviation Principle with ratefunction

IT (�) =

0L(�; _�)dt

if � is AC and �(0) = x , and IT (�) =1 else. Heuristically, for T <1,given �, small � > 0 and large n

0�t�TkX n(t)� �(t)k � �

)� e�nIT (�):

Hitting probability:

nlog pn(x)

! inf fIT (�) : �(0) = x ; � enters B prior to A before T ;T <1g :

LetTB;A = f trajectories that hit B prior to A g

r(x) = inf fIT (�) : �(0) = x ; � enters B prior to A before T ;T <1g

For standard Monte Carlo we average iid copies of 1fX n2TB;Ag. One

needs K � enr(x) samples for bounded relative error [std dev/pn(x)].

Alternative approach: construct iid random variables sn1 ; : : : ; snK with

Esn1 = pn(x) and use the unbiased estimator

q̂n;K (x):=sn1 + � � �+ snK

Performance determined by variance of sn1 , and since unbiased byE (sn1 )

2.By Jensen's inequality

�1nlog E (sn1 )

2 � �2nlog Esn1 = �

nlog pn(x)! 2r(x):

An estimator is called asymptotically e�cient if

lim infn!1

�1nlog E (sn1 )

2 � 2r(x):

needs K � enr(x) samples for bounded relative error [std dev/pn(x)].Alternative approach: construct iid random variables sn1 ; : : : ; s

nK with

q̂n;K (x):=sn1 + � � �+ snK

�1nlog E (sn1 )

2 � �2nlog Esn1 = �

nlog pn(x)! 2r(x):

lim infn!1

�1nlog E (sn1 )

2 � 2r(x):

nK with

q̂n;K (x):=sn1 + � � �+ snK

By Jensen's inequality

�1nlog E (sn1 )

2 � �2nlog Esn1 = �

nlog pn(x)! 2r(x):

lim infn!1

�1nlog E (sn1 )

2 � 2r(x):

nK with

q̂n;K (x):=sn1 + � � �+ snK

�1nlog E (sn1 )

2 � �2nlog Esn1 = �

nlog pn(x)! 2r(x):

lim infn!1

�1nlog E (sn1 )

2 � 2r(x):

nK with

q̂n;K (x):=sn1 + � � �+ snK

�1nlog E (sn1 )

2 � �2nlog Esn1 = �

nlog pn(x)! 2r(x):

lim infn!1

�1nlog E (sn1 )

2 � 2r(x):

Two main methods (to date) for construction of random variables sniwith Esni = pn(x): importance sampling (IS) and splitting.

Idea of importance sampling. Simulate under a di�erentdistribution for which the event is not rare, correct using likelihoodratio to make unbiased.

Idea of splitting. Many variations. Simplest is to trigger splits insuch a way that rare event is encouraged. Divide unit mass of originalparticle among descendants to maintain unbiasedness.

Importance sampling

Tilted distributions. Recall

X ni+1 = Xni +

nvi (X

ni ); X n0 = x

and P fvi (y) 2 dzg = �(dz jy). Consider the exponential tilt with tiltparameter � and

��(dz jy) = eh�;yi�H(y ;�)�(dz jy):

Construct �X ni recursively by setting

�X ni+1 = �X ni +1

n�Zni ; �X n0 = x ;

where P��Zni 2 dz j data till time i

= ��

ni (dz j�X ni ); ��ni = F ni ( �X n0 ; : : : ; �X ni ).

Likelihood ratio up to time n, new distribution with respect to old:

n�1Yi=0

eh��ni ;�Zni i�H( �X ni ;��ni )

Importance sampling

X ni+1 = Xni +

nvi (X

ni ); X n0 = x

�X ni+1 = �X ni +1

n�Zni ; �X n0 = x ;

= ��

n�1Yi=0

Importance sampling

X ni+1 = Xni +

nvi (X

ni ); X n0 = x

�X ni+1 = �X ni +1

n�Zni ; �X n0 = x ;

= ��

n�1Yi=0

Importance sampling

Let �Nn = min�i : �X ni 2 A [ B

and set

sn = 1f�X n2TB;Ag

�Nn�1Yi=0

e�h��ni ;�Zni i+H( �X ni ;��ni ):

Recall that performance is measured by

E (sn)2 = Ex

241f�X n2TB;Ag�Nn�1Yi=0

e�2h��ni ;�Zni i+2H( �X ni ;��ni )35 ;

which in terms of original random variables with �ni = Fni (X

n0 ; : : : ;X

ni�1) is

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

e�h�ni ;vi (X ni )i+H(X ni ;�ni )#:

Potential problem: the exponential in the last display is dangerous.

Importance sampling

and set

sn = 1f�X n2TB;Ag

�Nn�1Yi=0

E (sn)2 = Ex

e�2h��ni ;�Zni i+2H( �X ni ;��ni )35 ;

n0 ; : : : ;X

ni�1) is

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

Importance sampling

and set

sn = 1f�X n2TB;Ag

�Nn�1Yi=0

E (sn)2 = Ex

e�2h��ni ;�Zni i+2H( �X ni ;��ni )35 ;

n0 ; : : : ;X

ni�1) is

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

Importance sampling

and set

sn = 1f�X n2TB;Ag

�Nn�1Yi=0

E (sn)2 = Ex

e�2h��ni ;�Zni i+2H( �X ni ;��ni )35 ;

n0 ; : : : ;X

ni�1) is

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

Splitting

Start a particle at initial condition of interest, then branch in a way thatencourages outcome of rare event.Key issues:

When to branch?

How much to branch?

How to form an unbiased estimate?

Splitting

A certain number [proportional to n] of splitting thresholds Cnj are de�nedwhich enhance migration, e.g.,

A single particle is started at x that follows the same law as X n, butbranches into a number of independent copies each time a new level isreached.

Splitting

A certain number [proportional to n] of splitting thresholds Cnj are de�nedwhich enhance migration, e.g.,

A single particle is started at x that follows the same law as X n, butbranches into a number of independent copies each time a new level isreached.

Splitting

For simplicity assume the number of new particles M is deterministic. Ateach branching a multiplicative weight 1=M is assigned to eachdescendent.

Splitting

For simplicity assume the number of new particles M is deterministic. Ateach branching a multiplicative weight 1=M is assigned to eachdescendent.

Splitting

Evolution continues until every particle has reached either A or B. Let

Rnx = total number of particles generatedX nj (t) = trajectory of jth particle,

W n = product of weights assigned to j along path

RnxXj=1

1fX nj 2TB;AgWn:

If Kn thresholds, then

sn =W n � (# of particles reaching B before A) =1

RnxXj=1

1fX nj 2TB;Ag:

How to choose thresholds Cnr , M, and weights?

Splitting

RnxXj=1

1fX nj 2TB;AgWn:

RnxXj=1

1fX nj 2TB;Ag:

Splitting

RnxXj=1

1fX nj 2TB;AgWn:

RnxXj=1

1fX nj 2TB;Ag:

Remarks

First use of IS in rare event context was Seigmund (1976).

Fixed rate splitting schemes begin with Kahn and Harris (1951),further developed in Booth and Hendricks (1984). Schemes havedi�erent names in di�erent communities (e.g., forward ux inchemistry).

Some obvious ine�ciencies lead to the RESTART variantVill�en-Altamirano and Vill�en-Altamirano (1994), which allowsparticles of little likely use to be terminated without introducing bias.Harder to analyze but leads to same theoretical bounds and preferablefor several reasons.

Remarks

Natural conceptual framework for design is through importance functions.Let V : Rd ! R be continuous and satisfy

V (x) � 0 for x 2 B:

Design of scheme:

for importance sampling, and assuming V continuously di�erentiable,the change of measure if the simulated trajectory is at �X ni is

��DV (�X ni )(dz jy) = eh�DV ( �X ni );zi�H(y ;�DV ( �X ni ))�(dz jy):

for splitting, assuming V is continuous, thresholds are de�ned by

Cni = fy : V (y) � i (logM) =ng ;

if M descendents at each time of splitting.

�The �rst examples showing that state feedback was essential for even good (betterthan naive MC) performance are in Ref 5.

Natural conceptual framework for design is through importance functions.Let V : Rd ! R be continuous and satisfy

V (x) � 0 for x 2 B:

Design of scheme:

for importance sampling, and assuming V continuously di�erentiable,the change of measure if the simulated trajectory is at �X ni is

��DV (�X ni )(dz jy) = eh�DV ( �X ni );zi�H(y ;�DV ( �X ni ))�(dz jy):

for splitting, assuming V is continuous, thresholds are de�ned by

Cni = fy : V (y) � i (logM) =ng ;

if M descendents at each time of splitting.�The �rst examples showing that state feedback was essential for even good (better

than naive MC) performance are in Ref 5.

Some heuristics

Recall

r(x) = limn!1

�1nlog pn(x)

�Z T

0L(�(s); _�(s))ds : �(0) = x ; � hits B before A, T <1

Generalize to arbitrary starting position y . Then r(y) satis�es a dynamicprogramming relation: for � > 0

r(y) = inf

�Z �

0L(�(s); _�(s))ds + r(y + �(�)) : �(0) = y

Some heuristics

Recall

r(x) = limn!1

�1nlog pn(x)

�Z T

0L(�(s); _�(s))ds : �(0) = x ; � hits B before A, T <1

Generalize to arbitrary starting position y . Then r(y) satis�es a dynamicprogramming relation: for � > 0

r(y) = inf

�Z �

0L(�(s); _�(s))ds + r(y + �(�)) : �(0) = y

Some heuristics

This implies r(y) is a weak sense (viscosity) solution to

where H(y ; �) = �H(y ;��).

Some heuristics

If we could use r as the importance function, problems with splitting,importance sampling essentially solved!

Splitting: With thresholds de�ned by r ,

P fparticle born at threshold j reaches j � 1g � e�n inf I�(�)

� e�n[r(Cnj )�r(Cnj�1)]

= e�n[logM=n]

where inf is over paths connecting any point in Cnj to Cnj�1. Critical

(borderline) growth, and weights M�Kn = M�n[r(x)=(logM)] = e�nr(x).

Some heuristics

Importance sampling: With tilts de�ned by

��Dr(�X ni )(dz jy) = eh�Dr( �X ni );zi�H(y ;�Dr( �X ni ))�(dz jy);

likelihood ratio along any trajectory satis�es

1f�X n�Nn2Bg

�Nn�1Yi=0

ehDr( �X ni );�Zni i+H( �X ni ;�Dr( �X ni ))

= 1f�X n�Nn2Bg

�Nn�1Yi=0

enhDr( �X ni );[ �X ni+1��X ni ]i�H( �X ni ;Dr( �X ni ))

= 1f�X n�Nn2BgenP�Nn�1i=1 hDr( �X ni );[ �X ni+1��X ni ]i

� 1f�X n�Nn2Bgenr( �X n�Nn )�nr(x) = 1f�X n�Nn2Bg

e�nr(x):

Some heuristics

While possible in special cases, r is typically not available. However, itturns out that the critical properties of an importance function V are

it should be a subsolution to the PDE,

its value at the starting point,

it should be C 1 for important sampling and a viscosity subsolution forsplitting.

LetH(y ; �) = �H(y ;��):

A classical sense subsolution V for the escape probability is smooth (C 1)and satis�es

A viscosity sense subsolution V satis�es the boundary inequalities andH(y ; p) � 0 for any superdi�erential p of V at y 2 (A [ B)c (need not besmooth). Analogous de�nitions for other problems (e.g., �nite timeprobabilities, functionals).

LetH(y ; �) = �H(y ;��):

A classical sense subsolution V for the escape probability is smooth (C 1)and satis�es

A viscosity sense subsolution V satis�es the boundary inequalities andH(y ; p) � 0 for any superdi�erential p of V at y 2 (A [ B)c (need not besmooth). Analogous de�nitions for other problems (e.g., �nite timeprobabilities, functionals).

A constraint on importance functions. If V is used as an importancefunction and V is not a subsolution, bad (exponentially bad) thingshappen.

For importance sampling we have expression for second moment whenV used:

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

ehDV (X ni );vi (X ni )i+H(X ni ;�DV (X ni ))#:

In regions where H(x ;�DV (x)) > 0, large deviation analysis showsdecay rate strictly smaller than twice that of pn(x). Asymptotice�ciency impossible.

For splitting in regions where H(x ;�DV (x)) > 0 the mean number ofparticles reaching next threshold strictly larger than 1, exponentiallymany particles, and a \work-normalized" asymptotic e�ciencyimpossible.

A constraint on importance functions. If V is used as an importancefunction and V is not a subsolution, bad (exponentially bad) thingshappen.

For importance sampling we have expression for second moment whenV used:

E (sn)2 = Ex

"1fX n2TB;Ag

Nn�1Yi=0

ehDV (X ni );vi (X ni )i+H(X ni ;�DV (X ni ))#:

In regions where H(x ;�DV (x)) > 0, large deviation analysis showsdecay rate strictly smaller than twice that of pn(x). Asymptotice�ciency impossible.

For splitting in regions where H(x ;�DV (x)) > 0 the mean number ofparticles reaching next threshold strictly larger than 1, exponentiallymany particles, and a \work-normalized" asymptotic e�ciencyimpossible.

Theorem

Let sn be the splitting estimate for the escape probability pn(x) based onV using either standard splitting or RESTART. Then the number ofparticles generated grows subexponentially in n if and only if V is aviscosity subsolution, in which case we also have

lim infn!1

�1nlog E (sn)2 � V (x) + r(x):

Under additional regularity lim inf and � become lim and =.

With a strict subsolution, one can bound the mean number of particlesuniformly in n.

Theorem

Let sn be the splitting estimate for the escape probability pn(x) based onV using either standard splitting or RESTART. Then the number ofparticles generated grows subexponentially in n if and only if V is aviscosity subsolution, in which case we also have

lim infn!1

�1nlog E (sn)2 � V (x) + r(x):

Under additional regularity lim inf and � become lim and =.

With a strict subsolution, one can bound the mean number of particlesuniformly in n.

Theorem

Let V be a classical subsolution and sn be the importance samplingestimate for the escape probability pn(x) based on V . Then

lim infn!1

�1nlog E (sn)2 � V (x) + r(x):

Under additional regularity lim inf becomes lim with a RHS bounded belowby V (x) + r(x).

Proofs of these and other results found in Springer book.

Theorem

Let V be a classical subsolution and sn be the importance samplingestimate for the escape probability pn(x) based on V . Then

lim infn!1

�1nlog E (sn)2 � V (x) + r(x):

Under additional regularity lim inf becomes lim with a RHS bounded belowby V (x) + r(x).

Proofs of these and other results found in Springer book.

Requirements for asymptotic optimality. Asymptotic optimality meansV (x) = r(x) (note V (x) � r(x) automatic). Equation holds with equalityalong conditional most likely path starting at x :

Analogous results for �nite time problems, expected values, and other (butnot all) problem formulations.

Requirements for asymptotic optimality. Asymptotic optimality meansV (x) = r(x) (note V (x) � r(x) automatic). Equation holds with equalityalong conditional most likely path starting at x :

Analogous results for �nite time problems, expected values, and other (butnot all) problem formulations.

Construction of subsolutions

Various methods depending on structure of problem (see the references):

For systems with H(y ; �) piecewise constant structure (e.g., queueingnetworks){a direct construction based on pointwise minimum of a�nefunctions using critical roots of H(y ; �) = 0.Construction as pointwise minimum of solutions to boundary/terminalconditions admitting explicit solutions (useful for occupancy andrelated combinatorial problems).

Construction in terms of solution to linear/quadratic/regulator (inparticular useful for moderate deviations approximations).

For �nite time problems one can be constructed in terms ofFreidlin-Wentsell quasipotential. Available in explicit form forreversible or (asymptotically reversible). Useful when time interval islarge for splitting, and for importance sampling if no \rest point".

For systems with H(y ; �) piecewise constant structure (e.g., queueingnetworks){a direct construction based on pointwise minimum of a�nefunctions using critical roots of H(y ; �) = 0.

Construction as pointwise minimum of solutions to boundary/terminalconditions admitting explicit solutions (useful for occupancy andrelated combinatorial problems).

Remarks on proofs

Importance sampling. Estimate

1f�X n�Nn2Bg

�Nn�1Yi=0

ehDV ( �X ni );�Zni i+H( �X ni ;�DV ( �X ni )):

Express second moment for estimator as an exponential integral withrespect to original distributions

Use same method as that of LD analysis to write a stochastic controlrepresentation for log of second moment

Use classical veri�cation argument to bound representation (hencedecay of second moment)

Remarks on proofs

Splitting. Estimate based on splitting rate M and Kn thesholds

sn =# particles reach B before A

MKn; E (sn)2 = E

24 RnxXj=1

1fX nj 2TB;AgM�Kn

352 :Partition expectation into contributions from pairs of particles withlast common ancestor at threshold �.

Bound each such using large deviation bounds (one particle gets toCn� , two independent particles go from Cn� to B), decay of M

�Kn,dynamic programming inequalities on r and V to get upper bound ofform

e�n[r(x)+V (x)]:

Summary{distinctions between importance sampling andsplitting

Subsolutions allow one to characterize su�cient (and also necessary)conditions for performance measures for important sampling and splitting,but there are important di�erences.

The di�erences are largely related to the regularity required from thesubsolution [classical for importance sampling, viscosity for splitting],and how it is used to de�ne the scheme [D �V versus �V ].

Subsolutions are often naturally constructed as the pointwise min ofsmooth subsolutions. It turns out that if V = mink=1;:::K Vk satis�esthe boundary condition V (x) � 0 for x 2 B and if each Vk is asubsolution to the PDE, H(y ;DVk(y)) � 0, then can use exponentialmolli�cation to produce a classical subsolution: for � > 0

V �(x) = �� log

e�1�Vk (x)

with only small change in value at starting point.

Summary{distinctions between importance sampling andsplitting

When one goes to the trouble to make importance sampling work,generally (but not always) performs somewhat better than splitting.

For Markov modulated noise (not covered in talk) and othermultiscale problems IS requires solution to an eigenvalue for each tiltparameter used. Although splitting does not explicitly, often neededto evaluate Hamiltonian. Possible advantage to splitting.

Under the condition supx Ee�kvi (x)k2 <1 for some � > 0, one can

prove non-asymptotic bounds for importance sampling. Analogue notknown for for splitting.

A situation where the two di�er greatly

Although often comparable, recent results show can be truly signi�cantdi�erences. In particular, when

metastable point is in the domain of simulation.

Px fX n hits B before A by Tg vs Px fX n hits B by Tg

Treatment in neighborhoods of metastable points di�cult.

When T is large (e.g., transition rate calculations) the Freidlin-Wentsellquasipotential can be used to de�ne a subsolution with nearly optimalvalue at starting point (optimal in limit T !1). With

Q(x) = inf

�Z T

0L(�(s); _�(s))ds : �(0) = 0; �(T ) = x ;T <1

Q + c is a viscosity subsolution for any c .

For problem such as P fX n hits B by T (n)jX n(0) = xg, with T (n)!1as n!1, qualitative di�erences such as

limn!1

�1nlog E

�snsplitting

�2= 2 V (0)

versus

limn!1

�1nlog E (snIS)

2 = �1:

Source of variance in latter are paths under the change of measure whichstay near 0 for long times.

For problem such as P fX n hits B by T (n)jX n(0) = xg, with T (n)!1as n!1, qualitative di�erences such as

limn!1

�1nlog E

�snsplitting

�2= 2 V (0)

versus

limn!1

�1nlog E (snIS)

2 = �1:

Source of variance in latter are paths under the change of measure whichstay near 0 for long times.

References

Papers introducing methods

Importance sampling in the Monte Carlo study of sequential tests (D.

Siegmund), Ann. Statist., 4, (1976), 673{684.

Estimation of particle transmission by random sampling (H. Kahn and T.E.

Harris), National Bureau of Standards Applied Mathematics Series, 12,

(1951), 27{30.

Importance estimation in forward Monte Carlo calculations, (T. Booth and

J. Hendricks), Nucl. Tech./Fusion, 6, (1984), 90{100.

Paper introducing RESTART

RESTART: A method for accelerating rare event simulations, (M.

Villen-Altamirano and J. Villen-Altamirano), Proc. of the 13th International

Teletra�c Congress, Queueing, Performance and Control in ATM, (1991),

71{76.

Paper showing state feedback essential for importance sampling

Counter examples in importance sampling for large deviations probabilities,

(P. Glasserman and Y. Wang), Ann. Appl. Prob., 7, (1997), 731{746..

References

Papers developing subsolutions for importance sampling

Importance sampling, large deviations and di�erential games (D. and H.

Wang), Stochastics and Stochastics Reports, 76, (2004), 481{508.

Subsolutions of an Isaacs equation and e�cient schemes for importance

sampling (D. and H. Wang), Math. of OR, 32, (2007), 1{35.

Importance sampling for Jackson networks (D. and H. Wang), Queueing

Systems, 62, (2009), 113{157.

Large deviations and importance sampling for a tandem network with

slow-down (D., K. Leder and H. Wang), QUESTA, 57, (2007), 71{83.

Papers developing subsolutions for splitting

Splitting for rare event simulation: A large deviations approach to design

and analysis (T. Dean and D.), SPA, 119, (2009), 562{587.

The design and analysis of a generalized RESTART/DPR algorithm for rare

event simulation (T. Dean and D.), Annals of OR, 189, (2011), 63{102.

References

Papers that analyse neighborhoods of metastable points

Escaping from an attractor: Importance sampling and rest points I, (D., K.

Spiliopoulos and X. Zhou), Annals of Applied Probability, 25, (2015),

2909{2958.

Splitting algorithms for rare event simulation over long time intervals (A.

Buijsrogge, D. and M. Snarski), preprint.

Subsolutions of a Hamilton-Jacobi-Bellman Equation and the …€¦ · and the Design of Rare Event...

Documents

Antony and Cleopatra [James F. Bellman, Kathryn Bellman]

1.7cm Lecture 3: [1ex] Hamilton-Jacobi-Bellman Equations ... · Lecture 3: Hamilton-Jacobi-Bellman Equations Distributional Macroeconomics Part IIof ECON2149 Benjamin Moll Harvard

Discontinuous Galerkin finite element methods for Hamilton … · 2017-06-20 · Talk outline 1. Introduction: Hamilton{Jacobi{Bellman (HJB) equations. 2. Analysis: Analysis of HJB

Time-inconsistent stochastic optimal control …web.sgh.waw.pl/delong/time-inconsistent.pdfconsistency provides the theoretical foundation for Dynamic Programming Principle and Hamilton-Jacobi-Bellman

Hamilton-Jacobi-Bellman Equationmitchell/Class/CS532M.2007W2/Talks/hjbSham.pdfHJB Equation Extension of Hamilton-Jacobi equation (classical mechanics) Solution is the optimal cost-to-go

Numerical Methods for Hamilton-Jacobi-Bellman Equations

Hamilton-Jacobi-Bellman equations for dynamic pricing · Hamilton-Jacobi-Bellman equations for dynamic pricing - Industrial and Applied Mathematics Seminar, Oxford Author Asbjørn

Optimal control of Hamilton-Jacobi-Bellman equations

STOCHASTIC CONTROL PROBLEMS, VISCOSITY SOLUTIONS, AND ... › ~touzi › pise02.pdf · 1 Stochastic control problems and the asso-ciated Hamilton-Jacobi-Bellman equation 1.1 Stochastic

Nonlinear dynamic systems and optimal control problems … · NONLINEAR DYNAMIC SYSTEMS AND OPTIMAL CONTROL PROBLEMS ON TIME SCALES 655 Hamilton-Jacobi-Bellman equations on time scales

Numerical Solution of the Hamilton-Jacobi-Bellman ...paforsyt/mean_variance.pdf · Numerical Solution of the Hamilton-Jacobi-Bellman Formulation for Continuous Time Mean Variance

Lecture 4: Hamilton-Jacobi-Bellman Equations, Stochastic ﬀ ...moll/ECO521Web/Lecture4... · Lecture 4: Hamilton-Jacobi-Bellman Equations, Stochastic ﬀ Equations ECO 521: Advanced

A Hamilton Jacobi Bellman Approach to Optimal Trade Executionpaforsyt/optimal_trade.pdf · 1 A Hamilton Jacobi Bellman Approach to Optimal Trade Execution Peter A. Forsyth y 2 3 August

Numerical solution of the Hamilton-Jacobi-Bellman formulation …paforsyt/mean_var_heston.pdf · 2015-05-19 · Numerical solution of the Hamilton-Jacobi-Bellman formulation for continuous

Chapter 2 Hamilton–Jacobi–Bellman equationsadrianofesta.altervista.org/FestaKalise-Chapter2.pdf · 2017. 3. 23. · Chapter 2 Hamilton–Jacobi–Bellman equations In this chapter

High-order and coupled schemes for Hamilton-Jacobi-Bellman ... · FACOLTA DI SCIENZE MATEMATICHE FISICHE E NATURALI Dottorato in Matematica Applicata XXVII cicolo High-order and coupled

Solution of Evolutionary Games via Hamilton-Jacobi-Bellman ...pure.iiasa.ac.at/11806/1/Poster101 Solution of...Solution of Evolutionary Games via Hamilton-Jacobi-Bellman Equations

2cm Lecture 3: [1ex] Hamilton-Jacobi-Bellman …moll/ECO521_2016/Lecture3_ECO521.pdfHamilton-Jacobi-Bellman Equations ... programming was a good name.It was something not even a

Patchy Solutions of Hamilton Jacobi Bellman Partial

NYC Health + Hospitals Physician Directory Corporate ... · Jacobi ALOEZOS CHRISTOPHER Jacobi ALPORT ADINA Jacobi ALQUEZA FE Jacobi ALREFAI SAMEER Jacobi ALTAMIRANO JOSE Jacobi ALTAMIRANO