6
Path Constraints in Tychastic and Unscented Optimal Control: Theory, Application and Experimental Results I. Michael Ross, Mark Karpenko, and Ronald J. Proulx Abstract— In recent papers, we have shown that a Lebesgue- Stieltjes optimal control theory forms the foundations for un- scented optimal control. In this paper, we further our results by incorporating uncertain mixed state-control constraints in the problem formulation. We show that the integrated Hamiltonian minimization condition resembles a semi-infinite type math- ematical programming problem. The resulting computational difficulties are mitigated through the use of the unscented transform; however, the price of this approximation is a solution to a chance-constrained optimal control problem whose risk level is determined a posteriori. Experimental results conducted at Honeywell are presented to demonstrate the success of the theory. An order of magnitude reduction in the failure rate in obtained through the use of an unscented optimal control that steers a spacecraft testbed driven by control-moment gyros. I. I NTRODUCTION In [1], we introduced the concept of unscented optimal control. Thereafter, we provided a theoretical framework for the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines the concept of the unscented transform of Julier et al [6]–[9] with standard deterministic optimal control [10], [11] to produce an unscented approach for controlling a con- ditionally deterministic dynamical system. Such a dynamical system can be parameterized by a differential equation, ˙ x = f (x, u,t; p) (1) where, p supp(p) R N p is an N p -dimensional uncer- tain parameter defined over a support supp(p) such that f : (x, u, t, p) 7R Nx is deterministic if p is known. Throughout this paper we assume all data functions are differentiable. Given any deterministic control trajectory t 7u, the evolution of a state trajectory t 7x governed by (1) is deterministically conditioned on the knowledge of p. Because p is unknown, we treat (1) as a deterministic selection of a controlled differential inclusion, ˙ x ∈F (x, u,t) := {f (x, u,t; p): p supp(p)} (2) Hence, for a given control trajectory, (1) or (2) emit set- valued states from a known initial state as illustrated in Fig. 1. Such dynamical systems have found widespread applications in aerospace engineering [1]–[3], [13], search theory [14]–[16], and quantum control [17], [18] to name I. M. Ross is a Professor and Program Director of Control and Optimiza- tion at the Naval Postgraduate School, Monterey, CA 93943 USA. M. Karpenko is a Research Associate Professor of Control and Opti- mization at the Naval Postgraduate School, Monterey, CA 93943 USA. [email protected] R. J. Proulx is a Research Professor of Control and Optimization at the Naval Postgraduate School, Monterey, CA 93943 USA. t initial state x evolutions Fig. 1. The evolution of a state trajectory is set-valued for (1) or (2) for any given control trajectory; figure adapted from Aubin et al [12]. a few. It has also been widely studied in the Russian literature [19] and has led to a minimax or robust optimal control theory as put forth separately by Vinter [20] and Boltyanski and Poznyak [21]. Because (1) is not a stochastic differential equation, that is, an Itˆ o differential equation with a diffusion term, it is called a tychastic differential equation, after Tyche, the ancient Greek goddess of chance and fortune [12], [22]. By using the unscented transform [6] to map the mean and covariance of p to a collection of sigma points, a tychastic optimal control problem maps to an unscented optimal control problem [2]. The unscented optimal control problem is deterministic but possibly large scale; hence, the spectral efficiencies of pseudospectral op- timal control techniques [23] are a good match to solve these problems. In both the simulations and the experimental implementations presented later in this paper, we used the MATLAB r toolkit DIDO c 1 which implements the spectral algorithm [24], [25] for solving optimal control problems. In general, an unscented optimal control solves the original tychastic problem approximately [3]. Hence, it can be viewed as solving an associated chance-constrained optimal control problem exactly. In this case, the risk level is determined a posteriori by a Monte Carlo simulation [3], [5]. These ideas have found new applications in aerospace engineering such as in extending the life of the Hubble space telescope [1] and providing safety margins for spacecraft proximity operations over distant asteroids [2]. In this paper, we further these emerging concepts by incorporating path constraints in the problem formulation. That is, in nearly all of the prior work, the state space was an open set. In many engineering applications it is critical to incorporate collision avoidance constraints, keep-out zones, and other physical and operational constraints. General deter- ministic path constraints impose restrictions on ( x(·), u(·) ) 1 See http://www.mathworks.com/products/connections/product detail/pro- duct 61633.html and http://www.ElissarGlobal.com

Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

Path Constraints in Tychastic and Unscented Optimal Control: Theory,Application and Experimental Results

I. Michael Ross, Mark Karpenko, and Ronald J. Proulx

Abstract— In recent papers, we have shown that a Lebesgue-Stieltjes optimal control theory forms the foundations for un-scented optimal control. In this paper, we further our results byincorporating uncertain mixed state-control constraints in theproblem formulation. We show that the integrated Hamiltonianminimization condition resembles a semi-infinite type math-ematical programming problem. The resulting computationaldifficulties are mitigated through the use of the unscentedtransform; however, the price of this approximation is a solutionto a chance-constrained optimal control problem whose risklevel is determined a posteriori. Experimental results conductedat Honeywell are presented to demonstrate the success of thetheory. An order of magnitude reduction in the failure rate inobtained through the use of an unscented optimal control thatsteers a spacecraft testbed driven by control-moment gyros.

I. INTRODUCTION

In [1], we introduced the concept of unscented optimalcontrol. Thereafter, we provided a theoretical framework forthe technique in terms of a Lebesgue-Stieltjes optimal controltheory [2]–[5]. In simple terms, unscented optimal controlcombines the concept of the unscented transform of Julier etal [6]–[9] with standard deterministic optimal control [10],[11] to produce an unscented approach for controlling a con-ditionally deterministic dynamical system. Such a dynamicalsystem can be parameterized by a differential equation,

x = f(x,u, t;p) (1)

where, p ∈ supp(p) ⊆ RNp is an Np-dimensional uncer-tain parameter defined over a support supp(p) such thatf : (x,u, t,p) 7→ RNx is deterministic if p is known.Throughout this paper we assume all data functions aredifferentiable. Given any deterministic control trajectory t 7→u, the evolution of a state trajectory t 7→ x governed by(1) is deterministically conditioned on the knowledge ofp. Because p is unknown, we treat (1) as a deterministicselection of a controlled differential inclusion,

x ∈ F(x,u, t) := {f(x,u, t;p) : p ∈ supp(p)} (2)

Hence, for a given control trajectory, (1) or (2) emit set-valued states from a known initial state as illustrated inFig. 1. Such dynamical systems have found widespreadapplications in aerospace engineering [1]–[3], [13], searchtheory [14]–[16], and quantum control [17], [18] to name

I. M. Ross is a Professor and Program Director of Control and Optimiza-tion at the Naval Postgraduate School, Monterey, CA 93943 USA.

M. Karpenko is a Research Associate Professor of Control and Opti-mization at the Naval Postgraduate School, Monterey, CA 93943 [email protected]

R. J. Proulx is a Research Professor of Control and Optimization at theNaval Postgraduate School, Monterey, CA 93943 USA.

t

initial

state

x

evolutions

Fig. 1. The evolution of a state trajectory is set-valued for (1) or (2) forany given control trajectory; figure adapted from Aubin et al [12].

a few. It has also been widely studied in the Russianliterature [19] and has led to a minimax or robust optimalcontrol theory as put forth separately by Vinter [20] andBoltyanski and Poznyak [21]. Because (1) is not a stochasticdifferential equation, that is, an Ito differential equationwith a diffusion term, it is called a tychastic differentialequation, after Tyche, the ancient Greek goddess of chanceand fortune [12], [22]. By using the unscented transform[6] to map the mean and covariance of p to a collectionof sigma points, a tychastic optimal control problem mapsto an unscented optimal control problem [2]. The unscentedoptimal control problem is deterministic but possibly largescale; hence, the spectral efficiencies of pseudospectral op-timal control techniques [23] are a good match to solvethese problems. In both the simulations and the experimentalimplementations presented later in this paper, we used theMATLABr toolkit DIDO c⃝1 which implements the spectralalgorithm [24], [25] for solving optimal control problems.In general, an unscented optimal control solves the originaltychastic problem approximately [3]. Hence, it can be viewedas solving an associated chance-constrained optimal controlproblem exactly. In this case, the risk level is determined aposteriori by a Monte Carlo simulation [3], [5]. These ideashave found new applications in aerospace engineering suchas in extending the life of the Hubble space telescope [1] andproviding safety margins for spacecraft proximity operationsover distant asteroids [2].

In this paper, we further these emerging concepts byincorporating path constraints in the problem formulation.That is, in nearly all of the prior work, the state space wasan open set. In many engineering applications it is critical toincorporate collision avoidance constraints, keep-out zones,and other physical and operational constraints. General deter-ministic path constraints impose restrictions on

(x(·),u(·)

)1See http://www.mathworks.com/products/connections/product detail/pro-

duct 61633.html and http://www.ElissarGlobal.com

SafeR
Typewritten Text
2016 American Control Conference July 6-8, 2016, Boston Marriott Copley Place, Boston, MA
Page 2: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

by means of functional constraints, h(x(t),u(t)) ≤ 0 ∀ t ∈[t0, tf ] where, h : RNx × RNu → RNh is a given function.In this paper, we incorporate a tychastic version of such pathconstraints and extend a minimum principle. In addition,we incorporate uncertainties in the initial state vector, x0.Uncertainty in the initial value of the state vector is par-ticularly important in interplanetary space operations wherethe initial conditions may be derived from an impreciselytargeted condition of a preceding phase in a mission plan.For instance, during the landing phase of a distant celestialbody (e.g., asteroid) the initial conditions are derived fromthe targeting conditions of a prior phase of injection.

II. DEVELOPMENT OF A PROBLEM FORMULATION

In this paper, we consider uncertainties in the initialstate. Uncertainties in the dynamical system are implicitlyincorporated in such a formulation through the addition of anintegrator [4]. For similar reasons we consider “autonomous”dynamical systems.

In many practical systems, the components of a statevector are not independent. For example, the variables of aquaternion that parameterizes the orientation of a spacecraftmust lie on S3. Hence, we need to consider condition-ally dependent uncertain variables. These requirements frompractical considerations can be articulated as

x0 ≡ p ∈ supp(p) and e0(x0) ≤ 0 (3)

where, e0 : RNx → RNe0 is a given function. Hence, weconsider uncertain initial states that satisfy the condition,

x0 ∈ E0 :={p ∈ RNx : e0(p) ≤ 0 ∀ p ∈ supp(p)

}(4)

Given a deterministic control trajectory t 7→ u ∈ U ⊆RNu and a deterministic dynamical system,

x = f(x,u) (5)

the evolution of x is uncertain with uncertain initial condi-tions; see Fig. 2. At any given time t, the cross-section of

t

uncert

ain

initia

l sta

tes

x

evolutions

Fig. 2. Set-valued evolutions of a state trajectory for uncertain initialconditions. Compare with Fig. 1.

the evolution is given by the set {x(t,p) : p ∈ supp(p)}where, t 7→ x(t,p) is a parameterized absolutely-continuoussolution to (5) (for a given control trajectory t 7→ u).

Suppose we are given a target set,{xf ∈ RNx : ef (xf ) ≤ 0

}(6)

where, ef : RNx → RNef is a given function. In atychastic optimal control problem, the objective is to find

a deterministic control trajectory that drives the initial set{x(t0,p) : p ∈ supp(p)} to the target set while minimizinga given cost functional,

J :(x(·, ·),u(·)

)7→ R (7)

In minimax or robust optimal control [20], [21], the costfunctional is given by

J [x(·, ·),u(·)] := maxp∈supp(p)

E(x(tf ,p),p

)(8)

where, E : RNx × RNx → R is a given function. Thecaveats in (8) are that supp(p) be compact and the initialcondition be deterministic. As indicated in [1]–[5], [11], inmany engineering applications there is a need to consideralternative cost criteria that include the important case ofminimum time problems. To this end, we consider Lebesgue-Stieltjes cost functionals given by

J [x(·, ·),u(·)] :=∫supp(p)

E(x(tf ,p),p

)dm(p) (9)

where, m : p 7→ R+ is a measure function given by thecumulative distribution function (CDF) of p. Furthermore,we allow the initial set and final set to be jointly dependentand hence consider general boundary conditions of the form,

e(x(tf ,p),p) ≤ 0 ∀ p ∈ supp(p) (10)

where e : RNx × RNx → RNe is a given data function.More importantly, we also consider mixed state-control pathconstraints given by

h(x(t,p),u(t)) ≤ 0 ∀ (t,p) ∈ [t0, tf ]× supp(p) (11)

where, h : RNx×RNu → RNh is a given function. Collectingall the relevant equations, we formulate a transcendentaltychastic optimal control problem as:

x ∈ RNx , u ∈ U ⊆ RNu , x(t0,p) ≡ p ∈ supp(p)

(LS∞)

Minimize J [x(·, ·),u(·)] :=∫supp(p)

E(x(tf ,p),p

)dm(p)

Subject to x(t,p) = f(x(t,p),u(t))

e(x(tf ,p),p) ≤ 0

h(x(t,p),u(t)) ≤ 0

where, barring the usual technicalities of zero measures, allconstraints are required to be satisfied for all t ∈ [t0, tf ] andall p ∈ supp(p). Note also that we allow abstract constraintson the control variable (u ∈ U in the preamble) in addition tothe path constraints as proposed by Dubovitskii and Milyutin[11], [26]. It will be apparent in Sections V and VI that allthese generalities are indeed essential for practical problemsolving.

The existence of a solution to Problem LS∞ is the mainproblem in tychastic optimal control theory [2], [11]. Froma practical perspective, it can be addressed to some extentthrough the use of unscented techniques as discussed in thenext section.

Page 3: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

III. UNSCENTED OPTIMAL CONTROL

In this section we briefly review unscented optimal con-trol [1]–[3] with the addition of a path constraint. Later, inSection IV, we will connect it to Problem LS∞ as its low-order semi-discrete version.

Let µx0 and Σx0 be the mean and covariance ofx(t0,p) ≡ p. Then it is quite straightforward [7], [9] todetermine a set of sigma points χ0

1,χ02, . . . ,χ

0Nσ

at t = t0.The dynamics of each sigma point, χi, i = 1, . . . , Nσ isgiven by χi = f(χi,u, t). Let X be an NσNx-dimensionalvector given by,

X :=(χ1,χ2, . . . ,χNσ

)∈ RNσNx

The dynamics of X are simply Nσ copies of f given by

X =(f(χ1,u, t), . . . ,f(χNσ

,u, t)):= F (X,u, t)

In unscented optimal control, we consider the controlleddynamical system X = F (X,u, t), whose initial state isgiven by the sigma points

X(t0) = X0 :=(χ0

1,χ02, . . .χ

0Nσ

)The objective is to find a control trajectory t 7→ u thatthat drives X0 to a target state while minimizing a costfunctional and satisfying the path constraints. Thus, anunscented optimal control problem can be framed as thefollowing deterministic optimal control problem:

X ∈ XNσ = RNσNx , u ∈ U ⊆ RNu

(U)

Minimize J [X(·),u(·)] := E(X(tf ),X(t0))

Subject to X(t) = F(X(t),u(t))e(X(tf ),X(t0) ≤ 0h (X(t),u(t)) ≤ 0

where, for the purposes of economy of notation, we havereused the symbols E, e, and h to mean the same conceptsas before but with different sets of inputs and outputs: E :XNσ × XNσ → R, e : XNσ × XNσ → RNe , and h : XNσ ×U → RNh .

Through the use of engineering principles, it is possibleto frame a sequence of unscented optimal control problemsthat numerically approach a solution to Problem LS∞ [1],[2]; however, the most that can be guaranteed from a math-ematical perspective is the satisfaction of the transcendentalconstraints in the mean and covariances (and possibly a fewmore higher-order moments depending upon the choice ofthe sigma points). In any case, once an unscented controlis obtained, a Monte Carlo simulation may be performed toestimate the risk levels,

re := 1− Pr {e(x(tf ,p),p) ≤ 0} (12)

rh := maxt∈[t0,tf ]

(1− Pr {h(x(t,p),u(t)) ≤ 0}

)(13)

where Pr {A} denotes the probability of event A. Thus,the unscented control solves the chance-constrained optimal

control problem,

x ∈ RNx , u ∈ RNu , x(t0,p) ≡ p ∈ supp(p)

(C)

Minimize J [x(·, ·),u(·)] :=∫supp(p)

E(x(tf ,p),p

)dm(p)

Subject to x(t,p) = f(x(t,p),u(t))

Pr {e(x(tf ,p),p) ≤ 0} ≥ 1− re

maxt∈[t0,tf ]

Pr {h(x(t,p),u(t)) ≤ 0}

≥ 1− rh

where re and rh are the risk levels associated with thetargeted condition and the path-constraint satisfaction respec-tively.

In the next section, we provide another perspective forProblem U as a low-order semi-discretization of the tran-scendental tychastic optimal control problem.

IV. THEORETICAL FOUNDATIONS

A minimum principle for the transcendental tychasticoptimal control problem can be obtained by an applicationof the generalized covector mapping principle [4], [11]. Inthis approach, the necessary conditions are derived by takingthe limits of a semi-discretization. To this end, considerany (convergent) cubature scheme that approximates theLebesgue-Stieltjes cost functional given in Problem LS∞.That is, let (pi, wi) i = 1, . . . , n be a collection of nodesand nonnegative weights such that∫

supp(p)

E(x(tf ,p),p

)dm(p) =

limn→∞

n∑i=1

wiE(x(tf ,pi),pi

)Then, for any finite n, we can semi-discretize Problem LS∞

as:

xi ∈ RNx , u ∈ U ⊆ RNu , xi(t0) ≡ pi ∈ supp(p)

i = 1, . . . , n

(LSn)

Minimize J [x1(·), . . .xn(·),u(·)] :=n∑

i=1

wiE(xi(tf ),pi

)Subject to x1(t) = f(x1(t),u(t))

...xn(t) = f(xn(t),u(t))

e(x1(tf ),p1) ≤ 0

...e(xn(tf ),pn) ≤ 0

h(x1(t),u(t)) ≤ 0

...h(xn(t),u(t)) ≤ 0

Page 4: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

The consistency of a version of Problem LSn with respectto Problem LS∞ is proved in [16]. Motivated by the theo-retical components of our prior work [2]–[5], we write theHamiltonian for Problem LSn as

I(λ1, . . . , λn,x1, . . . ,xn,u) :=n∑

i=1

wi H

(λi

wi,xi,u

)(14)

where, λi ∈ RNx , i = 1, . . . , n are the adjoint covectors, andH is the Pontryagin Hamiltonian function

H(λ,x,u) := λTf(x,u) (15)

Taking the limit of (14) we define the integrated Hamiltonianfunctional as

I(λ(t, ·),x(t, ·),u) :=∫supp(p)

H(λ(t,p),x(t,p),u) dm(p) (16)

where, t 7→ λ(t,p) ∈ RNx is a covector function for each pthat satisfies the adjoint differential equation

−λ(t,p) = ∂xH(µ(t,p),λ(t,p),x,u(t)

)∣∣∣x=x(t,p)

(17)

H is the Lagrangian of the Hamiltonian [11] defined as

H(µ,λ,x,u) := H(λ,x,u) + µTh(x,u) (18)

and t 7→ µ(t,p) ∈ RNh is the path covector function associ-ated with (11) that satisfies the complementarity condition,

0 ≤ µ(t,p) ⊥ h(x(t,p),u(t)) ≤ 0 ∀ p ∈ supp(p) (19)

Thus, the optimal control satisfies the the integrated Hamil-tonian minimization condition (iHMC)

(iHMC)

Minu(t)

I[λ(t, ·),x(t, ·),u(t)] :=∫supp(p)

H(λ(t,p),x(t,p),u(t)) dm(p)

s. t. h(x(t,p),u(t)) ≤ 0 ∀ p ∈ supp(p)

u(t) ∈ U

Although it resembles one, Problem iHMC is not exactlya classic semi-infinite programming problem; however, it isindeed an instantaneous transcendental (static) optimizationproblem considered in [27]. Consequently, we may use therecent results of unscented programming [27] to analyzeProblem iHMC.

Following the same procedure, it follows that the transver-sality condition is given by

λ(tf ,p) =∂E(ν,xf ,p)

∂xf

∣∣∣∣(ν(p),x(tf ,p),p)

(20)

where, E : RNe × RNx × RNx → R is the endpointLagrangian

E(ν,xf ,p) := E(xf ,p) + νTe(xf ,p)

and ν : p 7→ RNe in (20) is an endpoint covector functionthat is complementary to e in an analogous manner as (19).

When path constraints are added to even a deterministicoptimal control problem, it requires an entirely new set ofmathematical machinery to accommodate for atomic mea-sures that naturally enter in the construction of the multiplierrule [10], [11]. It is therefore not surprising that these issuescarry over to tychastic optimal control problems as well. Forthe purposes of brevity, we omit these details while notingthat the adjoint covector function t 7→ λ(t,p) may exhibitjumps. These jump conditions can be directly connected tothe properties of the path covector function t 7→ µ(t,p) asdiscussed in [11].

V. A SPACE APPLICATION PROBLEM

The use of satellite imagery is ubiquitous: from an every-day map application in a smart phone to providing timelyagricultural soil information to a farmer. In order to meetthe increasing demands, a satellite is required to point andre-point rapidly and accurately [28]. The dynamics of anagile spacecraft are given by [29],

q = 12Q(ω)q

ω = I−1(− ω × I · ω − ω × hc(δ)−A(δ)u

)δ = u

where, q ∈ R4 is a quaternion that parameterizes its attitudein inertial space, ω ∈ R3 is the body rate, δ is an Nc-vectorof gimbal angles associated with the onboard control momentgyros (CMGs), I is the inertia matrix of the spacecraft, hc(δ)is the angular momentum of the CMG configuration, andA(δ) is a 3×Nc effector matrix associated with the controlvector u ∈ U ⊂ RNc of gimbal rates.

Any time a spacecraft is not collecting imagery is con-sidered wasted time and lost revenue; hence, our primaryobjective is to minimize the time to maneuver between twocollects. This commercial objective can be mathematicallytranslated to a canonical optimal control problem as:

Jcom[x(·),u(·), tf ] := tf

q(0) = q0, q(tf ) = qf

ω(0) = ω0, ω(tf ) = ωf

δ(0) = δ0, δ(tf ) = δf

It is well known [29] that when this problem is solved, itmay generate gimbal trajectories t 7→ δ that render A(δ(t))singular at some t ∈ [t0, tf ]. This essentially makes thespacecraft uncontrollable; hence, to avoid this singularity weimpose a path constraint,

t 7→ S(δ) :=√det[A(δ)AT (δ)

]≥ Smargin∀ t ∈ [t0, tf ]

where Smargin > 0 is an engineering decision. For a boxconfiguration of four CMGs [30], [31], A(δ) is given by

A(δ) :=

0 sin(δ2) 0 − sin(δ4)cos(δ1) cos(δ2) cos(δ3) cos(δ4)sin(δ1) 0 − sin(δ3) 0

Page 5: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

A plot of the path constraint function t 7→ S(δ(t)) for thisdeterministic optimal control problem is shown in Fig. 3.For the purposes of brevity, only key plots are presented.

0 5 10 15 200

0.5

1

1.5

time (sec)

S

Smargin

Fig. 3. A minimum-time path-constrained control solution to the spacecraftattitude steering problem indicates the satisfaction of the safety constraint.

All results were generated using DIDO c⃝ [11]. The DIDOresults were verified and validated using standard proceduresdiscussed in detail in [11].

In a practical implementation, q and ω at the boundarypoints are dictated from the precise requirements for satelliteimagery. Although there are no targeting requirements on δ,its initial value is determined by the final value of the gimbalposition from the previously imaged condition. This quantityis not known precisely because it is implemented as an innerloop in a feedback control system. By setting δ0 = p asan uncertain parameter, we can numerically test the optimalslew profile for robustness in its singularity avoidance. Weassume δ0 to be uncorrelated Gaussian with standard devi-ation σ = 10o. A Monte Carlo simulation shows that (seeFig. 4) the path constraints S(δ(t)) ≥ Smargin is violated

0 5 10 15 200

0.5

1

1.5

2

time (sec)

S(δ

)

Smargin

Fig. 4. Monte Carlo simulation results for deterministic control indicatesa violation of the path constraint that cross the safety margin that includesthe failure zone of S(δ(t)) = 0.

quite often. Furthermore, this violation is quite severe inmany cases as evident by the many samples in Fig. 4 whereS(δ(t)) is zero. In the absence of an experimental test, wedo not know how many of our assumptions are valid; hence,we now present these results.

VI. EXPERIMENTAL RESULTS

To support the increasing business of agile spacecraft,Honeywell has set up a state-of-the-art test facility inPhoenix, AZ; see Fig. 5. Through a partnership program with

MMCS

flight

computer

batteries

air

bearing

IMU

Fig. 5. Honeywell’s ground test facility where several experiments onunscented optimal control were conducted in an operationally-relevantenvironment.

the U.S. Air Force Research Laboratory we conducted a largenumber of tests on various control schemes and architecturesover an operationally relevant scenario. We use the resultsfrom one of these tests to illustrate the theory.

An experimental implementation of the deterministic con-trol shown in Fig. 3 resulted in a failure. This is not entirelysurprising from the perspective of the pre-test Monte Carlosimulations of Fig. 4.

We now develop a path-constrained tychastic optimalproblem by imposing the transcendental constraint,

S(δ(t, δ0)

)≥ Smargin ∀ (t, δ0) ∈ [t0, tf ]× supp(δ0)

while continuing to demand minimum-time performance,

J [x(·, ·),u(·), tf ] :=∫supp(δ0)

tf dm(δ0) = tf (21)

Equation (21) shows that standard cost functionals are nat-urally allowed in the tychastic formulation. The unscentedimplementation (transcendental form) is given by

S(δ(t, δ01)

)≥ Smargin

... ∀ t ∈ [t0, tf ]

S(δ(t, δ0Nσ

))≥ Smargin

where δ01, . . . δ0Nσ

are the sigma points of δ0. A Monte Carlosimulation (see Fig. 6) shows that the performance of theunscented optimal control is drastically different from that ofthe deterministic optimal control. That is, the path constraintsare not violated for the majority of the samples and evenwhen they cross the safety margin, the values of S(δ(t))remain above the failure zone (zero).

An experimental implementation of the unscented controlwas successful over all trials conducted. These results areshown in Fig. 7. Note in particular that one of the exper-imental trials is even outside of the Monte Carlo tube ofFig. 6.

VII. CONCLUSIONS

When uncertainty is managed in a practical implementa-tion through the use of feedback control, standard optimal

Page 6: Path Constraints in Tychastic and Unscented Optimal ...the technique in terms of a Lebesgue-Stieltjes optimal control theory [2]–[5]. In simple terms, unscented optimal control combines

0 5 10 15 200

0.5

1

1.5

2

time (sec)

S(δ

)

Smargin

Fig. 6. Monte Carlo results for the unscented control shows a significantimprovement in safety. Compare to Fig. 4.

0 5 10 15 200

0.5

1

1.5

2

time (sec)

S(δ

)

Smargin

Fig. 7. Experimental results of the unscented optimal control conductedat Honeywell showed that all trials were successful.

controls may not be desirable because they might put thesystem in uncontrollable regions. By explicitly incorporatingsystem uncertainties in an optimal control framework wearrive at a tychastic problem formulation. The computationalchallenges in solving a tychastic optimal control problemare immense; however, the unscented transform offers asimple way out through its use of sigma points. The pricefor this simplicity is a solution to a chance-constrainedoptimal control problem whose risk is determined a posteriorithrough the use of a Monte Carlo simulation. Successfulexperimental results validate the need for an entirely newarea of research where a large number of open problems intheory and computation need to be addressed.

ACKNOWLEDGMENTS

We thank the U.S. Navy’s Judge Advocate General’soffice for securing US8,880,246 B1 and their strong supportin filing our applications for US patents 14/081,921 and61/985,917.

REFERENCES

[1] Ross, I. M., Proulx, R. J., and Karpenko, M., “Unscented OptimalControl For Space Flight,” 24th International Symp. on Space FlightDynamics (ISSFD), Laurel, MD, May 5-9, 2014.

[2] Ross, I. M., Proulx, R. J., and Karpenko, M., “Unscented OptimalControl for Orbital and Proximity Operations in an Uncertain Envi-ronment: A New Zermelo Problem,” AIAA Space and Astro. Forumand Expo.: AIAA/AAS Astrodynamics Specialist Conf., San Diego, CA4-7 August 2014.

[3] Ross, I. M., Proulx, R. J., and Karpenko, M. “Unscented Guidance,”Proceedings of the ACC, Chicago, IL, July 1–3, 2015.

[4] Ross, I. M., Proulx, R. J., Karpenko M. and Gong, Q., “Riemann-Stieltjes Optimal Control Problems for Uncertain Dynamical Systems,”J. Guid., Control and Dynamics, Vol. 38, No. 7, 2015, pp. 1251–1263.

[5] Ross, I. M., Karpenko, M. and Proulx, R. J., “A Lebesgue-StieltjesFramework For Optimal Control and Allocation,” Proceedings of theACC, Chicago, IL, July 1–3, 2015.

[6] Julier, S. J. and Uhlmann, J. K., “Unscented Filtering and NonlinearEstimation,” Proc. of the IEEE , Vol. 92, No. 3, 2004, pp. 401–422.

[7] Julier, S. J. “The Spherical Simplex Unscented Transformation,” Proc.of the ACC, Denver, CO, June 4-6, 2003, Vol. 3, pp. 2430–2434.

[8] Julier, S. J. and Uhlmann, J. K., “Reduced Sigma Point Filtersfor the Propagation of Means and Covriances through NonlinearTransformations,” Proceedings of the ACC, Vol. 2, 2002, pp. 887–892.

[9] Julier, S. J., Uhlmann, J. K., and Durrant-Whyte, H. F., “A NewApproach for Filtering Nonlinear Systems,” Proceedings of the ACC,Seattle, WA, 1995, pp. 1628–1632.

[10] Vinter, R. B., Optimal Control, Birkhauser, Boston, MA, 2000.[11] Ross, I. M. A Primer on Pontryagin’s Principle in Optimal Control,

Second Edition, Collegiate Publishers, San Francisco, CA, 2015.[12] Aubin, J.-P., Bayen, A. M. and Saint-Pierre, P., Viability Theory: New

Directions, Second Edition, Springer-Verlag Berlin Heidelberg, 2011.[13] Shaffer, R., Karpenko, M. and Gong, Q., “Unscented Guidance for

Waypoint Navigation of a Fixed-Wing UAV,” Proceedings of theAmerican Control Conference, Boston, MA, July 6–8, 2016.

[14] Koopman, B. O., “The Theory of Search, I: Target Detection,” Oper-ations Research, Vol. 4, No. 5, 1956, pp. 503–531.

[15] Stone, L. D. and Richardson, H. R., “Search for Targets with Condi-tionally Deterministic Motion,” SIAM Journal of Applied Mathematics,Vol. 27, No. 2, September 1974, pp. 239–255.

[16] Phelps, C., Royset, J. O., and Gong, Q., “Sample Average Approx-imations in Optimal Control of Uncertain Systems,” Proceedings ofthe 52nd IEEE CDC, Florence, Italy, 2013.

[17] Li, J.-S. and Khaneja, N., “Control of Inhomogeneous QuantumEnsembles,” Physical Review A, Vol. 73, No. 030302(R), 2006,pp. 030302-1–030303-4.

[18] Li, Jr-S., Ruths, J., Yu, T-Y, Arthanari H., and Wagner, G., “OptimalPulse Design in Quantum Control: A Unified Computational Method,”Proceedings of the NAS, Vol. 108, No. 5, Feb 2011, pp. 1879-1884.

[19] Kurzhanski, A. B. (Ed), Advances in Nonlinear Dynamics and Control:A Report from Russia, Birkhauser, Boston, 1993.

[20] Vinter, R. B., “Minimax Optimal Control,” SIAM Journal of Controland Optimization, Vol. 44, No. 3, 2005, pp. 939-968.

[21] Boltyanski, V. G. and Poznyak, A. S., The Robust Maximum Principle:Theory and Applications, Birkhauser, New York, N.Y., 2012.

[22] Aubin, J.-P., Chen, L., Dordan, O., Faleh, A., Lezan, G. and Planchet,F., “Stochastic and Tychastic Approaches to Guaranteed ALM Prob-lem,” Bulletin Francais d’Actuariat, Vol. 12, No. 23, 2012, pp. 59-95.

[23] Ross, I. M. and Karpenko, M. A Review of Pseudospectral OptimalControl: From Theory to Flight, Annual Reviews in Control, Vol. 36,No. 2, pp. 182–197, 2012.

[24] Ross, I. M. and Fahroo, F., “Pseudospectral Knotting Methods forSolving Optimal Control Problems,” Journal of Guidance, Control andDynamics, Vol. 27, No. 3, pp.397-405, 2004.

[25] Gong, Q., Fahroo, F., and Ross, I. M. “A Spectral Algorithm forPseudospectral Methods in Optimal Control,” Journal of Guidance,Control and Dynamics, Vol. 31, No. 3, pp. 460–471, 2008.

[26] Dmitruk, A. V., “On the Development of Pontryagin’s MaximumPrinciple in the Works of A. Ya. Dubovitskii and A. A. Milyutin,”Control and Cybernetics, Vol. 38, No. 4A, 2009, pp. 923–957.

[27] Ross, I. M., Proulx, R. J. and Karpenko, M., “Unscented Opti-mization,” AAS/AIAA Astrodynamics Specialist Conference, Vail, CO,August 9–13, 2015. Paper AAS 015-607.

[28] Ross, I. M., Proulx, R. J., Greenslade, J. M. and Karpenko, M.,“Dynamic Optimization for Satellite Image Collection,” AAS/AIAASpace Flight Mechanics Conference, Napa, CA, February 14–18, 2016.Paper AAS 16-260.

[29] Wie, B., Space Vehicle Dynamics and Control, AIAA, Reston, VA,1998.

[30] McMickell, B., Buchele, P., Gisler, G., and Andrus, J., “ControlMoment Gyroscope based Momentum Control Systems in SmallSatellites,” U.S. Patent US8312782 B2, issued Nov. 20, 2012.

[31] Wise, K. et al., “Guidance, Navigation and Control Applications in theAerospace Industry: Current Problems and Modern Solutions,” 51stIEEE Conference on Decision and Control, Maui, HI, Dec. 2012.