APPENDIX • PROOFS - Springer978-3-540-46196-8/1.pdf · Chapter II 219 hence L+ and L_ have full row rank. As L_ has full row rank, R (t) consists of the rows of R of order t, and

A P P E N D I X • P R O O F S

C H A P T E R II

P r o o f o f t h e o r e m 3-5

It easily follows that f](AR)cB. Now let BeB, then according to proposition 3-3

there is a Zl_>0 such that {weB} ,~ {Wl{t,t+A]el3[[O,Zl ] for all te][}. Let (Rq) A+I

be equipped with the Euclidean inner product and let L be a matrix the rows of

which span (B[io,,a]) -L. Then {weB}-e~{wl[t,t+zaleker(L) for all te2}. This shows

6eB(AR). •

P r o o f o f p r o p o s i t i o n 8-6

In the proof we make use of two facts about polynomial matrices.

First, every submodule of Nl×q[s,s-i] is finitely generated, i.e., if

McRl×q[s,s -1] is linear and sM=M, then there exist gEN and rieRl×q[s,s-1], i=l,...,g, such that with R:=col(rl,...,rg) there holds M=M(R). This can easily

be derived e.g. from Northcott [56, proposition 1.9 and theorem 1.10].

Second, every ReRg×q[s,s -1] can be decomposed as R=UDV with UeRg×g[s,s -1] [z~ o] with A=diag(dl,...,dn) , and veRq×q[s,s -1] both unimodular and D= o o

dieR[s,s-1], i=l,...,n, where n=rank(R) and d i divides di+l, i=l , . . . ,n-1. D is

called the Smith form of R. We refer to e.g. Kailath [33, section 6.3.3].

(i) First, if BeB, then B -l- is a module, Bl-c•l×q[s,s-1], hence it is finitely

generated, so B-I-ell -1-.

Second, if B=B(R), then B-I-=M(R). This is seen as follows. If B=B(R), then M(R)cl3 "l- is evident. On the other hand, let feb J-, then we have to prove

that reM(R). Let R=UDV with U and V unimodular and D the Smittl form of R,

D=[~:], A=diag(dl,...,dn). As f e b -k, there holds {weker(R(a,a-1))}~ k v v ) N - - I N N {=eker(r(a,a-1))}. Let ~:=V~ and r:=rV =(rl,...,rq), ~ieR[s,s-1], i=l,...,q.

Then by using the fact that U and V are unimodular it follows that

{~eker(D(a,a-1))} ~ {~eker(F(a,a-1))}. For p,q~R[s,s -1] there holds

218 Appendix

{ker(p(a,a-1))cker(q(a,a-1))}.~{3cceR[s,s -1] such that q=c~p}. It hence follows

that ~i=0 for i=n+l,...,q and that there exist c~ie~[s,s -1] such that ~i=cqdi, i=l,...,n. Hence r=(cq,...,c%,0,...,0)U -l-R, and as U is unimodular reM(R).

Third, if for BeB there holds l~±=N(R), then B=G(R), which is seen as

follows. As BoB, according to theorem 3-5 there exists a polynomial matrix R*

such that 13=B(R*). It has just been shown that then B3-=M(R*), hence

M(R)=M(R*). This implies {R*(a,a -1) w=O} ~ {R(a,a -1) w=O}. Hence B(R*)=I3(R). Finally we show that f:B * B3-:B-" B j- is a bijection onto ~3_. Let .~le~3_, so

3I is finitely generated, say M=N(R), then with B:=B(R)eS there holds 133-=I~1(R), so f is surjective. Injectivity follows from the fact that if B1,B2~ , then

(ii) If dim(B-l-)=p, then there exist p elements rl,...,r p in Rl×q[s,s-1] such

that with R:=col(rl,...,rv) there holds B±=M(R). Moreover, R has full row

rank p over the polynomials. According t6 (i) B=B(R). Now suppose R also has p

rows and B=B(R). As dim(B3-)=p, £) has full row rank. According to (i)

B3_=M(R)=M(R). This implies that there exist F, ['~Rv×P[s,s -1] such that R=FR and I~=IYR. Hence (I-FYF) R=O=(I-YFF)R, and as R and ,~ have full row rank F~'=FF=I. So /~=FR with F unimodular. •


D D 3_ 3- _L l Let R~A, •:=B(R). Define Vt:=vt(Lt)=[vt(13t_l+sBt_l) ] n[vt(Bz)] , t_>0. Then

clearly {vD; t_>0} forms a set of complementary spaces for ~. Let {v~0;

i=l,...,nt} be an arbi t rary basis of V D, t>_0, and d:=max{t;nt#O }. Define ( t) -1 . (~). r i =v t (v i ), i=l,...,n,, t=O,...,d, and let R be a matrix with rows r~ t),

i=l,...,nt, t=0,...,d. According to proposition 3-10 R is a tightest equation

representation of B. Moreover, in R laws of different order are evidently

orthogonM. Hence R is in (CDF) and R~R. •

Proof of proposit ion 3-13

First suppose that R is in (CDF). Clearly L+ is the leading coefficient matrix D 3_ 3_ 3_

of R, and as vt(Lt)cvt(13t)o[vt(s~t_l) ] it follows that L_ is the trailing

coefficient matrix of R. Let B:=B(R), then R is a tightest equation

representation of B. According to proposition 3-8 R is bilaterally row proper,

C h a p t e r II 2 1 9

hence L+ and L_ have full row rank. As L_ has full row rank , R (t) consis ts o f

the rows of R of o r d e r t, and as R is in (CDF) the rows of R (t) a re con ta ined D D D ±

in L t and those o f N t in Vd(LDt). As Lt_l+sLt_icI3t i t fol lows b y induct ion t ha t D 3_ ±

the rows of 17" t a r e con ta ined in Vd(S%). As va(Lt)_l_va(St_x+s6t_l) we conclude

tha t Nt±col(Vt_l,sP't_,) fo r al l t=l,...,d.

Next suppose t ha t L+ and L_ have ful l row r ank and t ha t Nt.J_col(P't_l,

sVt_l) , t=l,...,d. Then R is b i l a t e r a l l y row p rope r , hence it is a t i gh te s t

equa t ion r e p r e s e n t a t i o n of S. I t remains to show tha t laws o f o r d e r t a r e

con ta ined in L D. As L_ has full row rank , R (t) consis ts of the rows of R of

o r d e r t, and accord ing to p ropos i t i on 3-10 the number n t of rows o f R (t)

equals the dimension of vd(LDt). We now show by induct ion fo r t=O,...,d t ha t

the rows of 17" t span va(B~t) and tha t the rows of N t a r e con ta ined i n Vd(LDt).

Then the rows of R (t) a r e con ta ined in L D, as desired. ..k D

Now Vo=No consis ts of n o independent e lements in Vd(]3 0 )=Vd(LO) , hence the ±

rows span Vd(LDo). Next suppose t ha t the rows of l?t_ 1 span va(/3t_x) , then the ± .L

rows of col(Vt_l,S~Zt_l) span Vd(Bt_l+SBt_l). As Nt±col(~'t_,,s~"t_i) , the .1. ± ±

rows of Nt, which a re con ta ined in Vd(St), a re o r t h o g o n a l to va(St_l+sSt_l) ,

so t hey a r e con ta ined in va(LD). Fur ther , as L_ and L+ have full row rank N t • D

conta ins nt i ndependen t rows, and nt=dlm(Vd(Lt) ). l lence the rows of N t span ± A_ _L D

Vd(LDtt). AS 13t=13t_l+Sl3t_l+L t it fol lows t ha t ~'t:=col(P't_l, SVt_l, Nt) spans ±

Vd(S t ), which concludes the induct ion p a r t o f the p roof . •

P r o o f o f p r o p o s i t i o n 3 -14

Let B=I3(R) with R in (CDF) and the rows o rde r e d with increas ing degree . If R'

is in (CDF) with B(R')=B, then le t / / be such tha t in R"=IIR' t i le rows a re

o r d e r e d with increas ing degree . For t=O,...,d le t R (t) and R ''(t) deno te the

mat r ices cons is t ing of the rows of o rde r t in R and R" r e spec t ive ly , i.e., o f , w t - 1 . ~ t the rows t.~r=lnr)+t,...,Lr=lnr. Then vd(R (t)) and Vd(R"(t)) bo th consis t of n t

D independent e lements in va(Lt) , which has dimension nt, see p ropos i t i on 3-10.

ntxn t t tence t h e r e exis ts a nons ingula r ma t r i x Attar such t ha t vd(R"(t))=

Attva(R(t) ) and hence R"(t)=AttR (t). So R'=IIAR, A:=diag(Aoo,...,Add ).

On the o t h e r hand, if R is in (CDF) with rows o r d e r e d with increas ing

degree , then c l ea r ly AR also is in (CDF) with A=diag(Aoo,...,Aaa), Att

nonsingular , t=0 , . . . ,d , as the rows o f o r d e r t s t i l l span L D. Also lIAR is in

2 2 0 A p p e n d i x

(CI)F). As HA is invertible B(IIAR)=B(R). •

P r o o f o f p r o p o s i t i o n 3 - 1 6

P P t ± ± 2- Let ReA, S:=S(R), and define Vt:=vt(Lt)=[vt(Ft_is )+vt(Bt_1) ] n[vt(/~t) ]. We

claim that {VtP; t_>0} gives a set of complementary spaces for S. AssiLming this

to hold true, choose arbi t rary bases {v~t); i=l,...,nt} of V P, T~t):='/);l(v~t)), and let ~' have rows {r~t); i=l,...,nt, t=O,...,d}, where d:=max{t; nt¢O}. Then

according to proposition 3-10 R is a tightest equation representat ion of B,

which evidently has its rows in L B. Hence R is in (CPF) and R~/~, as desired.

To prove that {V~; t_>0} is a set of complementary spaces we have to prove P ± ± Vt+vdS t-l+SB t-D=vt( B t ). that (i) Vtnvt(Bt_l+SBt_l)={O } and (ii) P ± ± .L

P 2- 2- Concerning (i), let veVtnv t ( s t-l+sl3 t-1), say v=[ro,...,rt] , tiering,

t k 2_ .L i=O,...,t. Let r:=Zk=orkS. As ?~E]3t_l+£13t_ 1 it follows that Tt6Ft_l, and as ±

vt(r)eV P it follows that rt±Ft_,, hence rt=O. Then reSt_l , but vt(r)±vt(S¢_l) ,

hence r=O, so v=O. P .1. 2_ 2_

Concerning (ii), note that Vt+vt(13t_i+sBt_l))CVt(13t) is trivial. Now let

reBUt, then we have to show that there exist r('),r(2)~B~t_l and v ~ V P such that t k vt(r)=vt(r(1)+sr(2))+v. Let r=Zk=orkS , rt=~'t+rt, rtlFt-1, ~teFt-1. Let

2_ , t ' , k r '~t3t_l be such that r =Zk=orkS , rt,--rt,' --= where t'<t-1._ Define e(2):=

8t-t'-l.r ', then r(2)e/~t_l . Next d e f i n e ~'":=r-s'r(2)-~'t St, then d(r")<_t-1. Now , 2_

let "vt_l(r")=Vt_l(r(1))+v' where r(1)~B~_l and v±vt_,(Bt_l). Let v:=[v',Pt].

Then vtl(v)=r"-r(1)+~'tst=r-sr(2)-r(1)etg~t, as this is a linear space. t .1_ 2_ _L P

SO v ~ [ v t ( F t - 1 s )+vt(B t-l)] nvt(13 t )=Vt. Moreover, vt(r)=vt(r"+sr(2)+~tst)= vt(r(O+sr(2))+vevt(131t_l+ 2_ p sSt_~)+Vt, as desired. •


First assume that R is in (CPF). Again L+ clearly is the leading coefficient P ± t ±

matrix of R, and as vt(Lt)evt(Bt)n[vt(Ft_zs )] it follows that L_ is the

trailing coefficient matrix of R. Let B:=B(R). Then R is a tightest equation

representation of /~, hence L+ and L_ have full row rank. Moreover, R (t)

consists of the rows of R of order t. As R is in (CPF), it follows that p(t ) . ,~(8) vt(R(t))±vt(Ft_lst), hence especially , - t - K s for s<t, t=0,...,d. Moreover,

it easily follows by induction that the rows of 17" t are contained i n Vd(•lt). AS

Chap te r II 221

P 2_ the rows of N t are contained in Vd(Lt).LVd(~t_I) we conclude that Nt.l.Vt_l, t=l,...,d.

Next suppose that conditions (i), (ii) and (iii) of proposit ion 3-17 are

satisfied. We have to prove that the corresponding R is in (CPF). Condition

(i) implies that R is a t ightest equation representat ion of B, see proposit ion

3-8. Hence the number n t of rows of R (t) equals the dimension of va(LP), see

proposit ion 3-10. We now show by induction for t=O,...,d that the rows of 12 t

span Vd(13&t) and that the rows of Nt are contained in vd(Let). Then the rows of

R (t) are contained in Let, as desired, and R is in (CPF). .l_ P Vo=No consists of no independent elements in Vd(So)=Vd(Lo) , hence the

P .L rows span Vd(Lo). Next suppose that the rows of Ft-I span Vd(13t_l). Condition

.L (ii) implies that Nt±vd(Ft_xSt), condition (iii) that Nt.l_Vd(Bt_t). As the rows

of Nt evidently are contained in Vd(8+) if follows that they are contained in

va(Let). Moreover, the rows of Nt are independent, due to (i), and their number i i i P

equals dim(va(LP)), hence they span vd(LPt). As 13t=13t_l+Sl3t_lTL t it follows _L

tha t Yt:=col(~'t_x,S~'t_x, Nt) spans va(Bt), which concludes the induction par t

of the proof. •


In the proof we make use of a result from abst rac t realization theory, given

in the next lemma. For a proof we refer to Willems [74, sections 2.2.4, 2.4.3,

4.7.4 and 4.7.5]. For ze(Ra) z we use the notat ion z--:=z](_oo,_l], z-:=zI(_oo, o ], ++ Z Z+:=Z[[o,¢o)~ Z : = [1,00).

Lemma 3-22 Let BeB have realization Bs=I3s(A,B,C,D ) :={(v,x,w)e (Rmx Rnx Rq)Z; [ a : ] =[ADBI [:] }. Then the dimension n of the s tate space

is minimal among all realizations of B if and only if the next three

conditions are satisfied:

(1) the s tate is trim, i.e., for all x0eR n there exist (v,x,w)eB~ with

x(0)=x0;

(2) the state is past induced, i.e., {(v,x,w)e13s, w--=0} ~ {x(0)=0};

(3) the s ta te is future induced, i.e., {(v,x,w)~Bs, w+=0} ~ {x(0)=0}.

(i) Let Bd3 be given, and let Bs(A,B,C,D ) be a realization of 8 with n and m

222 Appendix

the number of state variables and driving variables respectively. Let n ' and m*

denote the smallest achievable n and m respectively. We have to show that

these minima can be achieved simultaneously.

First we derive a lower bound for ra*. For T>_0 def ine 13T(O):={a~(~)T+I;

~W~B such that w--=O, Wl[O,Tl=a } and dT:=dim(13T(O)). NOW linearity and time

invariance of B imply that dT>do(T+l). On the other hand, a realization

13s(A,B,C,D ) of /3 implies dT<_n+m(T+l ). Hence m*>d o.

Next let I3~(A,B,C,D) be a realization with n=n*. If suffices to prove

that we can reduce ra to do, as then re=m*. If [~] has column rank m'<m, then t . - - J

clearly there exists of realization of B with n=n* and m=m'. Hence we may

M J

seen as follows. Let v0eker(D), (v,x,w)EBs(A,B,C,D), w--=O, v(O)--vo, then

lemma 3-22(2) implies that w(O)=O and that O=x(O)=x(1)=Bv(O), hence vo~kcr(B). f "x

So ker(D)cker(B), which due to injectivity of I~1 implies that D is injective. % - j

By taking (v,x,w)--=O it follows from lemma 3-22(2) that im(D)=Bo(O), hence

rn=d o. So both m and n are minimal.

(ii) First suppose that (A,B,C,D) is perfectly observable, that (A B) is

surjective and that D is injective. We have to show that then n and m

both are minimal.

Concerning the minimality of n, according to the lemma it suffices to

check the conditions (1), (2) and (3) stated there. Perfect observability

inwoediately implies (3). Condition (1) is implied by surjectivity of (A B).

Indeed, let xoeR ~, then as (A B) is surjective there exists (v,x)- such that

x(t+l)=Ax(t)+Bv(t), t<_-l, with x(O)=x o. Take v ++ arbi t rary and x(t+l)=

Ax(t)+Bv(t) for t_>0. Let w:=Cx+Dv. Then (v,x,w)eB 8 and x(O)=Xo. Finally we

check condition (2). Let (v,x,w)el3~ and w--=O. Perfect observability implies

x](_0~,_n]=0. For k=O,...,n-1 one easily proves inductively that, due to

injectivity of D, there holds v(-n+k)=O, x(-n+k+l)=O. Hence x(O)=O, which

verifies (2).

Now the minimality of m follows from the proof of (i), where it was shown

that re=m* in case n=n* and IDB~ is injective, so especially when n=n* and D is M J

injective.

Next we consider the implication {n=n*,m=m*}=> {(A,B,C,D) is perfectly

observable, (A B) is surjective, D injective}.

Suppose (A B) is not surjective. Then there exists a nonsingular SeR '*×n

Chapter II 223

such that S(A B)=[A'B0']. Let Sx:=rX']LX~J be an according partition, then in

Bs(A,B,C,D ) we have x2=0. This contradic ts condit ion (1) of the lemma.

Further, re=m* and hence [B] is injective. In the p roof of (i) we have shown

tha t fo r n=n* the injectivity L~'JOf [BD] implies injectivity o f D.

Finally we have to show tha t {(v,x,w)~/3~, w[[0,n_l]=0} ~ {x(0)=0}. Now

let (v,x,w)eB s with wl[0,,_l]=0. As D is injective there exists a GcR n×q such

that B=GD. Then for O<t<n-1 we have O=Cx(t)+Dv(t) and x(t+l)=Ax(t)+Bv(t)= (A-GC)x(t)=(A-GC)txo. So C(A-GC)txo~im(D) for O<t<n-1. This implies tha t

C(A-GC)txoeim(D) fo r all t>0. Hence by choosing v I In,=) appropr ia te ly we can

const ruct (~,2,£z)eB s with (~,2,£o)[(_oo,n_ll=(V,X,W)l(_co,n_l] and w+=O. Condition (3) o f the lemma implies x(O)=O. •

P r o o f of proposition 3-25

In the p roo f we use a result on uniqueness o f the s ta te for minimal

realizations. For this result we refer to Willems [74, sections 2.4.3, 3.2.5

and 4.7.5].

Lemma 3 -25 Let /3~, i=1,2, be two minimal real izat ions of a given system

/3~B, and let n be the dimension of the s ta te space. Then there exists a

nonsingular SeR n×n such tha t fo r any we~3, if (Vi,Xi,W)~/3~, i=1,2, then

X 2 ~ - - S X 1 .

Now let Bs:=Bs(A,B,C,D ) be a minimal real izat ion o f BE[B, i.e., B={w; 3(v,x)

such tha t (v,x,w)~Ys}. First we show tha t /3s(A,B,C,D) also is a minimal rea l iza t ion of /3 if

(A,B,C,D)=(S(A+BF) S -1, SBR, (C+DF) S -I, DR) for Se~ n~n, R~R 'n×m both

nonsingular and for any F~R rn×n. I t suffices to show tha t it is a real izat ion.

This is easily seen, as on the one hand fo r web there is (v,x,w)~/3s, and with

(~.):=(R-l(v-Fx), Sx) there holds (~,w)eBs(A~B~C,D), while on the o ther hand

for (~,~,~)~Bs(A,B,C,D) there holds (v,x,w)e/3 s with (v,x):=(R~+FS-I~, S-I~),

hence weB.

Next let 13s(A,B,C,D ) be an a rb i t r a ry minimal rea l iza t ion o f /3. We then

have to const ruct S, R and F such tha t (A,B,C,D)=(S(A+BF)S -1, SBR,

(C+DF)S -1, DR). We will do this by considering i/s/o real izat ions.

224 Appendix

Let H be a permutation matrix such that in I=l :=Hw u plays the role of an LyJ Cc'~ (~1 input and y that of an output, see corollary 3-23. Let IIC:=t lj and HI):= 12 ,

where the partitions are according to ( : ] . T h e n 13s(A,B ,[cc: l:ci~:]) is a minimal

realization of IIB. Now D 1 is invertible, which is seen as follows. Let m u

denote the number of components in the (free) variable u. Let 13o(O):={aeRq; 3[~IEIIB with [~1--=0, [~l(0)=a}, then according to the proof of part ( i ) o f

proposition 3-22 there holds dim(13o(O)):=do=m*, while clearly do>_mu. The fact

that 6eB has i/s/o realizations implies m*<mu, hence do=m ~. Moreover, due to

minimality the state is past induced, from which it follows that g0(0)=im[D:].

As u is free it follows that DleR m*Xm* is surjective, hence invertible.

Defining {A,B,C,D):=(A-BD-11C1, BD-11, C2-D2D~IC1, DzD-11) we get 1113={ [~] ; 3(v,x)

such that [a~]=[AIB1]LC2 D2J [:]}={[~); 3X such that [ a ~ ] = I ~ l [:1}.

For (A,B,C,D)we analogously get //13={[~1; ~£ such that [ a ~ = [ ~ - ] [~}

where (~4,B,~f)):=(~4-Bt)-1'CI, / ~ 1 , ~2_~2/~1~,, ~2~[1). As these are two

realizations of the same system, according to the lemma there is an S such

that £=Sx. Hence (x ,u ,y ) s a t i s f i e s [~1 = [~ ~1 [ : ) i f and only if

[ ~ ] = [ s ~ s-l~] [:]. As the state is trim and past induced we can generate

anyxoeRnandtakeu(O):O, fromwhichweconclude[S-cTtss]:[~].Bytakingxo=O

and u arbitrary we conclude I s - l ~ [ ' ] A direct calculation now shows that t B J = LDJ (A,B,C,D)=(S(A+BF) S -~, SBR, (C+DF) S -1, DR), where R:=D~I/~I is invertible

and F:=D-11(C1S-C1). •

Proof of proposi t ion 3 - 3 2

In the proof we use a result on representation of linear, time invariant,

complete systems for time axis T=N. A system B¢(Rq) • is called linear if it is

a linear subspace of (Rq)t~, time invariant if B li2,00)cB , and complete if

{weB} *~ {wl[1,tleB][1,t] for all teN}. By B~ we denote the class of linear,

time invariant, complete systems in (Rq) ~. By B~(AR) we denote AR-systems in

(Rq) ~, i.e., any set for which there exist geN and ReRg×q[s] such that

B=B~(R):={we(Rq)~; [R(a)w](t)=O for all teN}. The following result is analogous

Chapter II 225

to theorem 3-5 and follows from Willems [73, theorem 5].

Lemma 3-32 B~=B~(AR). Moreover, for any BeB® there exists a row proper R

such that B=I3®(R).

Now let BoB T and define Be:=[we(Rq)~; W I[t,t+T_I]E]~ for all teN}. Shift

invariance of B implies 13=Bel[1,T]. Moreover, BeeB~. Let ReRg×q[s] be row

proper such tha t Be=lJoo(R). We will show that for any row proper R there holds

Bt(R)=B~(R)[[1,t], for all teN. Then especially 13=Boo(R)I[1,T]=I3T(R), which

shows BeBT(AR ). • , ~ a i ( i ) t~ (ii)~=0, i e [ 1 , g ] . Let RcRgXq[s] be row proper with rows ri(s)=2~lc=or k s , r d

Let L+:=col(ra(ii); ic[1,g])eR g×q. Then L+ is surjective as R is row proper.

Now first let wel3~(R)[[1,t], say w=Col[1,t] with ~eB~(R). Then

[ri(a)~o](T)=O, r~N, and especially for rows of R with degree di<t-1 there

holds [ri(~r)w](7")=O , ~-e[1, t-di] , which by definition means that wcl3t(R ).

Conversely, let weBt(R ). Then define ~e(Rq) t~ recursively as follows. Let

~[[1,tl:=w. If ~ is defined on [1,t*], then define ~(t*+l) as an arbitrary

solution of r{dl)~o(t*+l)+r(j.)_l~o(t*)+...+r(oi)~(t*-di+l)=O for all i with di<_t*. Existence of a solution is guaranteed as L+ is surjective. Clearly ~eB~(R), as

all laws [R(a)~z](t)=O, teN, are satisfied. Hence wel3~(R)[[1,t I. •

P r o o f o f p r opos i t i on &-33

(i) First let BoB T. It follows from the proof of proposition 3-32 that there

exists a row proper R with d(R}_<T-1 such that 13=I3T(R ).

Conversely, let d(R)<T-1 and R row proper. Let R have rows ri(s)= Z~t i ( i )k r(1)~0 ' ic[1,g], and let L+ee g~q have rows r (i) ie[1,g]. To show k=OT"k 3 , di ,

that BT(R)eBT it suffices to consider shift invariance, i.e.,

aBT(R)CBT(R)[D,T_I]. Now this condition is equivalent to existence of a

solution ae~ q of the set of equations r(a:)a+r(cl:)_,w(T)+...+r(ol)w(T-di+l)=O for

all ie[1,9], for fixed weBT(R ). Because R is row proper, L+ is surjective and

existence of a solution is guaranteed.

(ii) If R is zero order bilaterally row proper, d(r)_<T-1, then one easily

shows by a similar reasoning as in (i) that BT(R)eBr.

Conversely, let BeB T. Define ~ee:={'//}E(Rq)Z; W[[t,t+T_l]~13 for all teT}. As

226 Appendix

S is t ranslat ion invariant it follows that B=Bee[[1,T]. Moreover, 19eeeB.

According to theorem 3-5 and proposition 3-8 there exists a bilaterally row

proper R such that Bee=B(R)e8. I t follows from proposit ion 3-10 that R can be

chosen to be zero order bilaterally row proper by multiplying the rows by

appropr ia te f ac to r s a hi, nieT/. It remains to show tha t ~(R)_<T-1 and that

13(R)][,,T]=BT(R ). This follows by a reasoning completely analogous to the one

given in the proof of proposition 3-32, using the fact that R is bilaterally

row proper. •

P r o o f o f l emma 3 -34

In the proof we use a result for systems in Be0 which is analogous to

proposit ion 3-6(i). The proof is completely analogous to that of proposition

3-6. We use the following notation. For BeBm let B±:={reRl×q[s]; [r(a)w](t)=O, for all web and all teN}. If R~Rg×q[s] has rows rieRl×q[s], then let

M~(R):={re~l×q[s]; 3pieR[s], ie[1,g], such that r=zg=lpivi) denote the

submodule of Rl×q[s], generated by the rows of R.

Lemma 3-34-1 Let BeS~, Then {6=/3~(R)} ~=> {B±=Moo(R)}.

Now let BT(Ri)eBr, i=1,2. Define B~:={we(Rq)~; W][t,t+T_l]el3T(Ri) for all teN},

i=1,2. Shift invariance of BT(Ri) implies that B~I[1,T]=I3T(RI) , i=1,2, and ¢ ¢

that {BT(R1)CBT(R2) } <~. {BlcB2}.

Define Boo(Ri):={we(Rq)~; [Ri(a)w](t)=O for all teN}, /=1,2. Because [l(Ri)< e

T-1 there holds Bi=B~(Ri), i=1,2, which is seen as follows. Let R i have rows

r~.i),je[1,gi]. If W~Boo(Ri) , then [r~.O(a)w](t)=O for all je[1,gi] for all

teN, especially for all te[v, ~'+T-d(r}O)-l], for &ll teN, so w[[~,r+r_lle

BT(Ri) for all yeN, hence weB~. Conversely, if weB, e. then with wt:=w[[t,t+T_l] , teN, wteBT(Ri) , so [r}i)(a)wt](r)=O for all je[1,gi], for all re[1,T- d(r}0)]¢O, especially [r}O(a)wt](1)=[r}i)(a)w](t)=O for all teN, hence

weB~(Ri).

So to prove lemma 3-34 it remains to prove that {19~(RI)CI3~(R~) } ~=~ {there

exists an F such that R2=FRI}. Now (~) is obvious, while (=>) follows from

lemma 3-34-i. •

Chapter II 227

Proof of lemma 3-35 and proposi t ion 3-36

In the proof we make use of the following result, which was shown in the proof

of proposition 3-32.

Lemma 3-35-1 If R is row proper, then for all TeN ]3eo(R) I[1,TI=I3T(R ).

First we consider the results for LCLM and addition.

Let Ri be row proper, i=1,2. Define 13co(Ri):={w~(Rq)N; [Ri((7)w](t)=O for

all teN} and B:=Yoo(R1)+Boo(R2). It follows from lemma 3-32 in the proof of

proposition 3-32 that there exists a row proper R 0 such that B=13¢0(R0). We will

show that RocLCLM(RI,R2). Row properness implies that for all TeN 13T(Ri)=13(z(Ri) ]D,T], i=1,2, and

that 13T(Ro)---- Boo(Ro)[[1,T]. As [Boo(R1)+/~co(R2)][[1,T]= B~(R1)][1,T]+Bco(R2)][1,T] this implies that for all TeN also BT(R1)+BT(R2)=BT(Ro).

Taking T>_max{d(R0), d.(R1) , d(R2)}+l, l~T(Ri)CI3T(Ro) by lemma 3-34 implies

that there exist F~ such that Ro=FiR i. Moreover, if for /~ there exist /~i with

R=FiR,, i=1,2, then let U unimodular be such that UR=I~ ] with k r o w proper,

cf. e.g. Wolovich [77, theorem 2.5.11]. Then Yoo(R)=Sco(R)DSoo(R1)+13®(R2)= , u j

B®(Ro). Due to row properness it follows that for all TeN BT(R)DBT(Ro). Taking

T>max{~l(Ro) , d(R)}+l it follows from lemma 3-34 that there exists an F such

that [~=FRo, hence R=FR o where F:=U-lfFo]. This proves that RoeLCLM(R,,R2)and x. j

(i) of lemma 3-35 for LCLM.

Next let ReLCLM(R1,R2) be row proper. Then there exists an F 0 such that

R=FoR o and an F such that Ro=FR. In the notation of lemma 3-34-1 hence

Mo~(R)=Moo(Ro) , so Bco(R)=B~o(Ro)=Bco(RI)+Bo~(R2). Due to row properness BT(R)= I~T(R1)+BT(R2) , which proves (~) of proposition 3-36. Moreover, R=FoFR and

Ro=FFoR o. Because R 0 and R are row proper they have full row rank over R[s],

so that FoF=FFo=I, which proves (ii) of lemma 3-35 for LCLM.

Now second we consider the results for GCRD and intersection.

Let R~ and R~ be row proper. From lemma 3-32 in the proof of proposition

3-32 it follows that Boo is closed under intersection, as Boo(AR) clearly has

this property. Moreover, according to this lemma there exists a row proper R 0

such that B®(Ro)=B~(R1)nt3o~(R2). That RocGCRD(RI,R2) is proved in a way

228 Appendix

analogous to the result for LCLM, and one easily gets lemma 3-35(i) and (ii)

for GCRD.

To prove proposition 3-36(ii), let RcGCRD(R1,R2) be row proper and let

d.(Ri)___T-1 , i=1,2. By lemma 3-35(ii) there exists a unimodular Y such that

R=URo, so Boo(R)=B®(Ro) and BT(R)=BT(RO). So it suffices to prove that

BT(R0)=E{~T(R); ]3T( R, )CI3T( R1)NI3T( R2) }. Now Boo(Ro)ct3o~(Ri) and row properness implies 13T(Ro)CST(Ri) , i=1,2; hence

BT(Ro)Cl3T(R1)nBT(R2). So it suffices to prove that for R row proper

{13T(R)Cl3T(R1)~I3T(R2)) =¢, {13T(R)Cl3T(RO) }. Let t2 consist of the rows of R of

degree at most T - l , then d(Ri)<T-1, /=1,2, and lemma 3-34 implies that there

exist Fi such that Ri=FiR, i=1,2; hence B®(l~)cB®(Ro) and BT(~))=BT(/2)C

GT(R0). •

C H A P T E R III


Let P be bilaterally monotone on 13~ T. By taking t=2 in the definition of

bilateral monotonicity we have that generically in we6 {Gr_l(R)e

PT_I(WI[2,T]), ]3'ePTW}=~ {aB'eBT_I(R)} =~. {B'CBT(aR)} , which proves the shift

invariance condition for t=T.

To prove this condition for general te[2,T], let Be_I(R)ePe_I(wI[=,t] )

and B'ePt(wl[l,t]). Now Be~r , so by proposition II.3-30(ii) there exists ~eB

with "W[[T_t+l,Tl=Wl[1,t] and ]3t_I(R)ePt_I('W[[T_t+2,T]), B'EPt('W[[T_t+I,TI ).

NOW bilateral monotonicity implies that generically in CoeB al3'cSt_l(R), hence

B'eBt(aR ). We have to prove that this holds generically in weB. It is

sufficient to construct a linear bijection w-~ ~. This can be done as follows.

Let Bee:={wee~(Rq)Z; wee][r,r+T_lleB for all re][}. Because B is e e

t ranslation invariant B I[1,TI=B. I t can be shown that there exists a linear

injection L:8 -> B~e:w -> w ee w i t h ~#ee][1,Tl~-W such that for all f e z Lr:B -> G:

w -> Wee[[~+l,r+T] iS a bijection. Then for ~zeB take ~:=Lt_TW.

The idea to construct L is as follows. Let 6i/s/o be a minimal (forward)

input /s ta te /output realization of 8 ee (see corollary II.3-23) with state x and

with / : / = / / w a corresponding input/output decomposition of w. I t can be shown , . - .%

that there exists a linear map f such that x(r)=f(w[[r_r,r_d) for all reP. e e e e

Now take in 13i/8/o u periodic on Z+ with u I[kT+I,(I,.+I)T]:=UI[1,T] ~ k~.N.

Together with x(T+l) this uniquely defines "weel[o,~]. We define wee](_®,_1] in

an analogous way, using a backward realization 8i/s/o of Bee. This defines a

linear injection L:B--> B ee To see that Lr is a bijection, suppose that

wee][r+l,r+T] is given for some feZ. From this we can reconstruct x(r+T+l) and

u, as u is periodic. From Bi/s/o and 8i/s/o we then can reconstruct W ee o n ~'~

W e e hence especially :=w [1,T]. •

230 Appendix

P r o o f o f lemma 3-11

(~) Let r(n) be linearly independent from r(1),. . . ,r(n-1) and r(n+l) linearly }2

dependent on r(1),...,r(n), say r(n+l)=$i=lair(~) (defined for the columns

1,...,T-n of HT(W)). Define w(r), r>T, recursively by w(T):=En=laiw(T-n-l+i)

and define a Hankel extension M of liT(w ) by mij:=w(i+j-1). Using the ttankel

structure one gets rank(M)=n, hence rank(HT(W ))<_n. To prove that

rank(HT(W))>n , let M" be an arbitrary extension of IIT(W) and let d:=rank(M').

If d<n this would imply that among the rows 1,...,n of M' at least one, say

row n', is either zero (in which case r(n') is linearly dependent on

r(1),...,r(n-1) by definition) or it is linearly dependent on the foregoing

ones. This implies that r(n') is linearly dependent on r(1) , . . . , r (n ' - l ) , and

because of the Hankel structure of HT(W) and the fact n'<n this means that

r(n) would be linearly dependent on r(1),...,r(n-1). So rank(M')>n and hence

rank(HT(W))_>n.

(=,) Let rank(HT(W))=n. Then r(n) cannot be linearly dependent on

r(1),. . . ,r(n-1), as the construction above would give rank(HT(W))<_n-1.

Moreover, r (n+l) cannot be linearly independent from r(1),...,r(n), as this

would finply that any extension of liT(w ) would have rank at least n+l. •


A minimal rank extension which is Hankel was constructed in the proof of (~)

of lemma 3-11. •


Let rank(HT(W))=d. Assume w~B~B T. Shift invariance of B implies that there

exists weE(R) •, We l[1,Tl=W, We[tr,r+r_lleB for all v~N. Define an extension /~I

of liT(w ) by mij:=we(i+j-1), i,j=l,...,T, mij:=O elsewhere. Then rank(M)>d and

hence we l[r,r+T_l], T~[1,T], span a space of dimension at least d in B, hence

c(t3)>__d. Further, as rank HT(W)=d there exists a=(al,..,aa)eR d such that d . d d i-1 r(d+l)=~i=lair(z). Define Ra:=a -Zi=laia Then clearly weI3T(Ra) and

c(13T(Ra))=d. Using the definition of PT g this proves (i) and 3 in (ii). To

prove c in (ii), let BT(R)ePKTTW, so C(BT(R))=d , which implies that R has degree

Chapter III 231

d ~ d i-1 d . d, say K=a -2.~=laia . Then in HT(W ) r(d+l)=Y~i=lair(z ). •

P r o o f o f proposition 3-16

Let R~0 have degree [I(R)=d. First assume d<ENT(T/2), so we have to show tha t

generical ly in welt(R) rank(HT(W) )=d. For wel3T(R ) row d+l of HT{W ) is l inearly dependent on the foregoing ones,

so rank(HT(W))<_d. To prove tha t gen. rank(HT(W))=d, it suffices according to

lemma 3-11 to show tha t gen. row d is l inearly independent f rom the foregoing

ones. Sufficient fo r this is tha t gen. in WeBT(R ) rank(Hd, d(w))=d , where

w(1)

Hd,d(~):= ~ ( 2 )

v(d)

~ ( 2 ) ... w(d )

w ( 3 ) ... w ( d + i )

w ( d + l ) ... w ( 2 d - 1 )

Note tha t d<_ENT(T/2), so 2d-l<_T and Hd, d(i/J) is well-defined.

In BT(R), w(r), Te[1,d], can be chosen arbi t rar i ly while w(r), r e [d+ l ,T ]

can be expressed as linear functions of w(r), ve[1,d]. So for WeBT(R ) det(Hd,~(w)) can be considered as a polynomial in (w(1),...,w(d))e~ d. It

suffices to show tha t det(Hd,d(W)) is not the zero polynomial , because then

{w;rank(Hd,d(w))<d}={w;det(Hd, d(w))=O} is a proper algebraic va r i e ty and

hence rank(Ha, d(W))=d gen. in W~6T(R ).

That det(Hd, d(w))$O is seen as follows. We claim tha t det(Hd, d(W)) contains {w(d)} d as a term with coeff icient _+1. Indeed, det(Hd, d(W))=

. d p~p{Slgn(p).lli=la~p(i)} , where Hd,d(W)=(aij), P denotes the set o f all

permutat ions o f {1,...,d}, and s ign(p)e{-1 ,+l} . In order to get {w(d)} d, f r o m

every row and column i n Hd,d(W ) one has to choose an element which involves

w(d). In the f irst row this is only the element (1,d), so p(1)=d. In the

second row only the elements (2 ,d- l ) and possibly (2,d) contain w(d), so

necessari ly p(2)=d-1. Going on in this way one gets f o r {W(d)} d the unique

permuta t ion p:={d,d-1,...,2,1}. This proves our claim and hence det(Ild, d(W))~tO. Next assume R=O or d>ENT(T/2). By a shnilar reasoning as before one can

show tha t gen. in weBt(R ) HENT(T/2),ENT(T/2)(w ) has rank ENT(T/2) and hence

row ENT(T/2)+I o f HT(W } then is l inearly dependent on the fo regoing ones (its

length is ENT(T/2) if T is even, ENT(T/2) - I if T is odd). So then

rank(HT(W))<ENT(T/2 ) and hence it equals ENT(T/2), as row ENT(T/2) is l inearly

232 Appendix

independent from the foregoing ones. •


(i) Obvious.

(ii) Not monotone. Let 13=R T and te[3,T] odd. Then according to proposition

3-16 gen. in web there holds that for Bt_ld~t_1(w[[1,t] ) and Bt~P~t(w[[1,t]) dim(Bt_l)=(t-1)/2 and dim(6t)=(t+l)/2. For t>3 (t+l)/2<t-1 and dim(Bt)=(t+l)/2 implies that dim(Bt[[1,t_ll)=(t+l)/2 so gen.

Bt[[1,t_l]¢Bt_ 1 and pK is not monotone. We have used the fact that {GeEt,

dim(/~)=d} =~ {dim(13[[1,r])=d for all ve[d,t]} which follows from {6eEt,

dim(B)=d} ~, {there exists R of degree d such that 13=13t(R)}. Not shift invariant. Let B=R r and take t=3, so B l[2,t]=R 2 and B[[1,t]=R 3.

Let weB, wi[1,3]=(a,b,c ) with a¢O, b~O, ac-b2~O. Then P~2(w[[2,3])=132(¢-(c/b)) and Ba(a2-(c/a))eP~'3(w[[1,3] ). Shift invariance would require that gen.

I33(a2-(c/a))cBa(a(a-(c/b)))=B3(a2-(c/b)a), which clearly does not hold true.

Not linear. Take T=3, 131:=B3(a2-1),B2:=B3(~r+2). Then Bl+B2=R 3 so

generically in (wl,w2)eBlX/~2, if BeP~(wl+w2) then dim(B)=2. Also generically

BleP~3wl and 62=P~3w 2. Linearity would require that Ra=BI+B2cB which is false.

(iii) Take for example T=3, B0=R 3. Then gen. in weBo, if 13ePgw then dim(B)=2,

so B0¢B.

(iv) We will determine im(P~), ~KT, B~TK.

That im(P/~T)=B:r is seen as follows. If B=~ T then take weff~ T defined by

w(t)=0, t~[1,T-1], w(T)=l, so rank(HT(W))=T and by proposition 3-i3(i) RT=p~TW. If RT~BEBT, then according to proposition IL3-33(i) there exists R with

d=[l(R)<T-1 such that B=BT(R). Choose wel3T(R ) with w(r)=0, T6[1,d-l], w(d)=l and w(r) for re[d+l,T] computed by means of R. Then rank(HT(W))=d and by

proposition 3-13(i) BT(R)ePZT w. Next we prove B~={BeET; c(B)<_ENT(T/2)}. From example 1 we know already

' that R T ~ . Now let RT~BeET and let R¢0 with c(B)=d=~(R)<T-1 be such that

B=BT(R ). If d>ENT(T/2) then from proposition 3-16, gen. in weI3T(R), if 6"ePTw then dim(B)=ENT(T/2), hence I3T(R)~t@TW , So BT(R)~B~. If d<ENT(T/2) then gen.

in WeBT(R ) rank(HT(W))=d , so BT.(R)eP~w and BT(R)eEU~K. Finally we consider B ~ . First let T be even. Let BeE T with d:=c(B)<

ENT(T/2). Then gem in web the first d rows of IIT(W } are linearly independent

C h a p t e r III 233

and row d+l has T-d>_T/2>_d elements and this row is linearly dependent on the

foregoing ones. Using proposition 3-13(ii) and the linear independence of rows

1,...,d, this implies that gen. in web P~TW={B), i.e., the assigned model is

unique. So B ~ B ~ , hence equality holds.

Next let T be odd. If c(B)<_ENT(T/2)-I then by a reasoning as before we

get B~B~. If d:=c(B)=ENT(T/2) tllen in HT(W ) row d+l consists of T-d= (T-1)/2<d=(T+l)/2 elements and gen. pKTW is not unique; hence B~B~TK.

(v) As can be seen from the reasoning in (ii), (iii) and (iv), lacking the

properties of monotonicity, linearity, truthfulness and prudence has not

to do with possible nonunique assignment of models by pK. We now show that

shift invariance also cannot be obtained by choice of a selection rule S.

To get shift invariance, taking the example in (ii) with a~O,b~O,c#O, K ac-b2~O, this would require that for BeP3(a,b,c ) BcB3(a~-(c/b)a) while

dim(B)=2, so this requires Sa(a,b,c)=Ba(a2-(c/b)a). Moreover it is required

that (gem) S4(d,a,b,c)cB4(aa-(c/b)a2). Now gen. if BePK44(d,a,b,c) then

dim(B)=2. Let B4(a2+c~ct+fl)~(d,a,b,c). In order that B4(a2+c~a+fl)cB4(a a-

(c/b)a 2) according to lermna II.3-34 there has to exist a T such that

(a2+(xa+fl)(a+T)=a3-(c/b)a 2, which implies that (c~,fl)--(0,0) or (c~,fl)=(-c/b,O). But B4(a2)~t~4(d,a,b,c), as it requires b=c=0, and B4(a2-(c/b)a)~l~4(d,a,b,c), as it requires ac-b2=O. So it follows that it is impossible to construct a

shift invariant selection rule for pK. •

P r o o f o f t h e o r e m 3 - 2 0

The proof of this theorem is quite lengthy and will be split in a number of

steps. The result is proved by using a number of lemmas, some of which play a

role in the proof of proposition 3-25.

Nota t ion. First we introduce some notation. T is assumed to be fixed T - I k throughout. For R=Zk=oaks eR[s] let I(R):={ke[O,T-1]; ak=0}, so #I(R)=T-c(R).

Let B*(d), B*(I) and B*(d,I) as subsets of B T be defined as follows.

B*(d):={BT(R)eB~; ~(R)=d}, B*(I):={BT(R)eBT; I(R)~Z}, B*(d,I):=B*(d)nB*(I). Moreover define W(d), IV(I) and W(d,I) as subsets of R T by W(d):=U{BT(R); BT(R)~B*(d)}={wE~T; 3BT(R)~B*(d ) with w~BT(R)} , W(I):=U{BT(R); BT(R)~B*(I)} , and W(d,I):=U{BT(R); BT(R)~3*(d,I)}. Without loss of generality we restrict

2 3 '1 A p p e n d i x

attention to (d,I) with d+(T-#(I))<_T, i.e., d<_#(I). []

Let B0e/~ T be fixed, web o and HT(W ) its incomplete Hankel array. We now first

give an outline of the proof of theorem 3-20 by means of four lemmas and then

will give the proof of these lemmas.

Lemma 3 - 2 0 - 1 For every (d,I) either (i) w~W(d,l) generically in weBo, or

(ii) BoclV(d,I).

Lemma 3 - 2 0 - 2

19T(R(d,I) ) }.

{BocW(d,I)} ~ { 3BT(R(d,I))~B*(d,I ) such that B0c

This lemma states that if for every weBo there exists a model 13w(R)cB*(d,I )

such that weBw(R), then there exists such a model independent from weBo.

For Bo define d0e[0,T ] as follows. If EoCW(d,I } for all I and de[0,T-1]

then do:=T , else do:=min{de[O,T-1]; 3I such that BocW(d,I)}.

If d0=T , then according to lemma 3-20-1 (and using the fact that the

number of indexsets I is finite) generically in web o w¢W(d) for all de[0,T-1].

This means that gen. row d+l of H T is not linearly dependent on less than T-d

foregoing rows of IfTI so P~w=R T gen. on B0, and theorem 3-20 follows as

obviously in this case there is no BT(R)eB ~ with BoCBT(R ) and C(BT(R))<T.

For doe[0,T-1] let J0 be defined by Jo:={I;BocW(dD,I)}. Because by lemma

3-20-1 gen. on B 0 w¢IV(d) for d<d o and by lemma 3-20-2 BoCBT(R(do,I)) for

t~Jo, for some R(do,I), it follows from the definition of P~ that gen. on B0 0 * {BT(R(do,I)); Iedo}CPTWcB (do). Indeed, on B0 gem no remarkable laws of

degree d<do are satisfied while remarkable laws of degree d o always exist.

Because of lemma 3-20-1 we even have that gen. on B 0 0 * { BT( R( do,I) )'~ [~.Io}CPTWcU{B (d0,I); IeJo}.

0 * Lemma 3 - 2 0 - 3 For IeJo gen. on Bo PTWnB (d0,I) is a singleton, i.e.,

BT(R(do,I) ).

The generic way in which P~ assigns models on the basis of data from B 0 is

described in lemma 3-20-4, which is a direct consequence of lemma 3-20-3 and

the preceding discussion.

Chapter III 235

Lemma 3 - 2 0 - 4 Generically for wel3 o P~-w={l~T(R(do,I) ); IeJo}.

Now from lemm& 3-20-2 and lemma 3-20-4 it follows that gen. on B 0 if BePTW

then B0c• and c(B)=d o. To conclude the proof of theorem 3-20 note that by

definition of do if BeB r with c(B)<do, then B0¢B.. On the other hand, if 0 BeBT, c(B)=do, BocB , then gen. B~PTW. This proves theorem 3-20.

Finally we prove the foregoing lemmas.

P r o o f o f l emma 3-20-1 and lemma 3 - 2 0 - 2

Let (d,I) be given, c:=T-#(I), and let c+d<T. Suppose that wf~W(d,I) is not gem

true on B0. We will show that then there exists RcR[s] with 13T(R)c•*(d,I ) and

BOCBT(R), which proves the desired results. The proof has the following

structure. First we introduce some notation. Next in step (i) we prove the

result under two conditions, (C1) and (C2) , and using an auxiliary lemma, (L).

In step (ii) we prove (L) under conditions (C1) and (C2). Finally in step

(iii) we consider the case where condition (C1) or (C2) is not satisfied.

N o t a t i o n . Let [O,T-1]\I={il,ie,...,ic_l,d } c+d<_T. For web o define Hz(w)cR cx(T-d) by

with 0<i1<i2<...<ic_1<d and

HI(w):=

w(i 1+1) w(i 1+2) ... w(il+T-d )

w(i 2+1) w(i2+2) ... w(i2+T-d)

w ( i c _ l + l ) w(ic_x+2) ... w(ic_[+T-d )

w(d+l) w(d+2) ... w(T)

Let BO=BT(Ro) with Ra=ro+rls+...+rn_lSn-l+s n, where n:=dim(/30). Note that

wcW(d,I) if and only if the last row of Hz(w ) is l inearly dependent on the

foregoing ones. As it is given that this is not gen. false on B0 it follows

that Bo¢R T and hence n<_T-1. Let el:=(1,0,...,0)eR 1×" and define AeR n×" by the

so-cal led companion matrix of R0, i.e.,

236 Appendix

A:= 1 ° ° ' 1°1 0 1 0 . . . 0

0 0 . . . 1 0 = 0 0 . . . 0

L-To - r l - r 2 . . . - r n - 2 - r n - 1

0 In-1 ] .

I-to (-rl...-r._~)j

For we13 o le t x:=col(w(1) , . . . ,w(n))eR n. Note tha t {we13o}~=~{3xeR n such tha t

w(t)=elAt- lx fo r all te[1,T]}, so (A,el) is a "minimal rea l iza t ion" of 130, cf.

def ini t ion I I .3-20 and propos i t ion II.3-22(~i). Finally fo r xeR n define

x(t):=elAtX, t~yz+, and define H(x ,k )eR c×k by

H(x,k):=

x(il) x(G)

X ( i l + l ) . . . x ( i l + k - 1 ) x ( i 2 + l ) . . . x ( i 2 + k - 1 )

x(ic_x) x ( i c _ l + l ) . . . x ( i c _ l + k - 1 x(d) x ( d + l ) . . . x ( d + k - 1 )

Let Hi(x ) :=H(x ,T -d ) and II~(x):=H(x,2n-1). Note tha t f I i (x)=lI~(w ) for

w(t):=elAtqx, te[1,T]. []

(i) As a f i rs t s tep we p rove the result under condit ions (C1) and (Ca) and

using the aux i l i a ry lemma (L).

(C1) B0=BT(R0) , Ro=ro+rlS+...+rn_lsn-l+s n, where r0#0;

(C2) gen. on Y 0 the ma t r ix consist ing of the f i rs t c-1 rows of Hi(w) has

full row r ank c - l ;

(L) if w¢IV(d,I) is not gen. t rue on 13o, then under condit ions (C~) and

(C2) there holds tha t fo r generic xeR" the last row of H~(x) is

l inear ly dependent on the fo rego ing ones.

So suppose (L) holds true. Note tha t fo r Xo:--(O,...,O,1) T the re holds

det([xoAxo.. .An-lxo])#O, hence fo r gener ic xeN n de t ( [xAx . . .Anqx])¢O. This

implies tha t there exists xcR n such tha t this condi t ion is sat isf ied and

m o r e o v e r the last row of tt~(x) is l inear ly dependent on the foregoing r~C-1 jik+t

ones, i.e., there exist vckeR , ke[1,c-1] , such t ha t elAd+tx = 2,k=lc%elA X

fo r all t e [0, 2 n - 2]. Then col{ el, e lA, . . . , e lA "-1 )Ae[ x / i x . . . An-ix ] = _ n - I v-,c-I . i k , r . . A n - I x col(ex, e l A , . . . , e l n )(~k=lc~kn Jt X Ax. ], and as bo th col(%exA,

n-1 ,d r~c-1 Ai k ...,elA } and [xAx . . .Anqx] are inver t ib le it follows tha t zl =~k=lC~k/l . n d ~ c - 1 i k Define K:=s-2,k=lcYkS , then c lear ly BT(R)eB*(d, I ) and B0cBT(R), as fo r w~13 o

Chapter III 237

_ t - l , _ d ~ c - 1 - - i k . ^ and x:=col(w(1),...,w(n)) there holds [R(a)w](t)= elA (A-:.k=ltxk.q )X----O,

te[1,T-d]. This proves the desired results.

(ii) As a second step we prove (L) under conditions (C~) and (C2). If w¢W(d,I)

is not gen. true on Bo, then rank(Hl(X))=c is not gen. true on R n, and as

this is a polynomial condition in x we conclude that rank(Hl(X))<_c-1 for all

xeR n. We state that gen. the first c-1 columns of HI(x) are linearly

independent. Suppose this would not hold true. Then those first c-1 columns

are always linearly dependent and there is a column k<c-1 which gen. is

linearly dependent on the foregoing ones, say for xeV~{R'~\p-l(o)} for some n ~ ~ T - d - k

polynomial p:R ->R with p#0. Let pi:=pA i, ie[0,T-d-k] , and p:=tli=o Pl, then ~#0

as A is invertible under condition (CI), and V:={x; AixeV for all

ie[O,T-d-k]}~Rn\~-l(O) is generic in •n. For xeV there holds Ax~V and from the

structure of Iti(x ) it follows that for such x column k+l of Hi(x) is linearly

dependent on columns 2,...,k, hence also on the first k-1 columns. Analogously

it follows by induction that for xeV all columns k+i of Hi(x ) are linearly

dependent on the first k-1 columns, i~[O,T-g-k], and hence gen.

rank(Hl(X))<k-l<c-2. This contradicts condition (C2).

So gen. the first c-1 columns of Hz(x ) are linearly independent and as

rank(Hz(x))<c-1 for all xeR n it follows that gen. column c of H~(x) is

linearly dependent on the foregoing ones, say on the generic set V'cR n. Then

V':={x; AIx~V ' for all O<_i<_max{2n-l-c, T-d-c}} is also generic, due to (Cl) ,

and for xeV' both rank(Hz(x))_<c-1 and rank(H~(x))<_c-1. If T-d>_2n-1, then (C2)

implies that gen. on A n the last row of Hi(w)=Hi(x ) is linearly dependent on

the foregoing ones, and hence the same holds true for tl}(x) as it contains

less columns than Hi(x ). If on the other hand T-d<2n-1, then (C2) implies that

gen. the first c-1 rows of H~(x) have rank c-l, and hence again gen. the last

row of Hei(x) is linearly dependent on the foregoing ones. This proves (L).

(iii) Finally we consider the case where condition (C1) or (C2) is not

satisfied. This step is split in four parts.

(iii-1) First suppose that condition (C2) is not satisfied. Then there exists

a k_<c-1 such that the matrix consisting of the first c-1 rows of HI(w)

always on B o has rank at most k and such that there is a WoeB 0 with

rank(H1(wo))=k, say for Wo the rows in J':={i~,...,ik}c{il,...,ic_1}=:J are

238 Appendix

linearly independent, where O<_i~<...<i~<d. Then gen. on B0 the rows in J ' are

linearly independent and the rows in J\J' are linearly dependent on those in

J ' . As it is supposed that the last row of Hl(W ) is not gen. linearly

independent from the rows in J it also is not gen. linearly independent from

the rows in J ' . Now condition (C2) is satisfied for c ' :=k+l and

I':=[ O,T-1]\ ( J'u{ d} ).

I f condition (C1) is satisfied then the results of (i) and (ii) imply

that there is an R' such that 13oCl3r(R')cB*(d,I')d3*(d,I ) as 1%1, and the

desired results follow.

Finally let condition (C2) be satisfied for c' and I ' as defined before

and suppose that condition (C1) is not satisfied. Let n':=min{k; rk¢0 }. We will

consider three cases, i.e., in (iii-2) that n'>d, in (iii-3) that n'<i[, and

in (iii-4) that i~<n'<_d.

(iii-2) The case n'>d cannot arise. It would imply that on B0 the values of

w(t), t~[1,d+l], could be chosen freely. Now for je[1,k] and

l~[1,k-j+l] there holds i~+l<i)+k-j+2<_i'j+l+k-j+l<...<i~+2<d+l , and hence there

is a w0E/~ 0 with wo(ij+l):=O, j~[1,k], l~[1,k-j+l], and wo(d+l)=wo(i~+2)= wo(i~_1+3 ) . . . . . Wo(i't+k+l):=l. From the structure of Hi(wo) it follows that

rank(Hi(wo))=c' and hence that gen. on 130 rank(Hi(w))=c'. This contradicts

that the last row in Ht'(w) is not gen. linearly independent from the

foregoing ones.

(iii-3) Next suppose that n'<_i~. Then consider (R'd,I",T") defined by

~n - n ~ . o , i - I r - F ~ . r i r R0:=s Ro, T":=T-n', and [ .=[0,T -1 ] \{h -n , z2 -n , . . . , zk -n ,d-n }. Note

that R~ satisfies (C1) and that Hl"(w ) satisfies (C2) on Br"(R~) , as Hz'(w)

satisfies this condition on B 0. As moreover d(Rg)+c(Rg)=

d-n'+(k+l)<_d-n'+c<_T-n'=T" it follows from steps (i) and (ii) that there is an

R" such that BT'(R'~)CBT"(R")EB~'(d-n',I" ) and hence Eo=I3T(RO)CI3T(a~'R')~

B (d , I ' ) c2 (d,I) , as c(a R")+d(a R')<_k+l+d<d+c<_T. Hence the desired results

follow.

(i~i-4) Finally suppose that i~<n'<_d. Let /~[2,k+1] be such that i~_l<n'<i[,

where i~+fi=d. Let H(w)~R (k÷l)x(k÷l) consist of the first k+l columns

of ttz'(w) and let H(w)~R (k-z÷~)×(k-:+:) be defined by

Chapter III 239

~(v):=

w(i~+l) ... w ( i i+k+l )

w(i~+lTl) ... w ( i i + l + k + l )

w(i~+l) ... w(i~+k+l)

w(d+ l ) ... w(d+k+l )

Note that the assumption that w~IV(d,I') is not gen. satisfied on B 0 implies

that det(H(w})-=0 on B 0. Moreover on B 0 there holds that, for given values of

w(t) for te[n'+l,T], the values of w(t) for te[1,n'] can be chosen

arbitrarily. Let Wl[n,+l,T]el3Ol[n,+l,T ] be arbitrary and fixed, choose w(t):=0

for te[1,n']\{i~_j+j;je[1,l-1]}~, and consider det(H(w)) as a polynomial in

the variables {w(i~_j+j); je[1,l-1]}. Note that indeed for je[1,/-1] i'z_i+y"

<i~_l+l<_n'. It is a simple matter of explicitly writing out H(w) to prove that 1-1 ., .

det(H(w}) contains the term //jffilW(h_j+3) with coefficient det(H(w)). As

det(H(w))--0 it follows that also det(H(w))=0 on B 0.

Now for w e b o define ~:=w I[i'+~,d+k+lleR ~, where T:=d+k-l-i~+2. Further let

~0 := /30 [ , = /30[ Ro:=s-n'Ro=~O+~lS+...+~~ sn-l+s ~ with ~0#0 [il+l,d+k+l] [n '+ l ,n '+~] ~ n - 1 and g:=n-n'. Let ~':=k-/+2 and ('fl,'f2,...,'~.~_l,d):=(O,i~+l-i],...,i~-i~,d-i~) and

let _~:=[0,T-1]\{~'l,...,~'~._l,d }. Define .4 and £ in terms of /~0 and @ analogous

to the definition of A and x in terms of R0 and w respectively as indicated in

the notation in the beginning of the proof.

Note that Lt(w)=nT(@ ) for /30, that hence det(H~.(@))-0 on B0, that ~'+d=T,

and that /~u satisfies condition (C1). As det(HT(@))----0 on Bo it follows that

there is a row in HT(5 ) which gen. is linearly dependent on the foregoing

ones. This shows the existence of a d.'e{~'l,...,~'~%,d } such that for gen. w~/~0

@clV(d',I)cR ~. Using arguments similar to those of steps (iii-1) and (i) it

follows that there exist {~'~,...,~'~, d.'}c{~'~,...,~'~._~,d} and 51eR, jel l ,k] ,

~ =BoCBT(R)~B~ such that ,a =~j=lC~js ~. Let m=s -~j=~c~js ~, t h e n ]flOl [n,+l,n,+T ] T

as c(fi))+d(R)<~'+d=T. From this and shift invariance of B 0 it follows that

~ 0 C H T ( $ n R). Let R:=s'tR, then I3ocI3T(R), as i;>_n', and [O,T-1]\I(R)c

{ ~'1+i~,..., ~'~._1+i~, d+i~}={i~,i~+~,...,i~,d). It was supposed that condition

(C2) is satisfied for c' and I ' of step (iii-1). Hence necessarily d'=d and

d(r)=d. Hence BoCBT(R)eB (d , I )cB (d,I), which proves the desired results.

240 Appendix

This concludes the proof of lemma 3-20-1 and lemma 3-20-2. •

To prove lemma 3-20-3 we make use of a result stated in lemma 3-20-5 which

also plays a role in the proof of proposition 3-25. Let 0 ~ i 1 < $ 2 < . . . < i c _ 1

<d~T-c and

~(w):=

w(i l+l ) w(il+2) ... w(i l+T-d )

w(i2+ 1 ) w(i2+ 2 ) ... w( i2+T-d )

W(ic_l+l) W(ic_1+2) ... w( ic_ l+T-d )

L e m m a 3-20-5 If rank (M(w))<c-2 everywhere on G0, then there exists

R~O, [l(R)<_ie_l~ I(R)~[O,T-1]\{i1,...,ic_l} , such that BOCBT(R)E~3 T.

P r o o f o f lemma 3 -20-5

There is at least one row of 3/ which is not gen. linearly independent from the

foregoing ones, say row k0. Exactly analogous to the proof of lemma 3-20-1 and

lemma 3-20-2 it follows that this row then always is linearly dependent on the

foregoing ones and that there exist a k such that on B o ~o(i%+t)= kO-I ~k=l akw(zk+t), tE[1,T-d]. By shift invariance of B0 this then also holds true

n ik 0 ~k0-1 i k * on [1,T-i%]. Define K:=a --:'k=l aka , then 13OCI3T(R)EE3T, while [l(R)=i%<ic_l

and I(R)~[O,T-1]\{il,...,ie_l}. •

P r o o f o f l e m m a 3 - 2 0 - 3

Let [~Jo, and define M as before, with {il,...,ic_l,do}:=[O,T-1]\I.

We state that gen. on /30 rank(M(w))=c-1. For suppose this is not true,

then det(MMV)¢0 is not generic, so det(MMT)=o on B 0 and rank(M(w))<c-2 on 130.

By lemma 3-20-5 this would imply that there exists R¢0 such that

BoCBT(R)~q*(d',I'), where d'<<_ic_l<d o and I'~[O,T-1]\{i,...,icq }. Hence 60c

BT(R)cIV(d',I') and d'<do, which contradicts the definition of d 0. 0 * d o c - 1 j i k

Now suppose that BT(Rj)~PTWnB (d0,I), j=l ,2 . Let Rj(a)=a +~k=laka , J i i a :=(al,...,ac_l) ' j=1,2. Using the notation of lemma 3-20-1, this means that

(al,1)lI1(w) =(a2,1)Hz(w)=O, so (al-32)31(w)=0. As gen. on B o rank(M(w))=c-1, we

C h a p t e r III 241

0 * get gen. on B0 al=a2; hence RI=R2, i.e., gen. on B o PrwnB (d0,I) contains at

most one model. From lemma 3-20-2 and the discussion following that lemma we 0 * 0 *

know that gen. on B o ~T(R(do,I))EPTWNB (d0,1). So gen. on B 0 PTW(3B (d0,I)

consists of a singleton, i.e., BT(R(do,I)). •

This concludes the proof of theorem 3-20. •

P r o o f o f c o r o l l a r y 3-21

From theorem 3-20 we immediately conclude that if B0eB r then gen. on B 0 0 ~oePT w. To prove the corollary, due to theorem 3-20 it suffices to show that

{13ocBeBr, C(B)=C(Bo) } ~ {B=Bo}. This easily follows from lemma II.3-34 and the

fact that for R~=0 with d(R)<T-1 there holds C(BT(R))=[l(R ). •


(i) Obvious from the definition of po.

(ii) Obvious from theorem 3-20. • w . 0 * * 8 (iii) Obviously Bpoc~poClm((PT)C~T,. , so it suffices to show that IRTC~p~ ,. This

is immediate from corollary 3-21.

(iv) Consider p0 with the action of p0 restricted to models in the set B t.

For monotonicity and shift invariance, consider the inclusion conditions

on t which involve modelling w[[1,t_ q or w[[2,t], and w[[t,t]. Assume

E=ET(R)e~] r with c(r)+[l(R)<t-1.

For the monotonicity condition, observe that w[[1,t_q~Bt_l(R)eB~_l and

w[D,t]eBt(R)cB* t. From corollary 3-21 it follows that gen. in web

P°~(w [D,t_I])=Bt_I(R) and P°(w ] D,t])=Et(R), and the condition

Bt(R)[[x,t_x]cBt_l(R ) is trivial. Note that in fact corol lary 3-21 only gives 0

that for example P,_l(W[[1,t_l])=Et_l(R) gen. in w[[1,t_l]eBt_l(R). That this

also holds true gen. in web can be derived from the fact that d(R)_<t-1 which

implies tha t there is a linear bijection w[D,t_fl-->w from Bt_I(R ) to B.

~ ( R ) k For the shift invariance condition, consider two cases for R=.~k=o aka.

If a0~0 , then according to propositions II.3-30(ii) and II.3-33(ii) w[[2,t]e

B[[2,t]=B[[1,t_l], while if a0=0 , then w[[2,t]~B[[2,t]=l~t_l(o-lR). Generically 0 0 - 1

in web Pt_l(W][2,t])=Bt_l(R) in the first case, P,_l(w][2,tl)=13t_t(a R) in

242 Appendix

the second case. The shift invariance condition is trivally satisfied in both

cases.

Concerning linearity, let BieB~, i=1,2, then gen. in (wl,w2)et3axl3 z 0 0 P,wi=Bi, i=1,2, while due to theorem 3-20 gen. for 13ePt(Wl+W2) /~1+/~2c/~, which

0 proves linearity. Note that theorem 3-20 in fact gives Bl+132cBeP,w gen. in

weSx+S2, but one easily proves that this then also gen. holds true in

(wl,w2)~13,xB2 with w:=wl+w 2. Finally we give an example which shows that p0 is not monotone, not shift

invariant, and not linear.

Example. Let T=20, R:=al°+ag+2aS+aT+os+2aS+a4+a3+2a2+a+l, so c(R)+[l(R)= 21,

and consider B:=132o(R ). Further define Rl:=(a-1)R=a1'+a9-aS+a6-aS+a3-a2-1 with

c(R,)+~t(R1)=19, and R "-to" l~R--o"11+lal°+aag+lo'7+30"6+lo'4+30"3+lo- 1 with 2.-~ -~ / - ~ ~ ~ ~ ~ ~ ~ -~ 0 0 c(R2)+ct(R2)=20. Then gen. in w=/3 P19(wl[1,19])=P19(wli2,2o])=Blg(R1) while

0 P2oW={I32o(RI), ~2o(R2)}. So pO is not monotone as 132o(Rz)][L19]C.B19(R1), and

pO is not shift invariant as B20(R2)¢B20(O-.R1). pO also is not linear, as for 0 0 0 BI=B2:=B and for generic (wx,w2)el3xl3 there holds P2owa=P2ow2=P2o(Wx+W2)=

{B2o(RI),B2o(R2)}, while e.g. 132o(Rt)+B2o(R1)=B2o(R1)gt132o(R2) as would be

required for linearity. []



For BoeB T we have to prove that gen. in w~B o the following holds:

{ReL(w):={R'#O; d(R')<T-I and W6~T(R')e~3~'}} ~ {~DC~T(R)}.

Notat ion. We will use some of the lemmas and the notation introduced in the

proof of theorem 3-20. Further we define Ko:={(d,I); BocW(d,I)) and

L(w;d,I):={ReL(w); I(R)~I,a(R)=d}. Again we can restrict attention to the case

d+(T-#(I))<T, i.e., d<_#(I). []

Remark. According to lemma 3-20-1 gen. on S 0 L(w;d,I)=o if (d,I)~Ii o. So gen.

on /~0 L(w)=[.J{L(w;d,I); (d,I)eKo}. []

Chapter III 243

The following lemma is crucial in the proof of proposition 3-25.

Lemma 3-25 Let (d,I)~K o be fixed. Then there exist n>_0 and R(J)~R[s],

je[0,n], such that

(i) d=[l(R(°))>~l(R(1))>...>~l(R(")),I(R(J))~I,je[O,n];

(ii) BoCBT(R(J))forallje[O,n];

(iii) gen. in wEB o L(w;d,I)=spano{R (i), j~[O,n]}:={R; 3oLj~R, j~[0 ,n] ,

c~0#0 , such that R=Z~=occjR(J)}.

We first give an interpretation of this lemma, then give the proof of

proposition 3-25 using lemma 3-25, and finally prove lemma 3-25.

I n t e r p r e t a t i o n . Lemma 3-25 has the following interpretation. Let (d,I)eKo,

then by definition of K 0 for every w~B o there is a remarkable law R w with

W~Br(Rw)cB*(d,I ). Now the lemma states that gen. in B 0 the class of

unfalsified remarkable laws in 8*(d,I) is independent of web o and that this

class is spanned by a finite number of "basic laws" R (j) which moreover are

true laws for g0. []

Proposition 3-25 can be proved by using lemma 3-25 in the following way.

Generically on B o L(w)=U{L(w;d,I); (d,I)~Ko} , and as K0 is a finite set it

suffices to prove that for (d,I)eK o gen. in web o {R~L(w;d,I)} ~. {BoCBT(R)}.

According to lemma 3-25(ii), (iii), ReL(w; d,I) gen. is of the form

R=Z~.=oc~jR (j), (x0~0 , with BOCBT(R(J)), je[O,n]. Using lemma 3-25(i) this

implies that for WeBo [R(i)(a)w](t)=O, te[1,T-a(R(J))]~[1,T-d]. This implies

that [R(a)w](t)=O, t~[1,T-d], and hence weBT(R ) as cl(R)=d for c¢0~0. So

BoCl3T(R), which proves proposition 3-25.

Now finally we prove lemma 3-25.

Remark. First note that if do:=min{d; there exists I such that (d,I)eKo} then

it follows from lemma 3-20-3 that for (do,Io)dio L(w;do,Io) gen. is a

singleton BT(R(do,Io)), and according to lcmma 3-20-2 Yoclg,l.(R(do,Io)), which

proves lemma 3-25 for d 0. However, in general for (d,I)~K o L(w; d,I) need not

gen. be a singleton. As an example, let T=5, Bo:=Bs(a-1), then (d,I):=

244 Appendix

(2,{3,4})eK 0 and for all WEB o L(w; d,I)~{Rc,,c¢eR} where Rc,:=(a-c~)(a-1)= a2-(~+l)o+~. []


We give the proof by construction. First we define R (j) and then we show that

these have the desired properties.

Part (i) and (ii). Let

...<ic_l<d (c+d<_T), and define

w ( i l + l )

w( i 2+ 1) III(w):=

(d,I)EKo, [O,T-1]\I={il,i2,...,ic_l,d), 0<ii<i2<

w( i t +2 ) ... w( i l+T-d ) w(i2+2) ... w(ie+T-d)

w ( i c - t + l ) w ( i e - l + 2 ) ... w ( i c_ l+T-d ) w(d+l ) w(d+2 ) ... w(T)

and let Mk(W ) consist of the first k rows of tlI(w ).

As BocW(d,I), lemma 3-20-2 implies that there exists R (°) such that

13OCBT( R( °))cI3* ( d,I ). n ~ c - I i k d

Now note that for K=~k=laka +aca , ac¢:O , a:=(al,...,ac) , there holds

{RcL(w; d,1)}~{aHi(w)=O }. If gen. on 13 o rank(Mc_l(w))=c-1, then gen. a is

unique up to a constant factor and hence gen. L(w; d,I)={c~R(°); 0#c~R} and

lemma 3-25 is shown with n=0.

So suppose that not gen. rank(Mc_x(w ))=c-l; hence not gem

det(Mc_l(w)Mc_l(w)T)#o, so det(Mc_l(w)Mc_l(w)T)--O and rank(Mc_l(W))<_c-2 on

~0-

Lemma 3-20-5 implies that there exists R' with 13ocBT(R' ) and [l(R')<ic_l< d=d(R(°)), I(R')~Iw{d). Let R (1) be such a law for which d(R (1)) is maximal.

Let d(R(1))=idl and I i : = [ O , T - 1 ] \ { i l , . . . , id l ) .

Now either gen. on /30 rank(Mavl(w))=dl-1 , in which case gen.

L(w; ial,I1)={c~R(1); O#c~R) and we stop, or rank(Mdx_l(w))<_dl-2 on B0. In the

latter case, using lemma 3-20-5~ we find a law R (z) of maximal degree in the

class of laws with I(R")~I1U{ial } such that 13oC13T(R" ). So BoCBT(R(2)). Let

~l(R(2))=id2<idl_l<idl=Cl(R(1)) and I2:=[O~T-1]\{il~...,id2 }. Going on in this way we find a number n_<c-1 such that for j~[0,n] there

exists R (j) with 13oc13T(R(J)), I(R(J))~I, d(R(J))<cl(R(J-1)), while for ian:=

Chapter III 245

d(R(n)), In:--[O,T-1]\{il,...,ian}, gen. on 13o rank(Mdn-l(W))=dn-1, so gen. on

/30 L(w; idn,In)={ccR(n); O:pecE~}.

In this way we have defined n, R (j), j~[0,n] , with [l=~(R(°))>[l(R(1))>...> d(R(n)), I(R(J))~I, and /3oc/3r(R(J)). This proves (i) and (ii).

Part (iii). If R=Z~.=o~jR (j), c¢0¢0 , then cl(R)=d, I(R)3I, and /30C/3T(R)eET, as

c(R)+[I(R)=T-#(I(R))+[I(R)<_T, so ReL(w;d,I). We now have to show tha t gen. on B 0

if R~L(w; d,I) then there exist (~k , kc[0,n], with c~0¢0 , such tha t

R=~=o~jR (j). Without loss of general i ty we assume R and R (j) to be monic,

je [0 ,n] .

So let R~L(w;d,I) be given. If R=R (°) then we are done, else define

Rl:={R-R(°)}.flo , with flo such tha t R 1 is monic. We state tha t gen. on B 0 there

exists je[1,n] such tha t [l(Rx)=[l(R(J)). For suppose this does not hold true,

then there exists ikeI\{[l(R(J)), j e [0 ,n]} such tha t in Hz(w ) row k is not gen.

l inearly independent of the foregoing ones; hence by lemma 3-20-1 it is always

l inearly dependent on them and lemma 3 -20 -2 implies tha t there exists Rik , ~l(Rik)=ik, I(Rik)~[O,T-1]\il,...,ik) , with BoCBT(R i ). Now i~.<~l(R (n)) is

impossible by definit ion of n, so [l(R(n))<ik<[l(R(°~). Let j be such tha t

~l(R(J))<ik<~l(R(J-1))=:dj_a, then this contradic ts the const ruct ion of R (j) as

being of maximal degree in the class of laws g such tha t I([~)~[O,T-1]\{ix,i2, ...,idj_l_l} and ~0C/3T(R).

So indeed gen. on B 0 there exists je[1,n] , say Jl, such tha t d(R1)=

d(R(Jl)). I f RI=R (jr), then stop, else define R2:={R1-R(Jt)}.fll , where fll is

such tha t R 2 is monic. Going on in this way we gen. reduce R to laws of lower

degree in the set {~(R(J)), je[1,n]}. The process gen. will end either if we

find a ke[1,n] such tha t Rk=R (jk), or if we get R k with a(Rk)=d(R(~)). As

rank(Mdn_l(W))=dn-1 gen. on /30, also gen. Rk=R (~) in the la t ter case.

In this way we conclude tha t gen. on /30 if ReL(w; d,I) then R=R(°)+/3~IR~= R(°)+flolR(Jl)+fl~lflxlR z and going on in this way we find c~j, j c [0 ,n] , with

C~o=l , such tha t R=E~=occjR (1).

This concludes the p roo f o f lemma 3-25 and hence o f proposi t ion 3-25. •

2 4 6 A p p e n d i x

P r o o f o f c o r o l l a r y 3 -26

According to proposition 3-25 gen. in we/30 Goc13T(R(w)) , so gen. W~GT(R(w))

and hence gen. P~.W=GT(R(w)), R(w):=GCD{R; ReL(w)). So gen. on B 0 if ReL(w) then

13oc13T(R) and P~,zo=13T(R(w) )c13T(R ). • P r o o f o f c o r o l l a r y 3-27

Let R:=GCD{R#0; d,(R)_<T-1 and BoC13r(R,)e~) , R(w):=GCD{R; /~eL(w)} From

corol lary 3-26 gen. on G 0 if R.eL(w) then 13oC13T(R)¢BT~ hence by using

proposit ion II.3-36(ii) gen. on 13o 13T(R)CBT(R(w))=P*rw. On the other hand, if

R~0, d(R)_<T-1 and GoCBT(R)eB~, , then on B0 R~L(w), so BT(R(w))CBT(R), which

implies GT(R(w))C13T(R), so gen. on 130 P~WCBT(R ). •

P r o o f o f theore m 3-28

(i) Obvious from the definition of P*.

(ii) For B~IR t let R(13):=GCD{R#O; d(R)<t-1 and 13c13t(R)cBt}.

Monotonicity. Let 130e~lT, we130, P*t_l(wltLt_ll)=13t_l, Pt(wl[1,tl)=13t, then

we have to prove that gen. 13t[tl,t_llc13t_l. If /3t_l=R t-1 then this is trivial,

so assume 13t_l~R t ' l . Then from corollary 3-27 we can conclude that, gen. on

13o, 13t-l=13t-l(Rt-1) and Gt=Bt(Rt) , where Rt_l:=R(Gol[Lt_l] ) and Rt:= R(13o 103])" Now if R is such that B o [[1,t_llC13t_x(R)eBt_l, then

Bol[2,tlcB ol[Lt_llcBt_l(R ) and hence 1301tl,t]c13t(R)~Bt as c(R)+d(g)<_t-l<_t, hence for these R Gt(Rt)c13t(R), cf. proposition II.3-36(ii). This implies

Gt(Rt)cl3t(Rt_l) , hence also gen. on 130 Btl[1,t_1]=13t(Rt)l[1,t_11 c

13t( Rt-1 ) I [1,t-1]=13t-l( Rt-1)=Gt-1 • Shift invariance. Let 13oeBT, WeGo, Pt_l(w[t2,tl)=Gt_l(R), 13'=Pt(wltLtl),

then we have to prove that gen. B'cGt(a.R }. From corol lary 3-27 it follows

that gen. on 13 o R=R(13ol[2,t]) and gen. 13"=Bt(R' ) where R'=R(BoI[1,t]). Now if

is such that 1301 ~ * ~ * [2,tlc13t_l(R)eBt_l, then Bol as [1,t]cGt(aR)d3t c(crR )+t~(a]~)=

c(P,)+~(R)+l<_t-l+l=t. So gen. 13'=13t(R')c13t(P. ) where R:=GCD{a/~#0; d(/~)___t-1 and G N . o [ [2,t]c13t-l( R )~Bt-1} =erR.

Lineaxity. P*dcxw)=P**(w) for ~#0 follows from L(cxw)=L(w), ¢x#O. Now let

(131,132)~B~, Bi=13T(Ri) , /=1,2, ( t 0 1 , W 2 ) e B 1 X 1 3 2 , * ' * " PTWl=13T(R ), PTWz=13T(R ),

PT(wl+wz)=13T(R), then we have to prove that gen. 13T(R')+BT(R")C13T(R).

C h a p t e r I I I 247

According to proposition II.3-36(i) BI+B2=BT(K) with K:=LCM(R,,R2). According

to corollary 3-27 gen. in wl+w2, hence also gen. in (Wl,W2)eBl×B2, there holds

R=R(BT(K)). Moreover, gen. R'=R(Bx) and R"=R(B2). Because BicBT(K), i=1,2, if

BT(K)CBT(R. ) then also BiCBT(R), i=1,2, so BT(R')CBT(R), BT(R")CBT(R), and

hence also BT(R')+BT(R")CBT(R). (iii) This is immediate from corollary 3-27. Note that for /~:=~T p~w=~T gen. in

we~ T.

(iv) Let V:={BT(R); R=GCD{RA, AeA} for some {RA, AeA} with d(Rx)<T-1 and

BT(R~,)eB~}. For every procedure B~TCB~Tcim(PT) , so it suffices to show

that VcB~ and that im(PT)cV. If BeV, then corollary 3-27 implies that gen. on

B PTW=B, hence BeB~ . If Be im(P~), then either s=eTev or 3weR T such that

B=BT(R(w)) , R(w)=GCD{R; Ret(w)}. Because for ReL(w) d(R)<T-1 and BT(R)e8 ~ it follows that BeV. •


Let BocBr. As L(w)cL(w) we conclude from corollary 3-26 that gen. on B o

{ReL(W)} =~ {BoCBT(R)}. So gen. on Bo BoCBT(R(w)) and hence gen. ['~W=BT(R{w)) where R(w):=GCD{R;ReL(w) }.

Now define R:=GCD{/~'~0; d(R)<T-1 and BOCBT(R)eB~}. For every R~0,

d(R)<T-1 with BoCBT(R)e~ ~ there holds that on B 0 ReL(w), hence BT(R(w ))cBT(R ) and gen. PTWCBT(R). On the other hand gen. on B 0 for ReL(w) ~ ~* BocBr( R)~ST, hence

gen. BT(R)CBT(~(W)) and gen. BT(R)G['~w. •


It is trivial that P* is exact. That it is truthful immediately follows from

proposition 3-32. Strong prudence is shown by means of proposition 3-32 in a

way exactly analogous to the proof of theorem 3-28(iv), along with the

characterization of im(,hT)=Bp~ =8~..prS Linearity is shown by using proposition

3-32 and the arguments in the proof of theorem 3-28(ii).

Hence it remains to show that P* is bilaterally monotone. Monotonicity

follows as in the proof of theorem 3-28(ii). Finally let B0eBT, then we have

to prove that gen. in weB 0 there holds aBT_t+2CBT_t+l, where BT_t+i:= N$ PT_t+i(Wl[t_i+l,T])~ i=1,2. According to proposition 3-32 gen. on B 0 BT_t+i=

248 Appendix

Br_t+i(ei) , with Ri:=GCD{R#O; d(R)<T-t+i-1 and Bo [[t-i+l,T]C~T_t+i(R)eB;_t+i} ,

i=1,2. Now if R is such that [l(R)<T-t and BOI[t,TICBT_t+~(~)e~_t+X, then

translation invariance of B o implies (see proposition II.3-30(ii)) that also

~o[[t-I,T-1]=B0[[t,TICBT-t+I(R'), hence 1301[t_l,TlCBT_t+2(R)eg~._t+2. From this

it follows that BT_t+2(R2)CBT_t+2(R1)~ hence gen. on /~o l~T-t+2CI3T-t+2(Rl),

especially aBT_t+2caBr_t+2(Rt)=BT_t+x(R~)=Br_t+ 1 gen. on /~0. •


For B0=~ T the result in (i) follows from the truthfulness of PT and PT. So let T - 1 k B0C:R T. For O~R=~,k=oak a ~R[a] let l(R):=min{k;ak~O }. It easily follows that

{I3T(R)e~T} ~. {I(R)=0} for R~0, d(R)<T-1.

Let Bo=BT(Ro)eB T. From corollary 3-27 and proposition 3-32 it follows

that gen. on G 0 P~w=I3T(R) and PTW=BT(R), where R:=GCD{R'~e0; d(R')_<T-1 and

13ocl3T(R')~B~} and R:=GCD{R'¢0; d(R')<_T-1 and 13oct3r(R )eBT}.

(i) Let B0e~T, so l(Ro)=O, and let R'#0, d(R )<_T-l, such that ~oCBT(R )~T. Then according to lemma II.3-34 there exists F' such that R'=F'R o. If

l:=l(R')=l(F')~O then define F' by F':=a-IF ' and R':=~"R o. So /~ocG(R')eB~ as

/(R')=0 and c(R')+[I(R')=c(R')+[I(R')-I<T-I<_T. Now R'=o~R ' and it follows that

R=GCD{R'~O, [I(R')_T-1 and BoCI3T(R')e~;}=GCD{R'¢O; d(R')<T-1 and 60cBT(R')e

B~}=R and hence gen. PTw=Prw.

(ii) Let 13oc~3T, B0¢~7- , so l(R0)>l. If d(R')_<T-1 and /3ocBT(R )~3T then there

exists F' such that R'=F'R o and hence I(R')>I, so t3T(R')~B ~. This implies

that gen. on B o I,(w)=O, as according to corollary 3-26 gen. on B 0 ~ * ~ * T {R~L(w)cL(w)} ~ {~oCI3T(R)~ST}. So gen. on 130 R(w)=0 and PTW=~ . •

P r o o f o f lemma 3 - 3 5

First suppose t h a t gER T+I has a realization of dimension n, i.e., there exist

g~R z+ and (A,B,C)~Rn×'~xRn×IxR l×n such that gC[[o,rl=g and S(ge)=S(A,B,C,go).

Consider the input u defined by u(0):=l and u(t):=0 for t~e0. The corresponding

output y in S(g e) is given by y(t)=0 for t<0 and y(t)=g t for t>0. As

(u,y)eS(A,B,C,go) it is easily seen that gt=CAt-lB, te[1,T]. Define ReR[s] as

the characteristic polynomial of A. Then [l(R)=n, R(A)=O, and hence ~ICBT(R ).

Next suppose that ~BT(R ) with d(R)=n. Let R=k~=orkSk , then without loss

Chapter III 249

of generality we may assume that rn=l. Define (A,B,C)c•n*n×Rnxlx• l×n by

B:=(gl,...,On) T, C:=(1,O,...,O) and A:=[°zn-1], where OeR n-l, I,,_, is the

identity matrix in R (n-1)×(n-1) , and L - )p:=(_r0,_rl,...,_rn_l). A direct

calculation gives that gt=CAt-lB, te[1,T]. Define 9eeRZ+ by 9eo:=g o and

g~:=CAt-IB for teN. It is easily seen that S(ge)=S(A,B,C,9o). tlence 9 has a

realization of dimension n. •

CHAPTER IV


In this proof and the next one we use the following result.

Lemma 3-2 If H=Bnl¢2, BeB, and B has a minimal realization Bs(A,B,C,D):=

m+n+q (v,x,w)~B,(A,B,C,O)nlz }.

P r o o f o f l emma 3 - 2

m + n + q If (v,x,w)eB~(A,B,C,D)AI 2 , then wcl3olq=H. On the othcr hand, if well and

~ i r n + n (v,x,w)cBs(A,B,C,D), then it suffices to show that ~v,x)etz . According to

proposition II.3-22 (A,B,C,D) is perfectly observable. From this and the

linearity and shift invariance of B it follows that there exists a linear map

L:(Rq)n->R n such that for (v,x,w)eB~ there holds x(t)=L(w][t,t+n_l]), t~Z.

Hence xel~ i f wel q. Moreover, Dv=w-Cx and D is injective, see proposition

II.3-22, hence vel 2. •

Next we prove the proposition. The result for Bs 2 is contained in the 2 lenm~a. To show that He~ 2 has a Bi/s/o realization, let H=Bnl~, where Be~3

has a minimal input/s tate/output realization Bq~/o , i.e., l lB={(u,y)e

(RmxRq-m)z; 3x~(Rn) z such that [~:1= I~ ~1 I:] }' cf. corollary II.3-23. Then

L'ir221slo:=Vilslont 2 r 2 .m+n+(q-m) ~'" an 12-input/state/output realization of H. Indeed, .m+n+(q-m) [:]

if (u,x,y)~13i/s/on t 2 , then // ~13nl~=tI, while for w~H there holds,

m+n+(q-m) B s and Bi/~/o follow~'J by (u,x,y)~13i/s/onl 2 . The results for R 2 R 2

considering RH, where R denotes the time reverse operator. •

Chapter IV 251

P r o o f o f p r opos i t i on 3-3

First we state two results which will be useful in the sequel. Then we use

these results to prove the proposition. Finally we prove the lemmas.

~'IXn Lemma 3 -3 -1 If A~R , then {x~l'~; ax=Ax)={O}.

Lemma 3 - 3 - 2 Let HeB2 and let B* be the closure of H in (Rq) z with

respect to the topology of pointwise convergence. If H has a realization

/3](A,B,C,D) with (A,B) controllable, then Bs(A,B,C,D) is a realization of

/3".

To prove the proposition we now first show that in a realization

B~(A,B,C,D) of HoB 2 for which n is minimal (for fixed m) the pair (A,B) is

controllable. Let R:=im[B AB ... An-IB]cR n and &=dim(R). As ARcR and im(B)cR it

follows that there is a choice of basis in R n such that in this basis

A: I A'A3] and B=rB1]. So in a corresponding partition [x:] of x in this basis L 0 A2J L 0 J ~

there holds for (v,x,w)~B s that ax2=A2x2. From lena 3-3-i we conclude x2=O,

hence 13~(A1,B1,C,D ) also is a realization of It. As n is minimal it follows

that dim(R)=d=n.

Hence for any m there exist controllable realizations of H with n

minimal, for given number m of driving variables. According to lemma 3-3-2

these induce realizations of B*. According to proposition II.3-22 B* has a

minimal realization, which induces a realization of H, see lenuna 3-2 in the

proof of proposition 3-2. So a minimal realization of H exists, and

B~(A,B,C,D) is a minimal realization of H if and only if Bs(A,B,C,D ) is a

minimal realization of B*.

Now if a realization /32(A,B,C,D) is minimal for H, then (A,B) is

controllable, and Bs(A,B,C,D ) is a minimal realization of /3", hence (A,B,C,D) is perfectly observable and D ~njective. On the other hand, if in a

realization B~(A,B,C,D) (A,B,C,D) is perfectly observable, D injective and

(A,B) controllable, then it easily follows that (A B) is surjective, hence

Bs(A,B,C,D ) is a minimal realization of /3*. This implies that /3](A,B,C,D) is a

minimal realization of H.

252 Appendix

Finally we prove lemma 3-3-1 and lemma 3-3-2.

P r o o f o f lemma 3 - 3 - 1

For A--0 the result is trivial. Hence assume A~0. Let p(s):=det(sI-A) be the

characteristic polynomial of A. As p(A)=O it follows that for solutions of

ax=Ax there holds p(a)xi=O , i=l,...,n. To prove the lemma it suffices to show

that with V:={weCZ; p(a)w=O) there holds {weV, t~y_lw(t)12< co}=~ {w=0}. M m i M

Let p(s)=s k. Hi=l(S-Ai) , IAtl>_...>_lA, l>0, ke:Y+, Ai~A j for i#j, Ei=xmi= n-k. Then dim(V)=n-k. Moreover one easily shows by induction on j that

the trajectories wij(t):=tJA~, j=O,...,mi-1 , /=I,...,M, give n-k independent

solutions in V. Hence weV if and only if there exist ~ijEC: j=l,...,mi, ! m i " t such that w(g)=~=ISj=lOQjtJ-1Ai . NOW consider such i=l,...,M, a with

t~zlw(t)12< ~ . If w#0 then let i+:=max{i; 3j such that c¢o-#0 } and i_:=min{i; 3j

such that cqj#0}. Taking t -> +co it can be shown that IAi_l<l and taking t -> -oo

it can be shown that [AI+I-~<I, hence I Ai-l<[Ai+l while i+>_i_. This

contradicts the ordering of the A's. Hence w=O. •

P r o o f o f lemma 3 - 3 - 2

If in Bs:=Bs(A,B,C,D ) the pair (A,B) is controllable, then for any to,tlJ 2 2

with t0<t I there holds Bs [[to,tl]=B2s[[to,tl], where Bs:=Bs(A,B,C,D ). Indeed, let (v,x,w)eBs and define (v',x',w') as follows. Let

(v',x',w')(t):=O for teT/\[to-n,tl+n], choose v' on [to-n,to-I ] such that

[B...An-IB] • col(v'(to-1),...,v'(to-n))=x(to) and on [tl+l,tl+n ] such

that [B...An-IB] • col(v'(tl+n),...v'(tl+l))=-AnX(tl+l), while v'l[to,tl]:=

v[[to,tl]. Compute (x',w')]to_n, tl+n ] according to = D "

follows that (v',x',w')eBs, and as it has compact support it is in B2s, while

(V ' ,X ' , 'W' ) l [ t O , t l ] = ( V , X , W ) [[ to , t l ].

Now let H have a realization B~ with (A,B) controllable, then we have to

show that B*=B':={w; 3(v,x) such that (v,x,w)eB~}. First let weB', (v,x,w)eBs. We conclude from the foregoing that for all

-co<t0Xtl<+cc so especially wl[to, tlleH][to, tllC ~*l[to, tl]. Completeness of B* implies that weB*, so B'cB*. That B*cB' is seen

as follows. According to theorem II.3-21 B'eB. Moreover, as B~cBs, it follows

Chapter IV 253

that ~3'oH. As /3" is the closure of H with respect to the topology of pointwise

convergence it follows from proposition II .3-3 that B'c/3 ' . •

This concludes the proof of proposition 3-3. •

P r o o f o f c o r o l l a r y 3 - 4

(i) Let Hc~32 have a minimal realization B2s(A,B,C,D), so according to

proposit ion 3-3 the pair (A,B) is controllable. Then Bs(A,B,C,D ) is a

minimal realization of B* as defined in lemma 3 -3 -2 in the proof of

proposit ion 3-3. That B* is controllable follows along the lines of the first

par t of the proof of lemma 3-3-2. Finally H=13*nlqz~ as for H=Bnlq2, B~B, there

holds HoB*c~3, so HcB*nlqgc/3nl~=H.

(ii) This follows along the lines of the first par t of the proof of lemma

3-3-2 , using the fact that for minimal realizations Bs(A,B,C,D) of BeB c

there holds that (A,B) is controllable, see Willems [74, section 4.8.2].

(iii) In the proof of proposition 3-3 we derived that B2s(A,B,C,D) is a minimal

realization of H if and only if Bs(A,B,C,D) is a minimal realization of

B* as defined in lemma 3-3-2. Hence the result follows from proposition

II.3-25. •

P r o o f o f p r o p o s i t i o n 3 - 5

Let ,.,ilslo:=ltu,x,y)e~2 ; y = 5 } be a minimal input /s ta te /

output realization of H~B2 such that //H={ ; 3x~l 2 such that (u,x,y)~Bi/s/o}.

It follows from proposition 3-3 that then (A,B) is controllable and that (A,C)

is observable, i.e., co1(C,~:,4,...,C.4 n-l) is injective. The last s tatement

follows from proposition 3-3 and the definition of perfect observabil i ty in

section II.3.3, which implies that {(u,Y) l[0,n_ll=0} ~ {x(0)=0}. 2

In Bi/s/o there exists a linear map L:u-~ x, as for u=O it follows from

lemma 3-3-1 in the proof of proposition 3-3 that x=O. As y=Cx+Du there exists

a linear map F:u--> y in H. So the remaining questions are in which case ud2

can be chosen arbi t rar i ly and in which cases F is causal or anticausal.

First we prove the implication ( ~ ) for (i), (ii) and (iii). Note that

2 5 4 A p p e n d i x

A+ o o ]

there is a choice of basis for ~n such that in this basis ~,=|0 A_ 0 |w i th L o o ~oJ

o-(A+)cC÷, a(A_)cC_, a(Ao)cCo, e .g , for .4 in real Jordan form. Let x= x_ , k ~ o J

B = B_ and C=(C+ C_ Co) be corresponding partitions. 0 If a(A)n%=O, then for uel~ let x÷,x_ be defined by x+(t):=

co k - I ¢o -1 1¢+1 t I:k= 1 A+ B+u(t-k) and x_(t):=-Sk=o(A_ ) B_u( +k), and let y:=C+x++C_x_+Du. It . ° [x÷l

e a s i l y follows that (x+,x_)el 2 and that : , _ / = / 0 A_ "- so "+]L,_j

13ilsl o. Hence u~l 2 is free. Moreover, if a(A)cC+ then the map L:u-> x

where x=x+ is causal, hence F:u-> y is causal. Similarly, if a(A)c£_ then

L:u --> x is anticausal, hence F:u -~ y also.

Next we prove (4) (~) . Suppose ,~eo{A) with IAl=l, say .Tlv=)~v where Ilvlt=l.

I t suffices to show that there is an uel'2 ~ such that there exists no xEl~ with

ax=~4x+Bu. Then there is no yel2 q-m with (u,y)dIH, as due to observabil i ty

{(u,y)e:~+(q-~)}~ {xe~. We then conclude that u is not free in l~ and hence

that there is no F:l~ --, l q-m with IIH=gr(F).

To construct u such that {x~l~; crx=.7tx+Bu}=O~ let al,a2e(Rm) n be such that

[An-lB...B]al=Re(v) and [A'~-IB...B]%=Im(v). This is possible as (A,B) is

controllable. Define ui~lr~ by uil[_n,_fl:=a i and ui(t):=O for tct[-n,-1], i=1,2.

Define Xi:={xe(Rn)z; ax=~4x+Bui} , i=1,2, and Xo:={x~(~'~)z; ax=Ax}. Then

Xi=xi+Xo, where x i is defined as that element xieX i for which xil(_oo,_nl=O,

i=1,2. It is easily seen that xl(t)=Re(Atv) and x2(t)=Im(Atv) for t>0. As

IAI=I it follows that at least one of the series YY I[xl(t)ll 2 and ~ [Ix2(t)l] 2 t > 0 t > 0

diverges, say t~ollXl(t)ll2=~. We now prove that then Xlnl'~=o. Let xeX1, i.e.,

x=Yc+x 1 with a~:=,4~. In an appropr ia te basis .7t=diag(A+o,A+l,A_,Ao) with

o(A+o)=(0}, a(A+l)cC+\(O}, ~r(A_)cC_, and a(A0)cC 0. Let ~=co1(~+o,$+1,~_,~0) be

a corresponding part i t ion of ~. I t easily follows that Yc+0=0. Further, in

order that E IIx(t)ll2<c¢ it is clearly necessary that ~:_=0, while E Ilx(t)ll2<~ t > 0 t_<O

implies that ~+a=0 and ~0=0 (note that x~(t)=0 for t<_-n), tlence if xJ~ , then

~=0. However, xx¢l'~, which proves the desired result

Finally we prove (ii) (~ ) and (iii) (~) . Suppose a(A}nC_#O, then we show

that there is (u ,y)e/ /H with ul(_~o,01 =0 but y](_=,o]#0. Note that

controllabil i ty implies that (A_,B_) is controllable and tha t perfect

observabil i ty implies that C_¢0. Let aeRa and actker(C_), and let b be such n - I n+ l m

that [A_ B .... B_]b=A_ a. Define uel 2 by ul[~,nl=b and u(t)=0 for t~[1,n].

Chapter IV" 255

-1 k+l Let y(t):=Du(t)-C_~=o(A_ ) B_u(t+k). Then one easily verifies that (u,y)~IIH and y(0)#0. This proves (ii) (~-). The proof of (iii) (=~) is completely

analogous. •


We only prove (iii)+, as (iii)_ follows in an analogous way and (i)±, (ii)± are trivial. For (iii)+ we need to prove that @oH can be obtained as /z-limit

of t rajectories in H with left compact support, i.e., for any ~>0 there should

be a kc:~ and ~c(a*)kH+ with [[@_~[[2<~.

Let B~/8/o (A,B,C,D) be a minimal input /s ta te /output realization of H and

let :=//~, ('U,,X,y)~i/s/o. Define a linear map L:Rn* (R") n as follows. Let

{el,...,en} be a basis of N~ and let aie(R") n be such that [An-lB...B]ai=ei, i=l,...,n. Such a i exist as (A,B) is controllable. Define Lei:=ai and for xeR n,

X=~?=lXiei, let Lx:=~?=lXia i. Let 3I:=[]LT(I+FTF)LI[ 2 with FER n(q-m)xnm defined by

F : = n-2

CA B " C B

Let T~Z be such that tZ<T[[@(t)[[2<~e and [[Yc(T)I[2<e/(4M). Define (g,~,~)e 2

~ i / s / o b y (g,X,y)[(_m,T_n_l]:=0, g[[T_n,T_aI:=LYc(T), and U[[T,00):=~.[[T,00). Then

clearly ~ ( T ) = ~ ( T ) a n d hence (~,~,~)[[T,~)=(~,~,~)[[T,®). Define ~ : = H I ~ , then

We(a*)T-nn+ and liW--~ll2=t~<Tl[~(t)-~J(t)[12<_ 2(t~<TI]~z(t)l[2+t~<Tl[~)(t)[[2)<l~+ 2([[LYc(T)[12+[[FLYc(T)[[2)<_~+2M[[Yc(T)][2<e, as desired. •


m q n I°:l lI:l 2 Consider H :={(v,w)el2xl2; 3xel 2 such that = }. Then B s is a minimal

input / s ta te /output realization of H'. If a(A)nCo=O then according to

proposition 3-5(i) there exists L:v-->w such that H'=gr(L), hence H=im(L), which proves (i). The results in (ii) follow from proposition 3-5(ii) and

(iii). Finally, (iii) follows from corol lary 3-4(iii) and the fact that (A,B) is controllable, see proposition 3-3. Indeed, it is a well-known result from

linear control theory that in this case det(sI-(A+BF)) may be any monic (real)

polynomial of degree n, by appropr ia te choice of F. See e.g. Kailath [33,

256 Appendix

section 7.1]. •


In the proof of (ii) we use some results on the solutions of the discrete time

Lyapunov equation. For similar and more general results for the continuous

time case we refer to Glover [17, theorem 3.3], and Kailath [33, section

2.6.2]. We will first s ta te and prove a lemma which subsequently is used to

prove the proposition.

Lerama 4 - 6 Let (A,C)eRn×nxR q×n be observable, i.e., col(C,...,CA n-l) is

injective. Consider the discrete time Lyapunov equation ATKA+CTC=K. (i) I f there exists a nonsingular solution K, then a(A)nCo=O. (if) If a(A)cC+, then there exists a unique solution K and moreover K>0;

if a(A)cC_, then there exists a unique solution K and moreover K<0.


(i) I f ATKA+CTC=K, then by repeatedly applying this identity we conclude that

K=(AT)nKAn+~-~(AT)t CrCA t, so K-(AT)nKAn>O as (A,C) is Observable. Now

suppose a(A)nC0%¢~, Ax=Ax with [A[=I and x=xl+ix2¢O , Xl,X2~R n. Then

x*(K-(AT)nKAn)x= (1-1A[2n)x*Kx=O, hence (K-(AT)nKAn)xi=O, i=1,2, which is in

contradict ion with K_(AT)nKAn>O. (fi) Let a(A)cC+ and suppose ATKiA+cTc=Ki, i=1,2. Then with Ko:=K1-K 2 it

follows that Ko=ATKoA=(AT)T~KoAn --> 0 for n -> oo, hence K0=0. Moreover, K:= Z~=o(AT)tcTcAt>O clearly is a solution. Analogously, if a(A)cC_ then K:=

-St=o[(A-1)T] t÷l cTc(A-1)t+I<O clearly is a solution, and it is the only one,

as for ATKiA+cTc=Ki, i=1,2, it follows that for Ko:=K1-K 2 Ko=(A-1)TIfoA-I=[(A-1)T]nKo(A-1)n-> 0 for n -> c¢, hence K0=0. •

(i) We now first prove proposition 4-6(i). As in the proof of proposition

II .3-22 we use for ze(Rd) z the notat ion z--:=zl(_~_l] , z+:=z][0,®). To

prove (~ ) , let be ( °~ m ' 2 0Iq

(v,x,w)~B2s there holds [laxl[K+l[w[[ =I[X[II~+I[v[[, hence IIw[12=l[vl[ 2. To prove (=~),

let Xo~R n and 13~(Xo):={(v,x,w)~B~; x(0)=x0}. Controllability implies that there

Chapter IV 257

is (v°,x°,w°)eB](xo) with (v°,x°,w°)l[n,~)=O. Analogous to part of the proof

of proposition 4-1 it is easily seen that we can take v°l[o,n_ll=LXo for a

linear map L:R n -~ (Rm) n, hence w°l[0,n_xl=Lx0 for a linear map L:R n -~ (Rq) n.

Define K:=Lr[,-LTL. Because Ilw°ll2=llv°lS 2 we conclude that ]](v°)--H2-H(w°)--I]2= n - 1 0 Et=o{llw (t)llu-IIv°(t)ll2}=xTKxo. As xocR n is arbitrary due to controllability,

if follows that K=K T is uniquely defined by By. Moreover, for any 2 (v,x,w)~B~(xo) there exists (v°,x°,w°)eB] with (v,x,w)--=(v°,x°,w°) -- and

(v°,x°,w °) I[n,00=O, due to controllability. Hence IIv--112-11w--ll2=xToKxo . 2 K is nonsingular, as for Kx0=O it follows that for (v,x,w)eBs(xo) with

v+=O there holds IIv--ll=Hw--II, hence w+=O and x0=O due to perfect

observability. We finally show K0 /,'0 ) Pontryagin

isometry. Let (a,b)~NnxN m and Ic ] : - - . D u e to controllability there 2 exists a (v,x,w)~B~ with (x(O),v(O),x(1),w(O))=(a,b,c,d). As (av,~rx,aw)eB~(c)

we conclude from the foregoing that Ilcll~=ll(~v)--112-11(a~ol--112=llv--112-11w--112+ IIbII2-11dlI2=llall~+llbll2-11dll ~, hence Ilcll~+lldll2=llall~+llbll 2 which is the desired

result.

(ii-1) Suppose (i) is satisfied, then K satisfies ATKA+CTC=K. Indeed, let x o 2 ~ x ( i ) ] = be arbitrary, then take (v,x,w)eBs(Xo) with v(0)=0, so (.w(o)j

[A B| [:0], and [A~] being an isometry implies IIAxoll~+llCxoll2=llXo]l~,., i.e.,

xTo(ATKA+cTc-K)xo=O. Now perfect observability of (A,B,C,D) implies

observability of (A,C), which is seen by taking (v,x,w)et3~ with

(v,w)l[0,n_l]=0. As K is nonsingular, it follows from lemma 4-6(i) that

a(A)nCo=¢) and from proposition 3-5(i) that L(A,B,C,D) exists. That it is an

isometry follows from (i). This proves (ii-1).

(ii-2) To prove (ii-2), note that in(K) does not depend upon a choice of A+ 0 coordinates in g~,. Choose these in such a way that A=r 1 with

r -, Lo a_J a(A+)cC+ and a(A_)cC_. Let C=(C+,C_) and K=| K+ K+_| be correspoding

1 T T K,-TJ T

partitions of C and K. As A KA+C C=K it follows that A+K+A++C+C+=K+ and

ATK_A_+CT_C_=K_. Moreover, (A+,C+) and (A_,C_) are both observable. From lemma

4-6(ii) we conclude that K+>0 and K_<0. From this we easily get (ii-2).

(ii-3) This result is an immediate consequence of (ii-2) and proposition

3-5(ii) and (iii).•

25S Append ix

P r o o f of p ropos i t ion 4 - 7

M'= [A+BF Be] ~R(,+q)×(,+m) SeR,×, invertible and h:=hjTeR "×" invertible. It Let " I C+DF DRJ . s o s-~n . rio Ro

is easily shown that I IM I l ,s a ( I l ' { l ) Pontryagin isometry k 0 IqA k 0 Ira) t. 0 Imj t. 0 Iq9 f __ "~ f L _ %

• . . . if i and only I f M is a ( L J U q J ° I m ' o ) Pontryagm lsometry, where K : = S r k S . IIence

it suffices to prove the proposition for the case S=I. f %

Now hi is a ( ~ ,,,~oz ' ~ ~°zq

(a,b)eNnx, m and / : / :=MIb/ there holds ~ J ~ J

0 = . ( c l T ~ K o ~alTCXIIXI2 l kb3 kx2, x22J ' LbJ ko

where a direct calculation shows that Xn=(A+BF)TK(A+BF)+(C+DF)T(C+DF)~ X22=RT(BTKB+DTD)R, and Xel=xT2=RT[BTK(A+BF)+ DT(C+DF)]. Hence M is such an

isometry if and only if Xll---K, X22=Im and X12=0. Now X22=I,n is equivalent with

(R) as R is invertible, and then X12=0 is equivalent with (F) as R is

invertible and BTKB+DTD also, according to (R). Given (F) and (R), X11=K is

equivalent with (ARE) by using the expression (F) for F. •


According to proposition 3-3 (A,B,C,D) is perfectly observable and (A,B)

controllable, hence stabilizable, see Kailath [33, section 7.1]. The result

follows immediatcly from Payne and Silverman [57, theorem 2.1 and lemma

, t .5} . •

P r o o f o f theorem 4-9

According to propositions 4-6 and 4-7 L:=L(A+,B+,C+,D+) is a causal, time

invariant isometry with im(L)=H. According to corollary 4-2 it suffices to

show that H+=LI2(Z+,~m), as L+ is unique (up to isomorphisms of ~"). As L is m 2 causal LI2(~_+,R )oH+. It remains to show that for (v,x,w)eB~(A+,B+,C+,D+) with

w--=O there holds v--=O. From corollary 3-4(iii) and proposition 3-3 it

follows that (A+,B+,C+,D+) is perfectly observable and that D+ is injective.

[cA+B+]I; ] it follws by induction that (v,x)(t)=O for t=-n+l , . . . , -1 , hence + D+J

Chapter IV 259

v = 0 . •

P r o o f o f p ropos i t i on 4-11

Let [.+ be the forward scattering representation of H and let L:=I~L+~. Then L

clearly is isometric and time invariant and im(L)=~im(L+)=~t=H. According to

corollary 4-2 it now suffices to show that LI2(7/_,Rm)=H_ in order to conclude

that L=L_. Define 12(7]++,Rm):={ve12(71,Rm); v(t)=O for t<O}=a*t2(7/ +,R"), then L12(7/_,Rm)=RL+I2(7]++, R m) = RL+a*I2(7/+, R m) = Rcr*L+I2(Z+, R m) = ~a*H+

=aR~I+= a{weH; w(t)=O for t>O)={weH; w(t)=O for t>O}=II_. •

P r o o f o f c o r o l l a r y 5 - 3

It suffices to prove that P=L+L+=L_L*. We will prove P=L+L+, as the other

result follows analogously. P:/q-> I q is uniquely determined by the conditions

(i) P*=P; (ii) p2=p; (iii) im(P)=H, see e.g. Akhiezer and Glazman [2, section

31]. Now L+L: clearly satisfies (i), while (ii) follows from the fact that L+

is isometric, so L~L+=I. As im(L+)=H it only remains to show that Hcim(L+L*+). Let w~H, then w=L+L+w which is seen as follows. Let v be such that L+v=w, then

for any v' <w,L+v'>=<L+v,L+v'>=<v,v'> as L+ is isometric, so by definition

L+w=v and L+L+w=L+v=w. Hence Hclm(L+L+). •

P r o o f o f t heo rem 5-4

In this proof we use some standard results on the discrete Fourier transform

and its inverse, the Z-transform. We will state these results in a lemma, give

an outline of the proof of the lemma and finally prove the theorem by means of

this lemma.

Notat ion. Let C0:={zeC; ]z l=l } and for f:Co --> C d let llf]12:=

(27r)-lf[f(eie)]*f(eie)dO, where • denotes complex conjugate transpose. We - T r

define Ilfll%0} and on

With this inner product L d is a Hilbert space. Let

ld(c):={xe(Cd)Y-;t~z[x(t)]*x(t)<c¢ }. We define the discrete Fouriertransform

260 Appendix

F:L~ -->/~(C) by (Ff)(k):=f(k):= (2~r) -I j'~ e~kef(e~e)dO and the Z - t r a n s f o r m iO oo - i k O - Tr

Z:ld(C)->L~ by (Zx)(e):=~k=-co e x(k) These t ransformat ions are

wel l -def ined in a l imi t - in -mean sense. []

Lemma 5 - 4 (i) F is un i ta ry f rom L d to I~(C), i.e., it is isometric and

surjective; Z is the inverse of F.

(ii) Let Hal3 2 have minimal real izat ion B~(A,B,C,D) with a(A)nCo=O and

let L:=L(A,B,C,D) be the corresponding driving opera tor , then Z(Lv)= G.Z(v) with G(z):=D+C(zI-A)-IB, z~C0; if L is isometric then Z(v)=

G*" Z(Lv) where G*(z):=[G(z)]*.

(iii) For L as in (ii) IILIh:=sup{llLvll; Ilvll--1}--llGllco.

P r o o f o f l emma 5 - 4 (outline)

The result in (i) is the well-known Fourier-Plancherel theorem, cf. Kato [40,

section V.2.2].

To prove (ii), let a basis in R n be chosen such t ha t A= IA+ °I with L° J A~

< , a(A+)cC+, a(A_)cC_, and let B= B_ Then for vel'~ LV=LlV+L2v+Dv whereV 2 (Llv)(t):=C+S~=lAk+_lB+v(t_k) and (L2v)(t):=

m -1 k + l -C_Sk=o(A_ ) B v(t+k), cf. the p roof o f proposi t ion 3-5. By the definition , co - k k - 1 co k - 1 k + l

of Z - t r a n s f o r m we obta in Z(Lv)=[C+(~k=xz A+ )B+-C_(Ek=oZ (A_ ) )B_+D]Z(v) and as a(A+) and a(A -1) are contained in C+ there holds for zeC0 that S c o - k , k - 1 ~ c o k . _ - l , k + l - - - 1 , T - - - 1 , - I k=xz .% = z-I(I-z-IA+)-I=(zI-A+) -1 and Z~k=oZ (A_ ) =a_ (l--Z:a_ ) =

-(zI-A_) -1. From this we obta in G. If L is in addit ion isometric, then for

vel 2 with Lv=w there holds for a rb i t r a ry v' with Lv'=:w' tha t <G Zw-Zv,Zv >=

<Zw,GZv'>-<Zv,Zv'>=<Zw,Zw'>-<Zv,Zv'>=<w,w'>-<v,v'>=<v,L*Lv'>-<v,v'>=O as

L*L=I. tlence G*Zw= Zv as desired.

To prove (iii) note tha t for [Iv[I=1 ]lLvll--llGZ(v)ll<_llGl[~, hence it suffices

to p rove tha t fo r every e>0 there is a vcl~ with Ilvil=l and [ILvH>llGll~-e. As IIG(z)ll is continuous on C O which is compact it follows that there is a

80c[-rc,rr ] such that [[G(ei°°)]l=sup{][G(z)[[; zeCo}=:lIG[]oo. Let aeC m with IIall=l be

such tha t ]]G(eW°)all=llGI]co. Continuity of G(z) implies tha t there exists a 6>0

such that fo r I8-001<6 IIG(ei°)alI>lIG}lco-e. Then for 18+0a1<6 there holds

I[G(eie)all=llG(e-ie)all>llG[Ico-e. Now define uel'~ as follows. If 80=0 , then take

aeR m (this is possible), and let u(eiO):=(rr/a)l/2"a'I(_a,a)(O), where Iv(O)=I

Chapter IV 261

for OeV and 0 for O¢V. If 00~0, then take 6<1001 and let u(ci°):=(lr/26) v2.

{a.I(oo_8,Oo+a)(O)+~.I(_oo_6,_eo+~)(O)}. Let v:=Fu. Then []vl[=JluIl=l and vcl~,

i.e., v is real, as -~)=(2rc)-lf're-ikOu-((eiO)do=(2rc)-lf're-ikOu(e-iO)dO=v(k), - ~ - f t

keZ. Now IlLvlh=llaull which is easily seen to be larger than Ilall~-s. •

Theorem 5-4 is easily proved by means of this lemma. Let L:=L+L*+-L+L~* and

G:=G+G+-G+G+ , then for w~l q there holds Z(Lw)=Z(L+L~w)-Z(L~L~*w)= G+Z(L+w)-G+Z(L+ w)=G+G+Zw-G+G+ Zw=GZw, and hence ]]Lil2=sup{i]GZwil; ]]w]i=l}=liGi] ~ which follows from the proof of part (iii) of the lemma. The

result for G_ follows in an analogous way. •

P r o o f o f p ropos i t i on 5 - 5

We first state a preliminary result. Let L1,L2cR n be linear subspaces with

dim(L1)>dim(L2) , then LlnL~2~e{0}, which easily follows from dim(Ll)+dim(L~)=

dim( L1)+n-dim( L2 )>n. Now let (ra,n,m',n'):=(ra(H),n(H),m(H'),n(H')) and suppose m>ra'. It suffices

to prove that Hn(H')-L~{0}, as from O~weHn(H') ± we conclude g(H,H')=

HP-P']]>_ j~wlj]lPw-P'wl]=l, while g(H,H')<I always holds true.

According to corollary 3-4 and proposition 2-2 there exist controllable

systems B,B°e~ such that H=Bnl~ and H'=B'nl q, where (m(B),n(B),m(B'),n(B'))= (m,n,m',n'). For TeT]+ let B(T):={w~B; w(t)=O for t<0 and for t>_T}cH. We will

show that there is a T such that there exists O~wEB(T) with

W[[o,T_ll_I_I3'I[o,T_I]. As B'[[o,T_I]=II'I[o,T_I] this implies that O¢wcHn(tt') "l-, as desired.

Now take T such that (T-2n)m>n'+Tm',i.e., T>(n'+2nm)/(m-m'). As 6 is

controllable we conclude that in B(T) the m inputs can be chosen freely on

[n,T-n-1], hence dim(B(T)][o,r_q)>(T-2n)m. From a minimal input/state/output

realization of B' it is evident that dim(B'l[O,T_l])<_n'+Tm'. From the result

stated in the first lines of this proof it follows that there exists

O~:X~[B(T) I[o,T_q] n [13'I[o,T_l]] ±, Define w~13(r) by Wl[o,T_z]:=x and w(t):=O for t~[0,T-1]. Then O~weHn(H') ±. •

2 1 3 2 A p p e n d i x


From proposi t ions 4 - 6 and 4 -7 charac te r iz ing isometries we conclude tha t 2 2 2 2 in IIx+(t+l)llK++llw(t)ll =I}x+(t)IIK++IIv+(t)I I . As x+e z this implies Ilx+(O)ll~+=

iiv~-112_llw--112= + 2 + 2 Ilw II-IIv+ll • From these proposi t ions and the fac t tha t /~B;= 2 B~(A_,B_,C_,D_) we conclude tha t IIx_(t-1)[l~++llw(t)ll2=llx_(t)ll~.++l[v_(~)ll =, so

IIx_(O)ll~+=llv+_+ ll2-1lw++ ll2. Next we p rove the express ions fo r Ilxoll++ and Ilxoll--- We use the facts

co k - 1 0 co k - I t ha t in B+s x+(O)=Sk=lA + B+v+(-k) and in B~ x_( )=Sk=IA_ B_v_(-k), t ha t L+

and L are isometries, t ha t ax+=x_ and all=H, to ob ta in that

Ilxoll~+ = inf{ Itv;-ll2; (v+,x+,~o)~B~ +, x+(0)=Xo} = inf{ IIv+ll2; (v+,x+,zo)et3+~,

x+(0)=xo} = inf{ Ilwll2; (v+,x+,zo)ez~ +, x+(O)=xo} = inf{ II~oll~; (v_,x_,v))~t~;, x_(-1)=xo} = inf{ 11~112; (v_,x_,w)et3;, x_(0)=xo} = inf{ IIv_l12; (v_,x_,w)ct~, x_(0)=xo} = inf{ IIv+_+lle; (v_,x_,~o)Et3 L x_(O)=xo}=llxoll2__ = inf{ Ilxoll_2+ll~o++llZ; ( v _ , x _ , ~ o ) ~ , x _ ( O ) = x o } = Ilxoll_ 2 + i n f { IIw÷l12; (v_,x_,w)~t~-~, x _ ( - 1 ) = X o } =

IIx0112_+inf{llvJ+ll2; (v+,x+,w)e/3+~, x÷(0)=xo} 2 2 . + 2 = Ilxoll-+llxoll++mf{ IIv+ll ; (v+,x+,~o)elS~, 2 2 2 r -1 x+(O)=xo}=l[Xol[++HXo[l_. I t remains to p rove tha t [IXoll++=xoQ+ x o.

7n This is a wel l -known result f rom l inear quadra t ic control . For v¢I 2 define

- - ~ k - 1 - - Tv :=~=~A+ B+v(-k), then for (v+,x+,w)~B+~ there holds Tv+ =x+(O). Let T * R n 2 . - - 2 : --> 12(N,R m) denote the adjoint o p e r a t o r of T, then IIx011++--lnf{llv + II;

Tv-+-=xo}=llT.(TT.)-lxoil2= T • -~ xo(TT ) xu, and a direct ca lcula t ion shows that

TT*=Q+. •

P r o o f o f l e m m a 5-8

Let B2s(A,B,C,D ) and RBs2(,4,B,C,D ) be minimal rea l iza t ions of HaB 2. A change of

cor rd ina tes x-~ x':=Sx leads to a t r ans fo rma t ion of p a r a m e t e r s (A,B,C,D)-~

(SAS-I,sB,CS-1,D) and (A,B,C,D)--> (S~tS-1,SB,CS-1,D). From (ARE) it follows ~ 1 / 2 ~ - 1 , , 1 / 2 r A2 , T

t ha t (K+,K+) -> ((S-I)TIi+S-I,(S-1)Tfi+S-1). Now let n + n + l~+=un ~ with U

o r thogona l and A=diag(A1,...,A,) , AI>_...>_An>0. A direct calcula t ion shows

t h a t S:=A-I/~uTK1/+ 2 gives the desired result . •


We will show tha t a(,4)cC+. According to p ropos t ion 3-5( i ) there then exists a

C h a p t e r I V 2 6 3

2 ^ ^ ^ ^ ^ map v -~ ~ with domain l~ for Bs(A,B,C,D). As D=D+ is injective we can conclude

that m(~I)=coo(ft)=m. As B](A,B,C,D) is a realization of H we conclude that

To show that a{A)cC+ we use the fact that with Db:=D + (Ab,Bb,Cb, Db) are

the parameters of the forward scattering representation of H with the

corresponding solution Ii b of (APE) given by A=diag(A1,...,An). Define

A:=diag(A1,...,A~) and A2:=diag(A~+1,...,An). From propositions 4-6 and 4-7 we

conclude that I=BTABb+DTbDb from (Rb) , that o=BTAAb+D~Cb from (Fb) , and hence

from (AREb) that A=A~AAb+CTcb.

Take partitions as in step 3 of the algorithm for balancing. Let c~a(A)

atld Ax=c~x for some 0¢:x~C . Let x:= I q ~C • A direct computation of x*(AREb)X • T ^ T ^ L ° J 2 • ^

shows that then ~ [A2~A2A2~+C C]~=(1-1al )~ A~. As A>0 it follows that Ic<l_<l.

It suffices to show that Ic~l~l. As A2>0, Ic~l=l would imply that A21~=0 and

hence that AbX=~X , which contradicts the fact that a(Ab)CC+, cf. proposition

4-6. •

CHAPTER V


The minimax p rope r ty is wel l -known and shown e.g. in Stewart [67, theorem 6.5

o f sect ion 6.6]. As the p rope r ty is o f crucial importance in the sequel we

give its p r o o f explicitly.

Let A have (SVD) A=UL-'V T. For xeR n2 and LcR n2 let y:=VTx and L':=VTL. Then

min{ max IIAxU. • . n2 2 2 vz n 2 2 z ~ i = l y i = l ~ dlm(L)>n2-k}=mm { max {Si_ 1 aiyi} ; dim(L')_>n z-

O~xeL IlXll ~ -- O;ayeL' - k}=:c¢ k. Taking L':=span{ek+x,...,en2 } shows tha t (xk<ak+ 1. On the o ther hand, if

dim(L')>_n2-k , then V:=L'nspan{e~,...ek+1}¢:{O}, hence V contains a y ' of norm 1.

This implies tha t (xk>ak+ 1.

Finally no te tha t c~ k is achieved for L':=span{%+l,..,en2}~ i.e., for

L =VL =span{vk+l,...,Vn2}=Ln2_ k. •


If nl=n2, A=diag(dl,...,dn2 ) with dl>...>dn2>O a n d L=• n2, then it easily

follows tha t eA(R n2) is wel l -def ined and equal to (dl,...,d,~2). To prove the

general case, let LcR n2 with dim (L)=d and let the columns of BcR '~2×a form an

o r thonormal basis of L. Further let BTArAB have (SVD) UZU T with Z=

diag(al,...,ad) , al>...>_aa>O. Then it follows tha t fo r O~xeL, say x=BUy, there

holds uAxll_ IIZV2YlI--. From this and the fac t tha t uTBTBU=Id it is easily seen II x I] II y II d

tha t eA(L)][1,d]=el/2(R ) which was a l ready seen to be well-defined• Hence v2 2/2 0 n~ ¢A(L) is also wel l -def ined and is equal to (al~...,~d~ , .... ,. •


If k<nz-r then eA(Lk)=0 and the results easily follow. We hence will assume

tha t k>n2-r. (i) As dHn(Ln(span{xl,...,Xj_l} ) )>_k-3+1 it follows f rom proposi t ion 2-8 that

fo r j=l , . . . ,d im(L) ei(L)>_an2_k+ j. For L~ it follows by induction tha t for

C h a p t e r V ' ) 6 5

j= l , . . . ,k ej(L~.)=a,2_k+ 2 (ii) First suppose that nl=n 2 and that A=diag(dl,...,dn2 ) with dl>_...>dn2_k>

dn2_k+l>_...>_dr>dr+ 1 . . . . . dn2=O. We then have to prove that eA(L)=(dn2_k+l, ...,dna,0,...,0 ) implies that L=L~, where L~=span{en2_k+l,...,e%}.

Let 0<61<...<6 ~ denote the distinct values in {dn2_k+l,...,dr} and let 6j have multiplicity mj, E~=lmj=max{O,r-n2+k ). Define Mj:=n2-r+ZJ=lmi, j e l l , s ] ,

and Mo:=na-r. We will show by induction that for j=O,...,s span{e~2_Mj+l,...,en2 } c L. For j=s this gives the desired result, as

n2-Ms+l=n2-k+l. First consider j=O. Proposition 2-8 implies that for

dim(L')>k+l el(L')>d . -k , hence dim(L)=k. As ei(L)=O for ie[r-n2+k+l,k ] there - - 2

is a subspace VocL with dim(V0) _> n2-r and Ax=O for xeV 0. lIence

Vo=span{er+l,...,en2 } which proves the result for j=O. Next suppose that

span{en2_Mj_l +i,...,cn 2}CL for some je[1,s]. As

eA(L ) s =(5:, ..,Sj, Sj_l,...,~j_l,...,~l,. ,~1,0, ,0), wimre ] [ ( E i = j + l m i ) + l , k ] -/ . . . . . .

0 appears nz-r and 8i appears m i times, i=l,. . . , j , it easily follows that

there is a subspace gjcL with dim(Vj)=mj, Vjxspan{e%_Mj_l+l,...,en2 } and I IAz l l .

0,~v~max ~llXll =o,.j Hence Vj=span{ena_Mj+l,...,en2_Mj_l } which concludes the

inductive proof.

We finally prove the result for general A. Let AeR "1×"2 have (SVI)) A=UEV T

and define D:=diag(cq,...,an2 ) where al>_...>_an2_k>an2_k+l>_...>_Crr>ar+ 1 . . . . . an2= 0. Suppose ¢A(L)=(crn2_k+l,...,a,,2,0,...,O), and let L':=vTL. With y:=VTx there holds IIAx II IIDyll IlXll -- I - ~ "° Using the or thogonal i ty of V we conclude that eA(L)=SD(L'). We have shown that hence L'=span{en2_k+l,...,e,=}, so L=VL'=L~. (iii) The proof of (~ ) is direct. To prove (=,), let ¢A(L)=(an2_k+l,...,

a t ,0 , . . . , 0 ) . Then dim(L)=k. Let the singular values of A satisfy

6rl>_...>_6rCl>O'Cl+l=...=tTn2~k~-~n2_k+l=...=~c2>Oc2+l~...>Un2~O. From the proof of

(ii) it easily follows that L':=span{v%+l,...,vn2}cL. Further there is a

subspace L"cL with L"±L', dim(L")=k-dim(L'), such that max ,,IIAzll O~ x e L IIXU ~" O'n2-k"

Hence L"c span{vq+l,...,v%}. •


It follows from definition 2-2 that eD(d,a) = ItSa[I and from definition 2-4

that tD(d,M)=es(M±), which is well-defined according to lemma 2-9. •

2 6 6 A p p e n d i x


The ordering of the misfits is lexicographical, cf. definition 2-5, and

according to proposition 2-11 eD(d,M)=es(M ±) with S the empirical covariance

matr ix corresponding to d. This enables us to use proposition 2-10.

(i) This is trivial from the definition of P~cto ~. (ii) Let L :=span{xl,...,xw}. As rank(S)=r dlm(L )=r<_cto b and eD(d,L*)=

es((L*)Z)=0, hence minimal. To show optimality of L* it remains to prove

that {dim(M)<r, eD(d,M)=O) ~ {M=L*}. If ¢D(d,M)=es(M-t')=O then ]]Sal]=O for all

aeM ±, so M&cker(S) and hence M~im(S)=L*. If in addition dim(M)<r=dim(L*) then

M=L*. * ± g~ , . •

(iii) First note that (MCtot) =Ln_ctot as defined m section 2.1.2. Let /lle~

with cD(M)<Ctol, ± " " " _ then dim(M )>n-cto I. According to proposition 2-10(i) D 2_ * D * • •

then e (d,31)=es(31)>es(Ln_ctot)=e (d,Mctot), while according to proposition

2-10(ii) aCtol>aetol+l lmphes that es(M )=¢s(Ln_ctol ) if and only if

1V±I =Z*n_Cto l ~ i.e., M=M:tol. (iv) As for cD(M)<_cta there holds dim(M±)>n-ctoz and ~rctol>__~rr>O we conclude

from proposition 2-10(i) and (iii) that M has minimal misfit if and only

if MX=L'+L '', where L':=span{ufi aj<aCtol } with dim(L')=n-Cl-dim(M(Crctol)) and

L"cM ( ff etot ) with dim( L")=n-Ctol-dim( L')=crCtol-dim( M ( actol) ). Then M=

(L')±n(L")±=M*~I+L, where L:=M(acto~)n(L")± has dimension dim(M(aCtot))- dim( L")=Ctol-C 1. []


D to l (i) This is evident as el(d,{O})=crx< Q .

(ii) Let L :=span{xl,...,XN} , then el(a,L )=0<e 1 . If cD(M)<r then dim(M±)_>

cl(d,bl)>_ar>e 1 . So the n-r+l and according to proposition 2-10(i) D tot

minimal achievable complexity is r. To show optimality of L* it remains to

prove that {c°(Pl)=r, eD(d,M)=O} =~ {M=L*}, for which we refer to the last part

of the proof of proposition 2-12(ii). "M *'-1- L* D • (iii) First note that ( k ) = n-k as defined in section 2.1.2, hence e (d,Mk)=

• D * to l ea(Ln_k)=(O'k+l,..,O'r,O,...,O), especially ei(d,31k)<<_ak+l<el for all

i~[1,n]• If cD(M)<k then dim(Ml)>n-k+l and according to proposition 2-10(i) D to l el(d,M)>ak> Q . Hence the minimal achievable complexity is k. As ak>ak+l we

C h a p t e r V 267

conclude from proposit ion 2-10(i) and (ii) that among models with cD(31)=k eD(d,M) is uniquely minimized by taking N:=31~. •


Let the data be generic, so Sxx a n d Syy are invertible. Let MeN and N~:=

{a2eRn2; 3aleR nl such that (ax,a2)eM-L}. Suppose that (al,az)~M 1- and

(a{,a2)~M3- , then (ax-a[,O)eM 1- and as the projection in M on the first n 1

coordinates is surjective it follows that al=a ~. Hence there exists a linear

map A: Rn2-> R 'q such that ML={(at,a2); a2eMl2, al=Aa2}. For a=(al,a2)eM3- let

V2 . P T Sxx Sxy IIQV2~II cc=S. .a2 , then it follows that e (d,a)={a [ ] .1/2.¢ T.~ 1/:~ at /ta2~yya2.~ = where _ . . . . r . . . . . . , / , . L S y x %) I, {2: = y y ( A b'xxA+ b'yy--}-A ,.~'xy+b'yxA)byy. A S Ot /O¢ l I a n d only i f a2_l_(2)a ~ i t f o l l o w s

p 1/2 £ from definition 2-17 that for generic data e (d,3I)=eO1/2(Syy312) which is

well-defined according to lemma 2-9. It

P r o o f o f l emma 2 - 1 9

Let L2cR n2 be given and MeN(L2). In the proof of proposit ion 2-18 it was shown • n 2 n 1 .L that there exasts an A:R ->R such that M ={(ax,az); a2eL2, ax=Aa2} and

~1/2 1 P 11(2 C¢tl • 1/2 -/2 T that for generic data e (d,a)= ~ .. with c~:=S a2 and Q:=S (A S~A+S +

t t ]l w¢ II Y y Y y YY T - / 2 . -"/2 -1 " 1/2 . . A Sxy+SyxA)Syy. Define A:=Sxx(A+SxxSxy)~yy , then a direct calculation shows

that Q=I-vETzvT+ATA. So eP(d,M) clearly is minimal on N(L2) if and only if , J / 2 . , . ~ 1 / 2 ,

Ac~=0 for all ~ e ~ y ~ 2 , i.e., (Z_~yy)[L2=0. As Sxx>O it follows that

AlL2=(-S;lxSxy)lL2which corresponds to 31=M*(L2). From the last line of the P * //2

proof of proposit ion 2-18 it follows that e (d,31 (L2))=E(I_V.~TsvT)1/2(SyyL2)=

P r o o f o f c o r o l l a r y 2 - 2 0

T 1/2 3- Let MeN with cP(M)<_r~2-k,, hence dim(M~)>k, and let L2:=V SyyM 2. It follows from

P P * 3- lemma 2-19 and proposit ion 2-10(i) that for generic data e (d,M)>e (d,M (312))=

~(I_sT~)I/2(L2)>_S(I_zT~)I/2(L~.), where L*k=span{cl,...,% } a s ([-ETE) 1/2 has _ , . 2 ,1/2

eigenvalues Al>_...>__),n2>__O with ai=~x-an2_i+l) , ie[1,n], and as %2_i+ 1 is an

eigenvector corresponding to ),~. v:TeV~ ~t-L_r * Hence minimal misfit is achieved by the model M*tM2) w i t h A f f ~ y ylv~l[ 2 - - ~ k o

2 6 8 Appendix

± (i} direct calculation then shows that M2=span{a 2 ; ie[1,k]} and that

, ~ -1 ,~ ( i ) ( i ) 31*'31±'=-x,Y'eR'q×R"2; ~ 2 ) { ~ ) -* <-SxxSxya2~x>+<a2,y>=O for all - - D x x ~ x y a 2 = - f f i a , ~ SO ± n 1 n 2 • . . * a2~M2}={(x,y)eR ×R ; <a~'),y>=a~.<a~*),x>, ~[1,k]}----mk. Moreover lemma

P * * 21/2 2 1 / 2 2-19 implies that e (d,mk)=e(x_~Tz)l/2(Lk)=((1-ak) , . . . , (I-a,) , 0,...,0). •

P r o o f of p ropos i t ion 2-21

According to lemma 2-19 it suffices to determine those subspaces M~ecR n2 for T I/~ ± which.e(i_xTs)V 2 (L2) is minimal, where La:=V SyyM2~ and to accept the models

P . . 31"(312). The requirement e (3l)<cto ! is eqmvalent to dim(L2)= P n2-c (M)>n2-cto~=:k. Let AI>..._>An2>_O denote the singular values of (I-ETE) l/2,

A "1 2 .1/2 i.e., i=(-0"n2_i+1) ~ so especially for r<n 2 A 1 . . . . . An2_r=l and for r*>O

Anz_r*+l . . . . . An2=O. In the notation of section 2.1.2 L~=span{ex,...ek} , and * - 1 / 2 / . . *

in the proof of corollary 2-20 it was shown that 3I (Syyl Lk)=M~, P • 2 1 / 2 el(d,Mk)=(1-aD •

(i) If ctol<n2-r then k>r and hence An2_k=An2_~+,=l. According to

proposition 2-10(i) and (iii) the optimal models are obtained by taking

L2=L'+L" with L'=L~ and L"cspan{ei; ie[r+l,n2] } with dim(L")=n2-Ctol-r , i.e., * . ±

L2DL r and dim(L2)=n2-cto 1. This is equivalent to dim(M 2)=n2-eta and * -1/2 * * McM (Sy y VL=)=Mr.

P * * P * 2 1/2 (ii) Clearly c (Mr*)=n2-r <cta and e (d,Mr.)=(1-ar*) =0, hence eP(d,31**)=O. P * * So it suffices to prove that {c (M)<n2-r , ~P(d,M)=O}~ {M=Mr* }. If

r -1/2

eP(d,M)=O, then there holds M-Lcker(~SxxSxY])=tsyx syyj [SXoV Sy °1/2y vJ ] " ker( Izx T 7 ) ) =

x~ v _5/2 ] . span{ _~ ; ~[1 , r ]}=span{(al,-a(20); ,~[1,r ]}, where e~ and Sy y v.j

e~' denote the i - th unit vectors in R nl and R n2 respectively. If in addition

ce(M)<_n2-r*, then dim(M'±)=dim(lll¢)>_r *, hence M±=span{(a,(i), _a~,));" i~[1,r*]}

and M=M**.

(iii) I f an2_Ctol>O'n2_etot+, t h e n An2_k>An2_k+ 1 and according to proposition

2-10(ii) we get L2=L ~ with corresponding model bl~ 31" 2 ctot

( i v ) I f O ' n 2 _ C t o l = O ' n 2 _ e t o l + l then An2_k=An2_k+l~ SO according to proposition

2-10(i) and (iii) the optimal models are obtained by taking L2=L'+L"

where L'=span{e,,...,eel } and L"cspan{eq+l,...,ec2 } with dim(L")=k-dim(L')= ne-Ctoz-q. The corresponding models are M*(L2)=M*ac~L where L±cM(a,2_%l) ± with dim(L-L)=dim(L"), so M(an2_Ctot)CL and cP(L)=n2-dim(L")=Ctoz+C,. •

C h a p t e r V 269


P . . . . * , ,~ 2 ,1/2 to l (i) Clearly el(a,:ln2)=tl-an2 ) <Q and cP(M~2)=0, hence it suffices to show

that {cP(M)=0, eP(d,M)<_eP(d,3I*2)}..~. {M=M~*2}. This follows from lemma

2-19 as for cP(M)=0 bl~=N "2 and 31"(Rn2)=74"2.

(ii) If cP(M)<n2 then dim(M~)_>l and according to corollary 2-20 e~(d,M)_> (~ 2 , ~/2 to~ ±-al)->¢1 • Hence cP(M)=n2, so M~={O} and hence M'I-={O}, i.e.,

M : ~ n l + n 2 .

• ~ 2 , 1 / 2 t o l , .~ 2 1/2 (iii) Let k:=r and note that (l-ak) <~1 <I=(l-ak+l) • The result then follows

from the proof of (iv). P 2 1/2 t o l (iv) If cP(M)<n2-k then according to corollary 2-20 el(d,M)>_(1-a~.+l ) >Q .

P • . ~ , * , ,~ 2 , V 2 t o l As cP(M*k)=ne-k and el(a,:lk)=(l-ak) <el it follows that the minimal

achievable complexity is n2-k. As ak>ak+~ it follows from proposition

2-21(iii) with cto/:=n2-k that the optimal model is M~.. •

P r o o f o f t h e o r e m 4 -4

Let /~ be as defined in step 5 of the algorithm. Due to assumption 4-3(ii) B is ± ±

uniquely defined. It follows from steps 3.0 and 3.2 that vt(Lt).l_vt(Bt_l+Sl3t_l) and it inductively follows from assumption 4-3(iii) that B~ as defined in

steps 2.2 and 3.2 indeed exactly consists of the t - th order laws claimed by /3

as defined in section II.3.2.4. Then (iv) follows from the definition of L D in

section II.3.2.5 and (ii) is implied by proposition II.3-10.

We prove (i) and (iii) by induction. For t=0 the restriction c(8)<Cto / t o t implies that at least e 0 zero order laws should be accepted, cf. definition

4-1. As for such laws eD(~,r)=[]S1/2(~v,O)vo(r)Tll/llvo(r)]] it follows from

assumption 4-3(ii) and p~oposition 2-10(ii) that the unique optimal solution

is given by V 0 as defined in step 2.2, with corresponding misfit (aq_e~oI+l , ...,aq, O,...,O) where a k is the k- th singular value of Slh(~,0). This shows

(iii) for t=O. Note that due to assumption 4-3(ii) and the lexicographie

ordering of misfits, cf. definitions 3-4 and 3-9, it is suboptimal to accept

more than e t°z laws.

Next suppose that optimality of step 3.2 is shown for steps v with v<t-1

for some t>l. According to definition 4-1 it follows from the requirement

c(B)<cw~ and the fact that err 0/ equations of order v have been accepted,

270 Appendix

T ~ Z t - - 1 , tha t a t least e~ °~ equations of order t have to be accepted. Moreover,

according to definition 3-3 we have to minimize the misfit of the newly 3_ 3- .1_

accepted laws in [ v t ( B t _ l T S l 3 t _ l ) ] . NOW for R~N~×q[s] there holds {vt(r)3. 2- .L v t (Bt_ l+SBt_l )}** {3aeR qt such that vt(r)=aTpt}. Hence for such laws t we get

D ,T J /e , e (~,r)=l{rfol[/lIrll=(vt(r)S(ff),t)vt(r J ) /I]vt(r)ll=l}SVt~aH/lla]}, where St:=PtS(~,t)P T. Here we used the fact that Ptpr=Iqt . Assumption 4-3(ii) and proposition

2-10(ii) imply tha t the unique optimal solution for step t is given by Vt of

step 3.2. The corresponding misfit is (aqt_e~Ot+l,...,aqt,O,..,O) where a k is ¢v2 the k - th singular value of o r , which shows (iii) for step t. This concludes

the inductive proof of (i) and (iii). •

P r o o f o f t h e o r e m 4 - 6

Due to assumption 4-5(ii) B in step 5 of the algorithm is uniquely defined.

Now (ii) and (iv) follow from assumption 4-5(iii) as in the proof of theorem

4-4(i i ) and (iv). We prove (i) and (iii) by induction. For t=O it follows from the ordering

of definition 3-12 that c o should be minimized~ i .e , the number of zero order

laws should be maximized. As for such laws eD(~,T)=[[S1/2(~,O)Vo(r)T[[/[[VO(r)[[, it

follows from proposit ion 2-10(i) and from step 2.1 of the algorithm tha t the D . . . . . tol

requirement e0,1~w,o)<e0 implies that at most e 0 laws can be accepted. Given

the ordering of definition 3-12 the misfit of these e 0 laws should he

minimized. Proposition 2-10(ii) implies that the unique optimal solution is

given by V 0 as defined in step 2.2, with corresponding misfit (aq_eo+l,...,aq,

0,...,0) where a k is the k - th singular value of Sv2(~,0). This shows (iii) for

t=0.

Next suppose that optimality of step 3.2 is shown for steps ~- with T_<t-1

for some t_>l. According to definition 3-3 we have to minimize the misfit of -1 _L 3_ .L

newly accepted laws rev t {[vt(Bt_x+sBt_l) ] }. Now r is in this set if and only

if there exists an aeR qt such that vt(r)=aTpt and then eD(ff~,r)=lls1/:all/llall with

St:=P tS( ffJ,t )P~'.

First assume tha t e~'_<e~. The ordering of definition 3-12 implies that a

maximal number of t - t h order laws should be accepted. Due to the requirement D . . . . . . tol

et , l (W,b)<Q it follows from proposition 2-10(i) and step 3.1 of the

algorithm that a t most e~ t - t h order laws can be accepted. The ordering of

Chapter V 271

definition 3-12 and proposition 2-10(ii) then imply that the unique optimal

solution is given by V t as defined in step 3.2, with corresponding misfit ¢1/2

(aqt_e,t,+l,...~aqt,O,...,O), where a k is the k - t h singular value of ,~t, which

shows (iii) for step t.

Finally assume that e~'>e~. Assumption 4-5(iii) implies that q-e~

independent laws or orders T<t--1 have been accepted. It then follows that at

most e~ independent laws of order t may be accepted. Indeed, otherwise the

resulting set of laws cannot be bilaterally row proper and from proposition

II .3-8 it follows that then there would exist an equivalent set of laws with

more than q-e~ laws of order at most t-1. According to steps 3.1 of the

algorithm for T<t--1 this would necessarily lead to unacceptable misfits.

Assumption 4-5(ii) , the ordering of definition 3-12 and proposition 2-10(ii)

imply that tim unique optimal solution in this case is given by Vt as defined

in step 3.2 with misfit as given in (iii).

This concludes the inductive proof of (i) and ( i i i ) . •


Assumption 4-7(ii) implies that step 3.1 is well-defined, assumption 4-7(iii)

that step 3.2 is well-defined and that /~ in step 5 is uniquely defined. It t ,1.

follows from steps 3.0 and 3.2 that vt(Lt}3-vt(Ft_ls }+vt(Bt_l) and it

inductively follows from assumption 4-7(iv) that B~ as defined in steps 2.2

and 3.2 indeed exact ly consists of the t - t h order laws claimed by g as defined

in section II.3.2.4. Then (iv) follows from the definition of LPt in section

II.3.2.6 and (ii) is implied by proposition II.3-10.

We prove (i) by induction. For t=O the optimality of step 2.2 follows

from theorem 4-4, cf. definition 3-7. Next suppose that optimality of step 3.2

is shown for steps r with v<t-1 for some t>l. According to definition 4-1 it

follows from the requirement c(B)<cto I and the fact that er t°l equations of

order v have been accepted, T<t-1, that at least ett °t equations of order t

have to be accepted. Moreover, according to definition 3-7 we have to minimize t -1. -1.

t h e misfit of the newly accepted laws in [vt(Ft_ls)+vt(Bt_l)] . Now for t .l. reRlt×q[s] there holds {vt(r)-l-[vt(Ft-lS )+vt(B t -1)]} ~=~ ( 3a-eRqt, a+ eRe-hi-1

T r a'- [~ala_ ~t):=Pt.col(~(to+i),...,~(to+i+t)), such that vt(r)=(a_,a+)Pt}. Let . - + ,

i e [0 ,q- t0- t ] , then for such laws r we get eP(~,r)= II r~l[ /II r*~l l t =

272 Appendix

T ~ T ~/2 T ~ P ~(t) (a P~S(w,t)Pta) ]tla+P2twl]t=e (z ,a), where the last, static, predictive

misfit is defined as in definition 2-15 with n~:=q~, n2:=q-ntq ,

x~:=Plt" c°l(w(to+i),"',~v(to+i+t-1)) and Yi:=P2t "W(to+i+t), i~[O,tl-to-t]. Due to the requirement vt(r)lvt(13~t_l) at least ett 0/ independent a+ functionals

need to be accepted. Defining "gtoz:=q-nt_rett 0/, cf. section 2.2.1, we obtain

the optimal relations by applying proposition 2-21(iii), due to assumption

4-7(iii). Note that assumption 4-7(ii) implies that the genericity conditions

in proposition 2-21 are satisfied, cf. section 2.2.3. So proposition 2-21(iii) implies the optimality of step 3.2 in step t of the recursion, cf. the

definition of 3l*[a in section 2.2.3. This concludes the inductive proof

of (i).

Finally the expression for the misfit in (iii) follows from (i), (iv),

and corollary 2-20. •


Assumption 4-9(ii) implies that step 3.1 is well-defined, assumption 4-9(iii) tha t step 3.2 is well-defined and that t3 in step 5 is uniquely defined. Now

(ii) and (iv) follow fl'om assumption 4-9(iv) as in the proof of theorem

4-8(ii) and (iv). We prove (i) by induction. For t=0 the optimality of step 2.2 follows

from theorem 4-6, cf. definition 3-7. Next suppose that optimality of step 3.2

is shown for steps r with ~-<t-1 for some t_l. According to definitions 3-7 and

3-12 we have to minimize the misfit of a maximal number of newty accepted laws -1 t 3_ 3_ r~v t {[vdF~qs )+vdBt_l)] }. For such laws eP(~,r)=eP(z'4t),a) with ~t) and

a as definied in the proof of theorem 4-8(i). First suppose that in step 3.1 P . . . . . tol

e~'<e~. Due to the requirement ct,flw,o)<¢t if follows from proposition

2-22(iv) and the ordering of definition 3-12 that the unique optimal solution

is given by step 3.2, cf. the proof of theorem 4 - 8 ( i ) . Note that assumption

4-9(//) implies that the genericity conditions in proposition 2-22 are

satisfied, cf. section 2.2.3. Next suppose that in step 3.1 ¢~'>e[. Assumption

4-9(iv) implies that at step t at most e[ independent laws of order t may be

accepted for the reason given in tim proof of theorem 4-6. Assumption

4-9(iii), the ordering of definition 3-12 and proposition 2-22(iv) imply that

the unique optimal solution in this case is given by step 3.2.

Chapter V 273

Finally the expression for the misfit in (iii) follows from (i), (iv)~

and corollary 2-20. •

P r o o f o f p ro po s i t i on 5 - 2

As a simple example take B=(Rq) z. For any web and any :7- of the finite length

there exist B'~B such that wIT~B]T and dim(8')<_q.#(:7"), e.g., B':={w~(Rq)Y-; (a#(:7")-l)w=O}. Hence B~P"(wlS,- ) for any wlv-e(Rq) :T. •

P r o o f o f p rop os i t i on 5 - 3

We give the proof for P~ctol, pD and ~D etot P~tot' as similar arguments hold true

for the other procedures.

First suppose that cto t is given. Let eto~:=e(ctot) be the equation

structure corresponding to Ctot, cf. definition 4-1. If etol=O, then it

follows from definition 3-9 that pD is not consistent for the same reasons c tot

as given for pU in the proof of proposition 5-2. If there is t~:~+ with ctt°t>_l, then /~eB with e~(13)=O cannot be exactly identified, hence pD is not

c to t

consistent. D t o l

Next suppose that Q~ is given and consider P~tot" If et,l_<0 for some

tJ /+ , then for every 8eB (exact) identification is impossible for :7.

sufficiently large, cf. the interpretation following assmnption 4-5. If on the

other hand tot ^ pD ~t,l>u for all te77+, then especially e0,1>ut°t ~ so ~tot will accept

laws of order zero for w]2-e(Rq)T of sufficiently small norm. Not having this

sufficiently small norm clearly is not a generic property for any BeB, as B is

Especially eo(~)---~O , --/ '3

linear, if BeB with then ~tol cannot 8 exactly identify

for generic time series~ h~nce pD is not consistent. v to t

t o l Finally for ~tot note that Q,x<u for some tJ /+ implies that for

every BeB identification is' hnpossible for 3" sufficiently large, that ett°,tl>O for some feZ+ implies that, e.g., (Rq) z cannot be exactly identified for

tot ~ for all t~F+ implies that ~D is not generic time series, and that ¢t,l=U etot consistent for the reasons given in the proof of proposition 5-2. •

274 Appendix


For given BoB c we have to prove that generically in w~B there holds

P~(~)=P~e(~)={B} for ~v=w[gr with #(: / ' ) sufficiently large.

In the proof we use the following lemma. Let [l(Y):=(#(:Y)-q)/(q+l) and let

S(~,t) denote the empirical covariance matrix of order t as defined in

sections 3.3.1 and 4.2.1. Further for Vc(R:×q) t+1 let VT:={ve(g(q)t+:; vTev}.

Lemma 5 - 4 For t<d(9 v) generically in wEB ker(S(w[T,t))--[vt(B~)]T.

P r o o f o f l emma 5 -4

[vt(B-7)]Tcker(S(w[sv,t))''. and hence it suffices to show that gen. in Evidently

we13 dim(ker(S(wlT,t)))<dim(vt(B~)), i.e., rank(S(wlT,t))>_q(t+l)-dim(vt(B~))=

q(t+l)-Zk=o(t+l-k)ek, where {ei; teZ+} is the tightest equation structure of

/3, cf. proposit ion IV.2-2(i) and the remark following definition II.3-9. Let

T:=#(9") and relabel the time instants such that :T=[1,T]. Then S(w]5~,t)= 1 iH(w)H(w)T, where H(w)eR q(t+:)×(T-t) is defined by

T -

w(1) w(2) ... w(T-t) w(2) w(3) ... w(T-t+l)

H(w):= w ( t + l ) w( t+2) ... w(T)

It hence suffices to prove that there exists a web with rank(H(w))_> t * q(t+l)-Zk=o(t+l-k)ck=:rw, as this then also holds true generically on B.

Nota t ion . We use the following notat ion and concepts.

Let /3 have a minimal input /s ta te /output realization BiHo(A,B,C,D), i.e., there exists a permutat ion matrix // such that Ill3={(u,y)~(Rm)Z×(RP)Z;

I;; lI.l Bx~(Rn) z such that a = }, where n is the number of states, m the

number of inputs, and p:=q-m the number of outputs. We refer to definition ~ N N

II.3-24. As BeB c it follows that (A,B,C) is a minimal triple, by which

we mean that (A,B) is controllable, cf. section IV.3.2, and that (A,C) is

observable, cf. the proof of proposition IV.3-5, i.e.,

rank([B AB...A B])=rank(col(C,CA,...,C~4n-1))=n. For given FeR m×p let (A,C):=(A+BFC, C+DFC) and for (u,x,y)eYil, lo(~i,B,C,l) )

Chapter V 275

let v:=u-FCx. Then IIB={(u,y);3(v,x) such that = ~ }. Hcncc

@e/3][1,r] if and only if there exist x0e~ n and v[[1,T]e(Rm) T such that for all

~'e[1,T] l l~(T)=r~(r) l with ~t('r)=FCx(T)+vO- ) and ~(7)=Cx(T)+Dv(r), where Ly(")J

r - I ~ - 1 k - 1 ~ x(v):=A x0+Ek=IA By(r-k), Te[1,T].

We call A~R n×n cyclic if there exists an xo~R n such that

det([x0 Axo...A~-Xxo])#O. For ze(Ra) z let H(z)cR a(t+l)x(T-t) be defined by

) ... z ( T - t ) ] z(3)

L z ( i + l ) z(t+2) ,.. ziT) ]

Finally, for Mi~R nl×n2, i=1,2, let M1~312 denote that 311 can be transformed

into 312 by means of elementary row operations, i.e., {M~~312} : ~ {3SeR ~*'4

nonsingular such that S3I~=Mz}. []

We now show that there exists a web with rank(H(w))>_rw, by choosing

appropr ia te F, x0, and v l[1,T]. The proof is split in the following three

parts.

(i) 3FeR m×p such that A:=~4+BFC is invertible and cyclic;

(ii) for F as in (i) and for w~B with //w= and [ u] = ]FO there holds

L rJ L c r n IH/ /)--rankl whet0 ....

(i~i) there exist x0eR" and V[[I,TIE(Rm) T such that for x(1):=x o and

x(T+l):=Ax(v)+Bv(T), T~[1,T-t-1], there holds rank([H~vl])>rw. l_ .l

The desired result then follows from (ii) and (iii).

(i) We have to prove that there is an FeR "*×P such that A:=A+BFC is (1) cyclic

and (2) invertible. As these are algebraic conditions in F, cf Kailath

[33, 1emma 7.1-2], it suffices to prove that (1) and (2) can be satisfied

individually, as this even implies that conditions (1) and (2) are

simultaneously satisfied for generic F.

For (1) we refer to Kailath [33, lemma 7.1-2 and section 7.2.3].

Next we consider (2). As (A,B,C) is a minimal triple it follows from

276 Appendix

Kailath [33, lemma 7.1-1] that there exist tieR" and yeR p such that for b:=Bfl

and C:=TTc also (~4,b,c) is a minimal triple. Let a basis in R n be chosen such

that {A,b) is in control canonical form, cf. Kalman, Falb and Arbib [39, 11 1 1 - 1 J:

section 2.4], i.e., if det(sI-A)=s +Ek=Oaka and a:=-(ao,ai,...,a~_1) , then in

this basis ~=[0 In-l~a j '~ b=(o,...,O,1) T, and c=(q,...,Cn) for some cieR,

iE[1,n]. If a0¢0 then A is invertible, hence F=O satisfies (2). If ao=O then

observability of (A,c) implies that c1~0 , hence for F:=fl7 T there holds A= (o In_i) , j with a':=(c,,c2-a,,...,cn-an_,) , and as ci~0 A is invertible.

I°i] Ii ]I:l E:I (ii) Let F be as in (i), (A,C):=(A+BFC,C+DFC), = ~ and w:=ll . Then

by means of elementary row operations it follows that H(.w)~]u(~)]= L"(ylJ

~ H(u) ] _ [ll(FOx)+H(v)]- [ H(v)] H(Cx) +H(bu)J L n(~x) J k**(~.)j"

Noting that for I<r,<_T2<_T there holds X(T2)=ArZ-rlX(71)+ ~ r 2 - r 1 , k - l ~ , , . , -k=l A tsv(rz-~), it follows from the Hankel structure of H(v) and H(Cx)

[ "(~)] _ that LH(~x) J ~---[H(')] ~ where M:=col(C,CA,...,CAt).[x(1)x(2)...x(T-t)].~~ For

example, x(v+l)=Ax(r)+Bv(r), r~[1,T-t], hence by subtracting CB times the

matrix consisting of the first m rows of H(v) from the matrix consisting of

rows p+l,p+2,...,2p of H(Cx) this latter matrix is transformed into

CA[x(1) x(2)...x(T-t)], and similarly for the other rows.

follows that H(w)-H(v) As col(C,6"A,...,CAt)-col(C,C.Ti,...,C~i t) it [M ] with f - __ . "1

~'/:=eol(C,C,4 .... ,C.4').[x(1) x(2)...x(T-t)], hence rank(H(w))=rank(l '~=)/ ) L*~ 3

(iii) Finally we prove that there exist XoeR n and v I[1,T]e(R') T such

that for x{1):=x o and x(r+l):=Ax{r)+Bv(r), re[1,T-t-1], there holds

rank( [H!v)] )>_r~. M ~ t *

Let rx:=p(t+l)-Sk=o(t+l-k)%=rw-m{t+l ). Denoting the i - th component of v by vi, ic[1,m], we define v l[1,r]~(R') T

by vm_~(t+rx+k(t+l)+l):=l , k~[0,m-1], and 0 elsewhere. Note that

t+rx+(m-1)(t+l)+l=rw<_q(t+l)<T-t , as t<[l(:T)=(T-q)/(q+l). It is a simple matter

of explicitly writing out H(v) to conclude that rank(H(v))=m(t+l) and that in

H(v) the first r x columns are zero. As v t[t,r~]=O there holds that T - 1 x(r)=A x0, re[1,rx]. Now suppose that there is an XoeR n such that

rank(col(C,C~i,...,C~4t).[xo Axo...ArX-*Xo])=rx. Then the matrix consisting of

C h a p t e r V 277

the first r~ columns of /~ has rank rx, and it easily follows that the first

columns of |n~v)] are linearly independent, hence rx+m( t+l )=rw rank([n(v)l)>_rw, as desired.

L ~ ]

I t remains to prove that there is an x0eR ~ such

ttlat ~lo:=col(C,C.74,...,C~4t).[xo Axo...Arx-lXo] has rank r x. It follows from

Willems [73, table 2 and theorem 6(vi)] that rx<n and that . . . . . t t k * t * rank(col(C, CA,...,CA ))=Sk=o(P--Sj=oej)=p(t+l)--~k=o(t+l--k)ek=r x. Now suppose

that rank(~/o)<rx for all x0eR n. Then there would exist a k<r x such that for

generic xocR" the (k+ l ) - th column of ~I0 is linearly dependent on the foregoing n - 1 - k .

ones, say for xo~VcR n where V is generic• Then V:-- zNo.= A V : = { x ~ ; A~x~V for

all i~[O,n-l-k]} is also generic, as A is invertible. For every

xocV rank(col((~,C~i,...,~{t).[xo Axo...A"-lXo] )<_k<rx. However, as A is

cyclic, for generic xo det([x 0 Axo...An-tXo])#O and hence generically on

R n rank (col (C, CA, ...,c,4t).[Xo Axo...An-lxo])=rank(col(C,C~t,...,C'~tt))=rx. This

contradiction shows that there is an x0eR '~ with rank(M0)=rx, as desired.

This concludes the proof of lemma 5-4. •

D D Using this lemma we now first prove the theorem for POe. Let Vt:=vt(Lt) be tile

descriptive complementary spaces of B as defined in section II.3.2.5 and let

nt:=dun(Vt)=et(B), see proposition II.3-10. Moreover let t :=max{t;n#=O} and

let {t; nt#O}={tx,...,tc} with O<t<t2<...<tc_l<tc:=t*. We now show that gen. in

web /~e(wl:?-)={B } for every T with [I(T)>t*, i.e., #(T)>t*(q+l)+q.

Let :7- be such that ~(:T)>t*. Now P~ only accepts laws with descriptive n l x ~ ' r , misfit zero. Note that for t_<~[(T) and r ~ t is] there holds {eO(~,r)=0}¢=~

{vt(r)~ker(S(~,t)) }. I t hence follows from lemma 5-4 that gem on 13 pD

accepts no law of order t<tl, as then vt(Blt)={0}. For step t=t 1 gen. on 13

ker(S(w[:T,t))=vD 1 and hence gen. /~e exact ly identifies these laws on step t t.

Now suppose that for some k~[2,c] PO e gen. on steps tf identifies V D t i ,

i~[1,k-1], and gen. no laws are accepted on steps t_<tk_ I with t 6 { t l , . . . , t k _ l } .

- ± ~± ~± LD={0), and it For steps t~[tk_l+l,tk-1 ] there holds that l~ t =~,_l+sDt_l, as

follows by induction from lemma 5-4 that gem on /3 the rows of Pt as defined

in step 3.0 of the algorithm of section ,1.2.2 span [vt(B~)] ± and that gem on

13 ker(PtS(w }T,t)FT)={O}. So gen. no laws are accepted for steps ± ± . £

t~[tk_t+l,t~-l], and gen. the rows of Ptk span [Vtk(Btk_l+sBtk_1) ] . Lemma 5-4

2 7 8 A p p e n d i x

and the algorithm of section 4.2.2 imply that hence gen. on /3 for step t k P~e _1_ .L .L _L V D exactly identifies Vtk(/3tQn[Vtk(Btk_l+S/3tk_l) ] = tk. It follows by induction

that on step d(Y') P~e gen. has identified exactly vDi, ti<_t*, i.e.,

/~ee(W[:T)={/3 }. Note that the number of steps is finite and that a finite

intersection of generic sets is generic.

This proves consistency of P~e. The consistency of P~e follows from this

result. Note that for order zero the procedures coincide, while for steps t_>l

{eP(~,r)=O}.~{eD(~o,r)=O), provided that [[r*~][t~0 where r* is the leading

coefficient vector of r, cf. definition 3-5. As always on step 0 the set of P D zero order laws Vo=V o is identified it follows that gen. on t3 in all steps

* * P l_<t_<d(T) I[r*~]lt~0, as it is required that r ±Ft_l, hence r ±V 0. The

consistency of P~e then in]plies the consistency of P~e. •

P roo f of theorem 5-5

Lemma 5-4 and the proof of theorem 5-4 show the following. We recall that for

given interval of observation :7" the procedures P~e and P~e only consider the

model class ~(:T) for identification, cf. definitions 3-8, 3-13 and 3-14.

Lemma 5-5 Generically on 8eBc, P~(w[5~)=P~(wl:T)=/3(Y-):={w'c(Rq)z; r(a)w'=O for all reB~ where t:=d(:T)}.

_l_ .L pD and /~e evidently are exact. Moreover, as B tc/3t+l it follows that

B([to,tl+l])=B([to-l,tl])cB([to,tl] ) and lemma 5-5 implies that pD and P~ are

bilaterally monotone, i.e., the identified model becomes more strict if more

observations are available. To prove linearity, let B1,/3eeBc, then /~lq-/32e~]c~ [ 1 ± ,t3i) t ~(/31+B2) t , i=1,2, hence (/31+B2)(T)~Ba(Y')+I3a(:T) and lemma 5-5 implies that

P~e and P~e axe linear. That these procedures axe truthful is evident from lemma

5-5. Finally, for given 7 the procedures P~ and P~ only identify models in the

class •(:T), cf. definition 3-8, and according to lemma 5-5 models in B(:T) are

strongly corroborable by P~e and P~e, which proves that these procedures are

strongly prudential. •

C h a p t e r V 279


We first s ta te and prove two lemmas which will be used to prove the theorem.

Nota t ion . For A=ATeR n×n let a(A):=(al,...,an) , with al>...>a,, denote the

ordered set of eigenvalues of A. Let A=U•U T, ~=diag(al,...,an) , uuT----uTu~-Im

and for k<l with ak_l¢ak, al~=al+l, let /4[k,1]cR" denote the space spanned by

the columns k,k+l, . . , l of U. Consider the collection of finite dimensional

linear spaces £:={L; 3hen with LeR n, and L linear}. A sequence (Lk; keN) is

defined to converge in the Grassmannian topology if there is an LeL:, Lc~ n,

such that for k sufficiently large LkcR', dim(Lk)=dim(L), and such that there

exist choices of bases in L k which converge in Euclidean sense to a basis of

L. We denote this by Lk (a~ L. The sequence is defined to converge in the gap

topology if there is an LeE, LcR", such that LkCR n for k sufficiently large

and such that g(Lk~L)-~O for k-~c% where g denotes the gap between L and Lk, cf.

definition IV.5-1 and Kato [40, section IV.2.1]~ or Stewart [66~ section

2.1]. []

Lemma 5-10-1 The Grassmannian and gap topologies are equivalent.

Lemma 5-10-2 (i) The mapping a:A-~a(A) is continuous; (ii) if in a(Ao)

for some k<l ak_l~a k and az~al+l, then the mapping A-~ ' [k j l is

continuous in A0, in the gap topology; (iii) if Ao=ATo>O then the mapping

A-~ A -1/2 is continuous in A 0.

P r o o f o f l emma 5 - 1 0 - 1

Let {Lk; keN} be given. Note that convergence in the Grassmannian or gap

topology both imply that there is an n~N such that LcR n and LkCR n with

dim(Lk)=dim(L)=:d for k sufficiently large, cf. Stewart [66, section 2.1].

First suppose that Lk (a~L. Then there are matrices B and B k of full

column rank with Lk=im(Bk) , L=im(B), and ]lBk-BI]->0 if k-~o0. The orthogonal

projection operators P and Pk on L and L k respectively are given by T -1 T P=B(BTB)-IB T and Pk=Bk(BkBk) Bk, hence IIPk-P[]->o, and Stewart [66, theorem

2.2] implies that g(Lk,L)-->O for k-)oo, cf. lemma IV.5-2.

280 Appendix

Next suppose tha t g(L~,L),O. Let L=im(B) where B has full column r ank d.

For all e>0 there is a K E such tha t fo r k>K e the re exist {b~(k,e),...,

bd(k,e)}cL ~ such tha t fo r B(k,s):=(b~(k,e),...,bd(k,e)) I[B-B(k,e)ll<e, cf.

def ini t ion IV.5-1. For e suff ic ient ly small B(k,e) has full column rank and

hence fo r k suff ic ient ly l a rge the columns of B(k~e) fo rm a basis of Lk, i.e.,

L~(a~L. •

P r o o f o f l e m m a 5 - 1 0 - 2

(i) We r e f e r to S tewar t [67, co ro l l a ry 6.5.11].

(ii) Let A=U~U T with uuT=uTu---In, ~=diag(al,...,an) , al>_...>an, and let k<l be

such tha t ak_l~a k and at~aZ+l. Let UI~R n×(l-~+l) consist o f columns

k,...,l of U and U2eR n×(~-l+k-1) of the remaining columns of U, so

//[k,tl=im(U1). Let ~:l:=diag{ ak,. . . ,al ), Z2:=diag(al,...,ak_1,at+l,...,an), so

A=(U 1 U2)I~ ' ° l ( U l U2) T. Then the so -ca l l ed sepa ra t ion of Z 1 and £2 is ku 23

min{lajl-aj2[; jle[k,l], j2~[1,k-1]u[l+l,n]}>O. Now suppose t ha t Ai=AT->A. ( i ) _ ( i ) ( i ) ( i )

Let A i have e igenvalues a~ >...>__an, then according to (i) a j >a k for

j<k and @i)<ffli) fo r j>l if i is suff ic ient ly large. Let Ai=

~(20 (U~ i) U(20) r be a decomposi t ion of Ai ana logous to the . /Tl(i)~ 71(i)

one of A, so especial ly - , ~ t J=~[k,ti. I t follows f rom Stewart [66, section

4.7 and coro l l a ry 2.6], t ha t hn(U~iI).~im(U1) in the gap t opo logy if i-~ce, i.e.,

Ul~!zi->//[k,z] if i->ee. We also r e fe r to Davis and g a h a n [10, sect ion 2 (the

sin20 theorem) and theorem 8.2].

(iii) For A0>0 the mapping A->A -1 is cont inuous in A0, so it suffices to show

tha t A->A ~/~ is cont inuous in A a. I f A=U~UT>O with UUT=UrU=ln and

E=diag(a~, . . . , an) then A~/2=UN~/2U T. According to (i) the mapping A - ~ v is

cont inuous, and by apply ing (ii) and lemma 5-10-1 fo r the e igenspaces of A 0

cor responding to distinct e igenvalues it follows tha t the re exists a choice

fo r U such tha t A.->U is cont inuous in Ao. Hence A , A ~/~ is cont inuous in A 0. •

The two fo rego ing lemmas a re used to p rove the theorem.

N o t a t i o n . Let ~=w~[:T fo r a rea l iza t ion w r of a s tochast ic process w in tile

class G cctot or O(etot,-a) and let APtol(W)=Bc~]~ and APtot(W)=13~e~]~ be as defined C e / .E ~ C

in defini t ion 5-7, cf. p ropos i t ion 5-9. Let Vt:=vt(Lt) and I t:=vt(Lt), where L t

C h a p t e r V 2 8 1

and L t are the predictive spaces in (CPF) corresponding to B c and B~

respectively, a s defined in section II.3.2.6. Let e(Bc) and e(Be) denote the

tightest equation structures of B c and B~ respectively. We will show that for

sufficiently large, a . s . P~ctol(~)::Bc(Y') and P~(~tol,~)(VJ)=:Be(Y') are T:=#(T) singletons. We denote the corresponding predictive spaces by V~(T) and V~(T) and the corresponding tightest equation structures by ec(T) and ee(Y ") respectively. Further we use the notat ion of the algorithms in sections 4.3.1

and 4.3.2 and we use a • to indicate symbols corresponding to these algorithms

when applied to the process w, cf. proposition 5-9. Finally the condition that

# ( T ) should be sufficiently large is denoted by T->oo. []

We now prove the theorem by induction and first consider l °P c lol"

Let weC~ccta. We will prove by induction that a.s. co(T) ~ e(Bc) and

Vt(T ) ~ Vet in the Grassmannian topology. By choosing bases 31(t)(T) in Vet(T) which converge to bases M (t) of V~ and defining R(T):=col(v-tl(31(t)(T)); tcZ+) and R:=col(vtl(M(t)); t~Z+) we get Be(T)=8(R(T)), /~e=B(R), while R(U) -> R in

Euclidean sense where R is bilaterally row proper, cf. proposition II.3-8,

definition I1.3-15 and assumption 4-7(iv). Hence a.s. Be(T)--> Be, cf. section

5.3.2, which shows consistency of P~cto z. I t remains to show that a.s. ee(T)--> e(Bc) and Vct(T)(a~VCL. Consider the

algorithm of section 4.3.1. Note that Ctoa is sensible for T-~ o0, i.e.,

assumption 4-7(i) is then satisfied. As weG~ctoacG it follows from assumption

5-6 that a.s. S(~,t)--> * S(w,t)=:S (t) for all teZ+ if T-~c¢ . So assume

henceforth that w r satisfies assumption 5-6(ii), then it remains to show that

{S(~, t ) -> 5"*(t); teZ+}=~{eC(T)-> e(Bc) and V~(Y)(e~V~ for all t~Z+ if T-> 00}.

First consider step 0 of the algorithm of section 4.3.1. As w~G c it tol

... .(0 .(0 follows from definition 5-8 and assumption 4-7(zzz) that ~r._etOZ>a._etOl+1.

~[ 0 ~ 0

As S(~,0) -> S*(0) it follows from lemma 5-10-2(i) that assumption 4-7(/ii) for

t=O is satisfied for T-> cv. I t then follows from step 2.2 and lemma 5-10-2(ii)

that e~(T)--> eo(Bc) and g(Vo(T),Vo)--> 0, and from lemma 5-10-1 that hence

V~o(Y)(C~V~. It follows from Stewart [66, theorem 2.2] (cf. lemma IV.5-2), that

tlle projection opera tors PI(T) and P~ of step 3.0 for t=l satisfy

IfPI(T)-P~]I -> O. Note that the dimensions of PI(T) are equal to those of P1 if

eo(Y)=eo(8c) , i.e., for T -> c¢.

Next suppose that for some t<max{T; eT(Bc)¢O}=:t* it is proved that for

2 8 2 A p p e n d i x

all k~[O,t-1] ~(T) * ek(B~l, V~(Y)~"~V~, and Pk÷,(7) "~ P~+,. As

S(~,t) S*(t) there holds Pt(T)S('~,t)Pt(T) T • . ,T ->PtS (t)Pt • I t follows from

definition 5-8 and assumption 4-7(ii) for w that S (*) and S (0 as defined in

step 3.1 for ~ have full rank if T-> oo. Let Ct(Y):=(S (t) "1/2S(0 S (t) -~/~ ) _+( + ) and let

C~ be defined analogously in terms of S*. It follows from lemma 5-10-2(iii)

that Ct (7 ) -~ C~. By considering Ct(:T)cT(u) it follows from lemma 5-10-2(i) and

assumption 4-7(ii i) for w that this assumption also is satisfied for ~ if

T-> oo, hence e~(iT)-> et(13c) for T-> oo. Applying lenmaa 5-10-2(i) and (ii) to

Ct(7)cT(7) and cT(y)Ct(7) it follows that in step 3.2 g(VCt(7),V~)-> 0 and

hence Vt(:T)(a~vct if T-> oo. Moreover assmnption 4-7(iv) for ~ is implied by

that for w if T-> oo and hence the laws identified at step t for ~ then are

real ly t - t h order laws, i.e., V~(:T) then remains unchanged for k<t-1. As now

g(V~(:7"),V~.)-> 0 for all k<_t if T-> 0o it follows that in step 3.0 for t+l

[IPt+l(f')-Pt+l]l--> O, cf. lemma IV.5-2. This concludes the inductive part and G c

shows that for t<t* eel(iT) --> el(Be) and Vt(:T) ( ~Y t if T -> 0o.

Finally consider orders t>t*. As et(13c)=O for t>t*, the fact that

et(T)=et(13c) for t<t* and T-> 0¢ implies that for @ it is, for the given eta ,

allowable not to accept any law of order t>t*, for T -> c¢. Moreover, for t<d(7")

A:={Coe(Rq):T;det(S(ffl,t))=O} is a proper algebraic var ie ty and hence has

Lebesgue measure zero, cf. Federer [12, section 2.6.5]. As w ~ G C the

continuity of w implies that S(@,t)>0 a.s. Then also PtS(Co,t)pTt>O a.s., and

definition 3-9 implies that hence et(:T)=O a.s. for all te[t*+l,d(iT)]. This

shows tha t Vt(:T)(G)Vt, tear+, if T -> c¢, as desired.

The consistency of P~(etov~) on G(.tov~ ) is proved in a completely

analogous way by using definition 5-8, the first remark in section 5.3.5, • (0 ) ,-to~,2 assumption 4-9, and the algorithm of section 4.3.2. Note that aq_%>t~ 0 ) ,

(0 ) , - t o / . 2 which implies that for ~ also aq_%>teo ) a.s. for T-~ c¢, and in this case

%(Y-)=%(B~). Similar arguments hold true for steps t<_t*. That Vt(Y-)(a~Vt a.s.

for t<_t* follows in a way analogous to the proof of Vt(:T)(GIVt . Further note

that et(Y')=et(B~)=O for all t>~. Finally we consider te[t*+l,d]. Definitions . , * ( t ) , 2 ,-to~,2

3-12 and 5-8 and the fact that et(Bs)=O imply that l-(a~ ) >tel ) , and it . , ( t ) . 2 . - t o t . 2

follows by induction that hence l - ta~ ~ >tet J and e~(Y-)=O a.s. for all

te[t*+l,d] if T-> 00. It follows that V~(T)(a~v~ a.s. for all re71+ if T-> co, as

desired. Note that now continuity of w is not required.

C h a p t e r V 2 8 3



Let ~o~Y2~to I and Bo:=P~%ol(~0) , t*:=max{r; e,(B0)¢0 }. Let Ok-* wa if k-~ co,

hence S(Ok,t)-> S(~o,t) for all t<d(Y') if k -> co. In the proof of theorem 5-10

let ~0,~k, S(~o,t) and S(~k,t) play the role of w, ~=wr[y-, S*(t) and S(5,t) respectively. As for w0 assumption 4-7(ii), (iii) and (iv) are satisfied it

follows directly from the proof of theorem 5-10 that these assumptions also

are satisfied for ~k if k-~ co, for all t<t*. Definition 3-9 implies that * ~ * T PtS(wo,t)Pt >0 for all te[t*+l,d(Y')]. From this it follows by induction

that PtS(~k,t)pT>o if k -~ co. In this case et(P~ctol(Cok))=et(13Ü) for all t<d(Y')

and ~ke~CCto I if k -> co. This shows that ~cto~ is open. The proof of theorem

5-10 for t<t* moreover implies that P~cto~(~k).->l~¢to~(ff~o) if k-> co, which

proves the continuity of P~%oz on ~ctol. That J~toz is open and that on this set I~eto I and ~to~ are continuous

also follows directly along the lines of the proof of theorem 5-10 by taking

d:=~(Y'). Finally note that the assumptions on ~etol imply that none of the _tol y2P • restrictions s t is critical. Hence P~%o=~to~ on ~to?


ff~oe~ctot , then P~ctoZ(50) is a singleton, see theorem 4-4(i) . Let 5k -> 50 Let

if k -> co, hence S(~ok,t ) --> S(Coo,t ) for all t<_gi(T) if k -> co. From the definition

of Y2 D it follows along the lines of the proof of theorems 5-10 and 5-11 c to l

that S(Cok,t ) also satisfies assumption 4-3(ii) and (iii) for all t<~(Y') if

k-> co, and hence that ~C,ot is open. That P~ctol(ff~k)--> P~c,ol(~o) if k-> co also

follows directly along the lines of the proof of theorem 5-10. D D _=D

The results for J-2%oi, P~toz' and t~to z follow in a completely similar

_tol ~D it follows that way. As none of the restrictions ¢t is critical on %ol N D

~ • . ~ . . F - ' t o l ( f f J / = t " = t o z ( ~ for we£2%o F •

REFERENCES

[1] Akaike, H., Canonical correlation analysis of time series and the use of an information criterion, in Mehra, R.K., and D.G. Lainiotis (eds.), System identification: advances and case studies, Academic Press, New York, 1976, pp. 27-96.

[2] Akhiezer, N.I., and I.M. Glazman, Theory of linear operators in Hilbert space, volume I, Frederick Ungar, New York, 1961.

[3] Anderson, B.D.O., and J.B. Moore, Optimal filtering, Prentice-Hall, Englewood Cliffs, New Jersey, 1979.

[4] Anderson, T.W., The statistical analysis of time series, Wiley, New York, 1971.

[5] Box, G.E.P., and G.M. Jenkins, Time series analysis, forecasting and control, ttolden-Day, San Francisco, 1970.

[6] Brillinger, D.R., Time series analysis, data analysis and theory, ltolt, Rinehart and Winston, New York, 1975.

[7] Caines, P.E., On the scientific method and the foundations of system identification, in Byrnes, C.I., and A. Lindquist (eds.), Modelling, identification and robust control, North-Holland, Amsterdam, 1986, pp. 563-580.

[8] Chen, C.T., Linear syste~n theory and design, Holt, Rinehart and Winston, New York, 1984.

[9] Corr~a, G.O., and K. Glover, Pseudo-canonical forms, identifiable parametrizations and simple parameter estimation for linear multivariable systems, Automatica 20 (4), 1984, pp. 429-452.

[10] Davis, C., and W.M. Kahan, The rotation of eigenvectors by a perturbation III, SIAM journal on numerical analysis 7 (1), 1970, pp. 1-46.

[11] Davis, M.H.A., and R.B. Vinter, Stochastic modelling and control, Chapman and Hall, London, 1985.

[12] Federer, 1t., Geometric measure theory, Springer, Berlin, 1969. [13] Finesso, L., and G. Picci, Linear statistical models and stochastic

realization theory, in Lecture notes in control and information sciences 62 part 1, Springer, Berlin, 1984, pp. 445-470.

[14] Fomby, T.B., R.C. Hill and S.R. Johnson, Advanced econo~netric methods, Springer, New York, 1984.

[15] Gantmacher, F.R., The theory of matrices, volume I, Chelsea, New York, 1959.

[16] Gevers, M., and V. Wertz, Uniquely identifiable state-space and ARMA parametrizations for multivariable linear systems, AutoTnatica 20 (3), 1984, pp. 333-347.

[17] Glover, g., All optimal llankel norm approximations of linear multivariable systems and their L ~ error bounds, International journal of control 39 (6), 1984, pp. 1115-1193.

[18] Glover, K., and J.C. Willems, Parmnetrizations of linear dynamical systems, canonical forms and idcntifiability, IEEE transactions on automatic control AC-19 (6), 1974, pp. 640-6.t6.

286 References

[19] Golub, G.H., and C.F. Van Loan, An analysis of the total least squares problem, SIAM journal on numerical analysis 17 (6), 1980, pp. 883-893.

[20] Golub, G.H., and C.F. Van Loan, Matrix computations, Johns Hopkins University Press, Baltimore, 1983.

[21] Guidorzi, R., Invariants and canonical forms for systems structural and parametric identification, Automatica 17 (1), 1981, pp. 117-133.

[22] Hannan, E.J., Multiple time series, Wiley, New York, 1970. [23] Hannan, E.J., and M. Deistler, The statistical theory of linear systems,

Wiley, New York, 1988. [24] Hannan, E.J., and L. Kavalieris, Multivariate linear time series models,

Advances in applied probability 16 (3), 1984, pp. 492-561. [25] Hazewinkel, M., and R.E. Kalman, On invariants, canonical forms and

moduli for linear, constant, finite dimensional, dynamical systems, in Lecture notes in economics and mathematical systems 131, Springer, Berlin, 1976, pp. 48-60.

[26] Heij, C., Exact modelling of a finite data sequence, Proceedings of 25th IEEE conference on decision and control, Athens, 1986, pp. 1743-1744.

[27] Heij, C., Approximate modelling of deterministic systems, in Curtain, R.F. (ed.), Modelling, robustness and sensitivity reduction in control systems, NATO ASI series, Springer, Berlin, 1987, pp. 271-283.

[28] Heij, C., Exact modelling of a finite time series, SIA3I journal on control and optimization 26 (1), 1988, pp. 83-111.

[29] Heij, C., and J.C. Willems, Consistency analysis of approximate modelling procedures, in Byrnes, C.I., C.F. Martin and R.E. Saeks (cds), Linear circuits, systems and signal processing : theory and application, North-Holland, Amsterdam, 1988, pp. 445-456.

[30] Heij, C., and J.C. Willems, A deterministic approach to approximate modelling, in Willems, J.C. (ed.), From data to model, Springer, Heidelberg. To appear.

[31] Hinrichsen, D., and J.C. Willems (eds.), Special issue on parametrization problems, IMA journal of mathematical control and information 3 (2&3), 1986, pp. 59-254.

[32] Jayant, N.S., and P. Noll, Digital coding of wavefo~ns, Prentice-Hall, Englewood Cliffs, New Jersey, 1984.

[33] Kailath, T., Linear systems, Prentice-Hall, Englewood Cliffs, New Jersey, 1980.

[34] Kalman, R.E., On minimal partial realizations of a linear input/output map, in Kalman, R.E., and N. De Claris (eds.), Aspects of network and system theory, Holt, Rinehart and Winston, New York, 1971, pp. 385-407.

[35] Kalman, R.E., Identification from real data, in Hazewiukel, M., and A.H.G. Rinnooy Kan (eds.), Current developments in the interface: economics, econometrics, mathematics, Reidel, Dordrecht, 1982, pp. 161-196.

[36] Kalman, R.E., System identification from noisy data, in Bednarek, A.R., and L. Cesari (eds.), Dynamical systems II, Academic Press, New York, 1982, pp. 135-164.

[37] Kaiman, R.E. , Identifiability and modelling in econometrics, in Krishnaiah, P.R. (ed.), Developments in statistics, volume 4, Academic Press, New York, 1983, pp. 97-136.

[38] Kalman, R.E., and R.S. Bucy, New results in linear filtering and prediction theory, Transactions ASME, Journal of basic engineering, series 83D, 1961, pp. 95-108.

References 287

[39] Kalman, R.E., P.L. Falb and M.A. Arbib, Topics in mathematical system theory, McGraw-Hill, New York, 1969.

[40] Kato, T., Perturbation theory for linear operators, Springer, Berlin, 1966.

[41] Kendall, M.G., and A. Stuart, The advanced theory of statistics, 3 volumes, Griffin, London, 1958, 1961, 1966.

[42] Koopmans,T.C. (ed.), Statistical inference in dynamic economic models, Cowles commission for research in economics, monograph no. 10, Wiley, New York, 1950.

[43] Kullback, S., Information theory and statistics, Wiley, New York, 1959. [44] Kumar, P.R., and P. Varaiya, Stochastic systems: estimation,

identification and adaptive control, Prentice-Hall, Englewood Cliffs, New Jersey, 1986.

[45] Lax, P.O., and R.S. Phillips, Scattering theory, Academic Press, New York, 1967.

[46] Lindquist, A., and M. Pavon, On the structure of state-space models for discrete-time stochastic vector processes, IEEE transactions on automatic control AC-29 (5), 1984, pp. 418-432.

[47] Lindquist, A., and C. Picci, Realization theory for multivariate stationary gaussian processes, SIAbl journal on control and optimization 23 (6), 1985, pp. 809-857.

[48] Ljung, L., Convergence analysis of parametric identification methods, IEEE transactions on automatic control AC-23 (5), 1978, pp. 770-783.

[49] Ljung, L., A non-probabilistic framework for signal spectra, Proceedings of 24th IEEE conference on decision and control, Fort Lauderdale, 1985, pp. 1056-1060.

[50] Ljung, L., System identification: theory for the user, Prentice-Hall, Englewood Cliffs, New Jersey, 1987.

[51] Ljung, L., and P.E. Caines, Asymptotic normality of prediction error estimators for approximate system models, Stochastics 3 (1), 1979, pp. 29-46.

[52] Ljung, L., and T. S6derstr6m, Theory and practice of recursive identification, MIT Press, Cambridge, Mass., 1983.

[53] Malinvaud, E., Statistical methods of econometrics, North-Holland, Amsterdam, 1970.

[54] Moore, B.C. , Principal component analysis in linear systems: controllability, observability, and model reduction, IEEE transactions on automatic control AC-26 (1), 1981, pp. 17-32.

[55] Nieuwenhuis, J.W., and J.C. Willems, Continuity of dynamical systems: a system theoretic approach, Mathematics of control, signals, and systems 1(2), 1988, pp. 147-165.

[56] Northcott, D.G., Lessons on rings, modules and multiplicities, Cambridge University Press, London, 1968.

[57] Payne, H.J., and L.M. Silverman, On the discrete time algebraic Riccati equation, IEEE transactions on automatic control AC-18 (3), 1973, pp. 226-234.

[58] Pernebo, L., and L.M. Silverman, Model reduction via balanced state space representations, IEEE transactions on automatic control AC-27 (2), 1982, pp. 382-387.

[59] Rissanen, J., Modeling by shortest data description, Automatica 14 (5), 1978, pp. 465-471.

[60] Rissanen, J., Stochastic complexity and modeling, The annals of

288 References

statistics 14 (3), 1986, pp. 1080-1100. [61] Rosenbrock, H.H., State-space and multivariable theory, Wiley, New York,

1970. [62] Shibata, R., Asymptotically efficient selection of the order of the

model for estimating parameters of a linear process, The annals of statistics 8 (1), 1980, pp. 147-164.

[63] Silverman, L.M., Realization of linear dynamical systems, IEEE transactions on automatic control AC-16 (6), 1971, pp. 554-567.

[64] Slepian, D. (ed.), Key papers in the development of information theory, IEEE press selected reprint series, IEEE, New York, 1974.

[65] Sorensen, H.W., Kalman filtering: theory and application, IEEE press selected reprint series, IEEE, New York, 1985.

[66] Stewart, G.W., Error and perturbation bounds for subspaces associated with certain eigenvalue problems, SIA~I review 15 (4), 1973, pp. 727-764.

[67] Stewart, G.W., Introduction to matrix computations, Academic Press, New York, 1973.

[68] Tether, A.J., Construction of minimal linear state-variable models from finite input-output data, IEEE transactions on automatic control AC-15 (4), 1970, pp. 427-436.

[69] Theil, H., Principles of econometrics, Wiley, New York, 1971. [70] Willems, J.C., Least squares stationary optimal control and the

algebraic Riccati equation, IEEE transactions on automatic control AC-16 (6), 1971, pp. 621-634.

[71] Willems, J.C., System theoretic models for the analysis of physical systems, Ricerche di automatica 10 (2), 1979, pp. 71-106.

[72] Willems, J.C., Input-output and state-space representations of finite-dimensional linear time-invariant systems, Linear algebra and its applications 50 (1), 1983, pp. 581-608.

[73] Willems, J.C., From time series to linear system, part I: Finite dimensional linear time invariant systems; part II: Exact modelling; part III: Approximate modelling, Automatica 22 (5), 1986, pp. 561-580; 22 (6), 1986, pp. 675-694; 23 (1), 1987, pp. 87-115.

[74] Willems, J.C., Models for dynamics, in Kirchgraber, U., and H.O. Walther (eds.), Dynamics reported, volume 2, Wiley and Teubner, 1989, pp. 171-269.

[75] Willems, J.C., and C. Heij~ /2-systems and their scattering representation, in Bart, H., I. Gohberg and M.A. Kaashoek (eds.), Operator theory and systems, Operator theory: advances and applications, volume 197 Birkh~.user, Basel, 1986, pp. 443-448.

[76] WiUems, J.C., and C. Ileij, Scattering theory and approximation of linear systems, in Byrnes, C.I., and A. Lindquist (eds.) Modelling, identification and robust control, North-Holland, Amsterdam, 1986, pp. 397-411.

[77] Wolovich, W.A., Linear multivariable systems, Springer, New York, 1974.

S y m b o l index

This index contains symbols which are used more than local ly in this

monograph. The other symbols are explained in the nota t ion paragraphs in the

text . The numbers indicate the pages where the symbols are defined.

s e t s

N 34

Z+ 58

Z 32

~+ 15

Co~C+~C_ 97

(Rq) z 32

l~ 94

p o l y n o m i a l s

Rg×q[s,s -1] 34

Rlt×q[s,s -1] 40

d(r) , d(R) 39

~(r), ~(R) 51

c(R) 76

e(R) 39

mode l s mode l o b j e c t i v e s

B 31 c 22

B t 4 0 C D 127

B(R) 35 c P 135

BT(R ) 52 ¢ 22

B ± 3 6 e D 128, 146

B f 40 eP 136, 150

B s 46 e D 128, 144

RB s 49 e P 135, 148

Bs(A,B,C,D) 46 u 23

B](A,B,C,D) 96 Cto z 23

c(B) 91 eto z 27

ct(B ) 91 UCtot 23, 152

e*(B) 91 Ueto~ 27, 155

re(B) 91 * 15,1 U V to l

n( B) 91 e( Ctol) 159

data

w 31

143

q 32

2" 142

D 22

mode l c l a s se s

N 22

~3 174

[] 33

[]T, BT 51

S T 76 N , []T 84

[]2 95

Be 96

• ( T ) 151

[]~T' s 64 []PT

290 Symbol index

p r o c e d u r e s

P 22

Pu 23

Pctol 24

Ctol 132, 153

PP 140, 153 c tol

P e tol 28

P e tot 28

I~e to z 132, 156

P~tol 140, 156

~D 133, 157 ¢ to/

~etol 142, 157

P~ 57

p K /~K 70, 73

pO 76

P*, P* 81, 84

PT 63

im(PT) 64

misce l lanea

G

0 "-1, 0"*

a(A) #(v) #(7) A~±A2

col

[q,td Ch. I I I

HT(W) L(w) R(w)

ENT, ent

Ch. IV + ++

Z- - , Z- , Z , Z

L+, L_

(A+,B+,C+,D+)

(A_,B_,C_,D_)

K+

O_+ R

V

X

gr(F)

Ch. V

V t

Lt, Vt

L

Ft ?q j-)

S(~,d) A

32

34, 98

97

57

174

43

36

5O

71

8O

8O

72

94

99

103

105

102

104

112

49

46

46

97

4O

41

42

44

44

151

151

172

Subject index

A

algorithm for approxhnation by balancing for L+ for L_

for 1~cta

for

for I~cta for I~to l for ~tol

anticausality approximation

model optimal

autoregressive system

B

balancing behaviour

C

canonical correlation canonical form

descriptive minimal predictive

causality compatibility complementary space completeness complexity

descriptive of dynamical systems map predictive

consistency controllability corroboration

principle strong, weak

D

degree deterministic

113 104 105

162

164

166

166

169

170

97

89 174 35

111 31

139 37 42 37 44 97 78 40 33

127 91 22

135 175 96

22 64

42 4

driving operator driving variable

E

empirical covariance matrix equation error equation structure

corresponding to a complexity

equivalent parametrizations exact ness external variable

G

gap genericity

A-genericity

I

identification impulse response

stable incomplete tlankel array index input

L

lag structure law lexicographic ordering linear

procedure system

M

minimax property minimum description length misfit

descriptive of a law of a model

map predictive

of a law of a model

misspecification modelling

approximate deterministic

100 46

131 128 39

159 36 66 46

109 63

172

1 58

196 71

101 48

39 35 93

68 32

130 29

144 146 22

147 148 172

126 4

292 Subject index

exact 56 static 127 stochastic 4 under complexity constraint 23 under misfit constraint 27

monotonicity 66 bilateral 66

most powerful unfalsified model 60

O

order 39 output 48

P

parametrization 37 perfect observabili ty 47 Pontryagin

isometry 101 space 101

predicted functional 44 prediction polynomial 44 predictive law 44 procedure 22

corresponding to a utility 23 for deterministic time series analysis 143 identification 1 partial realization 70

prudence 64

R

row proper bilaterally zero order bilaterally

saJnple ra te scaling scattering representation

backward forward

scattering theorem selection rule sensibility shift shift invariant

procedure system

signal to noise rat io simplicity principle simultaneous equation model singular value

decomposition specification state variable stochastic system

AR backward state space controllable deterministic dynamical finite time generating 12 state space

realization backward 49 T input /s ta te /output 48 12 96 minimal 46 forward 46

realization problem minimal 57 partial 59

relative mean prediction error 147 remarkabili ty 76 representation U

autoregressive 35 shortest lag 39 tightest equation 39

Riccati equation 102 robustness 187

time invariance time reverse operator time series analysis total least squares translation invariance truly t - th order laws truthfulness

undominated unfalsified model unimodular matrix utility

52 39 52

197 203

99 99 98 73

159 32

67 50

193 22 25

130 130

2 46

4

35 49 96 31 31 50

174 95 46

32 49

143 133 50 41 69

57 36 23

Lecture Notes in Control and Information Sciences Edited by M. Thoma and A, Wyner

Vol. 81: Stochastic Optimization Proceedings of the International Conference, Kiew, 1984 Edited by I. Arkin, A. Shiraev, R. Wets X, 754 pages, 1986.

Vol. 82: Analysis and Algorithms of Optimization Problems Edited by K. Malanowski, K. Mizukami VIII, 240 pages, 1986.

Vol. 83: Analysis and Optimization of Systems Proceedings of the Seventh International Conference of Analysis and Optimization of Systems Antiba, June 26-27, 1986 Edited by A. Bensoussan, J. L. Lions XVl, 901 pages, 1986.

Vol. 84: System Modelling and Optimization Proceedings of the 12th IFIP Conference Budapest, Hungary, September 2-6, 1985 Edited by A. Prekopa, J. Szelezs&n, B. Strazicky Xll, 1046 pages, 1986.

Vol. 85: Stochastic Processes in Underwater Acoustics Edited by Charles R. Baker V, 205 pages, 1986.

Vol. 86: Time Series and Linear Systems Edited by Sergio Bittanti XVII, 243 pages, 1986.

Vol. 87: Recent Advances in System Modelling and Optimization Proceedings of the IFIP-WG 7/! Working Conference Santiago, Chile, August 27-31, 1984 Edited by L. Contesse, R. Correa, A. Weintraub IV, 199 pages, 1987.

Vol. 88: Bruce A. Francis A Course in Hoo Control Theory Xl, 156 pages, 1987.

Vol. 88: Bruce A. Francis A Course in Hoo Control Theory X, !50 pages, 1987.

Corrected - 1st printing 1987

Vol. 89: G. K. H. Pang/A. G. J. McFarlane An Expert System Approach to Computer-Aided Design of Multivariable Systems XII, 223 pages, 1987.

Vol. 90: Singular Perturbations and Asymptotic Analysis in Control Systems Edited by P. Kokotovic, A. Bensoussan, G. Blankenship Vl, 419 pages, 1987.

Vol. 91 Stochastic Modelling and Filtering Proceedings of the IFIP-WG 7/1 Working Conference Rome, Italy, Decembre 10-14, 1984 Edited by A. Germani IV, 209 pages, 1987.

Vol. 92: L.T. Gruji6, A. A. Martynyuk, M. Ribbens-Pavella Large-Scale Systems Stability Under Structural and Singular Perturbations XV, 366 pages, 1987.

Vol. 93: K. Malanowski Stability of Solutions to Convex Problems of Optimization IX, 137 pages, 1987.

Vol. 94: H. Krishna Computational Complexity of Bilinear Forms Algebraic Coding Theory and Applications to Digital Communication Systems XVlII, 166 pages, 1987.

Vol. 95: Optimal Control Proceedings of the Conference on Optimal Control and Variational Calculus Oberwolfach, West-Germany, June 15-21, 1986 Edited by R. Bulirsch, A. Miele, J. Stoer and K. H. Well XlI, 321 pages, 1987.

Vol. 96: H. J. Engelbert/W. Schmidt Stochastic Differential Systems Proceedings of the IFIP-WG 7/1 Working Conference Eisenach, GDR, April 6-13, 1986 XII, 381 pages, 1987.

Lecture Notes in Control and Information Sciences Edited by M.Thoma and A.Wyner

Vol. 97: I. Lasiecka/R. Triggiani (Eds.) Control Problems for Systems Described by Partial Differential Equations and Applications Proceedings of the IFIP-WG 7.2 Working Conference Gainesville, Florida, February 3-6, 1986 VIII, 400 pages, 1987.

Vol. 98: A. Aloneftis Stochastic Adaptive Control Results and Simulation Xll, 120 pages, 1987.

Vol. 99: S. P. Bhattacharyya Robust Stabilization Against Structured Perturbations IX, 172 pages, 1987.

Vol. 100: J. P. Zolesio (Editor) Boundary Control and Boundary Variations Proceedings of the IFIP WG 7.2 Conference Nice, France, June 10-13, lg87 IV, 398 pages, 1988.

Vot. 101: E E.Crouch, A. J. van der Schaft Variational and Hamiltonian Control Systems IV, 121 pages, 1987.

Vol. 102: E Kappel, K. Kunisch, W. Schappacher (Eds.) Distributed Parameter Systems Proceedings of the 3rd International Conference Vorau, Styria, July 6-12, 1986 VII, 343 pages, 1987.

Vol. 103: E Varaiya, A. B. Kurzhanski (Eds.) Discrete Event Systems: Models and Applications IIASA Conference Sopron, Hungary, August 3-7, 1987 IX, 282 pages, lg88.

Vol. 104: J. S. Freudenberg/D. P. Looze Frequency Domain Properties of Scalar and Multivariable Feedback Systems VIII, 281 pages, 1988.

Vol. 105: Ch. I. Byrnes/A. Kurzhanski (Eds.) Modelling and Adaptive Control Proceedings of the IIASA Conference Sopron, Hungary, July 1986 V, 379 pages, lg88.

Vol. 106: R. R. Mohler (Editor) Nonlinear Time Series and Signal Processing V, 143 pages. 1988.

Vol. 107: Y. T. Tsay, L.-S. Shieh, St. Barnett Structural Analysis and Design of Multivariable Systems An Algebraic Approach VIII, 208 pages, 1988.

Vol. 108: K. J. Reinschke Multivariable Control A Graph-theoretic Approach 274 pages, 1988.

Vol. 109: M. Vukobratovic/R. Stojic Modern Aircraft Flight Control Vl, 288 pages, 1988.

Vol. 110: In preparation

Vol. 111: A. Bensoussan, J. L. Lions (Eds.) Analysis and Optimization of Systems XlV, 1175 pages, 1988.

Vol. 112: Vojislav Kecman State-Space Models of Lumped and Distributed Systems IX, 280 pages, 1988

Vol. 113: M. Iri, K. Yajima (Eds.) System Modelling and Optimization Proceedings of the 13th IFIP Conference Tokyo, Japan, Aug. 31 - Sept. 4, 1987 IX, 787 pages, 1988.

Vol. 114: A BermL]dez (Editor) Control of Partial Differential Equations Proceedings of the IFIP WG 7.2 Working Conference Santiago de Compostela, Spain, July 6-9, 1987 IX, 318 pages, 1989

Vol. 115: H.J. Zwart Geometric Theory for Infinite Dimensional Systems VIII, 156 pages, 1989.

Vol. 116: M.D. Mesarovic, Y~ Takahara Abstract Systems Theory VIII, 439 pages, 1989

Documents

APPENDIX • PROOFS - Springer978-3-540-46196-8/1.pdf · Chapter II 219 hence L+ and L_ have full row rank. As L_ has full row rank, R (t) consists of the rows of R of order t, and