7
209 THE DIFFERENTIAL OF A PRIMARY MATRIX FUNCTION by R. F. Rinehart (*) (Monterey, U. S. A.) w 1. THE HAUSDORFF DIFFERENTIAL If ~ is a finite dimensional linear associative algebra with identity over the real or complex field, with basis el ..... era, and f a function with domain and range in _~, then f is said to be Hausdorff differentiable [1] if (a) the component functions fi of f= E fiei are differentiable functior, s of the xi in x = F, xiei, and if (b) df ~ Z m dfi d xjei is expressible in the form i.j=l Oxi dr= y ki=iAidxBi, where dx = Y,j~ldxiei and the Ai, Bi are elements of _c4 depending only on xl .... , xm, and not on dxt, ..., dx,,. Condition (b) leads to a system of linear equations over the real or complex field for the determination of the Ai and Bi (which are not unique) [6], and the Ai and Bi, if they exist, involve the partial derivatives of the component funcfions fi(xj, ..., x,,). Explicit formulations of the Ai, Bi have not been developed. It is well known [6] that if ~ is normal simple, condition (a) is sufficient for HausdorII differentiability, i.e. the equations implied in condition (b) are always solvable for the Ai and Bi. If ~ is c/~., the algebra of n X n complex matrices, and f is a primary function [4], i.e. a function arising from the extension of a function f(z) of a complex variable z to c~, then f(X), Xs is known to be HausdorIf differentiable in any open set of 9/~ whose matrices have eigenvalues which (*) Supported by Army Research Office (Durham). 14 - Rend, Circ. Matem. Palermo - Serie 11 - Tomo XV - Anno 1986

The differential of a primary matrix function

Embed Size (px)

Citation preview

209

THE DIFFERENTIAL OF A PRIMARY MATRIX FUNCTION

by R. F. Rinehart (*) (Monterey, U. S. A.)

w 1. THE HAUSDORFF DIFFERENTIAL

If ~ is a finite dimensional linear associative algebra with identity over

the real or complex field, with basis el . . . . . era, and f a function with domain

and range in _~, then f is said to be Hausdorff differentiable [1] if (a) the

component functions f i of f = E f ie i are differentiable functior, s of the xi in

x = F, xiei, and if (b) d f ~ Z m d f i d xjei is expressible in the form i.j=l Oxi

d r = y ki=iAidxBi, where dx = Y,j~ldxiei and the Ai, Bi are elements of _c4

depending only on xl . . . . , xm, and not on dx t , . . . , dx , , .

Condition (b) leads to a system of linear equations over the real or complex

field for the determination of the Ai and Bi (which are not unique) [6], and the

Ai and Bi, if they exist, involve the partial derivatives of the component

funcfions f i ( x j , . . . , x,,). Explicit formulations of the Ai, Bi have not been

developed. It is well known [6] that if ~ is normal simple, condition (a) is

sufficient for HausdorII differentiability, i.e. the equations implied in condition

(b) are always solvable for the Ai and Bi.

If ~ is c/~., the algebra of n X n complex matrices, and f is a primary

function [4], i.e. a function arising from the extension of a function f ( z ) of a

complex variable z to c ~ , then f ( X ) , Xs is known to be HausdorIf

differentiable in any open set of 9/~ whose matrices have eigenvalues which

(*) Supported by Army Research Office (Durham). 14 - R e n d , C i rc . M a t e m . P a l e r m o - S e r i e 11 - T o m o X V - A n n o 1 9 8 6

2 1 0 R . F . RINEHART

are points of analyticity of f ( z ) [2]. For this special algebra and this special

but important class of functions, it might be expected that more explicit

descriptions of the A~, B~ of condition (b) might be attainable. This indeed

turns out to be the case if the domain of f in c ~ is restricted to be in the

set CKc of complex matrices with distinct eigenvalues.

w 2. PRIMARY FUNCTIONS ON Q('C

The algebra ~ endowed witl~ the topology induced by any convenient

matrix norm is a metric space. The set q~c C 9E~. of matrices X with distinct

eigenvalues is an open set of the space since the eigenvalues are continuous

functions of the elements of X. It has been shown in [5] that if P - ' X P = = d g [ X i , . . . , Xn]=D, then D is a Hausdorff differentiable function of X

throughout q(r and that for any Xo~CXc there exists an open set 9~, XoE~C~Xc, such that in ~ , P can be taken to be Hausdorff differentiable. (These assertions

are not valid at X ' s with repeated eigenvalues).

Theorem 2.1. Let f ( z ) be a function of the complex variable z and let ~ c be

the open set of matrices of ~ . ~ with distinct eigenvalues which lie in the region

of analyticity of f(z) . Let XoE.L?c and let N o D X o be an open set on which

the matrix P in P - ' X P = dg[),i, . . . , ~,,] is a (Hausdorff) differentiable function

of X. Let f ( X ) be the extension to c ~ . of f(z) . Then on Q = No A .~c the

differential of f (X ) is expressible as

d f (X) ~- f" (X) dX -[- (d P) P- ~ f (X) -- f (X) (d P) P- ' - - f" (X) [(d P) P - ' X - - X (d P) P-'].

Proof. Let X ~ Q . Then X = Pdg[).l, . . . , X n ] p - i = D, and P and D are

(Hausdorff) differentiable functions of X [5]. Hence by the customary rules for

differentiating matrix functions of scalar variables,

I d X = (dP)dg[X 1 .... , ),n] P - ' ~ Pdg[d~.~ .... , d ~] P - ' - - P d g [),~ ..... X~] P-' (dP)P- '

(I) = ( d P ) P - ' X - - X ( d P ) P - ' -[-Pdg{dX~,...,dX~]P-'.

Now as is well known f ( X ) = P d g [ f ( ) . ~ ) , . . . , f ( ; ~ ) ] P - ' [3] and hence

d f (X) = (dP)dg[f(X~), ..., f(~.n)]P-' + P d g [ f ' ( ) , j ) d ~ . . . . , f'(Xn) d),n]P -~

(2) - - P dg [f(),~) . . . . , f(),,)] P - ' (d P) P - '

= (dP)P- ' f (X) - - f (X) (dP) P-'-l-Pdg [ff(),,) ..... f'(),~)]dg[dX, .... , d),n] P- ' .

T H E DIFFERENTIAL OF A PRIMARY MATI~IX FUNCTION 2 1 1

Noting that Pdg[f'(X~) . . . . , f'(~.,)] P - ' = f ' ( X ) where f ' (X ) is the c~ -ex t ens ion

of if(z), and substituting from (1) for Pdg[dXt, . . . . dXn]P -1, (2) yields,

(3) df(X) = (dP)P-~f(X)-- f (X) (dP)P -~ + f ' ( X ) [ d X - - (dP)P- tX+ X(dP)P-~],

the formula of the theorem.

It may be noted that since dg[f'Ou)d),i , . . . . f'().n)d).n] could also be

factored in the order dg[d), i , . . . , d),,], dg [f ' () ' t ) . . . . , f ' 0",)], another equivalent

expression can be obtained in which f ' ( X ) occurs as a right factor.

Since P is differentiable in Q, dP is representable in the form d P = XHidXKi. Equation (3) therefore provides a Hausdorflian expression for df(X) of the form

required in condition (b), since P depends only on X. The expression (3)

exhibits the dependency of the Ai and Bz on the function f in explicit form;

the other contributions to the coefficients depend only on X, are independent

of the function f. Further, the only inexplicitness occurs in the recurring

factor (dP)P -~ .

w 3. REAL ARGUMENT MATRICES

It has been oullined in [5] how the concepts of Hausdorff differentiability

and Hausdorff derivative can be adapted to the case of a function whose domain tz n

is in c~R, the algebra of real mafrices and whose range is in c??~c. It was

shown that in this case the matrices P such that P - ' X P ~ - d g [ ) . t , . . . , ~] could be taken to be globally defined and differentiable on the entire open set

~R of real matrices with distinct eigenvalues, by choosing the columns of P

as eigenvectors of unit Hermitian length.

If now f (z) is a function such that f ( z ) = f ( z ) and is analytic at the

eigenvalues of a real matrix X, then the primary function f (X) is real matrix

valued [4]. For such a function f(z) , the corresponding statement of theorem 2.1

for real matrices is also valid for the global domain .~R, where Z?n is the open

set of matrices of 9]~.~ whose eigenvalues are distinct and are points of analyticity

of / (z ) .

w 4. THE MATRIX ( d P ) P -1

Since, ab initio, p - i involves the eigenvalues ;~i and dP involves the X t and

their differentials, the elements of dP and p-X, expressed as functions of the

elements of X and their differentials, will involve the elements of X irrationally.

212 R.r. mr~ruARr

However, for a large class of domains the matrix P can be so chosen that

the product ( d P ) P -~ is, rather surprisingly, rational in the elements of X, as

will be shown.

Let ~ be either the real field c)~ or complex field C, let eSe denote the

field F ( x i i , x ~ , . . . , x ,u) of all rational functicns of xi~, x~2 . . . . , x,,, over

r and let c~e-----de(~.~, . . . , ~.n) denote the splitting field of det (X- - ;~ I ) .

L e m m a 4.1. Each differential d~.i, i-----1, . . . . n, is a linear homogeneous

function of the differentials d x r s , r, s = 1, . . . , n, with coefficients belonging

to ~e. P r o o f . Each of the power sums Z" m .. i=~.~, m-----1, . , n is a rational integral

function of the coefficients of ~. in dot ( X - - ) ~ I ) ~ 0 and hence is a rational

integral function of the xrs, p,,,(x~s). Taking differentials of each of these

identities yields

(4.1) ~=1 )J~-'d~ ~ dp, , , (xr~)/m, m ~ I, . . . , n.

' T h e right members of these equations are linear homcgeneous polynomials in

the dx~ , , whose coefficients belong to c~e. The matrix of coefficients of the

d~.~ in 4.1 is the Vandermonde matrix associated with ~.~, X~, . . . , ~ . It is

nonsingular in the domain eke of matrices over ~ with distinct eigcnvalues,

and its elements belong to 3e . Hence in CKF, 4.1 is uniquely solvable for the

d ~.,, and each d),,. is a linear homogeneous function of the dx~, with coefficients

belonging to ~e.

Let @h denote the set of matrices X of eke for which column h of each

of the matrices ( X - - ~ i I ) A, i = 1 . . . . , n is not zero. Since the elements of

each of the matrices ( X - - X i I ) A are continuous functicns of the elements of X,

@, is an open subset of eke. Each nonzero column of ( X - ;~l)a is an

eigenvector of X corresponding to the eigenvalue )'i. Hence the matrix P whose

i-th column P~, i = 1 . . . . , n, is column h of ( X - - ) ~ , I ) a is a nonsingular

matrix such that P - ~ X P - - - - d g ( Z ~ , . . . , ~,) and is a differentiable function

of x [5]. Let ~ be the Galois group of the equation det ( X - ) ~ I ) ~ 0, whose

coefficient field is eSe. Let toe s be any permutation of the roots ~.~, . . . , )., of

the equation. The associated set of eigenvectors P~, . . . , P,, will undergo the

same permutation. If Z represents the permutation matrix obtained by applying

the permutation r to the columns of the identity matrix, then the matrix P

T I I E D I F F E R E N T I A L OF A P R I M A R Y M A T R I X F U N C T I O N 213

above becomes Po, = P Z , d P becomes dP, , = ( d P ) Z, and P- ' becomes -i _~p-i. , ). - 1 P,o = Z Hence, under the permutation to of the roots Xt, ... ,,(dPo,)Po, - -

( d P ) Z Z - ~ P - I ~ ( d P ) P -I and ( d P ) P -~ is invariant under any to of Q. This

also follows in exactly the same way if, in the real case, the columns of P

are normalized to Hermitian length one.

The elements of P, being cofactors from the matrices X - - ~ . z l , belong to

3F. Hence the elements of P - ' also belong to c~p. The elements of d P are

differentials of elements of c~r and therefore are linear homogeneous functions

of the differentials dx,s and dX~, with coefficients in c~r. Therefore, by Lemma

4.1. the elements of d R are expressible as linear homogeneous polynomials

in the dx,s with coefficients in c~. This, together with the invariance under -Q

implies that the coefficients of the dx,s in ( d P ) P - ' belong to dr=, i.e. are

rational functions of the x,s. If in the case c3:-~-c~ each column P~ of P is normalized by dividing by

its hermitian length, the conclusions of the preceding paragraph remain valid.

For let P: denote the corresponding normalized vector, and let P' denote the

matrix of these column vectors. Then the matrix P' of normalized vectors is

P'--= P d g ( l ~ '/', . . . , I~-'/~), where lj denotes the square of the hermitian length

of Pi. Then p.-1 dg(l'J' 1/o -1 , . . . , 1~ '1 , )_ = , . . . . l,,-)P , and d P ' = ( d P ) d g ( l ~ '/'

. . . . . l-~/, d Pdg(___l_l_~/ ,d! 1 ,, / , , )and 2 -t - - 1 . . . . , 2

( 1 -t l l : l ) d g ( d l , ' , d l , ) p - l . (4.2) ( d P ' ) P ' - ' = ( d P ) p - t - - P d g - - - 2 - 1 i . . . . . 2 "'"

As already shown, the elements of ( d P ) P -~ are linear homogeneous polynomials

in the differentials dx~, with coefficients belonging to cSF. In the second term

( 1 1 1:') and of in the above relation, the elements of P, of d g - - ~ l - ~ 1, . . . , 2

p-1 are all in ~,~., since ~zEc3p implies ~ E dp. Since 11 . . . . . I,, are in c~e, the

differentials dlI, . . . , dl~ are linear homogeneous polynomials in the dx~, with

coefficients in ~F by Lemma 4.1.

The invariance of ( d P ) P -~ and ( d t Y ) P '-I under Q has already been shown.

Hence the invariance under f~ of the second term of the right member of

4.2 follows. Hence the coefficients of the dx~, in 4.2 are in cSF. Thus,

Theorem 4.1. Let "7)h be the open set of matrices of C'K~ for which column

h of each of the matrices (X - - ),,. I)A , i = 1 . . . . , n is nonzero. Then in @hP

2 1 4 R . F . RINEHART

can be so chosen that the elements of (dP)P-' lie in the module of linear

homogeneous polynomials in the differentials dxrs with coefficients which are

rational functions of the elements of X. If ~" is the real field, the columns of

P may further be taken to be vectors of unit Hermitian length.

An explicit formula for (dP)P -1 in terms of X and dX would be desirable.

For second order matrices, a direct calculation of P, P - ' and dP, rather lengthy

for inclusion here, has yielded

( o o ) (: Hi - - S , K 1 =

- - 2X~2X~t - - X i 2 ( X 2 2 - - Xii )

( o o) (: :) H~ = S , K s =

- - x ~ (x~ . - - x , , ) 2 x',,

x2t (xii - - X2~) 2 x~2 X21

(o o ) H 4 = S

where

S = {x~ [(x~ - - x2~) ~" = 4x~x2~]l-'.

The quantity (x~ l - x~)~+ 4x,~x~L is equal to ( ;~2-) . t )~, the discriminant of

the characteristic equation of X, and is hence nonzero throughout CKc, the open

set of matrices X with distinct eigenvalues. The above expression for (dP)P -2 is

therefore valid throughout the region of CKc in which x~e :~ 0.

Unfortunately neither the method of calculation nor the form of the results

for c ~ suggest a mode of extension to matrices of arbitrary order.

Monterey (California), July 1965.

THE DIFFERENTIAL OF A PRIMARY MATRIX FUNCTION 2 1 5

REFERENCES

[I] Hausdorf/ F., Zur Theorie tier Sysleme comFlexer Zahlen, Leipziger Berichte, v. 52 (IC00),

pp. 43-61.

[2] Portmann W. O., Hausdorff-analytic functions of matrices, Proc. Amer. Math. Soc., v. 11

(1960), pp. 97-101.

[3] Rinehart R. F., The equivalence of definiticns of a matrix function, Amer. Math. Monthly, v. 62, no. 6 (1955), pp. 395-414.

[4] Rinehart R. F., Elements of a tt.eory of intrinsic functicns on algebras, Duke Maih. Jour., v. 27, (1960), pp. 1-20.

[5] Rinehart R. F., P and D in P - I X P = d g [ k ~ . . . . . xn] = D as matrix function of X,

Submitted to Can. Jour. of Mathematics. [6] Ringleb F., Beltrdge zur Fanktionentheorie in hyperkomplexen Systemen, Rend. d. Circ.

Matem. di Palermo, v. 57 (1933), pp. 311-240.

[7] ZurmUhl R., Matrizen, 4th Ed., Springer-Verlag, (1964).