The differential of a primary matrix function

209

THE DIFFERENTIAL OF A PRIMARY MATRIX FUNCTION

by R. F. Rinehart (*) (Monterey, U. S. A.)

w 1. THE HAUSDORFF DIFFERENTIAL

If ~ is a finite dimensional linear associative algebra with identity over

the real or complex field, with basis el . . . . . era, and f a function with domain

and range in _~, then f is said to be Hausdorff differentiable [1] if (a) the

component functions f i of f = E f ie i are differentiable functior, s of the xi in

x = F, xiei, and if (b) d f ~ Z m d f i d xjei is expressible in the form i.j=l Oxi

d r = y ki=iAidxBi, where dx = Y,j~ldxiei and the Ai, Bi are elements of _c4

depending only on xl . . . . , xm, and not on dx t , . . . , dx , , .

Condition (b) leads to a system of linear equations over the real or complex

field for the determination of the Ai and Bi (which are not unique) [6], and the

Ai and Bi, if they exist, involve the partial derivatives of the component

funcfions f i ( x j , . . . , x,,). Explicit formulations of the Ai, Bi have not been

developed. It is well known [6] that if ~ is normal simple, condition (a) is

sufficient for HausdorII differentiability, i.e. the equations implied in condition

(b) are always solvable for the Ai and Bi.

If ~ is c/~., the algebra of n X n complex matrices, and f is a primary

function [4], i.e. a function arising from the extension of a function f ( z ) of a

complex variable z to c ~ , then f ( X ) , Xs is known to be HausdorIf

differentiable in any open set of 9/~ whose matrices have eigenvalues which

(*) Supported by Army Research Office (Durham). 14 - R e n d , C i rc . M a t e m . P a l e r m o - S e r i e 11 - T o m o X V - A n n o 1 9 8 6

2 1 0 R . F . RINEHART

are points of analyticity of f ( z ) [2]. For this special algebra and this special

but important class of functions, it might be expected that more explicit

descriptions of the A~, B~ of condition (b) might be attainable. This indeed

turns out to be the case if the domain of f in c ~ is restricted to be in the

set CKc of complex matrices with distinct eigenvalues.

w 2. PRIMARY FUNCTIONS ON Q('C

The algebra ~ endowed witl~ the topology induced by any convenient

matrix norm is a metric space. The set q~c C 9E~. of matrices X with distinct

eigenvalues is an open set of the space since the eigenvalues are continuous

functions of the elements of X. It has been shown in [5] that if P - ' X P = = d g [ X i , . . . , Xn]=D, then D is a Hausdorff differentiable function of X

throughout q(r and that for any Xo~CXc there exists an open set 9~, XoE~C~Xc, such that in ~ , P can be taken to be Hausdorff differentiable. (These assertions

are not valid at X ' s with repeated eigenvalues).

Theorem 2.1. Let f ( z ) be a function of the complex variable z and let ~ c be

the open set of matrices of ~ . ~ with distinct eigenvalues which lie in the region

of analyticity of f(z) . Let XoE.L?c and let N o D X o be an open set on which

the matrix P in P - ' X P = dg[),i, . . . , ~,,] is a (Hausdorff) differentiable function

of X. Let f ( X ) be the extension to c ~ . of f(z) . Then on Q = No A .~c the

differential of f (X ) is expressible as

d f (X) ~- f" (X) dX -[- (d P) P- ~ f (X) -- f (X) (d P) P- ' - - f" (X) [(d P) P - ' X - - X (d P) P-'].

Proof. Let X ~ Q . Then X = Pdg[).l, . . . , X n ] p - i = D, and P and D are

(Hausdorff) differentiable functions of X [5]. Hence by the customary rules for

differentiating matrix functions of scalar variables,

I d X = (dP)dg[X 1 .... , ),n] P - ' ~ Pdg[d~.~ .... , d ~] P - ' - - P d g [),~ ..... X~] P-' (dP)P- '

(I) = ( d P ) P - ' X - - X ( d P ) P - ' -[-Pdg{dX~,...,dX~]P-'.

Now as is well known f ( X ) = P d g [ f ( ) . ~ ) , . . . , f ( ; ~ ) ] P - ' [3] and hence

d f (X) = (dP)dg[f(X~), ..., f(~.n)]P-' + P d g [ f ' ( ) , j ) d ~ . . . . , f'(Xn) d),n]P -~

(2) - - P dg [f(),~) . . . . , f(),,)] P - ' (d P) P - '

= (dP)P- ' f (X) - - f (X) (dP) P-'-l-Pdg [ff(),,) ..... f'(),~)]dg[dX, .... , d),n] P- ' .

T H E DIFFERENTIAL OF A PRIMARY MATI~IX FUNCTION 2 1 1

Noting that Pdg[f'(X~) . . . . , f'(~.,)] P - ' = f ' ( X ) where f ' (X ) is the c~ -ex t ens ion

of if(z), and substituting from (1) for Pdg[dXt, . . . . dXn]P -1, (2) yields,

(3) df(X) = (dP)P-~f(X)-- f (X) (dP)P -~ + f ' ( X ) [ d X - - (dP)P- tX+ X(dP)P-~],

the formula of the theorem.

It may be noted that since dg[f'Ou)d),i , . . . . f'().n)d).n] could also be

factored in the order dg[d), i , . . . , d),,], dg [f ' () ' t ) . . . . , f ' 0",)], another equivalent

expression can be obtained in which f ' ( X ) occurs as a right factor.

Since P is differentiable in Q, dP is representable in the form d P = XHidXKi. Equation (3) therefore provides a Hausdorflian expression for df(X) of the form

required in condition (b), since P depends only on X. The expression (3)

exhibits the dependency of the Ai and Bz on the function f in explicit form;

the other contributions to the coefficients depend only on X, are independent

of the function f. Further, the only inexplicitness occurs in the recurring

factor (dP)P -~ .

w 3. REAL ARGUMENT MATRICES

It has been oullined in [5] how the concepts of Hausdorff differentiability

and Hausdorff derivative can be adapted to the case of a function whose domain tz n

is in c~R, the algebra of real mafrices and whose range is in c??~c. It was

shown that in this case the matrices P such that P - ' X P ~ - d g [ ) . t , . . . , ~] could be taken to be globally defined and differentiable on the entire open set

~R of real matrices with distinct eigenvalues, by choosing the columns of P

as eigenvectors of unit Hermitian length.

If now f (z) is a function such that f ( z ) = f ( z ) and is analytic at the

eigenvalues of a real matrix X, then the primary function f (X) is real matrix

valued [4]. For such a function f(z) , the corresponding statement of theorem 2.1

for real matrices is also valid for the global domain .~R, where Z?n is the open

set of matrices of 9]~.~ whose eigenvalues are distinct and are points of analyticity

of / (z ) .

w 4. THE MATRIX ( d P ) P -1

Since, ab initio, p - i involves the eigenvalues ;~i and dP involves the X t and

their differentials, the elements of dP and p-X, expressed as functions of the

elements of X and their differentials, will involve the elements of X irrationally.

212 R.r. mr~ruARr

However, for a large class of domains the matrix P can be so chosen that

the product ( d P ) P -~ is, rather surprisingly, rational in the elements of X, as

will be shown.

Let ~ be either the real field c)~ or complex field C, let eSe denote the

field F ( x i i , x ~ , . . . , x ,u) of all rational functicns of xi~, x~2 . . . . , x,,, over

r and let c~e-----de(~.~, . . . , ~.n) denote the splitting field of det (X- - ;~ I ) .

L e m m a 4.1. Each differential d~.i, i-----1, . . . . n, is a linear homogeneous

function of the differentials d x r s , r, s = 1, . . . , n, with coefficients belonging

to ~e. P r o o f . Each of the power sums Z" m .. i=~.~, m-----1, . , n is a rational integral

function of the coefficients of ~. in dot ( X - - ) ~ I ) ~ 0 and hence is a rational

integral function of the xrs, p,,,(x~s). Taking differentials of each of these

identities yields

(4.1) ~=1 )J~-'d~ ~ dp, , , (xr~)/m, m ~ I, . . . , n.

' T h e right members of these equations are linear homcgeneous polynomials in

the dx~ , , whose coefficients belong to c~e. The matrix of coefficients of the

d~.~ in 4.1 is the Vandermonde matrix associated with ~.~, X~, . . . , ~ . It is

nonsingular in the domain eke of matrices over ~ with distinct eigcnvalues,

and its elements belong to 3e . Hence in CKF, 4.1 is uniquely solvable for the

d ~.,, and each d),,. is a linear homogeneous function of the dx~, with coefficients

belonging to ~e.

Let @h denote the set of matrices X of eke for which column h of each

of the matrices ( X - - ~ i I ) A, i = 1 . . . . , n is not zero. Since the elements of

each of the matrices ( X - - X i I ) A are continuous functicns of the elements of X,

@, is an open subset of eke. Each nonzero column of ( X - ;~l)a is an

eigenvector of X corresponding to the eigenvalue )'i. Hence the matrix P whose

i-th column P~, i = 1 . . . . , n, is column h of ( X - - ) ~ , I ) a is a nonsingular

matrix such that P - ~ X P - - - - d g ( Z ~ , . . . , ~,) and is a differentiable function

of x [5]. Let ~ be the Galois group of the equation det ( X - ) ~ I ) ~ 0, whose

coefficient field is eSe. Let toe s be any permutation of the roots ~.~, . . . , )., of

the equation. The associated set of eigenvectors P~, . . . , P,, will undergo the

same permutation. If Z represents the permutation matrix obtained by applying

the permutation r to the columns of the identity matrix, then the matrix P

T I I E D I F F E R E N T I A L OF A P R I M A R Y M A T R I X F U N C T I O N 213

above becomes Po, = P Z , d P becomes dP, , = ( d P ) Z, and P- ' becomes -i _~p-i. , ). - 1 P,o = Z Hence, under the permutation to of the roots Xt, ... ,,(dPo,)Po, - -

( d P ) Z Z - ~ P - I ~ ( d P ) P -I and ( d P ) P -~ is invariant under any to of Q. This

also follows in exactly the same way if, in the real case, the columns of P

are normalized to Hermitian length one.

The elements of P, being cofactors from the matrices X - - ~ . z l , belong to

3F. Hence the elements of P - ' also belong to c~p. The elements of d P are

differentials of elements of c~r and therefore are linear homogeneous functions

of the differentials dx,s and dX~, with coefficients in c~r. Therefore, by Lemma

4.1. the elements of d R are expressible as linear homogeneous polynomials

in the dx,s with coefficients in c~. This, together with the invariance under -Q

implies that the coefficients of the dx,s in ( d P ) P - ' belong to dr=, i.e. are

rational functions of the x,s. If in the case c3:-~-c~ each column P~ of P is normalized by dividing by

its hermitian length, the conclusions of the preceding paragraph remain valid.

For let P: denote the corresponding normalized vector, and let P' denote the

matrix of these column vectors. Then the matrix P' of normalized vectors is

P'--= P d g ( l ~ '/', . . . , I~-'/~), where lj denotes the square of the hermitian length

of Pi. Then p.-1 dg(l'J' 1/o -1 , . . . , 1~ '1 , )_ = , . . . . l,,-)P , and d P ' = ( d P ) d g ( l ~ '/'

. . . . . l-~/, d Pdg(___l_l_~/ ,d! 1 ,, / , , )and 2 -t - - 1 . . . . , 2

( 1 -t l l : l ) d g ( d l , ' , d l , ) p - l . (4.2) ( d P ' ) P ' - ' = ( d P ) p - t - - P d g - - - 2 - 1 i . . . . . 2 "'"

As already shown, the elements of ( d P ) P -~ are linear homogeneous polynomials

in the differentials dx~, with coefficients belonging to cSF. In the second term

( 1 1 1:') and of in the above relation, the elements of P, of d g - - ~ l - ~ 1, . . . , 2

p-1 are all in ~,~., since ~zEc3p implies ~ E dp. Since 11 . . . . . I,, are in c~e, the

differentials dlI, . . . , dl~ are linear homogeneous polynomials in the dx~, with

coefficients in ~F by Lemma 4.1.

The invariance of ( d P ) P -~ and ( d t Y ) P '-I under Q has already been shown.

Hence the invariance under f~ of the second term of the right member of

4.2 follows. Hence the coefficients of the dx~, in 4.2 are in cSF. Thus,

Theorem 4.1. Let "7)h be the open set of matrices of C'K~ for which column

h of each of the matrices (X - - ),,. I)A , i = 1 . . . . , n is nonzero. Then in @hP

2 1 4 R . F . RINEHART

can be so chosen that the elements of (dP)P-' lie in the module of linear

homogeneous polynomials in the differentials dxrs with coefficients which are

rational functions of the elements of X. If ~" is the real field, the columns of

P may further be taken to be vectors of unit Hermitian length.

An explicit formula for (dP)P -1 in terms of X and dX would be desirable.

For second order matrices, a direct calculation of P, P - ' and dP, rather lengthy

for inclusion here, has yielded

( o o ) (: Hi - - S , K 1 =

- - 2X~2X~t - - X i 2 ( X 2 2 - - Xii )

( o o) (: :) H~ = S , K s =

- - x ~ (x~ . - - x , , ) 2 x',,

x2t (xii - - X2~) 2 x~2 X21

(o o ) H 4 = S

where

S = {x~ [(x~ - - x2~) ~" = 4x~x2~]l-'.

The quantity (x~ l - x~)~+ 4x,~x~L is equal to ( ;~2-) . t )~, the discriminant of

the characteristic equation of X, and is hence nonzero throughout CKc, the open

set of matrices X with distinct eigenvalues. The above expression for (dP)P -2 is

therefore valid throughout the region of CKc in which x~e :~ 0.

Unfortunately neither the method of calculation nor the form of the results

for c ~ suggest a mode of extension to matrices of arbitrary order.

Monterey (California), July 1965.

THE DIFFERENTIAL OF A PRIMARY MATRIX FUNCTION 2 1 5

REFERENCES

[I] Hausdorf/ F., Zur Theorie tier Sysleme comFlexer Zahlen, Leipziger Berichte, v. 52 (IC00),

pp. 43-61.

[2] Portmann W. O., Hausdorff-analytic functions of matrices, Proc. Amer. Math. Soc., v. 11

(1960), pp. 97-101.

[3] Rinehart R. F., The equivalence of definiticns of a matrix function, Amer. Math. Monthly, v. 62, no. 6 (1955), pp. 395-414.

[4] Rinehart R. F., Elements of a tt.eory of intrinsic functicns on algebras, Duke Maih. Jour., v. 27, (1960), pp. 1-20.

[5] Rinehart R. F., P and D in P - I X P = d g [ k ~ . . . . . xn] = D as matrix function of X,

Submitted to Can. Jour. of Mathematics. [6] Ringleb F., Beltrdge zur Fanktionentheorie in hyperkomplexen Systemen, Rend. d. Circ.

Matem. di Palermo, v. 57 (1933), pp. 311-240.

[7] ZurmUhl R., Matrizen, 4th Ed., Springer-Verlag, (1964).

Documents

The differential of a primary matrix function