Understanding the QR algorithm, Part X · 2. Fundamentals of Matrix Computations, Wiley, 1991 3....

Understanding the QR algorithm,Part X

David S. Watkinswatkins@math.wsu.edu

Department of Mathematics

Washington State University

Glasgow 2009 – p. 1

1. Understanding the QR algorithm, SIAM Rev., 1982

2. Fundamentals of Matrix Computations, Wiley, 1991

3. Some perspectives on the eigenvalue problem, 1993

4. QR-like algorithms—an overview of convergencetheory and practice, AMS proceedings, 1996

5. QR-like algorithms for eigenvalue problems, JCAM,2000

7. The Matrix Eigenvalue Problem: GR and KrylovSubspace Methods, SIAM, 2007.

8. The QR algorithm revisited, SIAM Rev., 2008.

Some names associated withthe QR algorithm

Some names associated withthe QR algorithm (short list)

Rutishauser

Kublanovskaya

Rutishauser

Kublanovskaya

Francis

Rutishauser

Kublanovskaya

Francis

Implicitly Shifted QR algorithm

Rutishauser

Kublanovskaya

Francis

Implicitly Shifted QR algorithmHow should we understand it?

Rutishauser

Kublanovskaya

Francis

Implicitly Shifted QR algorithmHow should we understand it? . . . view it?

Rutishauser

Kublanovskaya

Francis

Implicitly Shifted QR algorithmHow should we understand it? . . . view it?. . . teach it to our students?

The Standard Approach . . .

The Standard Approach . . .. . . dating from the work of Francis

Start with the basic algorithm . . .

A = QR

A = QR RQ = A

A = QR RQ = A repeat!

This is simple,

This is simple, appealing,

This is simple, appealing, does not require muchpreparation,

This is simple, appealing, does not require muchpreparation, but . . .

. . . it is far removed from versions of theQRalgorithm that are actually used.

Refinements

Refinementsshifts of origin

reduction to Hessenberg form

implicit shift technique (Francis)

double shiftQR

multiple shiftQR

double shiftQR

multiple shiftQR

implicit-Q theorem

double shiftQR

multiple shiftQR

implicit-Q theoremvs.Krylov subspaces

double shiftQR

multiple shiftQR

Introducing Krylov subspaces improvesunderstanding,

double shiftQR

multiple shiftQR

Introducing Krylov subspaces improvesunderstanding, allows more general results,

double shiftQR

multiple shiftQR

Introducing Krylov subspaces improvesunderstanding, allows more general results, andprepares students for Krylov subspace methods.

The Implicitly Shifted QR Iteration

The Implicitly Shifted QR Iterationmatrix is in upper Hessenberg form

pick some shiftsρ1, . . . ,ρm

pick some shiftsρ1, . . . ,ρm (m = 1, 2, 4, 6)

p(A) = (A − ρ1I) · · · (A − ρmI)

p(A) = (A − ρ1I) · · · (A − ρmI) expensive!

computep(A)e1

computep(A)e1 cheap!

Build unitaryQ0 with q1 = αp(A)e1.

Perform similarity transformA → Q∗0AQ0.

Hessenberg form is disturbed.

An Upper Hessenberg Matrix@

After the Transformation ( Q∗0AQ0)

Now return the matrix to Hessenberg form.

Chasing the Bulge@

The implicitQR step is complete!

Summary of Implicit QR Iteration

Summary of Implicit QR IterationPick some shifts.

Computep(A)e1. (p determined by shifts)

Build Q0 with first columnq1 = αp(A)e1.

Make a bulge. (A → Q∗0AQ0)

Chase the bulge. (return to Hessenberg form)

A = Q∗AQ

Question

QuestionThis differs a lot from the basicQR step.

A = QR RQ = A

Can we carve a reasonable pedagogical path thatleads directly to the implicitly-shiftedQR algorithm,

A = QR RQ = A

Can we carve a reasonable pedagogical path thatleads directly to the implicitly-shiftedQR algorithm,bypassing the basicQR algorithm entirely?

A = QR RQ = A

Can we carve a reasonable pedagogical path thatleads directly to the implicitly-shiftedQR algorithm,bypassing the basicQR algorithm entirely?

That’s what we are going to do today.

Ingredients

Ingredientssubspace iteration (power method)

Krylov subspaces

Krylov subspaces and subspace iteration

(unitary) similarity transformation(change of coordinate system)

Hessenberg form and Krylov subspaces(instead of implicit-Q theorem)

No Magic Shortcut!

Power Method, Subspace Iteration

Power Method, Subspace Iterationv, Av, A2v, A3v, . . .

convergence rate|λ2/λ1 |

S, AS, A2S, A3S, . . .

subspaces of dimensionj

S, AS, A2S, A3S, . . .

subspaces of dimensionj (|λj+1/λj |)

S, AS, A2S, A3S, . . .

Substitutep(A) for A

S, AS, A2S, A3S, . . .

Substitutep(A) for A (shifts, multiple steps)

S, AS, A2S, A3S, . . .

S, p(A)S, p(A)2S, p(A)3S, . . .

S, AS, A2S, A3S, . . .

S, p(A)S, p(A)2S, p(A)3S, . . .

convergence rate|p(λj+1)/p(λj) |

Krylov Subspaces . . .

Krylov Subspaces . . .. . . and Subspace Iteration

Krylov Subspaces . . .. . . and Subspace IterationDef: Kj(A, q) = span

q, Aq,A2q, . . . , Aj−1q}

j = 1, 2, 3, . . . (nested subspaces)

q, Aq,A2q, . . . , Aj−1q}

Kj(A, q) are “determined byq”.

q, Aq,A2q, . . . , Aj−1q}

p(A)Kj(A, q) = Kj(A, p(A)q)

q, Aq,A2q, . . . , Aj−1q}

. . . becausep(A)A = Ap(A)

q, Aq,A2q, . . . , Aj−1q}

. . . becausep(A)A = Ap(A)

Conclusion: Power method induces nested subspaceiterations on Krylov subspaces.

power method: p(A)kq

nested subspace iterations:

p(A)kKj(A, q) = Kj(A, p(A)kq) j = 1, 2, 3, . . .

nested subspace iterations:

p(A)kKj(A, q) = Kj(A, p(A)kq) j = 1, 2, 3, . . .

convergence rates:

|p(λj+1)/p(λj) |, j = 1, 2, 3, . . .

(Unitary) Similarity Transforms

(Unitary) Similarity TransformsA → Q∗AQ preserves eigenvalues

transforms eigenvectors in a simple way(w → Q∗w)

is a change of coordinate system (v → Q∗v)

triangular form (eigenvalues)

relationship of invariant subspaces to triangular form

Subspace Iterationwith change of coordinate system

takeS = span{e1, . . . , ej}