Command Palette

Search for a command to run...

Linear Methods of AI

Spectral Theorem for Real Matrices

Equivalent Conditions of the Theorem

For a real matrix ARn×nA \in \mathbb{R}^{n \times n}, the following two conditions are equivalent to each other.

  1. There exists an orthonormal basis of Rn\mathbb{R}^n consisting of eigenvectors of matrix AA

  2. Matrix AA is normal and the characteristic polynomial χA(t)\chi_A(t) of AA factors into linear factors in R\mathbb{R}

The main difference from the case of complex matrices lies in the additional requirement about the factorization of the characteristic polynomial. In real numbers, all roots of the characteristic polynomial must be real numbers.

Based on this theorem, several types of matrices can be diagonalized using an orthonormal basis of eigenvectors. Complex Hermitian matrices, complex unitary matrices, and real symmetric matrices always satisfy this condition. Orthogonal matrices can be orthogonally diagonalized only if their characteristic polynomial factors into linear factors in real numbers.

Normal Matrix without Orthonormal Basis

Consider a rotation matrix for angle α\alpha that is not a multiple of π\pi.

A=(cos(α)sin(α)sin(α)cos(α))A = \begin{pmatrix} \cos(\alpha) & -\sin(\alpha) \\ \sin(\alpha) & \cos(\alpha) \end{pmatrix}

This matrix is orthogonal and therefore automatically normal. However, its characteristic polynomial

χA(t)=(cos(α)t)2+sin2(α)\chi_A(t) = (\cos(\alpha) - t)^2 + \sin^2(\alpha)
=t2+2cos(α)t+1= -t^2 + 2\cos(\alpha) \cdot t + 1

does not factor into linear factors in real numbers for most values of α\alpha. Imagine searching for solutions to a quadratic equation that has no real roots. As a result, even though the matrix is normal, there is no orthonormal basis of real eigenvectors.

Non-Normal Matrix that is Diagonalizable

The following matrix has eigenvalues λ1=1\lambda_1 = 1 and λ2=2\lambda_2 = 2 with algebraic multiplicity 1 each.

A=(1302)A = \begin{pmatrix} 1 & 3 \\ 0 & 2 \end{pmatrix}

Since the eigenvalues are different, the matrix can be diagonalized. However, this matrix is not normal because

AAT=(10664)A \cdot A^T = \begin{pmatrix} 10 & 6 \\ 6 & 4 \end{pmatrix}
ATA=(13313)A^T \cdot A = \begin{pmatrix} 1 & 3 \\ 3 & 13 \end{pmatrix}

These two matrices are not equal, so AATATAA \cdot A^T \neq A^T \cdot A.

The eigenspaces of this matrix are

EigA(1)=Kern(AI)=Kern(0301)=span(10)\text{Eig}_A(1) = \text{Kern}(A - I) = \text{Kern}\begin{pmatrix} 0 & 3 \\ 0 & 1 \end{pmatrix} = \text{span}\begin{pmatrix} 1 \\ 0 \end{pmatrix}
EigA(2)=Kern(A2I)=Kern(1300)=span(31)\text{Eig}_A(2) = \text{Kern}(A - 2I) = \text{Kern}\begin{pmatrix} -1 & 3 \\ 0 & 0 \end{pmatrix} = \text{span}\begin{pmatrix} 3 \\ 1 \end{pmatrix}

Here the kernel (null space) is the set of all vectors that are mapped to the zero vector by the matrix, while the span is the set of all linear combinations of that vector. Although the matrix can be diagonalized, there is no orthonormal basis of eigenvectors because the eigenvectors are not orthogonal to each other.