Diagonalizable matrix

In linear algebra, a square matrix <math>A</math> is called diagonalizable or non-defective if it is similar to a diagonal matrix. That is, if there exists an invertible matrix <math>P</math> and a diagonal matrix <math>D</math> such that . This is equivalent to (Such <math>D</math> are not unique.) This property exists for any linear map: for a finite-dimensional vector space a linear map <math>T:V\to V</math> is called diagonalizable if there exists an ordered basis of <math>V</math> consisting of eigenvectors of <math>T</math>. These definitions are equivalent: if <math>T</math> has a matrix representation <math>A = PDP^{-1}</math> as above, then the column vectors of <math>P</math> form a basis consisting of eigenvectors of and the diagonal entries of <math>D</math> are the corresponding eigenvalues of with respect to this eigenvector basis, <math>T</math> is represented by

Diagonalization is the process of finding the above <math>P</math> and and makes many subsequent computations easier. One can raise a diagonal matrix <math>D</math> to a power by simply raising the diagonal entries to that power. The determinant of a diagonal matrix is simply the product of all diagonal entries. Such computations generalize easily to

The geometric transformation represented by a diagonalizable matrix is an inhomogeneous dilation (or anisotropic scaling). That is, it can scale the space by a different amount in different directions. The direction of each eigenvector is scaled by a factor given by the corresponding eigenvalue.

A square matrix that is not diagonalizable is called defective. It can happen that a matrix <math>A</math> with real entries is defective over the real numbers, meaning that <math>A = PDP^{-1}</math> is impossible for any invertible <math>P</math> and diagonal <math>D</math> with real entries, but it is possible with complex entries, so that <math>A</math> is diagonalizable over the complex numbers. For example, this is the case for a generic rotation matrix.

Many results for diagonalizable matrices hold only over an algebraically closed field (such as the complex numbers). In this case, diagonalizable matrices are dense in the space of all matrices, which means any defective matrix can be deformed into a diagonalizable matrix by a small perturbation; and the Jordan–Chevalley decomposition states that any matrix is uniquely the sum of a diagonalizable matrix and a nilpotent matrix. Over an algebraically closed field, diagonalizable matrices are equivalent to semi-simple matrices.

Definition

A square <math>n \times n</math> matrix <math>A</math> with entries in a field <math>F</math> is called diagonalizable or nondefective if there exists an <math>n \times n</math> invertible matrix (i.e. an element of the general linear group <math>\operatorname{GL}(n,\mathbb{F})</math>), <math>P</math>, such that <math>P^{-1}AP</math> is a diagonal matrix.

Characterization

The fundamental fact about diagonalizable maps and matrices is expressed by the following:

An <math>n \times n</math> matrix <math>A</math> over a field <math>F</math> is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to <math>n</math>, which is the case if and only if there exists a basis of <math>F^n</math> consisting of eigenvectors of <math>A</math>. If such a basis has been found, one can form the matrix <math>P</math> having these basis vectors as columns, and <math>P^{-1}AP</math> will be a diagonal matrix whose diagonal entries are the eigenvalues of <math>A</math>. The matrix <math>P</math> is known as a modal matrix for <math>A</math>.
A linear map <math>T : V \to V</math> is diagonalizable if and only if the sum of the dimensions of its eigenspaces is equal to which is the case if and only if there exists a basis of <math>V</math> consisting of eigenvectors of <math>T</math>. With respect to such a basis, <math>T</math> will be represented by a diagonal matrix. The diagonal entries of this matrix are the eigenvalues of

The following sufficient (but not necessary) condition is often useful.

An <math>n \times n</math> matrix <math>A</math> is diagonalizable over the field <math>F</math> if it has <math>n</math> distinct eigenvalues in i.e. if its characteristic polynomial has <math>n</math> distinct roots in however, the converse may be false. Consider <math display="block">\begin{bmatrix}

-1 & 3 & -1 \\

-3 & 5 & -1 \\

-3 & 3 & 1

\end{bmatrix},</math> which has eigenvalues 1, 2, 2 (not all distinct) and is diagonalizable with diagonal form (similar to <math display="block">\begin{bmatrix}

1 & 0 & 0 \\

0 & 2 & 0 \\

0 & 0 & 2

\end{bmatrix}</math> and change of basis matrix <math>P</math>: <math display="block">\begin{bmatrix}

1 & 1 & -1 \\

1 & 1 & 0 \\

1 & 0 & 3

\end{bmatrix}.</math> The converse fails when <math>A</math> has an eigenspace of dimension higher than 1. In this example, the eigenspace of <math>A</math> associated with the eigenvalue 2 has dimension 2.

A linear map <math>T : V \to V</math> with <math>n = \dim(V)</math> is diagonalizable if it has <math>n</math> distinct eigenvalues, i.e. if its characteristic polynomial has <math>n</math> distinct roots in <math>F</math>.

Let <math>A</math> be a matrix over If <math>A</math> is diagonalizable, then so is any power of it. Conversely, if <math>A</math> is invertible, <math>F</math> is algebraically closed, and <math>A^n</math> is diagonalizable for some <math>n</math> that is not an integer multiple of the characteristic of then <math>A</math> is diagonalizable. Proof: If <math>A^n</math> is diagonalizable, then <math>A</math> is annihilated by some polynomial which has no multiple root (since and is divided by the minimal polynomial of

Over the complex numbers <math>\Complex</math>, almost every matrix is diagonalizable. More precisely: the set of complex <math>n \times n</math> matrices that are not diagonalizable over considered as a subset of has Lebesgue measure zero. One can also say that the diagonalizable matrices form a dense subset with respect to the Zariski topology: the non-diagonalizable matrices lie inside the vanishing set of the discriminant of the characteristic polynomial, which is a hypersurface. From that follows also density in the usual (strong) topology given by a norm. The same is not true over

The Jordan–Chevalley decomposition expresses an operator as the sum of its semisimple (i.e., diagonalizable) part and its nilpotent part. Hence, a matrix is diagonalizable if and only if its nilpotent part is zero. Put in another way, a matrix is diagonalizable if each block in its Jordan form has no nilpotent part; i.e., each "block" is a one-by-one matrix.

Diagonalization

Consider the two following arbitrary bases <math>E = \