Sylvester's law of inertia is a theorem in matrix algebra about certain properties of the coefficient matrix of a real quadratic form that remain invariant under a change of basis. Namely, if <math>S</math> is a symmetric matrix, then for any invertible matrix <math>P</math>, the numbers of positive, negative, and zero eigenvalues of <math>S' = PSP^\mathsf{T}</math> are constant (i.e., the inertia of <math>S'</math> is constant). This result is particularly useful when <math>S'</math> is diagonal, as the inertia of a diagonal matrix can easily be obtained by looking at the signs of its diagonal elements.
This property is named after James Joseph Sylvester, who published its proof in 1852.
Statement <span class="anchor" id="Statement of the theorem"></span>
Let <math>S</math> be a symmetric square matrix of order <math>n</math> with real entries. Any non-singular square matrix <math>P</math> of the same order is said to transform <math>S</math> into another symmetric matrix , also of order , where <math>P^\mathsf{T}</math> is the transpose of . It is also said that matrices <math>S</math> and <math>S'</math> are congruent. If <math>S</math> is the coefficient matrix of some quadratic form on , then <math>S'</math> is the coefficient matrix of the same form after the change of basis defined by .
A symmetric matrix <math>S</math> can always be transformed in this way into a diagonal matrix <math>D</math> which has entries only , , along the diagonal. Sylvester's law of inertia states that the number of diagonal entries of each kind is an invariant of , i.e., does not depend on the matrix <math>P</math> used.
The number of 's, denoted , is called the positive index of inertia of , and the number of 's, denoted , is called the negative index of inertia of . The number of 's, denoted , is the dimension of the null space of , known as the nullity of . These numbers satisfy the obvious relation
: .
The difference <math> \mathrm{sgn}(S) = n_+ - n_- </math> is usually called the signature of . However, some authors use that term for the triple <math>(n_0,n_+,n_-)</math> consisting of the nullity, the positive, and the negative indices of inertia of . For a non-degenerate form of a given dimension, these are equivalent data; but in general, the triple yields more data.
If the matrix <math>S</math> has the property that every principal upper left <math>k \times k</math> minor <math>\Delta_k</math> is non-zero, then the negative index of inertia is equal to the number of sign changes in the sequence
: .
Statement in terms of eigenvalues
The law can also be stated as follows: two symmetric square matrices <math>S</math> and <math>S'</math> of the same size have the same numbers of positive, negative, and zero eigenvalues if and only if they are congruent (i.e., for some non-singular square matrix ).
The positive and negative indices of a symmetric matrix <math>S</math> are also the numbers of positive and negative eigenvalues of . Any symmetric real matrix <math>S</math> has an eigendecomposition of the form , where <math>E</math> is a diagonal matrix containing the eigenvalues of , and <math>Q</math> is an orthogonal square matrix containing the eigenvectors. The matrix <math>E</math> can be written , where <math>D</math> is diagonal with entries , and <math>W</math> is diagonal with each . The matrix <math>P = QW</math> transforms <math>D</math> to , since .
Statement in terms of quadratic forms
A real quadratic form <math>q</math> in <math>n</math> variables (or on an -dimensional real vector space) can, by a suitable change of basis (by non-singular linear transformation from <math>x</math> to ), be brought to the diagonal form
: , with each .
Sylvester's law of inertia states that the number of coefficients of a given sign is an invariant of , i.e., does not depend on a particular choice of diagonalizing basis.
Expressed geometrically, the law of inertia says that all maximal subspaces on which the restriction of the quadratic form is positive definite (respectively, negative definite) have the same dimension. These dimensions are the positive and negative indices of inertia of .
Generalizations
Two complex matrices (i.e., with complex entries) <math>A</math> and <math>B</math> are said to be -congruent if (and only if) there exists a non-singular complex matrix <math>P</math> such that , where <math>P^*</math> denotes the conjugate transpose of .
The definition of inertia is still valid for a Hermitian matrix, as its eigenvalues are always real numbers.
Sylvester's law of inertia can be generalized to complex matrices:<br>Two Hermitian matrices <math>A</math> and <math>B</math> are -congruent if and only if they have the same inertia.
Ostrowski proved a quantitative generalization of Sylvester's law of inertia:<br>If two complex matrices <math>A</math> and <math>B</math> are -congruent, with where <math>P</math> is a non-singular complex matrix, then their eigenvalues <math>\lambda_i</math> are related by
: ,
where , and each <math>\theta_i</math> is such that
: .<br><br>
A theorem due to Ikramov generalizes the law of inertia to normal matrices:<br>Two normal matrices <math>A</math> and <math>B</math> are congruent if and only if they have the same number of eigenvalues on each open ray from the origin in the complex plane.
See also
- Metric signature
- Morse theory
- Cholesky decomposition
- Haynsworth inertia additivity formula
References
External links
- Sylvester's law of inertia and *-congruence
