ConceptComplete

Positive Definite Matrices

Positive definite matrices are symmetric matrices with positive eigenvalues. They arise naturally in optimization, statistics, and geometry as measures of "positive curvature."

DefinitionPositive Definite Matrix

A symmetric matrix AA is:

  • Positive definite (PD) if xTAx>0\mathbf{x}^TA\mathbf{x} > 0 for all x0\mathbf{x} \neq \mathbf{0}
  • Positive semi-definite (PSD) if xTAx0\mathbf{x}^TA\mathbf{x} \geq 0 for all x\mathbf{x}
  • Negative definite if xTAx<0\mathbf{x}^TA\mathbf{x} < 0 for all x0\mathbf{x} \neq \mathbf{0}
  • Indefinite if xTAx\mathbf{x}^TA\mathbf{x} takes both positive and negative values

The quadratic form Q(x)=xTAxQ(\mathbf{x}) = \mathbf{x}^TA\mathbf{x} generalizes the notion of squared length.

TheoremCharacterizations of Positive Definiteness

For symmetric matrix AA, the following are equivalent:

  1. AA is positive definite
  2. All eigenvalues of AA are positive
  3. All leading principal minors are positive (Sylvester's criterion)
  4. There exists invertible BB such that A=BTBA = B^TB (Cholesky decomposition)
  5. A=QΛQTA = Q\Lambda Q^T where all diagonal entries of Λ\Lambda are positive
ExampleTesting Positive Definiteness

Consider A=[2112]A = \begin{bmatrix} 2 & 1 \\ 1 & 2 \end{bmatrix}.

Method 1 (Eigenvalues): Characteristic polynomial λ24λ+3=0\lambda^2 - 4\lambda + 3 = 0 gives λ1=3,λ2=1\lambda_1 = 3, \lambda_2 = 1. Both positive, so AA is PD.

Method 2 (Sylvester): Leading principal minors are a11=2>0a_{11} = 2 > 0 and det(A)=3>0\det(A) = 3 > 0, confirming PD.

Method 3 (Quadratic form): xTAx=2x12+2x1x2+2x22=(x1+x2)2+x12+x22>0\mathbf{x}^TA\mathbf{x} = 2x_1^2 + 2x_1x_2 + 2x_2^2 = (x_1+x_2)^2 + x_1^2 + x_2^2 > 0 for x0\mathbf{x} \neq \mathbf{0}.

DefinitionMatrix Square Root

If AA is positive definite with spectral decomposition A=QΛQTA = Q\Lambda Q^T, then the matrix square root is: A1/2=QΛ1/2QTA^{1/2} = Q\Lambda^{1/2}Q^T

where Λ1/2\Lambda^{1/2} has diagonal entries λi\sqrt{\lambda_i}. This satisfies (A1/2)2=A(A^{1/2})^2 = A.

Similarly, A1/2=QΛ1/2QTA^{-1/2} = Q\Lambda^{-1/2}Q^T is well-defined.

ExampleApplications of Positive Definite Matrices
  1. Covariance matrices in statistics are always PSD; if variables are linearly independent, the matrix is PD
  2. Hessian matrices in optimization: PD Hessian implies local minimum
  3. Inner products: x,yA=xTAy\langle \mathbf{x}, \mathbf{y} \rangle_A = \mathbf{x}^TA\mathbf{y} defines an inner product iff AA is PD
  4. Ellipsoids: The set {x:xTAx1}\{\mathbf{x} : \mathbf{x}^TA\mathbf{x} \leq 1\} is an ellipsoid when AA is PD
Remark

Positive definiteness is a geometric property: it means the associated quadratic form is a "bowl" opening upward with a unique minimum at the origin. In machine learning, kernel matrices must be PSD for valid similarity measures. In numerical analysis, PD systems can be solved efficiently via Cholesky decomposition, which is faster and more stable than LU decomposition.