Linear Algebra: Positive Semidefinite Matrices

Probability

The Analysis of Data, volume 1

0
- Front Matter
- 0.1: Contents
- 0.2: Preface
1
2
- Random Variables
- 2.1: Basic Definitions
- 2.2: Functions of RVs
- 2.3: Expectation and Variance
- 2.4: Moments and MGF
- 2.5: RVs and Measure Theory
- 2.6: Notes
- 2.7: Exercises
3
4
5
- Important Vectors
- 5.1: Multinomial Vectors
- 5.2: Gaussian Vectors
- 5.3: Dirichlet Vectors
- 5.4: Mixture Vectors
- 5.5: Exponential Family
- 5.6: Notes
- 5.7: Exercises
6
- Random Processes
- 6.1: Basic Definitions
- 6.2: Marginals
- 6.3: Moments
- 6.4: Random Walk
- 6.5: Processes and Measure
- 6.6: Borell-Cantelli and Zero-One
- 6.7: Notes
- 6.8: Exercises
7
- Important RPs
- 7.1: Markov Chains
- 7.2: Poisson Process
- 7.3: Gaussian Process
- 7.4: Notes
- 7.5: Exercises
8
A
- Set Theory
- A.1: Basic Definition
- A.2: Functions
- A.3: Cardinality
- A.4: Limits of Sets
- A.5: Notes
- A.6: Exercises
B
- Metric Spaces
- B.1: Basic Definitions
- B.2: Limits
- B.3: Continuity
- B.4: Euclidean Space
- B.5: Growth of Functions
- B.6: Notes
- B.7: Exercises
C
- Linear Algebra
- C.1: Basic Definitions
- C.2: Rank
- C.3: Eigenvalues and Determinant
- C.4: Semidefinite Matrices
- C.5: SVD
- C.6: Notes
- C.7: Exercises
D
- Differentiation
- D.1: Scalar Differentiation
- D.2: Power and Taylor Series
- D.3: Notes
- D.4: Exercises
E
- Measure Theory
- E.1: Sigma Algebras
- E.2: Measure Function
- E.3: Extension Theorem
- E.4: Independence
- E.5: Important Measures
- E.6: Measurable Functions
- E.7: Notes
F

$ \def\P{\mathsf{\sf P}} \def\E{\mathsf{\sf E}} \def\Var{\mathsf{\sf Var}} \def\Cov{\mathsf{\sf Cov}} \def\std{\mathsf{\sf std}} \def\Cor{\mathsf{\sf Cor}} \def\R{\mathbb{R}} \def\c{\,|\,} \def\bb{\boldsymbol} \def\diag{\mathsf{\sf diag}} \def\col{\mathsf{\sf col}} \def\row{\mathsf{\sf row}} \def\rank{\mathsf{\sf rank}} \def\defeq{\stackrel{\tiny\text{def}}{=}} $

C.4. Positive Semidefinite Matrices

Definition C.4.1. A square symmetric matrix $H\in\R^{n\times n}$ is positive semi-definite (psd) if \[ {\bb v}^{\top}H{\bb v}\geq 0, \qquad \forall \bb v \in\R^{n}\] and positive definite (pd) if the inequality holds with equality only for vectors $\bb v=\bb 0$. A square symmetric matrix $H\in\R^{n\times n}$ is negative semi-definite (nsd) if \[ {\bb v}^{\top}H{\bb v}\leq 0, \qquad \forall \bb v \in\R^{n}\] and negative definite (nd) if the inequality holds with equality only for vectors $\bb v=\bb 0$.

We make the following observations.

The matrix $A$ is psd if any only if $-A$ is nsd, and similarly a matrix $A$ is pd if and only if $-A$ is nd.
The psd and pd concepts are denoted by $0\preceq A$ and $0\prec A$, respectively. The nsd and nd concepts are denoted by $A\preceq 0$ and $A\prec 0$, respectively.
The notations above can be extended to denote a partial order on matrices: $A\preceq B$ if and only if $A-B\preceq 0$ and $A\prec B$ if any only if $A-B\prec 0$. Note that $A\prec B$ does not imply that all entries of $A$ are smaller than all entries of $B$.

Proposition C.4.1. A symmetric matrix is psd if and only if all eigenvalues are non-negative. It is nsd if and only if all eigenvalues are non-positive. It is pd if and only if all eigenvalues are positive. It is nd if and only if all eigenvalues are negative.

Proof. Let $\bb v$ be an arbitrary vector. Using the spectral decomposition, we have \[\bb v^{\top} A\bb v=(\bb v^{\top} U)\diag(\bb\lambda)(U^{\top}\bb v)=\sum_{i=1}^n \lambda_i ([\bb v^{\top} T]_i)^2,\] where $U$ is a matrix containing the $n$ orthogonal eigenvectors of $A$. The above expression is non-negative for all $\bb v$ if and only if $\lambda_i\geq 0$ for all $i=1,\ldots,n$. The rest of the proof is similar.

Proposition C.4.2. Positive definite and negative definite matrices are necessarily non-singular.

Proof. Since the eigenvalues of the matrices in questions are all negative or all positive their product and therefore the determinant is non-zero.

Proposition C.4.3. A symmetric matrix of rank $r$ is psd if and only if there exists a square matrix $R$ of rank $r$ such that $A=R^{\top}R$. If $A$ is pd then $R$ is singular.

Proof. Let $A$ be a psd matrix $A$ of rank $r$. Then it has $r$ non-zero eigenvalues and we can write its spectral decomposition as \begin{align*} A = V\diag(\lambda_1,\ldots,\lambda_r) V^{\top}= V\diag(\sqrt{\lambda_1},\ldots,\sqrt{\lambda_r}) \diag(\sqrt{\lambda_1},\ldots,\sqrt{\lambda_r}) V^{\top}, \end{align*} where $V$ is a matrix whose columns contain the $r$ eigenvectors corresponding to non-zero eigenvalues. Conversely, if $A=RR^{\top}$ then $\rank(A)=\rank(R)=r$ (see Proposition C.2.5) and \[({\bb v}^{\top}R)(R^{\top}\bb v) = {\bb w}^{\top}\bb w \geq 0.\] Finally, if $A=R^{\top}R$ is pd then it is non-singular and therefore of full rank. It follows that $R$ is also non-singular and of full rank (see Proposition C.2.5).

Proposition C.4.4. If $A$ is pd then so is $A^{-1}$

Proof. Let $A$ be a pd matrix. Using the previous proposition, we have \[A^{-1} = (RR^{\top})^{-1} = R^{-1}(R^{\top})^{-1} = R^{-1}(R^{-1})^{\top}\] where $R$ is a non-singular square matrix. Using the previous proposition again (the converse part), we get that $A^{-1}$ is pd.

Proposition C.4.5. The diagonal elements of a pd matrix are all positive.

Proof. Using the standard basis ${\bb e}^{(i)}$ (see Example C.1.4), we have \[({\bb e}^{(i)})^{\top} A {\bb e}^{(i)} =A_{ii}, \qquad i=1,\ldots,n.\] It follows that if $A$ is pd, $A_{ii} > 0, i=1,\ldots,n$.

Proposition C.4.6. If $A$ is positive definite there exists a square root matrix $A^{1/2}$ for which $A^{1/2}A^{1/2}=A$.

Proof. Let $A$ be a pd matrix with positive eigenvalues. Using the spectral decomposition, we have \begin{align*} A&= U^{\top} \diag(\lambda_1,\ldots,\lambda_n) U \\ &= (U^{\top} \diag(\sqrt{\lambda_1},\ldots,\sqrt{\lambda_n}) U)(U^{\top} \diag(\sqrt{\lambda_1},\ldots,\sqrt{\lambda_n})U) \\&= A^{1/2} A^{1/2}. \end{align*}