Something more about Schur Complement

Sunday, January 12, 2025

mathlinear algebra

Schur Complement is a widely used way to find an inverse of a matrix in many fields. It takes place to find the decent representation of a matrix, particularly when a matrix is represented in several block sub-matrices. Assuming that an invertible matrix $M$ is $R^{n \times n}$ . A matrix $M$ can be written as four block matrices: $A$ , $B$ , $C$ , $D$ , such that $A$ and $D$ are $p \times p$ and $(n - p) \times (n - p)$ matrices respectively.

M = [A C B D], A \in R^{p \times p}; D \in R^{(n - p) \times (n - p)}

There are many resources in the internet showing the derivation of Schur Complement. The one that I found it to be the most understandable is by Jean Gallier. He mimicked Guass elimination on block matrices to find the inverse of $M$ . That is

M^{- 1} = [(A - B D^{- 1} C)^{- 1} - D^{- 1} C (A - B D^{- 1} C)^{- 1} - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1} + D^{- 1} C (A - B D^{- 1} C)^{- 1} B D^{- 1}] = [A^{- 1} + A^{- 1} B (D - C A^{- 1} B)^{- 1} C A^{- 1} - (D - C A^{- 1} B)^{- 1} C A^{- 1} - A^{- 1} B (D - C A^{- 1} B)^{- 1} (D - C A^{- 1} B)^{- 1}]

There are two ways to represent an inverse of $M$ . The first way, which already shows above, takes the $D^{- 1}$ to form the Schur complement. It is called Schur Complement of $D$ . The latter one takes the $A^{- 1}$ instead. It is worth to notice that only $A$ and $D$ are square matrices, thus constructing those two valid representations.

At this point, an inverse of a matrix M is simply constructed, but yet not the most concise one. Some books like to mention the form that further factorizes it down into three matrices.

M^{- 1} = [I_{p} - D^{- 1} C 0 I_{n - p}] [(A - B D^{- 1} C)^{- 1} 0 0 D^{- 1}] [I_{p} 0 B D^{- 1} I_{n - p}]

To come up with the representation above, we can make use of LU decomposition to find the lower triangular matrix. We first perform linear transformation by adding $R_{2}$ with $D^{- 1} C R_{1}$

[(A - B D^{- 1} C)^{- 1} - D^{- 1} C (A - B D^{- 1} C)^{- 1} - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1} + D^{- 1} C (A - B D^{- 1} C)^{- 1} B D^{- 1}] R_{2} - (- D^{- 1} C) R_{1} [(A - B D^{- 1} C)^{- 1} 0 - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1}]

From which, we can deduce that $K_{1}$ , $K_{2}$ and $K_{3}$ are just $I_{p}$ , $I_{n - p}$ and $- D^{- 1} C$ respectively.

[K_{1} K_{2} 0 K_{3}] [(A - B D^{- 1} C)^{- 1} 0 - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1}] = M^{- 1} [K_{1} K_{2} 0 K_{3}] = [I_{p} - D^{- 1} C 0 I_{n - p}]

We then undergo the similar step on the matrix to find its upper triangular matrix by performing linear transaction one more time. This time, we add $R_{1}$ by $(A - B D^{- 1} C)^{- 1} R_{2}$ .

[(A - B D^{- 1} C)^{- 1} 0 - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1}] R_{1} + (A - B D^{- 1} C)^{- 1} B R_{2} [(A - B D^{- 1} C)^{- 1} 0 0 D^{- 1}]

Again, we can deduce that $U_{1}$ , $U_{2}$ and $U_{3}$ are $I_{p}$ , $- B D^{- 1}$ and $I_{n - p}$ .

[(A - B D^{- 1} C)^{- 1} 0 0 D^{- 1}] [U_{1} 0 U_{2} U_{3}] = [(A - B D^{- 1} C)^{- 1} 0 - (A - B D^{- 1} C)^{- 1} B D^{- 1} D^{- 1}] [U_{1} 0 U_{2} U_{3}] = [I_{p} 0 - B D^{- 1} I_{n - p}]

After all, we factorize the Schur complement of an inverse $M$ into a more concise form.

M^{- 1} = [I_{p} - D^{- 1} C 0 I_{n - p}] [(A - B D^{- 1} C)^{- 1} 0 0 D^{- 1}] [I_{p} 0 - B D^{- 1} I_{n - p}]

References: