Schur Complement is a widely used way to find an inverse of a matrix in many fields. It takes place to find the decent representation of a matrix, particularly when a matrix is represented in several block sub-matrices. Assuming that an invertible matrix is . A matrix can be written as four block matrices: , , , , such that and are and matrices respectively.
There are many resources in the internet showing the derivation of Schur Complement. The one that I found it to be the most understandable is by Jean Gallier. He mimicked Guass elimination on block matrices to find the inverse of . That is
There are two ways to represent an inverse of . The first way, which already shows above, takes the to form the Schur complement. It is called Schur Complement of . The latter one takes the instead. It is worth to notice that only and are square matrices, thus constructing those two valid representations.
At this point, an inverse of a matrix M is simply constructed, but yet not the most concise one. Some books like to mention the form that further factorizes it down into three matrices.
To come up with the representation above, we can make use of LU decomposition to find the lower triangular matrix. We first perform linear transformation by adding with
From which, we can deduce that , and are just , and respectively.
We then undergo the similar step on the matrix to find its upper triangular matrix by performing linear transaction one more time. This time, we add by .
Again, we can deduce that , and are , and .
After all, we factorize the Schur complement of an inverse into a more concise form.
References: