So with the help and assistance of \(B\) we have been able to determine a solution to the system represented by \(A\vect{x}=\vect{b}\) through judicious use of matrix multiplication. We know by Theorem NMUS that since the coefficient matrix in this example is nonsingular, there would be a unique solution, no matter what the choice of \(\vect{b}\text{.}\) The derivation above amplifies this result, since we were forced to conclude that \(\vect{x}=B\vect{b}\) and the solution could not be anything else. You should notice that this argument would hold for any particular choice of \(\vect{b}\text{.}\)
The matrix \(B\) of the previous example is called the inverse of \(A\text{.}\) When \(A\) and \(B\) are combined via matrix multiplication, the result is the identity matrix, which can be inserted “in front” of \(\vect{x}\) as the first step in finding the solution. This is entirely analogous to how we might solve a single linear equation like \(3x=12\text{.}\)
Here we have obtained a solution by employing the “multiplicative inverse” of \(3\text{,}\)\(3^{-1}=\frac{1}{3}\text{.}\) This works fine for any scalar multiple of \(x\text{,}\) except for zero, since zero does not have a multiplicative inverse. Consider separately the two linear equations
The first has no solutions, while the second has infinitely many solutions. For matrices, it is all just a little more complicated. Some matrices have inverses, some do not. And when a matrix does have an inverse, just how would we compute it? In other words, just where did that matrix \(B\) in the last example come from? Are there other matrices that might have worked just as well?
SubsectionIMInverse of a Matrix
DefinitionMI.Matrix Inverse.
Suppose \(A\) and \(B\) are square matrices of size \(n\) such that \(AB=I_n\) and \(BA=I_n\text{.}\) Then \(A\) is invertible and \(B\) is the inverse of \(A\text{.}\) In this situation, we write \(B=\inverse{A}\text{.}\)
Notice that if \(B\) is the inverse of \(A\text{,}\) then we can just as easily say \(A\) is the inverse of \(B\text{,}\) or \(A\) and \(B\) are inverses of each other.
Not every square matrix has an inverse. In Example SABMI the matrix \(B\) is the inverse of the coefficient matrix of Archetype B. To see this it only remains to check that \(AB=I_3\text{.}\) What about Archetype A? It is an example of a square matrix without an inverse.
ExampleMWIAA.A matrix without an inverse, Archetype A.
We will show that \(A\) is a matrix with no inverse, with a proof by contradiction. To this end, suppose that \(A\) is invertible, and call its inverse the matrix \(B\text{.}\) Choose the vector of constants
and consider the system of equations \(\linearsystem{A}{\vect{b}}\text{.}\) We could now proceed exactly as we did in Example SABMI, and employ the matrix \(B\) to determine a unique solution to this vector equation. Namely, the solution would be \(\vect{x}=B\vect{b}\text{.}\) In other words, the system is consistent.
However, we will now show the system \(\linearsystem{A}{\vect{b}}\) has no solutions. In other words, this system is inconsistent. Form the augmented matrix \(\augmented{A}{\vect{b}}\) and row-reduce to
which allows us to recognize the inconsistency by Theorem RCLS.
So the assumption of \(A\)’s inverse leads to a logical inconsistency as the system cannot be both consistent and inconsistent. So our assumption of an inverse is false, and \(A\) is a matrix with no inverse (provably). So we say \(A\) is not invertible.
It is possible this example is less than satisfying. Just where did that particular choice of the vector \(\vect{b}\) come from anyway? Stay tuned for an application of the future Theorem CSCS in Example CSAA.
Let us look at one more matrix inverse before we embark on a more systematic study.
so by Definition MI, we can say that \(A\) is invertible and write \(B=\inverse{A}\text{.}\)
We will now concern ourselves less with whether or not an inverse of a matrix exists, but instead with how you can find one when it does exist. In Section MINM we will have some theorems that allow us to more quickly and easily determine just when a matrix is invertible.
SubsectionCIMComputing the Inverse of a Matrix
We have just seen inverses of matrices in Example SABMI and Example MI, but these inverse matrices have just dropped from the sky. How would we compute an inverse? And just when is a matrix invertible, and when is it not? Writing a putative inverse with \(n^2\) unknowns and solving the resulting \(n^2\) equations is one approach. Applying this approach to \(2\times 2\) matrices can get us somewhere, so just for fun, let us do it.
Assume that \(ad-bc\neq 0\text{.}\) We will use the definition of the inverse of a matrix to establish that \(A\) has an inverse (Definition MI). Note that if \(ad-bc\neq 0\) then the displayed formula for \(\inverse{A}\) is legitimate since we are not dividing by zero). Using this proposed formula for the inverse of \(A\text{,}\) we simply compute
By Definition MI this is sufficient to establish that \(A\) is invertible, and that the expression for \(\inverse{A}\) is correct.
(⇒)
Assume that \(A\) is invertible, and proceed with a proof by contradiction (Proof Technique CD), by assuming also that \(ad-bc=0\text{.}\) This translates to \(ad=bc\text{.}\) Let
The matrix on the right now has two rows that are identical, and therefore the same must be true of the matrix on the left. Identical rows for the matrix on the left implies that \(a=0\) and \(c=0\text{.}\)
So \(bg=dh=1\) and thus \(b,g,d,h\) are all nonzero. But then \(bh\) and \(dg\) (the “other corners”) must also be nonzero, so this is (finally) a contradiction. So our assumption was false and we see that \(ad-bc\neq 0\) whenever \(A\) has an inverse.
There are several ways one could try to prove this theorem, but there is a continual temptation to divide by one of the eight entries involved (\(a\) through \(f\)), but we can never be sure if these numbers are zero or not. This could lead to an analysis by cases, which is messy, messy, messy. Note how the above proof never divides, but always multiplies, and how zero/nonzero considerations are handled. Pay attention to the expression \(ad-bc\text{,}\) as we will see it again in a while (Chapter D).
This theorem is cute, and it is nice to have a formula for the inverse, and a condition that tells us when we can use it. However, this approach becomes impractical for larger matrices, even though it is possible to demonstrate that, in theory, there is a general formula. (Think for a minute about extending this result to just \(3\times 3\) matrices. For starters, we need 18 letters!) Instead, we will work column-by-column. Let us first work an example that will motivate the main theorem and remove some of the previous mystery.
For its inverse, we desire a matrix \(B\) so that \(AB=I_5\text{.}\) Emphasizing the structure of the columns and employing the definition of matrix multiplication (Definition MM), we have
Since the matrix \(B\) is what we are trying to compute, we can view each column, \(\vect{B}_i\text{,}\) as a column vector of unknowns in a linear system of equations. Then we have five systems of equations to solve, each with 5 equations in 5 variables. Notice that all 5 of these systems have the same coefficient matrix. We will now solve each system in turn.
\begin{align*}
\end{align*}
Row-reduce the augmented matrix of the linear system \(\linearsystem{A}{\vect{e}_1}\text{,}\)
By this method, we know that \(AB=I_5\text{.}\) Check that \(BA=I_5\text{,}\) and then we will know that we have the inverse of \(A\text{.}\)
Notice how the five systems of equations in the preceding example were all solved by exactly the same sequence of row operations. Would it not be nice to avoid this obvious duplication of effort? Our main theorem for this section follows, and it mimics this previous example, while also avoiding all the overhead.
TheoremCINM.Computing the Inverse of a Nonsingular Matrix.
Suppose \(A\) is a nonsingular square matrix of size \(n\text{.}\) Create the \(n\times 2n\) matrix \(M\) by placing the \(n\times n\) identity matrix \(I_n\) to the right of the matrix \(A\text{.}\) Let \(N\) be a matrix that is row-equivalent to \(M\) and in reduced row-echelon form. Finally, let \(J\) be the matrix formed from the final \(n\) columns of \(N\text{.}\) Then \(AJ=I_n\text{.}\)
Proof.
\(A\) is nonsingular, so by Theorem NMRRI there is a sequence of row operations that will convert \(A\) into \(I_n\text{.}\) It is this same sequence of row operations that will convert \(M\) into \(N\text{,}\) since having the identity matrix in the first \(n\) columns of \(N\) is sufficient to guarantee that \(N\) is in reduced row-echelon form.
If we consider the systems of linear equations, \(\linearsystem{A}{\vect{e}_i}\text{,}\)\(1\leq i\leq n\text{,}\) we see that the aforementioned sequence of row operations will also bring the augmented matrix of each of these systems into reduced row-echelon form. Furthermore, the unique solution to \(\linearsystem{A}{\vect{e}_i}\) appears in column \(n+1\) of the row-reduced augmented matrix of the system and is identical to column \(n+i\) of \(N\text{.}\) Let \(\vectorlist{N}{2n}\) denote the columns of \(N\text{.}\) So we find,
We have to be just a bit careful here about both what this theorem says and what it does not say. If \(A\) is a nonsingular matrix, then we are guaranteed a matrix \(B\) such that \(AB=I_n\text{,}\) and the proof gives us a process for constructing \(B\text{.}\) However, the definition of the inverse of a matrix (Definition MI) requires that \(BA=I_n\) also. So at this juncture we must compute the matrix product in the “opposite” order before we claim \(B\) as the inverse of \(A\text{.}\) However, we will soon see that this is always the case, in Theorem OSIS, so the title of this theorem is not inaccurate.
What if \(A\) is singular? At this point we only know that Theorem CINM cannot be applied. The question of \(A\)’s inverse is still open. (But see Theorem NI in the next section.)
We will finish by computing the inverse for the coefficient matrix of Archetype B, the one we just pulled from a hat in Example SABMI. There are more examples in the Archetypes (Appendix A) to practice with, though notice that it is silly to ask for the inverse of a rectangular matrix (the sizes are not right) and not every square matrix has an inverse (remember Example MWIAA?).
ExampleCMIAB.Computing a matrix inverse, Archetype B.
once we check that \(\inverse{B}B=I_3\) (the product in the opposite order is a consequence of the theorem).
SageMISLE.Matrix Inverse, Systems of Equations.
We can use the computational method described in this section in hopes of finding a matrix inverse, as Theorem CINM gets us halfway there. We will continue with the matrix from Example MI. First we check that the matrix is nonsingular so we can apply the theorem, then we get “half” an inverse, and verify that it also behaves as a “full” inverse by meeting the full definition of a matrix inverse (Definition MI).
Note that the matrix J is constructed by taking the last 5 columns of N (numbered 5 through 9) and using them in the matrix_from_columns() matrix method. What happens if you apply the procedure above to a singular matrix? That would be an instructive experiment to conduct.
With an inverse of a coefficient matrix in hand, we can easily solve systems of equations, in the style of Example SABMI. We will recycle the matrices A and its inverse, J, from above, so be sure to run those compute cells first if you are playing along. We consider a system with A as a coefficient matrix and solve a linear system twice, once the old way and once the new way. Recall that with a nonsingular coefficient matrix, the solution will be unique for any choice of const, so you can experiment by changing the vector of constants and re-executing the code.
SubsectionPMIProperties of Matrix Inverses
The inverse of a matrix enjoys some nice properties. We collect a few here. First, a matrix can have but one inverse.
TheoremMIU.Matrix Inverse is Unique.
Suppose the square matrix \(A\) has an inverse. Then \(\inverse{A}\) is unique.
Proof.
As described in Proof Technique U, we will assume that \(A\) has two inverses. The hypothesis tells there is at least one. Suppose then that \(B\) and \(C\) are both inverses for \(A\text{,}\) so we know by Definition MI that \(AB=BA=I_n\) and \(AC=CA=I_n\text{.}\) Then we have
So we conclude that \(B\) and \(C\) are the same, and cannot be different. So any matrix that acts like an inverse, must be the inverse.
When most of us dress in the morning, we put on our socks first, followed by our shoes. In the evening we must then first remove our shoes, followed by our socks. Try to connect the conclusion of the following theorem with this everyday example.
TheoremSS.Socks and Shoes.
Suppose \(A\) and \(B\) are invertible matrices of size \(n\text{.}\) Then \(AB\) is an invertible matrix and \(\inverse{(AB)}=\inverse{B}\inverse{A}\text{.}\)
Proof.
At the risk of carrying our everyday analogies too far, the proof of this theorem is quite easy when we compare it to the workings of a dating service. We have a statement about the inverse of the matrix \(AB\text{,}\) which for all we know right now might not even exist. Suppose \(AB\) was to sign up for a dating service with two requirements for a compatible date. Upon multiplication on the left, and on the right, the result should be the identity matrix. In other words, \(AB\)’s ideal date would be its inverse.
Now along comes the matrix \(\inverse{B}\inverse{A}\) (which we know exists because our hypothesis says both \(A\) and \(B\) are invertible and we can form the product of these two matrices), also looking for a date. Let us see if \(\inverse{B}\inverse{A}\) is a good match for \(AB\text{.}\) First they meet at a noncommittal neutral location, say a coffee shop, for quiet conversation,
So the matrix \(\inverse{B}\inverse{A}\) has met all of the requirements to be \(AB\)’s inverse (date) and with the ensuing marriage proposal we can announce that \(\inverse{(AB)}=\inverse{B}\inverse{A}\text{.}\)
TheoremMIMI.Matrix Inverse of a Matrix Inverse.
Suppose \(A\) is an invertible matrix. Then \(\inverse{A}\) is invertible and \(\inverse{(\inverse{A})}=A\text{.}\)
Proof.
As with the proof of Theorem SS, we examine if \(A\) is a suitable inverse for \(\inverse{A}\) (by definition, the opposite is true).
The matrix \(A\) has met all the requirements to be the inverse of \(\inverse{A}\text{,}\) and so is invertible and we can write \(A=\inverse{(\inverse{A})}\text{.}\)
TheoremMIT.Matrix Inverse of a Transpose.
Suppose \(A\) is an invertible matrix. Then \(\transpose{A}\) is invertible and \(\inverse{(\transpose{A})}=\transpose{(\inverse{A})}\text{.}\)
Proof.
As with the proof of Theorem SS, we see if \(\transpose{(\inverse{A})}\) is a suitable inverse for \(\transpose{A}\text{.}\) Apply Theorem MMT to see that
The matrix \(\transpose{(\inverse{A})}\) has met all the requirements to be the inverse of \(\transpose{A}\text{,}\) and so is invertible and we can write \(\inverse{(\transpose{A})}=\transpose{(\inverse{A})}\text{.}\)
TheoremMISM.Matrix Inverse of a Scalar Multiple.
Suppose \(A\) is an invertible matrix and \(\alpha\) is a nonzero scalar. Then \(\inverse{\left(\alpha A\right)}=\frac{1}{\alpha}\inverse{A}\) and \(\alpha A\) is invertible.
Proof.
As with the proof of Theorem SS, we see if \(\frac{1}{\alpha}\inverse{A}\) is a suitable inverse for \(\alpha A\text{.}\)
The matrix \(\frac{1}{\alpha}\inverse{A}\) has met all the requirements to be the inverse of \(\alpha A\text{,}\) so we can write \(\inverse{\left(\alpha A\right)}=\frac{1}{\alpha}\inverse{A}\text{.}\)
Notice that there are some likely theorems that are missing here. For example, it would be tempting to think that \(\inverse{(A+B)}=\inverse{A}+\inverse{B}\text{,}\) but this is false. Can you find a counterexample? (See Exercise MISLE.T10.)
If it exists, find the inverse of \(A = \begin{bmatrix} 2 & -1 & 1\\ 1 & 2 & 1\\3 & 1 & 2\end{bmatrix}\text{,}\) and check your answer.
Solution.
The procedure we have for finding a matrix inverse fails for this matrix \(A\) since \(A\) does not row-reduce to \(I_3\text{.}\) We suspect in this case that \(A\) is not invertible, although we do not yet know that concretely. (Stay tuned for upcoming revelations in Section MINM!)
C18.
If it exists, find the inverse of \(A = \begin{bmatrix} 1 & 3 & 1 \\ 1 & 2 & 1 \\ 2 & 2 & 1\end{bmatrix}\text{,}\) and check your answer.
Employ the matrix \(B\) to solve the two linear systems \(\linearsystem{A}{\vect{c}}\) and \(\linearsystem{A}{\vect{d}}\text{.}\)
Solution.
Represent each of the two systems by a vector equality, \(A\vect{x}=\vect{c}\) and \(A\vect{y}=\vect{d}\text{.}\) Then in the spirit of Example SABMI, solutions are given by
Notice how we could solve many more systems having \(A\) as the coefficient matrix, and how each such system has a unique solution. You might check your work by substituting the solutions back into the systems of equations, or forming the linear combinations of the columns of \(A\) suggested by Theorem SLSLC.
C23.
If it exists, find the inverse of the \(2\times 2\) matrix \(A\) and check your answer. (See Theorem TTMI.)
Compute the inverse of \(D\text{,}\)\(\inverse{D}\text{,}\) by forming the \(5\times 10\) matrix \(\augmented{D}{I_5}\) and row-reducing (Theorem CINM). Then use a calculator to compute \(\inverse{D}\) directly.
Compute the inverse of \(E\text{,}\)\(\inverse{E}\text{,}\) by forming the \(5\times 10\) matrix \(\augmented{E}{I_5}\) and row-reducing (Theorem CINM). Then use a calculator to compute \(\inverse{E}\) directly.
Solution.
The matrix \(E\) has no inverse, though we do not yet have a theorem that allows us to reach this conclusion. However, when row-reducing the matrix \(\augmented{E}{I_5}\text{,}\) the first 5 columns will not row-reduce to the \(5\times 5\) identity matrix, so we are at a loss on how we might compute the inverse. When requesting that your calculator compute \(\inverse{E}\text{,}\) it should give some indication that \(E\) does not have an inverse.
Compute the inverse of \(C\text{,}\)\(\inverse{C}\text{,}\) by forming the \(4\times 8\) matrix \(\augmented{C}{I_4}\) and row-reducing (Theorem CINM). Then use a calculator to compute \(\inverse{C}\) directly.
View this system as \(\linearsystem{C}{\vect{b}}\text{,}\) where \(C\) is the \(4\times 4\) matrix from Exercise MISLE.C28 and \(\vect{b}=\colvector{-4\\4\\-20\\9}\text{.}\) Since \(C\) was seen to be nonsingular in Exercise MISLE.C28Theorem SNCM says the solution, which is unique by Theorem NMUS, is given by
and the vector of constants is \(\vect{b}=\colvector{-3\\-4\\2}\text{.}\) So by Theorem SLEMM we can convert the system to the form \(A\vect{x}=\vect{b}\text{.}\) Row-reducing this matrix yields the identity matrix so by Theorem NMRRI we know \(A\) is nonsingular. This allows us to apply Theorem SNCM to find the unique solution as
We can reformulate the linear system as a vector equality with a matrix-vector product via Theorem SLEMM. The system is then represented by \(A\vect{x}=\vect{b}\) where
According to Theorem SNCM, if \(A\) is nonsingular then the (unique) solution will be given by \(\inverse{A}\vect{b}\text{.}\) We attempt the computation of \(\inverse{A}\) through Theorem CINM, or with our favorite computational device and obtain,
Construct an example to demonstrate that \(\inverse{(A+B)}=\inverse{A}+\inverse{B}\) is not true for all square matrices \(A\) and \(B\) of the same size.
Solution.
For a large collection of small examples, let \(D\) be any \(2\times 2\) matrix that has an inverse (Theorem TTMI can help you construct such a matrix, \(I_2\) is a simple choice). Set \(A=D\) and \(B=(-1)D\text{.}\) While \(\inverse{A}\) and \(\inverse{B}\) both exist, what is \(\inverse{\left(A+B\right)}\text{?}\)
For a large collection of examples of any size, consider \(A=B=I_n\text{.}\) Can the proposed statement be salvaged to become a theorem?