FCLA Matrix Inverses and Systems of Linear Equations

through judicious use of matrix multiplication. We know by Theorem NMUS that since the coefficient matrix in this example is nonsingular, there would be a unique solution, no matter what the choice of

b .

The derivation above amplifies this result, since we were forced to conclude that

x = B b

and the solution could not be anything else. You should notice that this argument would hold for any particular choice of

b .

🔗

The matrix

B

of the previous example is called the inverse of

A .

When

A

and

B

are combined via matrix multiplication, the result is the identity matrix, which can be inserted “in front” of

x

as the first step in finding the solution. This is entirely analogous to how we might solve a single linear equation like

3 x = 12 .

x = 1 x = (\frac{1}{3} (3)) x = \frac{1}{3} (3 x) = \frac{1}{3} (12) = 4

🔗

Here we have obtained a solution by employing the “multiplicative inverse” of

3,

3^{- 1} = \frac{1}{3} .

This works fine for any scalar multiple of

x,

except for zero, since zero does not have a multiplicative inverse. Consider separately the two linear equations

\begin{aligned} 0 x & = 12 & 0 x & = 0 . \end{aligned}

🔗

The first has no solutions, while the second has infinitely many solutions. For matrices, it is all just a little more complicated. Some matrices have inverses, some do not. And when a matrix does have an inverse, just how would we compute it? In other words, just where did that matrix

B

in the last example come from? Are there other matrices that might have worked just as well?

🔗

Subsection IM Inverse of a Matrix

Definition MI. Matrix Inverse.

Suppose

A

and

B

are square matrices of size

n

such that

A B = I_{n}

and

B A = I_{n} .

Then

A

is invertible and

B

is the inverse of

A .

In this situation, we write

B = A^{- 1} .

🔗

Notice that if

B

is the inverse of

A,

then we can just as easily say

A

is the inverse of

B,

A

and

B

are inverses of each other.

🔗

Not every square matrix has an inverse. In Example SABMI the matrix

B

is the inverse of the coefficient matrix of Archetype B. To see this it only remains to check that

A B = I_{3} .

What about Archetype A? It is an example of a square matrix without an inverse.

🔗

Example MWIAA. A matrix without an inverse, Archetype A.

Consider the coefficient matrix from Archetype A.

A = [\begin{matrix} 1 & - 1 & 2 \\ 2 & 1 & 1 \\ 1 & 1 & 0 \end{matrix}]

🔗

We will show that

A

is a matrix with no inverse, with a proof by contradiction. To this end, suppose that

A

is invertible, and call its inverse the matrix

B .

Choose the vector of constants

b = [\begin{matrix} 1 \\ 3 \\ 2 \end{matrix}]

and consider the system of equations

LS (A, b) .

We could now proceed exactly as we did in Example SABMI, and employ the matrix

B

to determine a unique solution to this vector equation. Namely, the solution would be

x = B b .

In other words, the system is consistent.

🔗

However, we will now show the system

LS (A, b)

has no solutions. In other words, this system is inconsistent. Form the augmented matrix

[A | b]

and row-reduce to

[\begin{matrix} 1 & 0 & 1 & 0 \\ 0 & 1 & - 1 & 0 \\ 0 & 0 & 0 & 1 \end{matrix}]

which allows us to recognize the inconsistency by Theorem RCLS.

🔗

So the assumption of

A

’s inverse leads to a logical inconsistency as the system cannot be both consistent and inconsistent. So our assumption of an inverse is false, and

A

is a matrix with no inverse (provably). So we say

A

is not invertible.

🔗

It is possible this example is less than satisfying. Just where did that particular choice of the vector

b

come from anyway? Stay tuned for an application of the future Theorem CSCS in Example CSAA.

🔗

Let us look at one more matrix inverse before we embark on a more systematic study.

🔗

Example MI. Matrix inverse.

Consider the matrices

\begin{aligned} A & = [\begin{array}{c} 1 & 2 & 1 & 2 & 1 \\ - 2 & - 3 & 0 & - 5 & - 1 \\ 1 & 1 & 0 & 2 & 1 \\ - 2 & - 3 & - 1 & - 3 & - 2 \\ - 1 & - 3 & - 1 & - 3 & 1 \end{array}] & B & = [\begin{array}{c} - 3 & 3 & 6 & - 1 & - 2 \\ 0 & - 2 & - 5 & - 1 & 1 \\ 1 & 2 & 4 & 1 & - 1 \\ 1 & 0 & 1 & 1 & 0 \\ 1 & - 1 & - 2 & 0 & 1 \end{array}] . \end{aligned}

🔗

Then

\begin{aligned} A B & = [\begin{array}{c} 1 & 2 & 1 & 2 & 1 \\ - 2 & - 3 & 0 & - 5 & - 1 \\ 1 & 1 & 0 & 2 & 1 \\ - 2 & - 3 & - 1 & - 3 & - 2 \\ - 1 & - 3 & - 1 & - 3 & 1 \end{array}] [\begin{array}{c} - 3 & 3 & 6 & - 1 & - 2 \\ 0 & - 2 & - 5 & - 1 & 1 \\ 1 & 2 & 4 & 1 & - 1 \\ 1 & 0 & 1 & 1 & 0 \\ 1 & - 1 & - 2 & 0 & 1 \end{array}] = [\begin{array}{c} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \end{array}] \end{aligned}

and

\begin{aligned} B A & = [\begin{array}{c} - 3 & 3 & 6 & - 1 & - 2 \\ 0 & - 2 & - 5 & - 1 & 1 \\ 1 & 2 & 4 & 1 & - 1 \\ 1 & 0 & 1 & 1 & 0 \\ 1 & - 1 & - 2 & 0 & 1 \end{array}] [\begin{array}{c} 1 & 2 & 1 & 2 & 1 \\ - 2 & - 3 & 0 & - 5 & - 1 \\ 1 & 1 & 0 & 2 & 1 \\ - 2 & - 3 & - 1 & - 3 & - 2 \\ - 1 & - 3 & - 1 & - 3 & 1 \end{array}] = [\begin{array}{c} 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \end{array}] \end{aligned}

so by Definition MI, we can say that

A

is invertible and write

B = A^{- 1} .

🔗

We will now concern ourselves less with whether or not an inverse of a matrix exists, but instead with how you can find one when it does exist. In Section MINM we will have some theorems that allow us to more quickly and easily determine just when a matrix is invertible.

🔗

Subsection CIM Computing the Inverse of a Matrix

We have just seen inverses of matrices in Example SABMI and Example MI, but these inverse matrices have just dropped from the sky. How would we compute an inverse? And just when is a matrix invertible, and when is it not? Writing a putative inverse with

n^{2}

unknowns and solving the resulting

n^{2}

equations is one approach. Applying this approach to

2 \times 2

matrices can get us somewhere, so just for fun, let us do it.

🔗

Theorem TTMI. Two-by-Two Matrix Inverse.

Suppose

A = [\begin{matrix} a & b \\ c & d \end{matrix}] .

Then

A

is invertible if and only if

a d - b c \neq 0 .

When

A

is invertible, then

A^{- 1} = \frac{1}{a d - b c} [\begin{matrix} d & - b \\ - c & a \end{matrix}] .

🔗

Proof.

(⇐)

Assume that

a d - b c \neq 0 .

We will use the definition of the inverse of a matrix to establish that

A

has an inverse (Definition MI). Note that if

a d - b c \neq 0

then the displayed formula for

A^{- 1}

is legitimate since we are not dividing by zero). Using this proposed formula for the inverse of

A,

we simply compute

\begin{aligned} A A^{- 1} & = [\begin{array}{c} a & b \\ c & d \end{array}] (\frac{1}{a d - b c} [\begin{array}{c} d & - b \\ - c & a \end{array}]) = \frac{1}{a d - b c} [\begin{array}{c} a d - b c & 0 \\ 0 & a d - b c \end{array}] = [\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] \end{aligned}

and

\begin{aligned} A^{- 1} A & = \frac{1}{a d - b c} [\begin{array}{c} d & - b \\ - c & a \end{array}] [\begin{array}{c} a & b \\ c & d \end{array}] = \frac{1}{a d - b c} [\begin{array}{c} a d - b c & 0 \\ 0 & a d - b c \end{array}] = [\begin{array}{c} 1 & 0 \\ 0 & 1 \end{array}] . \end{aligned}

🔗

By Definition MI this is sufficient to establish that

A

is invertible, and that the expression for

A^{- 1}

is correct.

🔗

(⇒)

Assume that

A

is invertible, and proceed with a proof by contradiction (Proof Technique CD), by assuming also that

a d - b c = 0 .

This translates to

a d = b c .

Let

B = [\begin{matrix} e & f \\ g & h \end{matrix}]

be a putative inverse of

A .

🔗

This means that

I_{2} = A B = [\begin{matrix} a & b \\ c & d \end{matrix}] [\begin{matrix} e & f \\ g & h \end{matrix}] = [\begin{matrix} a e + b g & a f + b h \\ c e + d g & c f + d h \end{matrix}]

🔗

Working on the matrices on the two ends of this equation, we will multiply the top row by

c

and the bottom row by

a .

[\begin{matrix} c & 0 \\ 0 & a \end{matrix}] = [\begin{matrix} a c e + b c g & a c f + b c h \\ a c e + a d g & a c f + a d h \end{matrix}]

🔗

We are assuming that

a d = b c,

so we can replace two occurrences of

a d

b c

in the bottom row of the right matrix.

[\begin{matrix} c & 0 \\ 0 & a \end{matrix}] = [\begin{matrix} a c e + b c g & a c f + b c h \\ a c e + b c g & a c f + b c h \end{matrix}]

🔗

The matrix on the right now has two rows that are identical, and therefore the same must be true of the matrix on the left. Identical rows for the matrix on the left implies that

a = 0

and

c = 0 .

🔗

With this information, the product

A B

becomes

[\begin{matrix} 1 & 0 \\ 0 & 1 \end{matrix}] = I_{2} = A B = [\begin{matrix} a e + b g & a f + b h \\ c e + d g & c f + d h \end{matrix}] = [\begin{matrix} b g & b h \\ d g & d h \end{matrix}]

🔗

b g = d h = 1

and thus

b, g, d, h

are all nonzero. But then

b h

and

d g

(the “other corners”) must also be nonzero, so this is (finally) a contradiction. So our assumption was false and we see that

a d - b c \neq 0

whenever

A

has an inverse.

🔗

There are several ways one could try to prove this theorem, but there is a continual temptation to divide by one of the eight entries involved (

a

through

f

), but we can never be sure if these numbers are zero or not. This could lead to an analysis by cases, which is messy, messy, messy. Note how the above proof never divides, but always multiplies, and how zero/nonzero considerations are handled. Pay attention to the expression

a d - b c,

as we will see it again in a while (Chapter D).

🔗

This theorem is cute, and it is nice to have a formula for the inverse, and a condition that tells us when we can use it. However, this approach becomes impractical for larger matrices, even though it is possible to demonstrate that, in theory, there is a general formula. (Think for a minute about extending this result to just

3 \times 3

matrices. For starters, we need 18 letters!) Instead, we will work column-by-column. Let us first work an example that will motivate the main theorem and remove some of the previous mystery.

🔗

Example CMI. Computing a matrix inverse.

Consider the matrix defined in Example MI.

A = [\begin{matrix} 1 & 2 & 1 & 2 & 1 \\ - 2 & - 3 & 0 & - 5 & - 1 \\ 1 & 1 & 0 & 2 & 1 \\ - 2 & - 3 & - 1 & - 3 & - 2 \\ - 1 & - 3 & - 1 & - 3 & 1 \end{matrix}]

🔗

For its inverse, we desire a matrix

B

so that

A B = I_{5} .

Emphasizing the structure of the columns and employing the definition of matrix multiplication (Definition MM), we have

\begin{aligned} A B & = I_{5} \\ A [B_{1} | B_{2} | B_{3} | B_{4} | B_{5}] & = [e_{1} | e_{2} | e_{3} | e_{4} | e_{5}] \\ [A B_{1} | A B_{2} | A B_{3} | A B_{4} | A B_{5}] & = [e_{1} | e_{2} | e_{3} | e_{4} | e_{5}] . \end{aligned}

🔗

Equating the matrices column-by-column we have

\begin{aligned} A B_{1} = e_{1} & A B_{2} = e_{2} & A B_{3} = e_{3} & A B_{4} = e_{4} & A B_{5} = e_{5} . \end{aligned}

🔗

Since the matrix

B

is what we are trying to compute, we can view each column,

B_{i},

as a column vector of unknowns in a linear system of equations. Then we have five systems of equations to solve, each with 5 equations in 5 variables. Notice that all 5 of these systems have the same coefficient matrix. We will now solve each system in turn.

\begin{matrix}  \end{matrix}