Subsection Stamps
Investigate!
You need to mail a package, but don’t yet know how much postage you will need. You have a large supply of 8-cent stamps and 5-cent stamps. Which amounts of postage can you make exactly using these stamps? Which amounts are impossible to make?
Perhaps in investigating the problem above you picked some amounts of postage, and then figured out whether you could make that amount using just 8-cent and 5-cent stamps. Perhaps you did this in order: can you make 1 cent of postage? Can you make 2 cents? 3 cents? And so on. If this is what you did, you were actually answering a
sequence of questions. We have methods for dealing with sequences. Let’s see if that helps.
Actually, we will not make a sequence of questions, but rather a sequence of statements. Let
be the statement “you can make
cents of postage using just 8-cent and 5-cent stamps.” Since for each value of
is a statement, it is either true or false. So if we form the sequence of statements
the sequence will consist of
’s (for true) and
’s (for false). In our particular case the sequence starts
because
are all false (you cannot make 1, 2, 3, or 4 cents of postage) but
is true (use one 5-cent stamp), and so on.
Let’s think a bit about how we could find the value of
for some specific
(the “value” will be either
or
). How did we find the value of the
th term of a sequence of numbers? How did we find
There were two ways we could do this: either there was a closed formula for
so we could plug in
into the formula and get our output value, or we had a recursive definition for the sequence, so we could use the previous terms of the sequence to compute the
th term. When dealing with sequences of statements, we could use either of these techniques as well. Maybe there is a way to use
itself to determine whether we can make
cents of postage. That would be something like a closed formula. Or instead we could use the previous terms in the sequence (of statements) to determine whether we can make
cents of postage. That is, if we know the value of
can we get from that to the value of
That would be something like a recursive definition for the sequence. Remember, finding recursive definitions for sequences was often easier than finding closed formulas. The same is true here.
Suppose I told you that
was true (it is). Can you determine from this fact the value of
(whether it true or false)? Yes you can. Even if we don’t know how exactly we made 43 cents out of the 5-cent and 8-cent stamps, we do know that there was some way to do it. What if that way used at least three 5-cent stamps (making 15 cents)? We could replace those three 5-cent stamps with two 8-cent stamps (making 16 cents). The total postage has gone up by 1, so we have a way to make 44 cents, so
is true. Of course, we assumed that we had at least three 5-cent stamps. What if we didn’t? Then we must have at least three 8-cent stamps (making 24 cents). If we replace those three 8-cent stamps with five 5-cent stamps (making 25 cents) then again we have bumped up our total by 1 cent so we can make 44 cents, so
is true.
Notice that we have not said how to make 44 cents, just that we can, on the basis that we can make 43 cents. How do we know we can make 43 cents? Perhaps because we know we can make
cents, which we know we can do because we know we can make 41 cents, and so on. It’s a recursion! As with a recursive definition of a numerical sequence, we must specify our initial value. In this case, the initial value is “
is false.” That’s not good, since our recurrence relation just says that
is true
if is also true. We need to start the process with a true
So instead, we might want to use “
is true” as the initial condition.
Putting this all together we arrive at the following fact: it is possible to (exactly) make any amount of postage greater than 27 cents using just 5-cent and 8-cent stamps. In other words,
is true for any
To prove this, we could do the following:
Demonstrate that is true.
Prove that if is true, then is true (for any ).
Suppose we have done this. Then we know that the 28th term of the sequence above is a
(using step 1, the initial condition or
base case), and that every term after the 28th is
also (using step 2, the recursive part or
inductive case). Here is what the proof would actually look like.
Proof.
Let
be the statement “it is possible to make exactly
cents of postage using 5-cent and 8-cent stamps.” We will show
is true for all
First, we show that
is true:
so we can make
cents using four 5-cent stamps and one 8-cent stamp.
Now suppose
is true for some arbitrary
Then it is possible to make
cents using 5-cent and 8-cent stamps. Note that since
it cannot be that we use fewer than three 5-cent stamps
and fewer than three 8-cent stamps: using two of each would give only 26 cents. Now if we have made
cents using at least three 5-cent stamps, replace three 5-cent stamps by two 8-cent stamps. This replaces 15 cents of postage with 16 cents, moving from a total of
cents to
cents. Thus
is true. On the other hand, if we have made
cents using at least three 8-cent stamps, then we can replace three 8-cent stamps with five 5-cent stamps, moving from 24 cents to 25 cents, giving a total of
cents of postage. So in this case as well
is true.
Therefore, by the principle of mathematical induction,
is true for all
Subsection Examples
Here are some examples of proof by mathematical induction.
Example 2.5.1.
Prove for each natural number
that
Solution.
First, let’s think inductively about this equation. In fact, we know this is true for other reasons (reverse and add comes to mind). But why might induction be applicable? The left-hand side adds up the numbers from 1 to If we know how to do that, adding just one more term () would not be that hard. For example, if suppose we know that the sum of the first 100 numbers is (so which is true). Now to find the sum of the first 101 numbers, it makes more sense to just add 101 to 5050, instead of computing the entire sum again. We would have In fact, it would always be easy to add just one more term. This is why we should use induction.
Now the formal proof:
Proof.
Let be the statement We will show that is true for all natural numbers
Base case: is the statement which is clearly true.
Inductive case: Let be a natural number. Assume (for induction) that is true. That means We will prove that is true as well. That is, we must prove that To prove this equation, start by adding to both sides of the inductive hypothesis:
Now, simplifying the right side we get:
Thus is true, so by the principle of mathematical induction is true for all natural numbers
Note that in the part of the proof in which we proved
from
we used the equation
This was the inductive hypothesis. Seeing how to use the inductive hypotheses is usually straight forward when proving a fact about a sum like this. In other proofs, it can be less obvious where it fits in.
Example 2.5.2.
Prove that for all
is a multiple of 5.
Solution.
Again, start by understanding the dynamics of the problem. What does increasing do? Let’s try with a few examples. If then yes, is a multiple of 5. What does incrementing to 2 look like? We get which again is a multiple of 5. Next, but instead of just finding what did the increase in do? We will still subtract 1, but now we are multiplying by another 6 first. Viewed another way, we are multiplying a number which is one more than a multiple of 5 by 6 (because is a multiple of 5, so is one more than a multiple of 5). What do numbers which are one more than a multiple of 5 look like? They must have last digit 1 or 6. What happens when you multiply such a number by 6? Depends on the number, but in any case, the last digit of the new number must be a 6. And then if you subtract 1, you get last digit 5, so a multiple of 5.
The point is, every time we multiply by just one more six, we still get a number with last digit 6, so subtracting 1 gives us a multiple of 5. Now the formal proof:
Proof.
Let be the statement, “ is a multiple of 5.” We will prove that is true for all
Base case: is true: which is a multiple of 5.
Inductive case: Let be an arbitrary natural number. Assume, for induction, that is true. That is, is a multiple of Then for some integer This means that Multiply both sides by
But we want to know about so subtract 1 from both sides:
Of course so is a multiple of 5.
Therefore is a multiple of 5, or in other words, is true. Thus, by the principle of mathematical induction is true for all
We had to be a little bit clever (i.e., use some algebra) to locate the
inside of
before we could apply the inductive hypothesis. This is what can make inductive proofs challenging.
In the two examples above, we started with
or
We can start later if we need to.
Example 2.5.3.
Prove that
for all integers
Solution.
First, the idea of the argument. What happens when we increase by 1? On the left-hand side, we increase the base of the square and go to the next square number. On the right-hand side, we increase the power of 2. This means we double the number. So the question is, how does doubling a number relate to increasing to the next square? Think about what the difference of two consecutive squares looks like. We have This factors:
But doubling the right-hand side increases it by since When is large enough,
What we are saying here is that each time increases, the left-hand side grows by less than the right-hand side. So if the left-hand side starts smaller (as it does when ), it will never catch up. Now the formal proof:
Proof.
Let be the statement We will prove is true for all integers
Base case: is the statement Since and we see that is indeed true.
Inductive case: Let be an arbitrary integer. Assume, for induction, that is true. That is, assume We will prove that is true, i.e., To prove such an inequality, start with the left-hand side and work towards the right-hand side:
Following the equalities and inequalities through, we get in other words, Therefore by the principle of mathematical induction, is true for all
The previous example might remind you of the
racetrack principle from calculus, which says that if
and
for
then
for
Same idea: the larger function is increasing at a faster rate than the smaller function, so the larger function will stay larger. In discrete math, we don’t have derivatives, so we look at differences. Thus induction is the way to go.
Warning:.
With great power, comes great responsibility. Induction isn’t magic. It seems very powerful to be able to assume
is true. After all, we are trying to prove
is true and the only difference is in the variable:
vs.
Are we assuming that what we want to prove is true? Not really. We assume
is true only for the sake of proving that
is true.
Still you might start to believe that you can prove anything with induction. Consider this incorrect “proof” that every Canadian has the same eye color: Let
be the statement that any
Canadians have the same eye color.
is true, since everyone has the same eye color as themselves. Now assume
is true. That is, assume that in any group of
Canadians, everyone has the same eye color. Now consider an arbitrary group of
Canadians. The first
of these must all have the same eye color, since
is true. Also, the last
of these must have the same eye color, since
is true. So in fact, everyone the group must have the same eye color. Thus
is true. So by the principle of mathematical induction,
is true for all
Clearly something went wrong. The problem is that the proof that
implies
assumes that
We have only shown
is true. In fact,
is false.
Subsection Strong Induction
Investigate!
Start with a square piece of paper. You want to cut this square into smaller squares, leaving no waste (every piece of paper you end up with must be a square). Obviously it is possible to cut the square into 4 squares. You can also cut it into 9 squares. It turns out you can cut the square into 7 squares (although not all the same size). What other numbers of squares could you end up with?
Sometimes, to prove that
is true, it would be helpful to know that
and and are all true. Consider the following puzzle:
You have a rectangular chocolate bar, made up of
identical squares of chocolate. You can take such a bar and break it along any row or column. How many times will you have to break the bar to reduce it to
single chocolate squares?
At first, this question might seem impossible. Perhaps I meant to ask for the
smallest number of breaks needed? Let’s investigate.
Start with some small cases. If
you must have a
rectangle, which can be reduced to single pieces in one break. With
we must have a
bar, which requires two breaks: the first break creates a single square and a
bar, which we know takes one (more) break.
What about
Now we could have a
bar, or a
bar. In the first case, break the bar into two
bars, each which require one more break (that’s a total of three breaks required). If we started with a
bar, we have choices for our first break. We could break the bar in half, creating two
bars, or we could break off a single square, leaving a
bar. But either way, we still need two more breaks, giving a total of three.
It is starting to look like no matter how we break the bar (and no matter how the
squares are arranged into a rectangle), we will always have the same number of breaks required. It also looks like that number is one less than
Conjecture 2.5.4.
Given a
-square rectangular chocolate bar, it always takes
breaks to reduce the bar to single squares.
It makes sense to prove this by induction because after breaking the bar once, you are left with
smaller chocolate bars. Reducing to smaller cases is what induction is all about. We can inductively assume we already know how to deal with these smaller bars. The problem is, if we are trying to prove the inductive case about a
-square bar, we don’t know that after the first break the remaining bar will have
squares. So we really need to assume that our conjecture is true for all cases less than
Is it valid to make this stronger assumption? Remember, in induction we are attempting to prove that
is true for all
What if that were not the case? Then there would be some first
for which
was false. Since
is the
first counterexample, we know that
is true for all
Now we proceed to prove that
is actually true, based on the assumption that
is true for all smaller
This is quite an advantage: we now have a stronger inductive hypothesis. We can assume that
…
is true, just to show that
is true. Previously, we just assumed
for this purpose.
It is slightly easier if we change our variables for strong induction. Here is what the formal proof would look like:
Strong Induction Proof Structure.
Again, start by saying what you want to prove: “Let
be the statement…” Then establish two facts:
Base case: Prove that is true.
Inductive case: Assume is true for all Prove that is true.
Conclude, “therefore, by strong induction,
is true for all
”
Of course, it is acceptable to replace 0 with a larger base case if needed.
Let’s prove our conjecture about the chocolate bar puzzle:
Proof.
Let
be the statement, “it takes
breaks to reduce a
-square chocolate bar to single squares.”
Base case: Consider
The squares must be arranged into a
rectangle, and we require
breaks to reduce this to single squares.
Inductive case: Fix an arbitrary
and assume
is true for all
Consider a
-square rectangular chocolate bar. Break the bar once along any row or column. This results in two chocolate bars, say of sizes
and
That is, we have an
-square rectangular chocolate bar, a
-square rectangular chocolate bar, and
We also know that
and
so by our inductive hypothesis,
and
are true. To reduce the
-square bar to single squares takes
breaks; to reduce the
-square bar to single squares takes
breaks. Doing this results in our original bar being reduced to single squares. All together it took the initial break, plus the
and
breaks, for a total of
breaks. Thus
is true.
Therefore, by strong induction,
is true for all
Here is a more mathematically relevant example:
Example 2.5.5.
Prove that any natural number greater than 1 is either prime or can be written as the product of primes.
Solution.
First, the idea: if we take some number maybe it is prime. If so, we are done. If not, then it is composite, so it is the product of two smaller numbers. Each of these factors is smaller than (but at least 2), so we can repeat the argument with these numbers. We have reduced to a smaller case.
Now the formal proof:
Proof.
Let be the statement, “ is either prime or can be written as the product of primes.” We will prove is true for all
Base case: is true because is indeed prime.
Inductive case: assume is true for all We want to show that is true. That is, we want to show that is either prime or is the product of primes. If is prime, we are done. If not, then has more than 2 divisors, so we can write with and less than (and greater than 1). By the inductive hypothesis, and are each either prime or can be written as the product of primes. In either case, we have that is written as the product of primes.
Thus by the strong induction, is true for all
Whether you use regular induction or strong induction depends on the statement you want to prove. If you wanted to be safe, you could always use strong induction. It really is
stronger, so can accomplish everything “weak” induction can. That said, using regular induction is often easier since there is only one place you can use the induction hypothesis. There is also something to be said for
elegance in proofs. If you can prove a statement using simpler tools, it is nice to do so.
As a final contrast between the two forms of induction, consider once more the stamp problem. Regular induction worked by showing how to increase postage by one cent (either replacing three 5-cent stamps with two 8-cent stamps, or three 8-cent stamps with five 5-cent stamps). We could give a slightly different proof using strong induction. First, we could show
five base cases: it is possible to make 28, 29, 30, 31, and 32 cents (we would actually say how each of these is made). Now assume that it is possible to make
cents of postage for all
as long as
As long as
this means in particular we can make
cents. Now add a 5-cent stamp to get make
cents.