Section 3.1 Orthogonal sets of vectors
You may recall from elementary linear algebra, or a calculus class, that vectors in or are considered to be quantities with both magnitude and direction. Interestingly enough, neither of these properties is inherent to a general vector space. The vector space axioms specify only algebra; they say nothing about geometry. (What, for example, should be the βangleβ between two polynomials?)
Because vector algebra is often introduced as a consequence of geometry (like the βtip-to-tailβ rule), you may not have thought all that carefully about what, exactly, is responsible for making the connection between algebra and geometry. It turns out that the missing link is the humble dot product.
You probably encountered the following result, perhaps as a consequence of the law of cosines: for any two vectors
where is the angle between and Here we see both magnitude and direction (encoded by the angle) defined in terms of the dot product.
While it is possible to generalize the idea of the dot product to something called an inner product, we will first focus on the basic dot product in Once we have a good understanding of things in that setting, we can move on to consider the abstract counterpart.
Subsection 3.1.1 Basic definitions and properties
For most of this chapter (primarily for typographical reasons) we will denote elements of as ordered -tuples rather than as column vectors.
Note that both the dot product and the norm produce scalars. Through the Pythagorean Theorem, we recognize the norm as the length of The dot product can still be thought of as measuring the angle between vectors, although the simple geometric proof used in two dimensions is not that easily translated to dimensions. At the very least, the dot product lets us extend the notion of right angles to higher dimensions.
Definition 3.1.2.
It should be no surprise that all the familiar properties of the dot product work just as well in any dimension. The folowing properties can be confirmed by direct computation, so the proof is left as an exercise.
Theorem 3.1.3.
Remark 3.1.4.
The above properties, when properly abstracted, become the defining properties of a (real) inner product. (A complex inner product also involves complex conjugates.) For a general inner product, the requirement is referred to as being positive-definite, and the property that only the zero vector produces zero when dotted with itself is called nondegenerate. Note that we have the following connection between norm and dot product:
For a general inner product, this can be used as a definition of the norm associated to an inner product.
Exercise 3.1.5.
Hint.
Use properties of the dot product to expand and simplify.
Exercise 3.1.6.
Hint.
Donβt forget to prove both directions! Note that the hypothesis allows you to write as a linear combination of the
There are two important inequalities associated to the dot product and norm. We state them both in the following theorem, without proof.
Theorem 3.1.7.
The first of the above inequalities is called the Cauchy-Schwarz inequality, which be viewed as a manifestation of the formula
The usual proof involves some algebraic trickery; the interested reader is invited to search online for the Cauchy-Schwarz inequality, where they will find no shortage of websites offering proofs.
The second result, called the triangle inequality, follows immediately from the Cauchy-Scwarz inequality and Exercise 3.1.5:
The triangle inequality gets its name from the βtip-to-tailβ picture for vector addition. Essentially, it tells us that the length of any side of a triangle must be less than the sum of the lengths of the other two sides. The importance of the triangle inequality is that it tells us that the norm can be used to define distance.
Definition 3.1.8.
Remark 3.1.9.
Using properties of the norm, we can show that this distance function meets the criteria of whatβs called a metric. A metric is any function that takes a pair of vectors (or points) as input, and returns a number as output, with the following properties:
for any and if and only if for any
We leave it as an exercise to confirm that the distance function defined above is a metric.
In more advanced courses (e.g. topology or analysis) you might go into detailed study of these structures. There are three interrelated structures: inner products, norms, and metrics. You might consider questions like: does every norm come from an inner product? Does every metric come from a norm? (No.) Things get even more interesting for infinite-dimensional spaces. Of special interest are spaces such as Hilbert spaces (a special type of infinite-dimensional inner product space) and Banach spaces (a special type of infinite-dimensional normed space).
Exercise 3.1.10.
- Yes!
- You should find that the dot product is
not so these vectors are not orthogonal. - You might be tempted to say that the zero vector is orthogonal to everything, but we canβt compare vectors from different vector spaces!
- Yes! We have to be careful of signs here:
Select all vectors that are orthogonal to the vector
Exercise 3.1.11.
True.
- Consider
and False.
- Consider
and
Subsection 3.1.2 Orthogonal sets of vectors
In Chapter 1, we learned that linear independence and span are important concepts associated to a set of vectors. In this chapter, we learn what it means for a set of vectors to be orthogonal, and try to understand why this concept is just as important as independence and span.
Definition 3.1.12.
Exercise 3.1.13.
Hint.
The dot product of the fourth vector with each vector above must be zero. Can you turn this requirement into a system of equations?
Exercise 3.1.14.
True.
- Try to construct an example. The vector
has to be orthogonal to but is there any reason it has to be orthogonal to or False.
- Try to construct an example. The vector
has to be orthogonal to but is there any reason it has to be orthogonal to or
The requirement that the vectors in an orthogonal set be nonzero is partly because the alternative would be boring, and partly because it lets us state the following theorem.
Theorem 3.1.15.
Any orthogonal set of vectors is linearly independent.
Strategy.
Any proof of linear independence should start by defining our set of vectors, and assuming that a linear combination of these vectors is equal to the zero vector, with the goal of showing that the scalars have to be zero.
Set up the equation (say, ), with the assumption that your set of vectors is orthogonal. What happens if you take the dot product of both sides with one of these vectors?
Proof.
Suppose is orthogonal, and suppose
for scalars Taking the dot product of both sides of the above equation with gives
Since we must have We similarly find that all the remaining scalars are zero by taking the dot product with
Another useful consequence of orthogonality: in two dimensions, we have the Pythagorean Theorem for right-angled triangles. If the βlegsβ of the triangle are identified with vectors and and the hypotenuse with then since
In dimensions, we have the following, which follows from the fact that all βcross termsβ (dot products of different vectors) will vanish.
Theorem 3.1.16. Pythagorean Theorem.
Strategy.
Remember that
and use the distributive property of the dot product, along with the fact that each pair of different vectors is orthogonal.
Our final initial result about orthogonal sets of vectors relates to span. In general, we know that if then it is possible to solve for scalars such that The trouble is that finding these scalars generally involves setting up, and then solving, a system of linear equations. The great thing about orthogonal sets of vectors is that we can provide explicit formulas for the scalars.
Theorem 3.1.17. Fourier expansion theorem.
Strategy.
Take the same approach you used in the proof of Theorem 3.1.15, but this time, with a nonzero vector on the right-hand side.
Proof.
Let Taking the dot product of both sides of this equation with gives
since the dot product of with for is zero.
One use of Theorem 3.1.17 is determining whether or not a given vector is in the span of an orthogonal set. If it is in the span, then its coefficients must satisfy the Fourier expansion formula. Therefore, if we compute the right hand side of the above formula and do not get our original vector, then that vector must not be in the span.
Exercise 3.1.18.
The Fourier expansion is especially simple if our basis vectors have norm one, since the denominators in each coefficient disappear. Recall that a unit vector in is any vector with For any nonzero vector a unit vector (that is, a vector of norm one) in the direction of is given by
We often say that the vector is normalized. (The convention of using a βhatβ for unit vectors is common but not universal.)
Exercise 3.1.19.
Definition 3.1.20.
A basis of is called an orthonormal basis if is orthogonal, and all the vectors in are unit vectors.
Example 3.1.21.
In Exercise 3.1.13 we saw that the set
is orthogonal. Since itβs orthogonal, it must be independent, and since itβs a set of four independent vectors in it must be a basis. To get an orthonormal basis, we normalize each vector:
The process of creating unit vectors does typically introduce square root coefficients in our vectors. This can seem undesirable, but there remains value in having an orthonormal basis. For example, suppose we wanted to write the vector in terms of our basis. We can quickly compute
and so
Thereβs still work to be done, but it is comparatively simpler than solving the corresponding system of equations.
Exercises 3.1.3 Exercises
1.
2.
3.
Hint.
Use properties of the dot product to expand and simplify.
4.
5.
Find two linearly independent vectors perpendicular to the vector
You have attempted 1 of 14 activities on this page.