Skip to main content

Section 1.1 Definition and examples

Let’s recall what we know about vectors in R2. Writing v=x,y for the vector pointing from (0,0) to (x,y), we define:
  1. Addition: x1,y1+x2,y2=x1+y1,x2+y2
  2. Scalar multiplication: cx,y=cx,cy, where c is a real number, or scalar.
We can then observe a number of properties enjoyed by these operations. In your first course, you may have observed some of these properties geometrically, using the “tip-to-tail” rule for vector addition, as shown in Figure 1.1.1
Two vectors in the plane, with the tail of the second vector placed at the tip of the first. The vector drawn from the tail of the first vector to the tip of the second is the sum of the two vectors.
Figure 1.1.1.
  1. Vector addition is commutative. That is, for any vectors v=a,b and w=c,d, we have v+w=w+v.
    This is true because addition is commutative for the real numbers:
    v+w=a+c,b+d=c+a,d+b=w+v.
  2. Vector addition is associative. That is, for any vectors u=a,b,v=c,d and w=p,q, we have
    u+(v+w)=(u+v)+w.
    This tells us that placement of parentheses doesn’t matter, which is essential for extending addition (which is defined as an operation on two vectors) to sums of three or more vectors.
    Again, this property is true because it is true for real numbers:
    u+(v+w)=a,b+c+p,d+q=a+(c+p),b+(d+q)=(a+c)+p,(b+d)+q=a+c,b+d+p,q=(u+v)+w.
  3. Vector addition has an identity element. This is a vector that has no effect when added to another vector, or in other words, the zero vector. Again, it inherits its property from the behaviour of the real number 0.
    For any v=a,b, the vector 0=0,0 satisfies v+0=0+v=v:
    a+0,b+0=0+a,0+b=a,b.
  4. Every vector has an inverse with respect to addition, or, in other words, a negative. Given a vector v=a,b, the vector v=a,b satisfies
    v+(v)=v+v=0.
    (We will leave this one for you to check.)
  5. Scalar multiplication is compatible with addition in two different ways. First, it is distributive over vector addition: for any scalar k and vectors v=a,b,w=c,d, we have k(v+w)=kv+kw.
    Unsurprisingly, this property is inherited from the distributive property of the real numbers:
    k(v+w)=ka+c,b+d=k(a+c),k(b+d)=ka+kc,kb+kd=ka,kb+kc,kdka,b+kc,d=kv+kw.
  6. Second, scalar multiplication is also distributive with respect to scalar addition: for any scalars c and d and vector v, we have (c+d)v=cv+dv.
    Again, this is because real number addition is distributive:
    (c+d)a,b=(c+d)a,(c+d)b=ca+da,cb+db=ca,cb+da,dbca,b+da,b.
  7. Scalar multiplication is also associative. Given scalars c,d and a vector v=a,b, we have c(dv)=(cd)v.
    This is inherited from the associativity of real number multiplication:
    c(dv)=cda,db=c(da),c(db)=(cd)a,(cd)b=(cd)a,b.
  8. Finally, there is a “normalization” result for scalar multiplication. For any vector v, we have 1v=v. That is, the real number 1 acts as an identity element with respect to scalar multiplication. (You can check this one yourself.)
You might be wondering why we bother to list the last property above. It’s true, but why do we need it? One reason comes from basic algebra, and solving equations. Suppose we have the equation cv=w, where c is some nonzero scalar, and we want to solve for v. Very early in our algebra careers, we learn that to solve, we “divide by c”.
Division doesn’t quite make sense in this context, but it certainly does make sense to multiply both sides by 1/c, the multiplicative inverse of c. We then have (1c)(cv)=(1c)w, and since scalar multiplication is associative, (1cc)v=(1c)w. We know that 1cc=1, so this boils down to 1v=(1/c)w. It appears that we’ve solved the equation, but only if we know that 1v=v.
For an example where this fails, take our vectors as above, but redefine the scalar multiplication as ca,b=ca,0. The distributive and associative properties for scalar multiplication will still hold, but the normalization property fails. Algebra becomes very strange with this version of scalar multiplication. In particular, we can no longer conclude that if 2v=2w, then v=w!

Exercise 1.1.2.

Given an example of vectors v and w such that 2v=2w, but vw, if scalar multiplication is defined as above.
In a first course in linear algebra, these algebraic properties of vector addition and scalar multiplication are presented as a theorem. (After all, we have just demonstrated the truth of these results.) A second course in linear algebra (and in particular, abstract linear algebra), begins by taking that theorem and turning it into a definition. We will then do some exploration, to see if we can come up with some other examples that fit the definition; the significance of this is that we can expect the algebra in these examples to behave in essentially the same way as the vectors we’re familiar with.

Definition 1.1.3.

A real vector space (or vector space over R) is a nonempty set V, whose objects are called vectors, equipped with two operations:
  1. Addition, which is a map from V×V to V that associates each ordered pair of vectors (v,w) to a vector v+w, called the sum of v and w.
  2. Scalar multiplication, which is a map from R×V to V that associates each real number c and vector v to a vector cv.
The operations of addition and scalar multiplication are required to satisfy the following axioms:
A1.
If u,vV, then u+vV. (Closure under addition)
A2.
For all u,vV, u+v=v+u. (Commutativity of addition)
A3.
For all u,v,wV, u+(v+w)=(u+v)+w. (Associativity of addition)
A4.
There exists an element 0V such that v+0=v for each vV. (Existence of a zero vector)
A5.
For each vV, there exists a vector vV such that v+(v)=0. (Existence of negatives)
S1.
If vV, then cvV for all cR. (Closure under scalar multiplication)
S2.
For all cR and v,wV, c(v+w)=cv+cw. (Distribution over vector addition)
S3.
For all a,bR and vV, (a+b)v=av+bv. (Distribution over scalar addition)
S4.
For all a,bR and vV, a(bv)=(ab)v. (Associativity of scalar multiplication)
S5.
For all vV, 1v=v. (Normalization of scalar multiplication)
Note that a zero vector must exist in every vector space. This simple observation is a key component of many proofs and counterexamples in linear algebra. In general, we may define a vector space whose scalars belong to a field F. A field is a set of objects whose algebraic properties are modelled after those of the real numbers.
The axioms for a field are not all that different than those for a vector space. The main difference is that in a field, multiplication is defined between elements of the field (and produces another element of the field), while scalar multiplication combines elements of two different sets.

Definition 1.1.4.

A field is a set F, equipped with two binary operations F×FF:
(a,b)a+b(a,b)ab,
such that the following axioms are satisfied:
  1. A1: for all a,bF,a+b=b+a.
  2. A2: for all a,b,cF,a+(b+c)=(a+b)+c
  3. A3: there exists an element 0F such that 0+a=a for all aF.
  4. A4: for each aF, there exists an element aF such that a+a=0.
  5. M1: for all a,bF, ab=ba.
  6. M2: for all a,b,cF, a(bc)=(ab)c.
  7. M3: there exists an element 1F such that 1a=a for all aF.
  8. M4: for each aF with a0, there exists an element 1/aF such that 1/aa=1.
  9. D: for all a,b,cF, a(b+c)=ab+ac.
Note how the axioms for multiplication in a field mirror the addition axioms much more closely than in a vector space. The only difference is the fact that there is one element without a multiplicative inverse; namely, the zero element.
While it is possible to study linear algebra over finite fields (like the integers modulo a prime number) we will only consider two fields: the real numbers R, and the complex numbers C.

Exercise 1.1.5.

Before we move on, let’s look at one example involving finite fields. Let Zn={0,1,2,,n1}, with addition and multiplication defined modulo n. (For example, 3+5=1 in Z7, since 81(mod7).)

(a)

Show that Z5 is a field.
Hint.
You will need to recall properties of congruence from your introduction to proofs course.

(b)

Show that Z6 is not a field.

(c)

Why does this work for n=5 but not for n=6? For which n do you think Zn will be a field?
A vector space whose scalars are complex numbers will be called a complex vector space. While many students are initially intimidated by the complex numbers, most results in linear algebra work exactly the same over C as they do over R. And where the results differ, things are usually easier with complex numbers, owing in part to the fact that all complex polynomials can be completely factored.
To help us gain familiarity with the abstract nature of Definition 1.1.3, let us consider some basic examples.

Example 1.1.6.

The following are examples of vector spaces. We leave verification of axioms as an exercise. (Verification will follow a process very similar to the discussion at the beginning of this section.)
  1. The set Rn of n-tuples (x1,x2,,xn) of real numbers, where we define
    (x1,x2,,xn)+(y1,y2,,yn)=(x1+y1,x2+y2,,xn+yn)c(x1,x2,,xn)=(cx1,cx2,,cxn).
    We will also often use Rn to refer to the vector space of 1×n column matrices [x1x2xn], where addition and scalar multiplication are defined as for matrices (and the same as the above, with the only difference being the way in which we choose to write our vectors). If the distinction between n-tuples and column matrices is ever important, it will be made clear.
  2. The set R of all sequences of real numbers
    (xn)=(x0,x1,x2,).
    Addition and scalar multiplication are defined in the same way as Rn; the only difference is that elements of R contain infinitely many entries.
  3. The set Mmn(R) of m×n matrices, equipped with the usual matrix addition and scalar multiplication.
  4. The set Pn(R) of all polynomials
    p(x)=a0+a1x++anxn
    of degree less than or equal to n, where, for
    p(x)=a0+a1x++anxnq(x)=b0+b1x++bnxn
    we define
    p(x)+q(x)=(a0+b0)+(a1+b1)x++(an+bn)xn
    and
    cp(x)=ca0+(ca1)x++(can)xn.
    The zero vector is the polynomial 0=0+0x++0xn.
    This is the same as the addition and scalar multiplication we get for functions in general, using the “pointwise evaluation” definition: for polynomials p and q and a scalar c, we have (p+q)(x)=p(x)+q(x) and (cp)(x)=cp(x).
    Notice that although this feels like a very different example, the vector space Pn(R) is in fact very similar to Rn (or rather, Rn+1, to be precise). If we associate the polynomial a0+a1x++anxn with the vector a0,a1,,an, the addition and scalar multiplication for either space behaves in exactly the same way. We will make this observation precise in Section 2.3.
  5. The set P(R) of all polynomials of any degree. The algebra works the same as it does in Pn(R), but there is an important difference: in both Pn(R) and Rn, every element in the set can be generated by setting values for a finite collection of coefficients. (In Pn(R), every polynomial a0+a1x+=anxn can be obtained by choosing values for the n+1 coefficients a0,a1,an.) But if we remove the restriction on the degree of our polynomials, there is then no limit on the number of coefficients we might need. (Even if any individual polynomial has a finite number of coefficients!)
  6. The set F[a,b] of all functions f:[a,b]R, where we define (f+g)(x)=f(x)+g(x) and (cf)(x)=c(f(x)). The zero function is the function satisfying 0(x)=0 for all x[a,b], and the negative of a function f is given by (f)(x)=f(x) for all x[a,b].
    Note that while the vector space P(R) has an infinite nature that Pn(R) does not, the vector space F[a,b] is somehow more infinite! Using the language of Section 1.7, we can say that Pn(R) is finite dimensional, while P(R) and F[a,b] are infinite dimensional. In a more advanced course, one might make a further distinction: the dimension of P(R) is countably infinite, while the dimension of F[a,b] is uncountable.
Other common examples of vector spaces can be found online; for example, on Wikipedia 1 . It is also interesting to try to think of less common examples.

Exercises Exercises

1.

Can you think of a way to define a vector space structure on the set V=(0,) of all positive real numbers?
(a)
How should we define addition and scalar multiplication? Since the usual addition and scalar multiplication wont work, let’s denote addition by xy, for x,yV, and scalar multiplication by cx, for cR and xV.
Note: you can format any math in your answers using LaTeX, by putting a $ before and after the math. For example, xy is $x\oplus y$, and xy is $x\odot y$.
Hint.
Note that the function f(x)=ex has domain (,) and range (0,). What does it do to a sum? To a product?
(b)
Show that the addition you defined satisfies the commutative and associative properties.
Hint.
You can assume that these properties are true for real number multiplication.
(c)
    Which of the following is the identity element in V?
  • 0
  • Remember that the identity needs to be an element of the set!
  • 1
  • Correct! Since nothing happens when we multiply by 1, we get 1x=x for any xV.
Hint.
Remember that an identity element e must satisfy ex for any xV.
(d)
What is the inverse of an element xV?
Hint.
Remember that an inverse x must satisfy x(x)=e, where e is the identity element. What is e, and how is “addition” defined?
(e)
Show that, for any cR and x,yV,
c(xy)=cxcy.
(f)
Show that for any c,dR and xV,
(c+d)x=cxdx.
(g)
Show that for any c,dR and xV, c(dx)=(cd)x.
(h)
Show that 1x=x for any xV.

2.

    True or false: the set of all polynomials with real number coefficients and degree less than or equal to three is a vector space, using the usual polynomial addition and scalar multiplication.
  • True.

  • This is the vector space P3(R) from Example 1.1.6.
  • False.

  • This is the vector space P3(R) from Example 1.1.6.

3.

    True or false: the set of all polynomials with real number coefficients and degree greater than or equal to three, together with the zero polynomial, is a vector space, using the usual polynomial addition and scalar multiplication.
  • True.

  • The set is not closed under addition. What happens if you add the polynomials x3+x and x3+x?
  • False.

  • The set is not closed under addition. What happens if you add the polynomials x3+x and x3+x?
Hint.
Remember that a vector space must be closed under the operations of addition and scalar multiplication.

4.

    True or false: the set of all vectors v=a,b of unit length (that is, such that a2+b2=1) is a vector space with respect to the usual addition and scalar multiplication in R2.
  • True.

  • The zero vector does not have unit length. Also, the sum of two unit vectors will usually not be a unit vector.
  • False.

  • The zero vector does not have unit length. Also, the sum of two unit vectors will usually not be a unit vector.

5.

Let V=R. For u,vV and aR define vector addition by uv:=u+v+3 and scalar multiplication by au:=au+3a3. It can be shown that (V,,) is a vector space over the scalar field R. Find the following:
(a) The sum 56
(b) The scalar multiple 85
(c) The zero vector, 0V
(d) The additive inverse of x, x

6.

Let V=(5,). For u,vV and aR define vector addition by uv:=uv+5(u+v)+20 and scalar multiplication by au:=(u+5)a5. It can be shown that (V,,) is a vector space over the scalar field R. Find the following:
(a) The sum 11
(b) The scalar multiple 21
(c) The additive inverse of 1, 1
(d) The zero vector, 0V
(e) The additive inverse of x, x
You have attempted 1 of 18 activities on this page.