The Only Linear Algebra You Must Know
Linear algebra is often taught backwards.
A standard course starts with row reduction, determinants, matrix multiplication, eigenvalues, diagonalization, etc. These are all useful, but taught in that order, the subject can feel like a list of unrelated tricks.
The more useful framing is:
A matrix is not a grid of numbers. A matrix is a coordinate description of a linear transformation.
That sentence is most of linear algebra.
Everything else is bookkeeping.
The few facts worth remembering are stated explicitly below. The proofs are short because the point is not to make the subject feel more formal. The point is to make the conceptual dependencies visible.
Vectors are not columns of numbers
A vector is an element of a vector space.
That definition is intentionally plain. A vector space is just a place where linear combinations make sense.
The important operation is:
\[x_1v_1 + \cdots + x_nv_n\]Once objects can be added and scaled, linear algebra can begin.
The column of numbers is what appears after choosing a basis.
For example, the notation
\[v = \begin{bmatrix} 3 \\ 5 \end{bmatrix}\]usually means
\[v = 3e_1 + 5e_2\]where
\[e_1 = \begin{bmatrix} 1 \\ 0 \end{bmatrix}, \quad e_2 = \begin{bmatrix} 0 \\ 1 \end{bmatrix}\]But the important thing is that the standard basis has been chosen.
The vector is not inherently the pair of numbers (3, 5). The pair of numbers is the vector’s address in a particular coordinate system.
This sounds pedantic until the source of confusion becomes clear: many linear algebra mistakes come from forgetting this distinction.
Bases are coordinate systems
A basis is just a coordinate system for a vector space.
If (B = {e_1, \dots, e_n}) is a basis for a vector space (V), then every vector (v \in V) can be written uniquely as
\[v = x_1 e_1 + \cdots + x_n e_n\]So choosing (B) gives a coordinate map:
\[\Phi_B : V \to \mathbb{F}^n\]where
\[\Phi_B(v) = \begin{bmatrix} x_1 \\ \vdots \\ x_n \end{bmatrix}\]This map is doing something conceptually important. It takes an abstract vector and gives coordinates for it.
The vector space did not come with coordinates. The coordinates were added by choosing a basis.
Linear transformations are the actual objects
The real object in linear algebra is a linear transformation.
A function
\[T : V \to W\]is linear if
\[T(v + w) = T(v) + T(w)\]and
\[T(\lambda v) = \lambda T(v)\]That is all linearity means: the function respects the vector space structure.
The reason bases matter is that a linear transformation is completely determined by what it does to basis vectors.
This is the structural reason matrices exist.
What is a matrix?
Suppose:
- (V) has basis (B = {e_1, \dots, e_n})
- (W) has basis (C = {f_1, \dots, f_m})
- (T : V \to W) is linear
To describe (T), look at where it sends each basis vector:
\[T(e_1), \dots, T(e_n)\]Each of these lives in (W), so each one can be expressed using the basis (C).
For each (j):
\[T(e_j) = a_{1j}f_1 + a_{2j}f_2 + \cdots + a_{mj}f_m\]The coefficients become the (j)-th column of the matrix:
\[A = \begin{bmatrix} a_{11} & \cdots & a_{1n} \\ \vdots & & \vdots \\ a_{m1} & \cdots & a_{mn} \end{bmatrix}\]So the matrix is not the transformation itself.
It is the coordinate representation of the transformation, after choosing:
- coordinates for the input space
- coordinates for the output space
Equivalently, writing the coordinate maps explicitly, the matrix (A) satisfies:
\[\Phi_C(T(v)) = A \Phi_B(v)\]or as a commuting diagram:
\[\begin{array}{ccc} V & \xrightarrow{T} & W \\ \downarrow \Phi_B & & \downarrow \Phi_C \\ \mathbb{F}^n & \xrightarrow{A} & \mathbb{F}^m \end{array}\]This is the whole picture.
The top row is the real transformation.
The bottom row is what that transformation looks like in coordinates.
Matrix multiplication is function composition
Matrix multiplication also becomes less arbitrary from this view.
Suppose:
\[T : U \to V\]and
\[S : V \to W\]They can be composed:
\[S \circ T : U \to W\]If (T) has matrix ([T]), and (S) has matrix ([S]), then the composition has matrix:
\[[S \circ T] = [S][T]\]This is not an arbitrary multiplication rule.
It is just the coordinate version of doing one function and then the next.
That is why the order can seem backwards at first:
\[[S][T]\]means:
- do (T)
- then do (S)
Functions compose right-to-left, so matrices multiply right-to-left.
Why change of basis is annoying
Change of basis is another topic that becomes simpler once vectors are no longer treated as columns.
The same vector can have different coordinate columns under different bases.
The vector did not change.
The coordinate system changed.
So a change-of-basis matrix is not doing something to the vector itself. It is translating between descriptions of the same vector.
Likewise, two different matrices can represent the same linear transformation if they are written using different bases.
This is why statements like “this matrix has eigenvalues…” are a little imprecise. The main object is the linear operator, and the matrix is one coordinate representation of it.
Why diagonalization matters
Diagonalization is often taught as another algorithm:
- find eigenvalues
- find eigenvectors
- build (PDP^{-1})
But conceptually, diagonalization means:
Find a basis where the transformation acts independently on each coordinate direction.
If an operator is diagonal in some basis, then in that basis it just scales each basis vector:
\[T(e_i) = \lambda_i e_i\]That is why diagonal matrices are useful.
They are not useful because zeros are aesthetically pleasing. They are useful because the transformation has been expressed in coordinates where its behavior is decoupled.
The sentence to remember
Linear algebra is the study of linear transformations between vector spaces.
Matrices are coordinate descriptions of those transformations.
With that distinction in place, many standard facts become instances of the same idea:
- matrix multiplication is composition
- change of basis is changing coordinates
- invertible matrices are isomorphisms in coordinates
- the identity matrix is the identity transformation in coordinates
- diagonalization is finding coordinates where an operator is simple
The useful diagnostic question is:
What transformation does this matrix describe, and which bases are involved?
That question usually identifies the relevant object and the relevant coordinates.