Linear Algebra

A lot of content summarized from Mark Schmidt’s notes on Linear Algebra

Notation

Generally column major

Scalar (1,1): $α$
Column Vector (m, 1): $[a_{1} a_{2}]$
Row Vector (1, n): $[a_{1} a_{2}]$
Matrix (m, n): $[a_{1, 1} a_{1, 2} a_{2, 1} a_{2, 2}]$

Operations

Transpose

$(A^{T})_{ij} = (A)_{ji}$

A matrix is symmetric if $A = A^{T}$

Vector Addition

Associative (brackets don’t matter) and commutative (order independent)

$a + b = [a_{1} a_{2}] + [b_{1} b_{2}] = [a_{1} + b_{1} a_{2} + b_{2}]$

Scalar Multiplication

Associative (brackets don’t matter) and commutative (order independent)

$α b = α [b_{1} b_{2}] = [α b_{1} α b_{2}]$

Inner Product

Between two vectors of the same length, multiply each element together to get a scalar result

$a^{T} b = \sum_{i = 1}^{n} a_{i} b_{i} = γ$

A specific version of this is the dot product which can be expressed as the inner product between two vectors, $a \cdot b = a^{T} b$

Commutative: $a^{T} b = b^{T} a$
Distributive across addition: $a^{T} (b + c) = a^{T} b + a^{T} c$

Outer Product

Between two vectors of the same length, create a matrix multiplying each combination of elements in each vector.

Given two vectors $u = u_{1} u_{2} ⋮ u_{m}$ and $v = v_{1} v_{2} ⋮ v_{n}$ ,

$u \otimes v = A = u_{1} v_{1} u_{2} v_{1} ⋮ u_{m} v_{1} u_{1} v_{2} u_{2} v_{2} ⋮ u_{m} v_{2} \dots \dots ⋱ \dots u_{1} v_{n} u_{2} v_{n} ⋮ u_{m} v_{n}$

The resulting matrix $A$ is always rank-1.

Multiplication

In general, we can multiply matrices A and B when the number of columns in A matches the number of rows in B

If A is (m, n) and B is (n, p), then AB is (m, p)

Associative: $A (BC) = (A B) C$
Distributive across addition: $A (B + C) = A B + A C$
Generally not commutative: $A B \neq = B A$
Transposing reverses order: $(A B)^{T} = B^{T} A^{T}$
Matrix powers don’t change order: $(A B)^{2} = A B A B$
Matrix-vector multiplication always yields a vector: $x^{T} A y = x^{T} (A y) = (A y)^{T} x = y^{T} A^{T} x$

Properties

Vector Norm

A scalar measure of a vector’s length

$∥ x ∥ \geq 0$
$∥ x ∥_{2}^{2} = x^{T} x$
Euclidean Norm (L2-Norm): $∥ x ∥_{2} = \sum_{i = 1}^{n} x_{i}^{2}$
- Also note that $∥ x ∥^{2} = ∥ x ∥_{2}^{2} = r^{T} r = ⟨ r, r ⟩$
Manhattan Distance (L1-Norm): $∥ x ∥_{1} = ∣ r_{1} ∣ + ∣ r_{2} ∣$
- How many ‘blocks’ you need to traverse
L $\infty$ -Norm: $∥ x ∥_{\infty} = max (∣ r_{1} ∣, ∣ r_{2} ∣)$
- How many blocks you have to walk in any one dimensions

Rank

The dimension of the vector space generated (or spanned) by its columns.
This corresponds to the number of linearly independent columns of A.
- This minimal set of vectors that span a space is called a basis

Orthogonal

If for some set of vectors $q$ :

$q_{i}^{T} q_{j} = 0$ , we call $q_{i}$ and $q_{j}$ orthogonal
$q_{i}^{T} q_{j} = 1$ , we call $q_{i}$ and $q_{j}$ orthonormal

Inner product of square orthogonal matrices is the identity matrix: $Q^{T} Q = I = Q Q^{T}$

Linear Dependence

A vector is linearly dependent on a set of vectors if it can be written as a linear combination of them

$c = α_{1} b_{1} + α_{2} b_{2} + \dots + α_{n} b_{n}$

A set of vectors is linearly dependent if and only if the zero vector can be written as a non-trivial combination of any of the vectors.

A matrix with fully independent columns has full column rank. If this is the case, $A x = 0$ implies that $x = 0$

Special Matrices

Identity Matrix

1’s on the diagonal and 0’s otherwise. $I_{n}$ denotes an (n,n) identity matrix.

Multiplication by the identity matrix yields the original matrix. Columns of the identity matrix are called elementary vectors.

Diagonal Matrix

$D = d_{1} 00 0 d_{2} 0 00 d_{3}$

Spaces

Range (Column-space)

Subspace spanned by the columns of a matrix.

A system $A x = b$ is solvable if and only if b is in $A$ ‘s column-space

Subspace

A non-empty subset of vector space that is closed under addition and scalar multiplication

Possible spaces of $R^{3}$

0 Vector
Any line or plane through the origin
All of $R^{3}$

We say that the vectors generate or span the subspace when you can reach any point in the subspace through a linear combination of the vectors.

Matrices as transformation

Viewing $A x = T (x)$

A linear transformation can’t move the origin. But, if there are linearly dependent columns, there are non-zero vectors that can be transformed to zero. The set of vectors that can be transformed to 0 is called the null-space.

Null space: $N (A)$ is all $x$ such that $A x = 0$

Fundamental Theorem of Linear Algebra

$r$ is the dimension of the column-space which is the same as the dimension of the row-space
The null-space is orthogonal to the row-space

Inverses

We can find the inverses if and only if A is square and doesn’t have null-space outside of the zero vector (otherwise we either lose information to the null-space or can’t get to all vectors)

If the inverse exists, it is a unique matrix such that $A^{- 1} A = I = A A^{- 1}$

Some identities

$(A^{- 1})^{T} = (A^{T})^{- 1}$
$(γ A)^{- 1} = γ^{- 1} A^{- 1}$
Assuming both $A^{- 1}$ and $B^{- 1}$ exist, $(A B)^{- 1} = B^{- 1} A^{- 1}$

Special inverses of diagonal matrices

$D = d_{1} 00 0 d_{2} 0 00 d_{3}$

$D^{^{-} 1} = 1/ d_{1} 00 0 1/ d_{2} 0 00 1/ d_{3}$

Solving Linear Equations

Given A and b, we want to solve for x in $A x = b$

Say, $[21 - 1 1] [x y] = [15]$ .

We can interpret this multiple ways:

By rows: $x$ is the intersection of the hyperplanes $2 x - y = 1$ and $x + y = 5$
By columns: $x$ is the linear combination that yields the RHS in $x [25] + y [- 1 1] = [15]$
Transformation

$A x = b$ generally has a solution when $b$ is in the column-space of A. It has a single unique solution if the columns of A are linearly independent.

If $A x = b$ has as solution we say it is consistent.

Basically, $x = A^{- 1} b$

We can solve using Gaussian Elimination

Recent Writing

Recent Notes