Data Structures & Algorithms

Lecture 1: Algorithms and Analysis

Psuedocode

Selection Sort
Input: A collection $A = \{a_1, \ldots, a_n\}$
Output: $A$, Sorted
For ($i = 1\ldots n-1$)
- min $\gets$ $a_i$
- For ($j = i+1 \ldots n$)
  - if (min $> a_j$)
    - min $\gets a_j$
  - swap min, $a_i$
output $A$

Analysis

Identify the input
Identify the input size
- collection size
- tree/graph number of nodes/edges
Identify the elementary operation(s)
- most common or most expensive
- comparisons, assignments, and incrementations ignored
Analyze elementary op executed with regard to the input size
Provide an asymptotic characterization from step 4

Example

Collection A
Size or cardinality of $A$, $n$
Comparison on line 4
$\sum_{i = 1}^{n-1} (n - i) = \sum_{i = 1}^{n - 1} n - \sum_{i = 1}^{n - 1} i = n(n - 1) - \frac{n(n - 1)}{2} = \frac{n(n-1)}{2}$
$O(n^2)$, $\Theta(n^2)$ (Prefer $\Theta$)

Brute Force

Given: a set of $n$ points, $P = \{(x_1, y_1), (x_2, y_2), \ldots, (x_n, y_n)\}$
Find: the two points $p$ and $q$ that are closest to each other with regard to $L_2$ norm ($\sqrt{(x_1-x_2)^2 + (y_1 - y_2)^2}$)

Looping Method

$minDist \gets \infty$
For ($i = 1 \ldots n-1$)
- For($j = i+1 \ldots n$)
  - If(distance($P_i$, $P_j$))
    - $p \gets P_i$
    - $q \gets P_j$
    - $minDist \gets distance(P_i, P_j)$
output p, q

Simpler

$minDist \gets \infty$
For Each Pair of Points $p = (x, y)$, $q = (x^\prime, y^\prime)$
- d ≥ts distance(p, q)
- if (d < minDist)
  - $minDist \gets distance(p,q)$
Finish

Analysis

$\binom{n}{2} = \frac{n!}{(n - 2)!2!} = \frac{n(n-1)}{2} = \Theta(n^2)$

Common Big Os

Constant
Logarithmic
Linear
Iterated Linear
Quadratic
Polynomial
$2^n$
Factorial

Lecture 2: Brute Force Approach

Examining all possibilities
optimisation to find the best solution
decision problems: find a solution, or determine none exists
generally infeasable or inefficient
may involve:
- generating combinatorial objects
- backtracking
Ex: What percentage of 52 card shuffles has been generated by humans

Ex: Compute $a^n \mod m$ for a very large $n$: Dumbly

  int a, n, m;
  int result = 1;

  for(int i = 1 ; i <= n; i++) {
      result = (result * a) % m;
  }

Pairs: 2 for loops
triples: 3 for loops
for each subset of k elements: write k for loops where k is a variable
Given a set of n elements $A = \{a_1, a_2, \ldots, a_n\}$ and an integer $k$, $1 \leq k \leq n$, output all subsets of $A$ with cardinality $k$.
- $\binom{n}{k} \in O(n^k)$
It suffices to consider combinations of integers $1, 2, \ldots, n$.
Algorithm:
- Start with $a_1, a_2, \ldots, a_k$
- Assume we have $a_1, a_2, \ldots, a_k$, we want to generate the next combination
- Locate the last element $a_i$ in the current such that $a_i \neq n - k + i$
- Replace $a_i$ with $a_i + 1$, $a_j$ with $a_i + j - i$ for $j = i+1, i+2, \ldots k$
Ex: $k = 3$ on $\{1, 2, 3, 4, 5\}$
- $\{1, 4, 5\}$
- $\{2, 3, 4\}$
- $\{2, 3, 5\}$
- $\{2, 4, 5\}$
Goal: generate all permutations of a set of size $n$, $A = \{a_1, a_2, \ldots, a_n\}$
- Steinhouse-Jonson-Trotter Algorithm
Generate all $n!$ permutations in lexicographic order
- Start with $1, 2, 3, \ldots, n$
- Given $a_1, a_2, \ldots, a_n$
- Find the rightmost pair $a_i, $a_{i + 1}$ such that $a_i < a_{i + 1}$
- Find the smallest element to the right of this pair that is larger than $a_i$, $a^{\prime}$
- Swap $a_i$ and $a^{\prime}$, sort all elements to the right of $a^{\prime{}}$
ex: $n = 6$
- $\{1, 6, 3, 5, 4, 2\}$
- $\{1, 6, 4, 2, 3, 5\}$
- $\{1, 6, 4, 2, 5, 3\}$
- $\{1, 6, 4, 3, 2, 5\}$
Ordered combinations ($n$ distinct elements), and a length $k$, generate all permutations of of length $k$, $P(n, k)$
- Generate all subsets of size k, for each, generate all permutations of it
Permutations with repetition: given $n$ distinct elements, generate all ordered sequences of length $k$, where each element can appear multiple times. $n^k$
- Count from 0 to $n^k - 1$, converting to base $n$, map to the set.

Satisfiability (SAT)

Given a predicate $P(x_1, x_2, \ldots x_n)$ on $n$ boolean variables $x_1, x_2, \ldots x_n$
Determine if there exists an assignment of $x_1, x_2, \ldots, x_n$ that $P$ evaluates to true
(x1 or x2 or x3) and (x1 or not x2) and (x2 and not x3) and (x3 and not x1) and (not x1 or not x2 or not x3)
- by brute force: No. All $2^n$ possibilities were checked
Possible solutions:
- First:
  - Input: A predicate $P(x_1,\ldots, x_n)$
  - Output: True if satisfiable, false otherwise
  - For each Truth assignment T
    - if $\mathrm{apply} P, T$
      - output True
  - Output False
  - $O(2^n)$
- Second: Backtracking, pruning, think trees

Lecture 3: Knapsack, Hamiltonian Path and Backtracking

Backtracking requires recursion
SAT traversal, w/ backtracking: SAT(P, $\overrightarrow{t}$)
- Input: A Predicate $(x_1, \ldots, x_n$), a partial truth assignment $\overrightarrow{t} = t_1, \ldots, t_k$
- Output: True if P is satisfiable, false otherwise
- If $k = n$
  - Return $P(\overrightarrow{t})$
- Else
  - $t_{k + 1} \gets 0$
  - If SAT(P, $\overrightarrow{t}$)
    - return True
  - Else
    - $t_{k + 1} \gets 1$
    - return SAT(P, $\overrightarrow{t}$)

Hamiltonian Path/Cycle

given an undirected graph $G = (V, E)$
output true if G has a Hamiltonian path, false otherwise
A Hamiltonian Path is a path in $G$ that visits every vertex exactly once.
A Hamiltonian Cycle is a path in $G$ that visits every vertex exactly once, save for the first and last, which must be the same.
Given a a graph $G = (V, E)$, and a vertex $v \in V$, the neighborhood of $v$ is $N(v) = \{x | (v, x) \in E\}$
Naive approach: Go by permutations
Smart approach: Go by neighborhood
- Given a Graph and a partial path:
- Return true if length of path $p = n - 1$, false otherwise
Algorithm (HamWalk(G, $p = v_1, \ldots, v_k$)):
- Input $G = (V, E)$ and partial path $p$
- Return true if $G$ contains a hamiltonian path, false otherwise
- If $k = n$, return True
- for each $v \in N(v_k)$
  - if ($v \not\in p$)
    - HamWalk(G, p + v)
- return false

0-1 Knapsack Problem

Given a collection of items $A = \{a_1, \ldots, a_n\}$ a Weight Function $wt: A \to \mathbb{R}^+$ $w(a_i)$, a value function $val: A \to \mathbb{R}^+$ $v(a_i)$ and a capacity $W$
Output $S \subseteq A$ such that $\sum_{s \in S} w(s) \leq W$ but $\sum_{s \in S} v(s)$ is maximized
Ex:

item value weight

1 15 1

2 10 5

3 9 3

4 5 4
- W = 8
- Solutions
  - 1, 2, 3
  - 1, 3, 4
For Each Subset in A
if total weight of subset is less than max, and total value is greater than the best value
- update solution
- update best value
Optimally:
- Use Backtracking, cutting off at a possibility if it's greater than the weight, not checking any children
Algorithm
- Input: Knapsack $A = \{a_1, \ldots, a_n\}$, weight function $w(a_i)$, value function $v(a_i)$, max weight $W$ and a Partial Solution $S \subseteq A$ constituting elements not indexed > j
- Output: A solution $S^{\prime} \subseteq A$ at least as good as $S$.
- If $j = n$:
  - Return $S$
- $S_{best} \gets S$
- For $k = j+1 \ldots n$
  - $S^{\prime} \gets S \cup \{a_k\}$
  - If $w(S^{\prime}) } \leq W$:
  - $T \gets$ Knapsack($A$, $w$, $v$, $W$, $S^{\prime}$, $k$)
  - If $v(T) > val(S_{best})$
    - $S_{best} \gets T$

item	value	weight
1	15	1
2	10	5
3	9	3
4	5	4

Lecture 4: Divide and Conquer: repeated squaring, karatsuba multiplicaiton

Traveling Salesman Problem

Given: A collection of cities $C = \{c_1, \ldots, c_n\}$
Output an output of cities $\pi(C) = (c_1^{\prime}, \ldots, c_n^{\prime})$
- such that the total distance travelled $(\sum_{i = 1}^{n - 1} (\delta(c_i^{\prime}, c_{i+1}^{\prime}))) + \delta(c_1^{\prime}, c_n^{\prime})$ is minimized
- This is on a permutation of cities where there is a complete graph with weighted edges, all solutions (permutation of the set) are therefore hamiltonian
an optimization problem: back tracking not necessarily applicable, but not the most appropriate, requires being smart
Question: Ham Path or TSP: Which is harder or more complex? Which requires more resources?
- They're equivalent as they're both NP-complete

Convex Hull

Given a field of points, find a polygon that encloses all points and is also convex and has a minimal count of sides
Given a set of $n$ points, compute a convex hull that contains them
- brute force is $0(n^3)$
- Smart way is based on presorting, $O(n)$

Assignment Problem

Given a collecetion of tasks $T$ and persons $P$ and an $n \times n$ cost matrix $C$ such that $C_{i, j}$ is the cost of assigning person $p_i$ to task $t_j$.
Output an assignment (a bijection) $f: P \to T$ such that the $\sum_{p_i \in P} c(i, f(p_i))$ is minimized.
Naive approach is $O(n!)$
Better Solutions
- Hungarian Algorithm – Polynomial Time
- Solving a Linear Program – Much faster, simpler

Integer Mod

Given integer $a$, $n$, $m$
Compute $a^n \mod{m}$
Naively: Perform $n$ multiplications (input size $\log{n}$, elem op multiplication, $n$ times, $O(2^N)$)
Ex: $a^77 = a^64 \cdot a^8 \cdot a^4 \cdot a$
- Repeated Squaring, Binary Exponentiation
Algo: $a, m, n = (b_k, \ldots, b_0)$
- $term \gets a$
- $prod \gets 1$
- if $b_0 = 1$
  - $prod \gets a$
- for $i = 1 \ldots k$
  - $term \gets term^2 \mod{m}$
  - if $b_i = 1$
    - $prod \gets (prod \cdot term) \mod{m}$
- output $prod$
- $O(\log{n})*$, but really linear, as $O(k)$

Karatsuba Multiplication

ad + bc = (a + b)(c + d) - ac - bd
Given 2 $n$ bit numbers, $a, b$
- split into 4 numbers
- $a = a_1 \cdot 10^{\frac{n}{2)} + a_0$
- $b = b_1 \cdot 10^{\frac{n}{2)} + b_0$
- $ab = a_1 b_1 10^n + (a_1 b_0 + a_0 b_1)10^{\frac{n}{2}} + a_0 b_0 = a_1 b_1 10^n + \[(a_1 + a_0)(b_1 + b_0) - a_1 b_1 - a_0 b_0\] 10^{\frac{n}{2}} + a_0 b_0$
Do this recursively
Is $O(n^{\log_2 3})$

Lecture 5: Karatsuba and Strassen Matrix Multiplication

karatsuba alternatives
- toom-cook $O(n^{1.463})$
- fast-fourier transform (Schonege-Strassen) $O(n\log n \log \log n)$
- Furer $O(nlog n 2^{log^* n})
Log * 64 = 4

Matrix Multiplication

$A \cdot B = C$
- $c_{ij} = \sum_{k = 1}^{n} a_{ik}\cdot b_{kj}$
- Traditional Method is $O(n^3)$ for multiplications, $O(n^3)$ for additions
- Input: Matrix, size $n^2 = N$, then $O(N^1.5)$

Strassen's Matrix Multiplication Algorithm

For $2 \times 2$:
- M_1 + M_4 - M_5 + M_7, M_3 + M_5, M_2 + M_4, M_1 + M_3 + M_2 + M_6
- M_1 = A00 + A11 * B00 + B11
- M_2 = A10 + A1 * B00
- M_3 = A00 * B01 - B11
- M_4 = A11 * B10 - B00
- M_5 = A00 + A01 * B11
- M_6 = A10 - A00 * B00 + B01
- M_7 = A01 - A11 * B11 + B11
- 7 mults, 18 additions for strassens
- 8 mults, 4 adds for traditional
For an $n \times n$ matrix, split into Quadrants, $A_00, A_01, A_10, A_11$ which are each of $\frac{n}{2} \times \frac{n}{2}$ and the same for $B$
- Use the previous identities recursively
$M(n)$ is the number of scalar multiplications by strassen on 2 matrices of $n \times n$
- $= 7M(\frac{n}{2})$
- $M(2) = 7$
By Master Theorem:
- $a = 7, b = 2, d = 0$
- $7 > 2^0$
- $\Theta(n^{\log_2 7})$
Alternatives:
- Winagard: $O(n^{2.375477})$
- Williams: $O(n^{2.3727})$ (2012)
Algorithm Details
- design and implement splitting or your functions should handle arbitrary indices
- matrix add and subtract
- matrix combine
- $M_1, \ldots, M_7$ functions
- pad-with-zeros function, for when $n$ is not a power of 2 and unpad

Lecture 6: Closest Pair, Convex Hull and Presorting & Gaussian Elimination and Applications

Closest Pairs, Revisited

Divide and Conquer
- Divide the points into 2 sets along the $x$ axis
- Conquer by recursively find the two closest points in each partition
- Combine by considering combinations that cross the split
- $D(n)$ the number of distance calculations for divide and conquer
  - $2D\left(\frac{n}{2}\right) + \frac{k}{2}n$
  - By Master Theorem $a = 2$, $b = 2$, $d = 1$
  - $a \equiv b^d$, thus $\mathcal{O}(n\log n)$
Use sorting to help reduce work

Gaussian Elimination

$3x + 2y = 35$
$x + 3y = 27$
$a_{11} x_2 + a_{12} x_2 + \cdots a_{1n} x_n = b_1$
$a_{n1} x_1 + a_{n2} x_2 + \cdots a_{nn} x_n = b_n$
Provides an $n \times n$ matrix, and with $\overrightarrow{b}$ added on the rhs, you get an $n \times (n + 1)$ augmented matrix
Goal: transform augmented matrix into an upper diagonal matrix
Tools: We can perform any operation (any op that does not change the solution)
- row multiply by any non-zero constant
- reorder rows
- add any constant multiple of one row to another
This can be done with Gauss-Jordan elimination
Gaussian Elimination Algorithm
- Input $n \times n$ matrix $A$, $1 \times n$ solution vector $\overrightarrow{b}$
- $A^{\prime} \gets \left[ A | \overrightarrow{b} \right]$
- For $i = 1 \ldots n - 1$
  - $\mathrm{pivotrow} \gets i$
  - for $j = i + 1 \ldots n$
    - if $|A^{\prime}_{ij}| > |A^{\prime}_{\mathrm{pivotrow}j}|$
      - $\mathrm{pivotrow} \gets j$
  - for $k = i \ldots n + 1$
    - swap $A^{\prime}_{ik}$, $A^{\prime}_{\mathrm{pivotrow}j}$
  - for $j = i + 1 \ldots n$
    - $t \gets \frac{A^{\prime}_{ji}}{A^{\prime}_{i,i}}$
    - for $k = i \ldots n + 1$
      - $A^{\prime}_{jk} \gets A^{\prime}_{jk} - tA^{\prime}_{ik}$
- return $A^{\prime}$
Most common op, multiplication in the for i, for j, for k$
- $\sum_{i = 1}^{n - 1} \sum_{j = i + 1}^{n} \sum_{k = i}^{n + 1} 1 = \sum_{i = 1}^{n - 1} \left(n_2 - 2ni + i^2\right) = \frac{(2n + 3)(n-2)n}{6}$
- Therefore $\mathcal{O}(n^3)$
Back Solving
- Input $n \times (n + 1)$ augmented upper triangualr matrix $A$
- Output a solution vector $\overrightarrow{x}$
- For $i = n \ldots 1$
  - $t \gets A_{i n+1}$
  - For $j = i + 1 \ldots n$
    - $t \gets t - x_j A_{ij}$
  - $x_i \gets \frac{t}{A_{ii}}$
- return $\overrightarrow{x}$
Gotchas:
- divide by bigger numbers or use a language with good numerics support
- There do exist some that have infinite solutions, a indeterminite system
- There are some with no solution, a degenerate system
After Gaussian Elimination:
- if last row is all zeros, infinite solutions
- if last row is all zeros but the augment, no solutions

Lecture 7: Gaussian Elimination, LU Decomposition, Linear Programming and Coding Theory

LU Decomposition

Uses gaussian elimination for two matrices
- L is lower triangular, 1's on the diagonal
- U is upper Triangular
- Works with unknowns on $\overrightarrow{b}$

Matrix Inverses

A Matrix $A^{-1}$ is a matrix such that $A \cdot A^{-1} = I$
A Matrix is invertible or singular iff its determinant is non-zero.
- $[A | I]$ and row ops to $[I | A^{-1}]$ (Gauss-Jordan)
A Determinant is $\mathrm{det}(A) = \sum_{\sigma \in S_n} (\mathrm{sign}(\sigma)\prod_{i = 1}^{n} A_{i\sigma_n})$
- The Determinant of an Upper Triangular Matrix is equal to the product of its diagonal, $\mathrm{det}(A) = \prod_{i = 1}^{n} a_{ii}$
- The determinant is invariant with regard to adding/subtracting a constant multiple of a row to another
- The sign flips whenever rows are exchanged.
- any constant multiplication of a row modifies the determinant as follows
  - if multiplied by c, then $\mathrm{det}(A) = \frac{\mathrm{det}(A)}{C}$

Linear Programming

Not the modern sense of programming
A methematical model for linear optimization
there's an objective function: cost(minimize)/benefit(maximize)
constraints: linear equations that cannot be violated
$n$ variables, $x_1, \ldots, x_n$
Objective: output a solution vector $\overrightarrow{x}$ that is feasable and optimizes the objective function
Standard Form:
- maximize $\overrightarrow{c}^{T} \cdot \overrightarrow{x}$
- Subject To $A\overrightarrow{x} \leq \overrightarrow{b}$, $x_i \geq 0$
Ex:
- Maximize $4x + 3y$
- $x \geq 0$, $y \geq 0$, $x \leq 9$, $2y + x \leq 12$, $2y - x \leq 8$
For LP in standard form, the optimal solution always lies at a critical point (in this situation, all vertices bounding the feasible region)
- this is the simplex method or "gradient descent"

Use of Linear Programs

Knapsack to ILP (Integer Linear Program)
- Maximize the sum of all values of items takenp
- Subject to the total weight constraint, $W$, and an Integer Constraint: each item $a_i$ can be taken, or not (0 or 1) \[ x_i = \left\{ \begin{matrix}0 & \mathrm{not\ taken}\\1 & \mathrm{taken}\end{matrix} \right. \]
- Maximize $\sum_{i = 1}^{n} x_iv_i$, subject to $\sum_{i = 1}^{n}x_iw_i \leq W$ and $x_i \in \{0, 1\} \forall x_i$
- Can do a relaxation, converting an ILP to an LP, such that $0 \leq x_i \leq 1$, but this generates incorrect solutions
In General, ILP are NP-Complete
Some Problems, however, can be relaxed and preserve all optimal solutions (totally unimodular)
In General, Simplex Method may be inefficient
- though good in practice
- there are, however, proven polynomila time algorithms
  - Khachyan (1979)
  - Interior Point Method (1984)

Coding Theory

Claude Shannon
Compressing Data for efficient storage and/or transmission
- lossless: tarballs (encodes all information)
- lossy: jpegs, mpegs, mp3s, oggs (looses information)
Adding redundancies to make it more reliable
- sending $n + \log n$, where $n$ is the number of bits, and $\log n$ is the redunancy
- Reed-Solomon codes

Huffman Coding

Lossless compression scheme for text and text-like data
Let $\Sigma$ be a fixed alphabet of characters appearing in a file, $|\Sigma| = n$, (n distinct characters)
A block encoding scheme encodes every character with a binary codeword of equal length, ex: ASCII (256 chars, in codewords of 8 bits)
- in general $n$ characters requires $\log n$ length codewords in a block encoding scheme
But some characters may have a higher frequency, and others a lower frequency, thus variable length codewords
- high-freq gets short codewords
- low-freq gets long codewords
- $\{0, 01, 101, 010\}$
- vlcw require a prefix-free code (no codeword appears as the prefix of another)
- $\{10, 010, 110, 0110\}$
Goal: Encode a file using prefix-free variable-length encoding such that common characters get short codewords, less common get longer codewords

Lecture 8: Huffman Coding and AVL Trees

Huffmann Coding Continued

Create $n$ binary tree nodes with the characters.
Combine 2 trees at a time with the smallest frequencies
Do this until 1 huffmann tree exists
Characters only appear as leave, thus the path from the root to any leaf is unique
high frequency leaves are shallow, high frequency leaves are deeper
Minimize average codeword length
Make sure to generate from root!
compression ratio is original minus avg codeword over original
in general english text files get 25-40% compression ratios
pack is the most common
Algorithm:
- Input: An alphabet of $n$ symbols, $\Sigma$, and a frequenci mapping $\mathrm{freq}: \Sigma \to [0, 1]$
- Output: A Huffmann Tree
- $H \gets \mathrm{minHeap}$
- For each $x \in \Sigma$
  - $T_x \gets \mathrm{single\ node\ tree}$
  - $wt(T_x) \gets \mathrm{freq}(x)$
  - $\mathrm{insert}(H, T_x)$
- While $\mathrm{size}(H) > 1$
  - $T_r \gets \mathrm{new tree node}$
  - $T_a \gets \mathrm{get}(H)$
  - $T_b \gets \mathrm{get}(H)$
  - $\mathrm{left}(T_r) \gets T_a$
  - $\mathrm{right}(T_r) \gets T_b$
  - $\mathrm{wt}(T_r) \gets \mathrm{wt}(T_a) + \mathrm{wt}(T_b)$
- Return $\mathrm{get}(H)$
Analysis
- While Loop executes $n - 1$ times
- Input alphabet and frequency function, input size $n = |\Sigma|$
- heap insertion is performed $2n - 1$ times, $\in\mathcal{O}(n)$, at worst
- truly $\mathcal{O}(\log n)$ when heap comparision operations considered
- Or $\widetilde{\mathcal{O}}(n)$

BSTs

Every node has a unique key, $k$
Every node in a node $u$'s left subtree has a key value less than $u$'s key
Every node in a node $u$'s right subtree has a key value greater than $u$'s key
In general, there is an $\mathcal{O}(\log n)$ lower bound on a BST with $n$ elements, an upper bound of $\mathcal{O}(n)$
Goal: Have a BST that is balanced: $d \in \mathcal{O}(\log n)$

AVL Trees

Accomplishes the goal
The height of a node is the length of the unique path from the root to the node
The balance factor of the node $u$ is as follows $\mathrm{bf}(u) = \mathrm{h}(u_l) - \mathrm{h}(u_r)$
The height of an empty tree is always $-1$, the heigh of a single node tree is $0$
balance factors of $-1$, $0$, $1$ indicate a balanced tree
positive balance factors indicate left skew
negative balance factors indicate right skew
left rotation pulls the tree to the left
right rotation pulls the tree to the right
Can have left and right rotations within subtrees, such as RL or LR rotations
Suppose that $r$ has on it's left $T_1$, on right $c$ which on the left, has $T_2$ and on the right $T_3$, and $r$ is -2, $c$ is -1$
- perform a generalized left rotation, with $c$ as new root, and $T_2 on right of $r$, and a new node on $T_3$
Suppose that $r$ has $c$ on left, $T_3$ on right. $c$ has $T_1$ and $T_2$ on left and right respectively
- Perform generalized right rotation, $c$ has $T_1$ on left, $r$ on right. $r$ has $T_2$ and $T_3$ on left and right respectively
(r (c T_1 (g T_2 T_3)) T_4)
- (r (g (c T_1 T_2) T_3) T_4) -> (g (c T_1 T_2) (r T_3 T_4))
Swinging over lets you cut one off and put it on the other side of the tree

Lecture 9: AVL Trees, 2-3 Trees and Balancing

AVL Trees

Insert 8, 4, 7, 3, 2, 5, 1, 10, 6
- (8)
- (8 4)
- (8 (4 nil 7))
- (8 (4 3 7)) -> (8 (7 4 3)) -> (7 (4 3) 8)
- (7 (4 (3 2)) 8) -> (7 (3 2 4) 8)
- (7 (3 2 (4 nil 5)) 8) -> (7 (4 (3 2) 5) 8) -> (4 (3 2) (7 5 8))
- (4 (3 (2 1)) (7 5 8)) -> (4 (2 1 3) (7 5 8))
- (4 (2 1 3) (7 5 (8 nil 10)))
- (4 (2 1 3) (7 (5 nil 6) (8 nil 10)))
Observations:
1. Search is $\mathcal{O}(d)$
  - $d \in \mathcal{O}(\log n)$
2. Insertion is $\mathcal{O}(d)$
  - $d \in $\mathcal{O}(log n)$
3. A single rotation is $\mathcal{O}(1)$
4. Deletion is $\mathcal{O}(d)$
  - $\mathcal{O}(d)$ to find replacement
  - May need to rotate all the way up to the root. ($\mathcal{O}(\log n)$)
5. In worst case, $\Theta{\log n}$, but guaranteed to be $\mathcal{O}(\log n)$ in all cases

B-Trees

Each node has at most $m$ children
Every node except the root has at least $\left\lceil\frac{m}{2}\right\rceil$
The root has at least 2 children unless it is a leaf
Non-leaf nodes with $k$ children have $k - 1$ keys
All leaves are at the same level and non-empty
commonly used in databases
Ex:
- Each node has at most 3 children
- Each node has at least 2 children

2-3 trees

This is a specialization of the B-tree where $m = 3$
2 kinds of nodes:
- 2-nodes: A single key $k$ and 2 children, $\ell, r$
- 3-nodes: 2 keys $k_1, k_2$ such that $k_1 < k_2$ and 3 children $\ell, c, r$
Ex: (10 (5 (7 4) 7) ((15 20) 12 17 (21 25)))
Search is simple, but varies based on node size
Insertion is always done at a leaf
- sometimes requires changing node size and ordering
- sometimes requires reordering of node keys
Ex: 8 4 7 3 2 5 1 10 6
- (8)
- ((4 8))
- ((4 7 8)) -> (7 4 8)
- (7 (3 4) 8)
- (7 (2 3 4) 8) -> ((3 7) (2 4) 8) -> ((3 7) 2 4 8)
- ((3 7) 2 (4 5) 8)
- ((3 7) (1 2) (4 5) 8)
- ((3 7) (1 2) (4 5) (8 10))
- ((3 5 7) (1 2) (4 6) (8 10)) -> (5 (3 (1 2) 4) (7 6 (8 10))
Analysis:
- Min key count depth $d$
  - all be 2-nodes (1 key per node)
  - all nodes are present at all levels (always 2 children)
  - $2^{d + 1} - 1$ nodes
  - $2^{d + 1} - 1$ keys
- Max key count for depth $d$
  - every node is 3-node (2 keys per node, 3 children per node)
  - $2(3^{d + 1} - 1)$ nodes/keys
- $d \leq \mathcal{O}(\log_2 (n + 1) - 1)$
- $\log_3(\frac{n}{2} + 1) - 1 \leq d$
- Therefore $d \in \Theta(\log n)$
Don't even think about deletion. It's a pain.

Other Balanced BSTs

Splay Trees
Tries
Red-Black Trees (default implementation in java for tree-set)
A Trees
Scapegoat Trees
Treap

Heaps

A (min) heap is a binary tree data structure with the following properties
- all nodes are present at all levels except possibly the last level
- at the last level, it is full to the left
- the key of any node is less than bth its children
- The minimum is therefore always at the root
- The max is always at the bottom towards the left. It will only be at the very bottom if it's full.
2 efficient operations
- remove min
- insert

Lecture 10: More Heaps

Insertion. Woo.
Most operations are $\mathcal{O}(\log n)$
Can implement using array.
- left is $2i$
- right is $2i + 1$
- parent is $\left\lfloor \frac{i}{2} \right\rfloor$
- last element is at index $n$
- Will have to occassionally resize array
Suppose you have $n$ elements, inserting each one into a min heap, then take each out in turn. You get a sorted list.

Lecture 11: Review

AVL Tree Rotations

rotate around any node which a balance factor of 2 or -2, or greater than or less than
sign is positive, skews left, sign negative, right
number is magnitude

First Second Rotation

left left R

left right LR

right left RL

right right L

Determine what rotation to use for an AVL tree

First	Second	Rotation
left	left	R
left	right	LR
right	left	RL
right	right	L

Collection, Find Median

Input $A = \{a_1, \ldots, a_n\}$
Output median of $A$
Sort A
Output $a_{\frac{n}{2}}$
Input $A$, Size $n$
Elementary operation, comparison in sorting algorithm, about $n\log n$
$\mathcal{O}(n \log n)$

Graph cycles of length $k$

Input $G = (V, E)$, $3 \leq k \leq n$
Output: True if G contains a cycle of length $k$, False otherwise
For each $k$-combination of vertices $v_1, v_2, \ldots v_k$:
- For each permutation of $v_1, \ldots, v_k$, $\pi = v_1^{\prime}, \ldots, v_k^{\prime}$
  - $\mathrm{isCycle} \gets true$
  - For $i = 1 \ldots k - 1$
    - if $(v_i^{\prime}, v_{i + 1}^{\prime}) \not\in E$
      - $\mathrm{isCycle} \gets false$
    - if $(v_k^{\prime}, v_i^{\prime}) \not\in E$
      - $\mathrm{isCycle} \gets false$
    - if $\mathrm{isCycle}$
      - return True
return False
is $\mathcal{O}(n!)$, as outer loop is $\binom{n}{k}$ and inner is $k!$, which becomes $\frac{n!}{(n - k)!}$

Develop an algorithm to finds the length of the longest cycle in a graph

Input: $G = (V, E)$
Output: $\mathcal{l}$
For $k = n \ldots 3$
- if $A(G, k)$
  - return $k$
return "no cycle"

Develop an algorithm to solve the hamiltonian cycle problem

Input $G = (V, E)$
Output True if $G$ contains a Hamiltonian Cycle, False otherwise
Return $A(G, n)$
Called a Turing Reduction

Grade School Vs Karatsuba

$A, B$ are 10000 digits long
1 million FLOPS ($10^6$)
Grade School ($\frac{10000^2}{10^6}$)
Karatsuba ($\frac{10000^{\log_2 3}}{10^6}$)
Karatsuba is faster

Gaussian Elimination and Time

100 Million variables ($10^8$)
1 PetaFlop ($10^15$)
Gaussian Elimination is $\mathcal{O}(n^3)$
$\frac{\left(10^8\right)^3}{10^15}$

Matrix Multiplications: 2 recursive on a constant time

$M(n) = 2M(n/2)$
By Master Theorem, $\mathcal{O}(\log n)$
Trivial Lower Bound, however, is $\Omega(n^2)$
Therefore, this is not feasible

AVL Tree

Label each node with balance factor
Insert a sequence and re-balance as needed

2-3 Trees

Be careful of insertions!

2-3 Tree, calc minimum depth

365 keys
$n = 2(3^{d + 1} - 1)$
$d = \left\lciel\log_3 \left(\frac{n}{2} - 1\right) - 1\right\rciel$

Insert into a min-heap

Remember to heapify on each insertion
count comparisons and swaps

Lecture 12: Graph Algorithms

Review

A Graph $G = (V, E)$ is a set of vertices $V = \{v_1, \ldots v_n\}$ and a set of edges $E = \{e_1, \ldots e_m\}$
$n$ is the number of vertices
$m$ is the number of edges, is $|E| \in \mathcal{O}(n^2)$
Variations:
- $e = (u, v) = (v, u)$ is an undirected pair
- $e = (u, v) \neq (v, u)$ is an ordered pair
- Weighted:
  - $wt : V \to \mathbb{R}$
  - $wt : E \to \mathbb{R}$
- Labelled: Vertices and/or edges may or may not have a label
- Psuedo-graphs allow self-loops
Implementations
- Adjacency Lists
- Adjacency Matrices
- Library
  - JGraphT for java
  - Boost, LEMON for C++
  - NetworkX, PyGraph for Python
  - Graph.JS for JavaScript
  - graphy for Ruby

Depth First Search

Explore a graph as deep as possible before backtracking to explore other areas
Process each node at most once
- $\mathcal{O}(n)$ or more appropriately, $\mathcal{O}(n + m)$, max could then be $\mathcal{O}(n^2)$
Is quite simple, must keep track of seen nodes
Stacks help to keep track of location
color vertices to show that they've been touched

white

unvisited (all vertices initially)

grey

discovered and (possibly) processed but not out of the stack

black

discovered, processed, out of stack and finished
Also keep track of discovery and finish "times"
- start an integer counter at 1
- each node traversal or color a vertex black increments the counter
- when a node is discovered, color it grey and stamp with discovery time
- when a node is finished, color it black and stamp with finish time
Algorithm (Non Recursive)
- Given $G = (V, E)$
- For each vertex $v \in E$
  - color $v$ white
- $\mathrm{count} \gets 1$
- $S \gets \mathrm{stack}$
- $u \gets $ start vertex
- color $u$ grey
- Push $u$ on to $S$
- stamp discovery time $u$ with $\mathrm{count}$
- While $S$ is not empty:
  - $\mathrm{count} \gets \mathrm{count} + 1$
  - $x \gets \mathrm{peak}(S)$
  - $y \gets $ next white vertex in $N(x)$
  - Comment: If there is a grey vertex in $N(y)$, not equal to $x$, there's a cycle
  - If $y = \phi$:
    - $x \gets \mathrm{pop}(S)$
    - process $x$
    - color $x$ black
    - Stamp finish time $x$ with $\mathrm{count}$
  - Else:
    - $\mathrm{push}(S, y)$
    - color $y$ grey
    - stamp $y$ with discovery time $\mathrm{count}$
If there are any white vertices, they are part of a disconnected graph
The backtracking pattern is called a DFS tree, with the edges traversed in it called tree-edges. Other edges are back/forward edges or cross edges
back/forward edges connect children to far ancestors (no distinction on an undirected graph, show cycles in undirected graph, back edges show cycles in directed graphs)
cross edges connect sibling/cousin nodes (only possible on a directed graph)
Recursive Algorithms:
- Main:
  - For each $v \in V$, color $v$ white
  - $\mathrm{count} \gets 0$
  - For each $v \in V$:
    - if $v$ is white:
      - $\mathrm{DFS}(G, v)$
- Recursive Function:
  - $\mathrm{count} \gets \mathrm{count} + 1$
  - mark $v$ with discovery time
  - color $v$ grey
  - For each $w \in N(v)$:
    - if $w$ is white:
      - $\mathrm{DFS}(G, w)$
  - $\mathrm{count} \gets \mathrm{count} + 1$
  - mark $v$ with finish time
  - process $v$
  - color $v$ black

Lecture 13: Graph Algorithms, Continued

DFS Analysis

Input is $G = (v, E)$
Size $n = |V|$, $m = |E| \in \mathcal{O}(n^2)$
Elementary Operation:
- Traversing to a node $\in \mathcal{O}(n)$
- Examining edges $\in \mathcal{O}(m)$
- Processing nodes $\in \mathcal{O}(n)$
- May have a total of $\mathcal{O}(m + n)$
- Dense graphs $m \in \Omega(n^2)$, thus $\mathcal{O}(n^2)$
- Sparse graphs $m \in \mathcal{O}(n)$
  - ex Trees

BFS: Breadth First Search

Explore closer vertices before distant vertices
This is done with a queue
Algorithm (Main)
- For each $v \in V$:
  - color $v$ white
- $\mathrm{count} \gets 0$
- for each $v \in V$
  - if $v$ is white:
    - $\mathrm{BFS}(G, v)$
Algorithm Subroutine $\mathrm{BFS}$
- Outputs a BFS traversal starting at $v$
- $\mathrm{count} \gets \mathrm{count} + 1$
- $Q \gets $ empty queue
- Mark $v$ discover
- Color $v$ grey
- $\mathrm{enqueue}(Q, v)$
- While $Q$ is not empty
  - $x \gets \mathrm{peek}(Q)$
  - for each $y \in N(x)$
    - if $y$ is white:
      - $\mathrm{count} \gets \mathrm{count} + 1$
      - mark $y$ with discover time
      - color $y$ grey
      - $\mathrm{enqueue}(Q, y)$
  - $z \gets \mathrm{dequeue}(Q)$
  - $\mathrm{count} \gets \mathrm{count} + 1$
  - mark $z$ with finish time
  - color $z$ black
  - process $z$
Discover and finishing timestamps are less useful, as both are sorted
BFS trees will have tree edges, and may have cross edges denoting cycles, but will never have back/forward edges in an undirected graph, but may have back edges in a directed graph
- Provide shortest path from the initial vertex to add other connected vertices (but only in an unweighted graph, and only relative to the start vertex)
Input size $n = |V|$, $m = |E| \in \mathcal{O}(n^2)$
Always $\mathm{O}(n + m)$

Applications of DFS/BFS

Find Connectivity (directed or undirected) ()
- Given $G = (V, E)$, $s, t \in V$
- Output a path $p$ from $s \to t$ if one exists, $\phi$ if no such path exits (Functional Problem)
- Output yes if there exists a path (undirected or directed) from $s \to t$, no otherwise (Decision Problem (reduces functional problem))
Suppose a Functional Path Builder that takes a graph $G$ and points $s, t$ and returns either a path $p$ or $\phi$ if none exists
- If $\phi$ return no, otherwise, return yes
- The Opposite can be done, but it's a pain and involves an oracle and feedback, and is an adaptive reduction
Cycle detection involves forward or back edges in DFS, cross edges in BFS
Bipartite Testing:
- Given a Graph $G$
- Output yes if $G$ is bipartite, no if not
- Theorem: $G$ is bipartite iff $G$ contains no odd-length cycles.
- Add blue and red colors, start with red, alternate to blue, keep alternating. If at any point, you find a color in your neighborhood that is the same as yours, there's a cycle of odd length
Ex:
- in a directed graph, a sink has no paths going from it, sources have only paths going out from themselves
- In DFS of a directed graph with sinks, the finish times in ascending order
In a directed graph $G$, a strongly connected component is a subset of vertices $C \subset V$ such that any pair of vertices $x, y \in C$ is connected by a path in $G$
To generate a Condensation Graph
- Run DFS
- Compute $G^{T}$
- Perform another DFS on $G^{T}$ in descending order of the original finish times
  - Each Restart defines a new strongly connected component
If the condensation graph is one node, the original graph is a single strongly connected component

Lecture 14: Graph Algorithms, Continued

Condencation graphs

Reduce a graph but preserve its connectivity
Given a graph $G$ reduce "redundant" connectivity in an optimal way
many different criteria
- min/max flow
- graph decomposition
- minimum spanning tree

Minimum Spanning Tree

Let $G = (V, E)$ be an undireected, weighted graph.
A Spanning Tree $T = (V, E^{\prime}), E^{\prime} \subseteq E$ is a set of edges that span the vertex set. $T$ is minimal if $\sum_{e \in E^{\prime}} \mathrm{wt}(e)$ is not more than any other spanning tree.

Kruskal's Algorithm

Greedy algorithm, start taking edges of minimal weight as long as they do not induce a cycle
For each edge, ordered by minimum weight
- Take the edge if it does not induce a cycle
- Stop if all vertices are touched by the same tree and edge count equals n-1
Algorithm, defined:
- Input: An undirected, weighted graph $G = (V, E)$
- Output: An MST of $G$
- Sort $E$ by weight, in non-decreasing order ($\mathcal{O}(m \log m)$)
- $E_T \gets \emptyset$
- $k \gets 1$
- While $|K_T| < n - 1$: ($\mathcal{O}(m)$)
  - If $(V, E_T \cup \{e_k\})$ is acyclic ($\mathcal{O}(n + m) \leq \mathcal{O}(n + n)$, as on a sparse graph or forest)
    - $E_T \gets E_T \cup \{e_k\}$
  - $k \gets k + 1$
- Return $(V, E_T)$
Is $\mathcal{O}(m(n + \log n)) \in \mathcal{O}(mn) \in \mathcal{O}(n^3)$, unless a disjoint set is used
Works well with a disjoint set datastructure

Lecture 15: Graph Algorithms, Continued

Prim's Algorithm

greedy, using locally optimal decisions to produce a globally optimal solution
start at an arbitrary vertex, iteratively build a tree by adding the least weighted edge out from the current tree
Has three categories of vertices: tree vertices (in the tree), fringe vertices (connected to the tree vertices, but not in the tree, choose the least weighted edge next), undiscovered vertices (vertices not yet in the tree or on the fringe)
Psuedocode:
- Input: Weighted Undirected Graph $G = (V, E)$
- Output: A minimum Spanning Tree $T$ of $G$
- $E_{T} \gets \emptyset$
- $V_T \gets \{v_1\}$
- $E^{*} \gets N(v_1)$
- For $i = 1 \ldots n - 1$
  - $e = (u, v) \gets$ minimum weighted edge in $E^*$, with $u$ being the endopint in $V_T$ and $v$ being the endpoint not in $V_T$
  - $V_T \gets V_T \cup \{v\}$
  - $E_T \gets E_T \cup \{e\}$
  - $E^* \gets (E^* \cup N(v)) \setminus \{e = (x, y) | x \in V_T \land y \in V_T\}$
- Output $V_T$, $E_T$
Done $n - 1$ times, and if a heap is used, $\mathcal{O}(n \log m)$

Dijkstra's Algorithm

Given a weighted graph (directed or not), $G = (V, E)$ and a source vertex $s \in V$.
Output the shortest distance paths from $s$ to all other vertices in $V$.
Caveat: Don't consider graphs with negatively weighted cycles
Start at $s$
- Add the least weighted vertex connected to your tree
- incoporate the new vertex's neighborhood, updating weights and predecessors
Pseudocode:
- Input: a weighted graph $G = (V, E)$ and $s \in V$, a source
- Output: the least weighted path $s \to v \in V$, and weights $P_v$
- $Q \gets \mathrm{min queue}$
- For each $v \in V \setminus \{s\}$
  - $d_v \gets \infty$
  - $p_v \gets \phi$
  - $enqueue(Q, v, d_v)$
- $d_s \gets 0$
- $p_s \gets \phi$
- enqueue($Q, s, d_s$)
- $V_T \gets \emptyset$
- for $i = 1 \ldots n$
  - $u \gets dequeue(Q)$
  - $V_T \gets V_T \cup \{u\}$
  - Foreach $v \in N(u) \setminus V_T$
    - if $d_u + wt(u, v) < d_v$
      - $d_v \gets d_u + wt(u, v)$
      - $p_v \gets u$
      - $decreasePriority(Q, v, d_v)$ (is $\mathcal{O}(\log n)$)
- Output $d, p$
Total will be $\mathcal{O}(m \log n)$

Floyd-Warshall's

Given a weighdet graph (directed or not), $G = (V, E)$
Output the shortest distance and path for every pair of nodes
Does the same of this, it works quickly, and goes from $v_i \to v_j$
Works with matrices
Algo:
- dist = $n \ctimes n$ matrix, defaulting to $\infty$
- for each $v \in V$
  - dist[v,v] = 0
- for each edge $(u, v) \in E$
  - dist[u, v] = wt(u, v)$
- for k from 1 to n
  - for i from 1 to n
    - for j from 1 to n
      - if dist[i,j] > dist[i,k] + dist[k,j]
        
        dist[i,j] = dist[i,k] + dist[k,j]

Lecture 16: More Graph Algorithms and the start of Dynamic Programming

Floyd-warshall's Algorithm, Continued

Checks if there's a midpoint that's less than direct
Algo
- Input: A Weighted Graph $G = (V, E)$
- Output: Shortest Distance Matrix and successor matrix
- $D \gets (n \times n)$ matrix
- $S \gets (n \times n)$ matrix
- For $i \leq n, j \leq n$:
  - $d_{ij} \gets wt(v_i, v_j)$ ($\infty$ if there isn't an edge)
  - $S_{ij} \gets j$ for all edges $(v_i, v_j)$, $\phi$ if no such edge.
- output $D$ and $S$
If one triangle (generally the lower), the graph is a dag

Dynamic Programming

Motivation: Binomial coefficients $(x + y)^n = \binom{n}(0)x^{n} + \ldots + \binom{n}{n}y^n$
- $\binom{n}{k} = \frac{n!}{k!(n - k)!}$
- About $(n - k - 2) + (k - 2)$ multiplicatinos and a division
- $n - 4$ multiplications, thus $\mathcal{O}(n)$ multiplications
- Pascal's Identity: $\binom{n}{k} = \binom{n - 1}{n - k} + \binom{n - 1}{k}$
  - with the base condition $\binom{n}{0} = \binom{n}{n} = 1$
- Algo(RecursiveBinomial):
  - Takes $n, k$
  - if $k = 0 \lor k = 1$
    - return 1
  - Else
    - return RecursiveBinomial($n - 1$, $k - 1$) + RecursiveBinomial($n - 1$, $k$)
Enter Memoization!
- If you compute a value, cache it!
- Use a map
- Or use a tableau, this is dynamic programming
I.E.
- Set up a recurrence that expresses an optimal solution in terms of subsolutions
- Fill a tableou of values using previously computed values for future values
- Work forwards, not backwards
- No what can be ignored
Algo:
- for $i \in 0 \ldots n$
  - for $j \in 0 \ldots min(i, k)$
    - if $j = 0 \lor j = k$
      - $C_ij ≥ts 1
    - else
      - $C_{ij} \gets C_{i-1 j_1} + C_{i-1 j}$
- Return $C_{nk}$
Requires $\mathcal{O}{n k}$, $\mathcal{O}(n^2)$

Optimal Binary Search Tree

Suppose you have $n$ keys, $k_1, \ldots, k_n$ and $k_1 < k_2 < \cdots < k_n$
Suppose you have a probability distribution on the searches of keys, $p(k_i)$ is the probability that someone will search for $k_i$
Optimize the BST for locality and probilities
Definition: The Catalan Numbers: $C_n = \frac{1}{n + 1} \binom{2n}{n}$ corressponds to the number of binary tree configuration with $n$ nodes
Find the best part to drape over

Lecture 17: Dynamic Programming

Optimal Binary Search Trees

Given a set of keys $k_1 < k_2 < \cdots < k_n$
with probabilities $p(k_i)$
Let $C_{ij}$ be the number of key comparisons in the obst for keys from $k_i$ to $k_j$
We want $C_{1n}$ to be minimized
Specific cases:
- $C_{i,i-1} = 0$
- $C_{ii} = p(k_i)$
A tree (kL ki-kl-i kl+1-j)
- Then $C_{i,l-1} + C_{l+1,j} + \sum_{s = i}^{j}p(k_s)$
$C_{ij} = \min_{i\leq \ell \leq j} \{C_{i\ell-1} + C_{\ell+1j}\} + \sum_{s=i}^{j}p(k_s)$
Algo:
- Input: A set of ordered keys $k_1$ to $k_n$ with search probabilities
- Output: A root tableau and a value tableau
- For $i = 1 \ldots n$
  - $C_{i, i-1} \gets 0$
  - $C_{i,i} \gets p(k_i)$
  - $R_{i,i} \gets i$
- $C_{n+1,n} \gets 0$
- For $d = 1 \ldots (n - 1)$
  - For $i = 1 \ldots (n - d)$
    - $j \gets i + d$
    - $min \gets \infty$
    - for $\ell = i \ldots j$
      - $q \gets C_{i,\ell + 1} + C_{\ell+1,j}$
      - if $q < min$
        
        $min \gets q$
        
        $R_{i,j} \gets \ell$
      - $C_{i,j} \gets min + \sum_{s=i}^{j}p(k_s)$
- Output C_{1, n}, R

0-1 Knapsack Problem

Given a set of $n$ elements in $A$, each with a weight $W$ and a value $V$ and a total capacity $W^{\prime}$
Take as many as possible while not exceeding $W^{\prime}$ and maximizing $V$
$V_{ij}$ is the value of the optimal knapsack involving elements from 1 to $i$ and having a weight no more than $j$
$V_{nW^{\prime}}$ is the optimal solution
An $n \times W^{\prime}$ tableau
Special Cases
- $V_{0j} = 0$
- $V_{i0} = 0$
based on max of a few sums
Is top to bottom left to right
Algorithm:
- Takes $A, w, v, W$
- Output: completed tableau
- For $i = 0 \ldots n$
  - for $j = 0 \ldots w$
    - If $i = 0 \lor j = 0$
      - $V_{ij} \gets 0$
    - Else If $(j - w_i) \geq 0$
      - $V_{ij} \gets max(V_{i-1j}, V_{i-1,j-w} + w_i)$
    - Else
      - $V_{ij} \gets V_{i-1j}$
- Return $V$
This is order $\mathcal{O}(nW)$ (psuedo polynomial)

Lecture 18: Hash Tables, Hash Functions and Computability

Hash Tables

store things in arrays
indices are determined by a key and a hash
Amortized constant time for look-up

Hash Functions

map a large domain onto a small codomain
involves cutting up a large domain into a small domain
Often $h: \mathbb{Z} \to \mathbb{Z}_m$
Not 1 to 1, can have collisions i.e. $h(a) = h(b)$
$\mathbb{Z}_m = \{0, 1, \ldots, m - 1\}$ (a ring)
- ex: $h(x) = x \mod 7$
Should be easy to compute
uniformly dstribute elements
Other applications:
- password storage (secure)
  - RC4, MD4, MD5, SHA suite, PBKDF2/scrypt/bcrypt
  - salt makes more secure, helps to defeat rainbow attacks
- Crypto currencies
  - blockchains
- integrity checking/unique identifiers
- derandomization

Implementation stuff

objects to be stored in the array
integer representation of the object is hashed and used for the index in the array
object is stored
Ex $h(k) = 7k + 13 \mod 8$
- $\mathbb{Z}_8 = \{0, 1, 2, 3, 4, 5, 6, 7\}$
- Only 8 elements can be stored
Collision resolution (open addressing):
linear probing
traverse the array to the next available spot
- will eventually fill up
- lookup/insertion will be worse ($\mathcal{O}(n)$)
linear function probing

$h(k, i) = h(k) + c_1 i \mod m$

quadratic probing

$h(k, i) = h(k) + c_1i + c_2 i^2 \mod m$

Secondary hash probing

$h(k, i) = h(k) + i s(k) \mod m$

Chaining

each index is a reference to a linked list or other general storage data structure
Strategies to remedy
- larger hash tables reduce collisions, but waste memory
- faster means more memory
- slower means less memory
Load factor $d = \frac{n}{m}$, where $n$ is the number of elements and $m$ is the size of the array
- high load factors: less memory, slow performance
- low load factor: more memory, fast performance
when load factor reaches a threshold, often 0.75, rehash with a new hash function and a larger array (larger m)
- increases table
- decreases load factor
- all elements however must be rehashed ($\mathcal{O}(n)$)
uses
- caches
- sets

Computability

Can computers solve any problem?
- No!
  - some problems are too hard
  - too abstract
  - do not halt
Can algorithms solve any problem?
- Can Turing Machines decide any language?
- Algorithm $\iff$ Turing Machine
- Solve $\iff$ Decide
- Problem $\iff$ Language
Language:
- An alphabet $\Sigma$, the characters, ex $\Sigma = \{0, 1\}$
- A string or word is a finite sequence of of characters, or $\lambda$, the empty string
- The set of all finite strings, $\{\lambda, 0, 1, 00, 01, \ldots\}$, or $\Sigma^*$, the Kleene Star
- A language is therefore some subset of $L \subseteq \Sigma^*$
  - has a constraint of some form in most situations
  - $L$ is the set of all binayr strings that encode a graph that is acyclic
    - $<G>$ is the encoding
    - Huffman coding reduces any language to binary
- Simple Model:
  - the Finite State Machine: Can be used to determine if in a language
    - Starts at start, and based on the input, traverse an edge
      - if you are at a reject state at the end, reject the string
      - if at an accept state, accept the state

Lecture 19: Computability, Continued

Turing Machines

finite state machines
- set of states
  - some accepting
  - some rejecting
- transitions (read a bit and go to a state)
- input is read once
- no extra memory
- accept the string if end at an accept state
- define regular languages
  - can be expressed with a regular expression
There exist non-regular languages
- context-free languages
  - require a push-down automaton, an FSM plus an unbounded stack
- decidable languages
  - requires a Turing Machine
And the Turing Machine is:
- an FSM with an input tape and a work tape
- can read/write the work tape and go backwards and forwards on both
- $Q$ is a non-empty set of states
  - has accept and reject states, the machine halts on either accept or reject
  - and an initial state
- $\Sigma$ is the input alphabet
- $\Gamma$ is the work tape alphabet, can equal $\Sigma$
- Has a transition function $\delta : Q \times \Sigma \times \Gamma \to Q \times \Gamma \times \{L,R,\mathbf{-}\}^2$
Church-Turing Thesis
- Any algorithm is equivalent to a Turing Machine or the $\lambda$ Calculus
Can a Turing Machine Decide any language?
- No! Turing machines are finite objects
- you can encode not just problems, but machines, $T \to \langle T \rangle \in \Sigma^*$
  - This set is countably infinite
- The set of all languages ($L \subset \Sigma^*$), is uncountable
- There are, therefore, languages for which there exists no Turing Machine

Lecture 20: Computability, Continued

Church-Turing Thesis

"Algorithm Solves Problems" $\equiv$ "Turing machines decide languages"
Turing Machines: finite state control plus a work tape
Language: subset of $\Sigma^{*}$
Decide: A TM decides language $L$ iff for all inputs it:
- halts
- ends in $q_{\mathrm{acc}} \leftrightarrow x \in L$

The Halting Problem

Does there exist an algorithm that:
- Given a TM $M$ and input $x$, outputs true if $M(x)$ halts, false if $M(x)$ inf-loops
- This is the Halting Problem
Theorem: No such algorithm to solve the halting problem exists
Proof (By way of contradiction):
- suppose $\exists$ a TM $H(\langle M,x \rangle)$ that decides (solves) the halting problem $$H(\langle M,x \rangle) = \left\{\begin{matrix}1&M(x) \mathrm{halts}\\0&M(x) \mathrm{no halt}\end{matrix}\right.$$
- Consider Running $M$ on itself, $M(M)$
- Consider the Encoding $\langle M,M \rangle$ $$Q(\langle M \rangle) = \left\{\begin{matrix}H & H(\langle M,M \rangle) = 0\\\infty&H(\langle M,M \rangle) = 1\end{matrix}\right.$$
- $Q(\langle Q \rangle)$ $$Q(\langle Q \rangle) = \left\{\begin{matrix}H & H(\langle Q,Q \rangle) = 0\\\infty&H(\langle Q,Q \rangle) = 1\end{matrix}\right.$$
- This halts iff $H(\langle Q,Q \rangle) = 0$ iff $Q(\langle Q \rangle)$ does not halt.
- Contradiction, $H$ does not exist
Restricting to solvable problems
- $M(X)$ on an input may use some resources:
  - time = each transition is 1 unit of time, $T(|x|) = T(n)$
  - memory = number of sells written to on work tape, $S(|x|) = S(n)$
  - $x$ is input, input size is $|x|$, or $n$
- Suppose a language $L$ is decided by a TM $M$ such that an input of length $n$, $M$ uses at least $p(n)$ transitions, where $p$ is some polynomial in $n$
  - then $L\in P$
  - $L$ is a language
  - $P$ is a complexity class
- $P$ is deterministic polynomial time, the set of all languages $L$ such that there exists a plynomial time TM that decides $L$
  - matrix multiplication $\mathcal{O}(n^3)$
  - connectivity $\mathcal{O}(n^3)$ (Floyd-Warshall)
  - Searching
  - Not in:
    - Hamiltonian Path / Cycle
    - 0-1 Knapsack
    - Satisfiability

Non-determinism

theoretical model of computation
2 phases, guessing and verification
- Ex: Ham Path
  - guess a permutation of vertices, $v_1, v_2, \ldots, v_n$
  - check if path is valid, if it is, accept, otherwise reject
A nondeterministic TM accepts if there is any guess that ends up accepting
like looking at a tree and being $\mathcal{O}(n)$ without having to look at the individual leaves
$\exists x_1, x_2, \ldots, x_n P(x_1, \ldots x_2)$ or $\forall x_1, \ldots, x_n P(x_1, \ldots x_n)$
NP, or non-deterministic polynomial time
- the set of all languages $L$ for which there exists a non-deterministic polynomial time Turing Machine that decides $L$
$P \subseteq NP$
Is $P \stackrel{?}{=} NP$
And $NP \subseteq EXP$ (deterministic exponential time), and $P \neq EXP$
Goal: Characterize the hard problems that we've seen
- NPC or NP-Complete is the set of all problems (languages $L$) such that the language $L \in NP$ and $L$ is at least as hard as all other languages in $NP$
Show 0-1 Knapsack is in $NP$
- Map to a decision problem (Does there exist a possible knapsack?)
  - $(A, \overrightarrow{v}, \overrightarrow{w}, W, k), 0 \leq k \leq \sum_{q\in A} v_n$
    - output yes if the optimal solution is at least $k$, no otherwise
- Show that the decision version is in $NP$
  - Algorithm
    - ND guess a subset $S \subseteq A$ ($\mathcal{O}(n)$)
    - Verify
      - for easch $s \in S$ ($\mathcal{O}(n)$)
        
        $\mathcal{V} \gets \mathcal{V} + v(s)$
        
        $\mathcal{W} \gets \mathcal{W} + w(s)$
      - if $\mathcal{W} > W$
        
        reject
      - else if $\mathcal{V} \geq k$
        
        accept
      - else, reject
    - Is in $NP$

Reductions

a decision problem $P_1$ is polynomial-time reducible to a decision problem $P_2$ if:
- there exists a function $f$ such that $f$ maps all yes instances of $P_1$ to yes instances of $P_2$ and no to no
- $f$ is computable by a polynomial time algorithm
Notation: $P_1 \leq_{P} P_2$
$P_2$ is at least as hard as $P_1$
$P_1$ is no harder than $P_2$
Example:
- Linear System Solving, reduced to Gaussian Elimination, $LSS \leq_{P} GE$
- Network Problems: Find fastest route, reduced to shortest graph distance
- Compute the median, reduce to sort
Want to solve problem $X$
- we know how to solve problem $Y$ (with algorithm $A$)
- we know that $X \leq_{P} Y$
- we have $x$ as an instance of $X$
  - $x \to f_{\leq_{P}}(x) \to y \to A(y)$
  - This is $A^{\prime}$
Goal: show that a problem $\mathcal{P}$ is NP-complete
- it suffices to take a known NP-complete problem, $X$ and reduce to $\mathcal{P}$, $X \leq_{P} \mathcal{P}$
Need at least 1 NP-complete problem to start any reduction
- In 1971, Stephen Cook showed the first NP complete problem: satisfiability is NP-complete

Lecture 21: Computablity, Continued

Reductions: Continued

Algorithmic analysis only studies algorithms and solutions
complexity studies the problems
We use reductions.
if we show $P_1 \leq_{P} P_2$
- $P_2$ is at least as hard as $P_1$
- $P_1$ is no harder than $P_2$
if we show also that $P_2 \leq_{P} P_1$
- $P_1$ is equivalent to $P_2$
$L$ is NP-complete if
- $L \in NP$
- All other languages in NP reduce to $L$, $\forall L_2 \[L_2 \leq_{P} L\]$
- All NP complete problems reduce to each other and have the same complexity
A Trivial problem: $L_{NP} = \left\{\langle M, x \rangle \biggr\vert \begin{matrix}\textrm{M is an NP Turing Machine}\\\textrm{that accepts X in polynomial time.}\end{matrix}\right\}$
- This is shown as $L_{NP} \leq_{P} SAT$
3SAT is a restricted variation of SAT
- Given a boolean expression on $n$ variables, $x_1, \ldots, x_n$ in 3CNF
  
  3CNF
  
  Conjunctive Normal Form, conjunction of a bunch of disjunctions, called clauses, each with 3 variables or their negations
- Ex: (and (or x1 x2 (not x4)) (or (not x1) x2 x3) (or x1 (not x3) (not x4)))
Clique Problem
- Given an undirected graph $G = (V, E)$ and an integer $k$
- Output true if $G$ contains a clique of size $k$ or greater
- A Clique is a complete subgraph
- No efficient solution, we will show that it is NP-complete
- Show that $\mathrm{3SAT} \leq_{P} \mathrm{Clique}$

Clique and NP Completeness

Clique is in NP
- Given: $\langle G, k \rangle$
- Guess: a subset of vertices $C \in V$ of size $k$ (non-deterministic, $\mathcal{O}(k) \leq \mathcal{O}(n)$)
- Verify: Is the guessed subset $C$ a clique?
  - For each pair $(u, v) \in C$
    - if $(u, v) ¬∈ E
      - halt and reject
  - halt and accept
Choose a known NP-complete problem to reduce from to Clique
- 3SAT
Define a mapping from instances of $P_1$ to $P_2$ that preserve solutions
- $\phi \to G, k$
- Mapping as follows:
- Let $\phi = C_1 \land C_2 \land \cdots \land C_k$ be a 3CNF formula of $k$ clauses
- For each $C_i= x_1^i \lor x_2^i \lor x_3^i$
  - create 3 vertices $v_1^i, v_2^i, v_3^i$
- Edges connect vertices $e = (v_a^i, v_b^j)$ iff the following both hold
  - the variables are from different clauses, $i \neq j$
  - and the variables are consistent, $v_a^i \neq \lnot v_b^j$
- Example: (and (or x1 (not x2) x3) (or x1 (not x2) (not x3)) (or x1 x2 x3))
Prove that solutions are preserved
- That $\phi$ is satisfiable iff $G$ has a clique of size $k$ or greater
- Outline
  - Let $\phi$ be satisfiable
    - Each clause has a variable (or a negation) that evaluates to true.
    - Each true variable corresponds to some vertex in $G$, $\therefore k$ vertices
    - Observation:
      - they must be in different clauses, thus must be in different vertex groups
      - none of them are negations of each other
    - therefore each of the $k$ vertices are connected.
    - Then the $k$ vertices form a clique
  - Let $G$ be a graph with a clique of size $k$
    - show that $\phi$ is satisfiable
Show that the reduction can be computed in deterministic polynomial time.
- 3SAT $\leq_{P}$ Clique
- $3k$ to generate vertices, at most $\mathcal{O}(n)$
- $\binom{3k}{2}$ to generate edges, $\mathcal{O}(n^2)$
Thus, Clique is NP-Complete

Reduction Tree

$L_{NP} \to SAT \to 3SAT \to Clique$ are all NP-complete
By composing mappings, it can be shown that polynomial time reductions in $NP$ are transitive, reflexive and symmetric.
Thus an equivalence class for NP-complete problems

Vertex Cover Reduction

Definitions
- Given an unweighted graph $G = (V, E)$ and an integer $k$, output yes if $G$ has a vertex cover of size at most $k$
- A vertex cover is $V^{\prime} \subseteq V$ such that every edge has at least one of its points in $V^{\prime}$
Show that the problem is in NP
- Guess a subset of size at most $k$
- for each $e = (u, v) \in E$
  - if ($u \not\in V^{\prime} \land v \not\in V^{\prime}$)
    - halt and reject
- halt, accept
Choose NP complete problem to reduce from
- Clique $\leq_{P}$ Vertex Cover
Define mapping
- $\langle G, k \rangle$ is a clique
- $\langle G, k \rangle \to \langle \overline{G}, n - k \rangle$
Prove that solutions are preserved
- Fact $G$ has a clique of size $k$ iff $\overline{G}$ has a vertex cover of $n - k$
- Let $G$ have a clique $C \subseteq V$ of size $k$
  - Let $e = (u, v) \in \overline{E}$
  - $\forall e \in \overline{E} \to e \not\in E \to u \not\in C \lor v \not\in C \to u \in V \setminus C \lor v \in V \setminus C$, thus $(u, v) \in \overline{E}$ is covered by $V \setminus C$, and therefore $V \setminus C$ is a vertex cover of size $n - k$
- Let $C \subseteq V$ be a vertex cover in $\overline{G}$ of size $n - k$
  - $\forall u, v \in V \left[ (u, v) \in \overline{E} \to u \in C \lor v \in C \right] \equiv \forall u, v \in V \left[ (u \not\in C \land v \not\in C) \to (u, v) \in E \right]$, therefore every pair of vertices in $V \setminus C$ is connected, thus $V \setminus C$ is a clique of size $k$.
Show the reduction can be computed in deterministic polynomial time
- $\mathcal{O}(n^2)$ to flip adjacency matrix
- $\mathcal{O}(1)$ to calculate $n - k$
- Thus, $\mathcal{O}(n^2)$

Overview of Complexity classes

P is part of NP, NPC is part of NP, CoNP contains part of NP and all of P, CoNPC is part of CoNP, R (Recursive, all languages taht have a halting Turing Machine) contains all of this, and RE contains R, and CoRE containes part of RE

Lecture 22: Final Review

Graph Algorithms

DFS
BFS
Djikstra's
MST

Cut Edge

Let $G = (V, E)$ be a connected, undirected graph
A cut edge $e \in E$ is an edge such that if removed from $G$ would disconnect it
Design an algorithm given $G$ that determines if it contains a cut edge
- Test each edge
- use DFS to check if disconnected
  - if yes, stop and output true
- output false

Hashing

repeat quiz 4 for $h(k) = (2k + 7) \mod 13$ for 4, 42, 26, 5, 8, 18, 14 with linear probing
Use chaining to resolve collisions

OBST

Suppose $T$ is an OBST with $n$ keys. What is max depth? – $n - 1$
Suppose we have $n$ keys each with uniform probability $\frac{1}{n}$, what would the depth of the OBST be? – $\lciel\log_2 n\rciel$
Given a root tableau and keys, build the OBST

P and NP

Let $P_1, P_2, P_3$ be problems
- suppose $P_1 \leq_{P} P_2$
  - $P_2$ is NP complete
  - can you conclude $P_1$ is NP complete
  - No. Anything in NP reduces to an NP complete problem
- $P_1 \leq_{P} P_2$
  - $P_2 \in NP$
  - Is $P_2$ NP Complete
    - No, both $P_1$ must be NP complete, but we don't know that.
- $P_1 \in P$ and $P_2 \in NPC$
  - Can we conclude that $P_1 \leq_{P} P_2$?
    - Yes. Anything in NP reduces to NP complete
- $P_1 \leq_{P} P_2$ and $P_2 \leq NPC$
  - Can we say that $P_1 \in NP$?
    - Not necessarily. Not enough information is given.
    - Although given this class, yes, as anything that reduces from NPC is in NP (for this class)