9615-06

CIS 9615. Analysis of Algorithms

Dynamic Programming and Memoization

1. The idea

Issue: overlapping subproblems in divide-and-conquer.

Direct recursive algorithm:

    Fibonacci-R(n)
    1  if n < 2
    2      then return n
    3  return Fibonacci-R(n-1) + Fibonacci-R(n-2)

Complexity: Θ(φⁿ), φ ∼ 1.618 ("golden ratio").

Repeated work: to calculate F(4), the algorithm calculates F(3) and F(2) separately though in calculating F(3), F(2) is already calculated.

Dynamic programming: to turn a "top-down" recursion into a "bottom-up" construction.

    Fibonacci-DP(n)
    1  A[0] <- 0        // assume 0 is a valid index
    2  A[1] <- 1
    3  for j <- 2 to n
    4      A[j] <- A[j-1] + A[j-2]
    5  return A[n]

Complexity: Θ(n).

Memoization: a variation of dynamic programming that keeps the efficiency, while maintaining a top-down strategy.

    Fibonacci-M(n)
    1  A[0] <- 0        // assume 0 is a valid index
    2  A[1] <- 1
    3  for j <- 2 to n
    4      A[j] <- -1
    5  Fibonacci-M2(A, n)

    Fibonacci-M2(A, n)
    1  if A[n] < 0
    2      then A[n] <- Fibonacci-M2(A, n-1) + Fibonacci-M2(A, n-2)
    3  return A[n]

Complexity: also Θ(n).

Dynamic programming used in optimization: the optimal solution corresponds to a special way to divide the problem into sub-problems. When different ways of division are compared, it is quite common that some sub-problems are involved in multiple times. The complexity of such a solution comes from the need of keeping intermediate results, as well as remembering the structure of the optimal solution, i.e., how the problem is divided into sub-problems, step by step.

2. Example: matrix-chain multiplication

Matrix multiplication: The product of a p-by-q matrix and a q-by-r matrix is a p-by-r matrix, which contains p*r elements, each calculated from q scalar multiplications, so the overall cost is proportional to p*q*r.

To calculate the product of a matrix-chain A₁A₂...A_n, n-1 matrix multiplications are needed, though different orders have different costs.

For matrix-chain A₁A₂A₃, if the three have sizes 10-by-2, 2-by-20, and 20-by-5, respectively, then the cost of (A₁A₂)A₃ is 10*2*20 + 10*20*5 = 1400, while the cost of A₁(A₂A₃) is 2*20*5 + 10*2*5 = 300.

For matrix-chain A_i...A_j where each A_k has dimensions P_k-1-by-P_k, the minimum cost of the product m[i,j] corresponds to the best way to cut it into A_i...A_k and A_k+1...A_j:

    m[i, j] = 0                                         if i = j
              min{ m[i,k] + m[k+1,j] + P_i-1P_kP_j }        if i < j

Use dynamic programming to solve this problem: calculating m for sub-chains with increasing length, and using another matrix s to keep the cutting point k for each m[i,j].

Example: Given the following matrix dimensions:
A₁ is 30-by-35
A₂ is 35-by-15
A₃ is 15-by-5
A₄ is 5-by-10
A₅ is 10-by-20
A₆ is 20-by-25

then the output of the program is

which means that A₁A₂A₃A₄A₅A₆ should be calculated as (A₁(A₂A₃))((A₄A₅)A₆), and the number of scalar multiplication is 15,125.

Time cost: Θ(n³), space cost: Θ(n²).

Please note that this algorithm determines the best order to carry out the matrix multiplications, without doing the multiplications themselves.

3. More examples

(1) Longest Common Subsequence (LCS)

For sequence X and Y, their LCS can be recursively constructed:

c[i,j] = 0                         if i == 0 or j == 0
         c[i-1,j-1]+1              if i,j>0 and Xi == Yj
         max(c[i,j-1],c[i-1,j])    if i,j>0 and Xi != Yj

Time cost: Θ(mn). Space cost: Θ(mn). Demo applet.

(2) Optimal Binary Search Tree

If the keys in a BST have different probabilities to be searched, then a balanced tree may not give the lowest average cost (though it gives the lowest worst cost). Using dynamic programming, BST with optimal average search time can be built, in a way similar to matrix-chain multiplication.

Time cost: Θ(n³). Space cost: Θ(n²). Demo applet.