9615-03

CIS 9615. Analysis of Algorithms

Probabilistic Analysis and Randomized Algorithms

1. Average cost

For a given algorithm and an instance size, if instance i has cost c_i, i = 1..m, then

best-case cost: Minimum(c_i, i = 1..m)
worst-case cost: Maximum(c_i, i = 1..m)
average-case cost: ∑_{[i = 1..m]} p_ic_i, where p_i is the probability for instance i to appear to the algorithm.

Average cost matters if the algorithm will be used for many times, the overall efficiency is more important, and exceptions are acceptable. How about real-time systems?

The actual probability distribution is usually domain-specific, and not a property of the algorithm.

When all instances have the same probability (i.e., 1/m), the average cost is ∑_{[i = 1..m]} (1/m)c_i = (∑_{[i = 1..m]} c_i) / m.

For linear search, if the target value has the same probability to appear in any positions in an array, then what is the average number of comparisons? Consider the following three cases:

The target is in the array.
The probability for the target to be not in the array is the same as for it to be in any position of the array.
The probability for the target to be not in the array is the same as for it to be in the array.

Relation between average cost and the best/worst cost.

Cost as exact function, approximate function, and order of growth: when to use which?

How about binary search?

2. Example: the hiring problem

To hire a new office assistant if the new candidate is the best so far (page 92):
03-04 (19K)

The quantity to be analyzed is the cost of the procedure, reflected by line 3 and 6, not the running time of the algorithm. However, the analysis is similar: we want to know the number of times for line 3 and 6 to be executed, respectively.

Assuming the cost of interviewing and hiring are c_i and c_h, respectively, the cost of the above algorithm is n c_i + m c_h, where m is the number of hiring. Since the interview cost n c_i remains unchanged, our analysis will focus on the hiring cost m c_h, therefore, m.

What are the best case and the worst case?

Candidate i is hired exactly when he/she is the best in 1 through i. If the candidates come in random order, then the probability for that event is 1/i. Therefore the expected number of hires is the sum of a harmonic series ∑_{[i = 1..n]} 1/i = ln n + O(1) (A.7, page 1060, 1066). As a result, the average hiring cost of the algorithm is O(lg n).

What if hiring can only happen once? The hiring cost will be minimized, though it can no longer guarantee to hire the candidate with the highest quality.

One solution: first select a positive integer k < n, interview but reject the first k candidates, then hire the first candidate thereafter who has a higher score than all the preceding candidates. If no such one can be found, hire the last one.
03-05 (15K)

What is the best choice of k that gives the highest probability for the best candidate to be hired?

Call that probability Pr{S} and divide it into the events where the hiring happens when the best candidate is at position i: Pr{S} = ∑_{[i = 1..n]} Pr{S_i}. Since the first k candidates are all rejected, it is actually ∑_{[i = k+1..n]} Pr{S_i}.

The event S_i happens if and only if (1) the best candidate is at position i, and (2) nobody before it is better than the best of the first k candidates. The first value is 1/n, and the second one is k/(i-1). In summary,
03-06 (9K)

Pr{S} is maximized when k = n/e, with the value 1/e (page 116-117).

3. Randomized algorithm

To guarantee a random order among inputs, and therefore a sure average cost, an algorithm can be randomized, so that there is no fixed best-case or worst-case instance (though the best-case or worst-case cost remains!).

Many randomized algorithm randomize the input by permuting the given input array A. One common method for doing so is to assign each element in the array A[i] a random priority P[i], and then sort the elements of A according to their priorities:
03-02 (11K)
The range [1, n³] makes it likely that all the random numbers produced are unique.

Another method for generating a random permutation is to permute the given array in place. In iteration i, the element A[i] is chosen randomly from subarray A[i..n], then remain unchanged.
03-03 (9K)

It can be proven that both algorithms generate random permutation as desired. Which is more efficient?

4. Example: quicksort

Problem: the average cost of quicksort if pivot values are randomly selected.

The cost comes from the number of comparisons, which is the sum of probability of every pairwise comparisons. Let z_i and z_j be the ith and jth smallest values, and Z_ij be the set containing them and the values in between, then
03-01 (72K)