 # SEARCH IN STATE SPACE REPRESENTATIONS[References]

A STATE SPACE is a set of descriptions, or STATES.

A SEARCH PROBLEM consists of:

• A set of INITIAL STATES S. Usually S consists of a single state called the initial state.
• A set of ACTIONS A, where each action a in A, given a state, returns nil or a new state.
• A state space N is a finite or infinite set of STATES. It can be generated starting with the initial states and applying the operators repeatedly.
• A set of GOAL STATES or SOLUTIONS G, a subset of N.

Usually in a search problem the goal states, or the paths from initial to goal states, are given a score so that we can say that a solution is or not better than another. An OPTIMIZATION Search Problem is one where we look for a best possible solution. A SATISFICING Search Problem is one where we look for a solution that has a good score and can be found with only a limited effort (H.Simon). We talk of NEAR OPTIMIZATION when we aim for a solution whose score is within a specified delta of the score of an optimal solution. We talk of APPROXIMATE OPTIMIZATION when there is a high probability that we will find a near optimal solution.

A search problem can be represented by a STATE-SPACE GRAPH with as nodes the states, with as arcs the actions, and specially marking the initial and goal states. The state-space graph usually is generated starting from the initial state adding gradually nodes and arcs.

A SEARCH STRATEGY is a prescription on how to make choices during a search. A Search Algorithm is the implementation of a search strategy.

As a Search Strategy progresses in examining a State-Space graph we say that a node is

• EXPANDED if we have generated all its successors
• EXPLORED if we have generated some of its successors
• GENERATED if we have reached that node.

A Search Strategy/Algorithm is:

• COMPLETE if it terminates with a solution when one exists
• ADMISSIBLE if it terminates with an optimal solution when one exists
• INFORMED(/UNINFORMED) if the next move considered may depend on information about nodes not yet generated.
• IRREVOCABLE if at each node we never follow up more than one alternative at each node. Otherwise it is TENTATIVE. Examples of irrevocable strategies are Hill-Climbing and Steepest-Ascent. Clearly irrevocable algorithms do not us Backtracking.
• DOMINATES a search strategy/algorithm A2 when the set of nodes expanded by the former is a subset of the set of nodes expanded by the latter.
• OPTIMAL OVER A CLASS of strategies/ algorithms if it dominates each element in the class.

We distinguish search strategies on the basis of their knowledge of nodes they have not yet explored. If they have no knowledge of such nodes we say that the search is Uninformed or Blind. If they have such knowledge we say they are Informed or Heuristic. Examples of Blind Search Algorithsm are:

• Uniform-Cost
• Depth-First
• Depth-Limited
• Iterative-Deepening
• Bidirectional (it requires operators that are inverses of the normal operators)

An HEURISTIC EVALUATION FUNCTION is a function f that, applied to a node n, returns a non-negative finite number f(n). It is assumed that the smaller the number, the more likely it is that the node is on the optimal path from the start to a goal node. The value f(n) may be different at different times during a search, but the value can only decrement, and decrement by some non-infinitesimal amount.

Most search algorithms examine states gradually, in terms of their depth or their heuristic cost. Such search algorithms are called Chronological Search algorithms. Other search algorithms use information about the internal structure of each state to determine what state to consider next. Such algorithms are called Non-Chronological. They are used for instance in scheduling and use information about failed schedules to come up with new candidate schedules. Of course, non-chronological algorithms are more complex than their chronological counterparts. A related concept to non-chronological search algorithms is the concept of MEANS-END ANALYSIS, whereby by examine operators on the basis of their ability to bring us closer to the goal. Means-end analysis usually takes place when we have knowledge of the internal structure of states and of the effect of operators on such structure.

Most search algorithms use the GENERATE-AND-TEST paradigm, that is, we generate new states until either a goal state is found or we run out of states.

Most state space search algorithms use two lists, OPEN and CLOSED. Initially OPEN is a set consisting of the initial state, and CLOSED is {}. OPEN consists of generated nodes that we intend to consider during the search. We say a node is OPENED after it is taken out of OPEN.

After a node is opened, it is possibly expanded and placed into CLOSED. The search algorithm terminates when a goal state is extracted from OPEN (success), or when we try to extract a node from OPEN and OPEN is empty (failure), or, in some cases, when we generate a goal state (success).

## BEST-FIRST SEARCH [BF] (Pearl, J.)

[For state space search problems with heuristic evaluation function]

1. Put the start node s on OPEN; set CLOSED to {}.
2. If OPEN is empty exit with failure.
3. Remove the node n from OPEN for which the Heuristic Evaluation function f has minimum value (break ties arbitrarily) and put it on CLOSED.
4. Expand n generating all its successors.
5. If any of n's successors is a goal node, exit successfully with the solution obtained by tracing the path along the pointers from the goal back to s.
6. For every successor n' of n:

1. Calculate f(n')
2. If n' was neither on OPEN or CLOSED, insert it in OPEN. Attach a pointer from n' back to n. Assign f(n') to n'.
3. If n' already occurs in OPEN or CLOSED, compare its new f(n') with the value previously assigned to n'. If the old value is lower or equal, discard the new node. If the old value is greater, then substitute this value for the old; set n' to point to n; If n' was in CLOSED, move it back to OPEN.
7. Go to Step 2

BF is in a hurry to find a goal state. In this hurry it may miss an optimal goal state. For example, if n in step 5. has two successors, both goal states, one better than the other, there is no guaranty that BF will return with as solution the best successor.

The BF* algorithm is obtained from BF modified to delay termination as follows:

• Remove step 5.
• Modify step 3 to read:
Remove the node n from OPEN for which the Heuristic Evaluation function f has minimum value (break ties arbitrarily).
• If n is a goal state, exit successfully with the solution obtained by tracing the path along the pointers from n back to s.
• Otherwise put n into CLOSED.
The notion of "Goal" assumed by BF* is tied exclusively with the state being considered, independent of the path followed. For example, if we defined a goal state as one that is reached with a path where two arcs are of the same length, then BF* could miss goal states because it keeps only one backward pointer at each node. Thus it may "forget" the alternative path that would identify the goal state.

In what follows we consider BF* and some of its variations.

FACT: If the State Space Graph is finite then BF* will terminate.

Proof: Since the graph is finite, the only possible danger is in step 6.3. when n' is in CLOSED. The danger is that the node may move from CLOSED to OPEN and back again an infinite number of times. But this is not possible since each time the node moves from CLOSED to OPEN its value is decremented by some non infinitesimal amount thus its value would be become eventually negative.

FACT: If the State Space Graph has a solution (a finite path from s to a goal node t), then all nodes opened by BF* will have a value smaller or equal than some fixed value F.

Proof: Since there is a finite path from s to t, F = max{f(n), where n is a node on that path} is well defined and finite. All the nodes n' considered by BF* will all have f(n') <= F.

FACT: Even if the State Space Graph has a solution (a finite path from s to a goal node t), BF* may not terminate.

Proof: We have already seen that all nodes n' expanded by BF* will all have f(n') <= F for some finite number F. Unfortunately the set of such nodes can be infinite when the graph is infinite (by example).

The following definitions apply whenever we define the heuristic evaluation function f in terms of the cost of paths (Path Based Evaluation Function):

c(ni, nj):
assuming that there is an action a such that (ni, nj) is in a, c(ni, nj) is the cost of applying the action a to ni. This cost is greater than 0 and not infinitesimal.
K(ni, nj):
The minimal cost for going from node ni to node nj. It is defined as the minimal cost among all the paths from ni to nj. The cost of a path is defined as the sum of the costs of the arcs in the path.
h*(n):
min {K(n, x) where x is a goal state}. For each goal state x, h*(x) = 0.
g*(n):
K(s, n). Clearly g*(s) = 0.
f*(n):
g*(n) + h*(n)
g(n):
an estimate of g*(n). We assume that g(n) is the smallest cost among all the paths from s to n that have been considered up to now. Thus g(s)=0. Clearly g(n) >= g*(n). For a given node n, g(n) may have different values at different times during the execution of BF*. These values are decreasing.
h(n):
an estimate of h*(n) [heuristic function]. We select h so that h(x) is 0 for each goal state x. For a given node n, h(n) will always have the same value.
f(n):
an estimate of f*(n) = g(n) + h(n)
A Greedy Algorithm is one where we take g(n) to be always 0 and we are driven exclusively by h(n).

The A Search Algorithm is the BF* algorithm when using a path-based evaluation function.

FACT: If the State Space Graph has a solution (a finite path from s to a goal node t), then A will terminate (i.e. A is Complete), but the solution may not be optimal (i.e. A is not Admissible).

Proof: As in the case of the BF* algorithm, we recognise that there is a number F such that all nodes removed from OPEN by the algorithm will have value at most F. Thus all such nodes will be at a finite distance from s (since the cost of infinite paths would be greater than F). The problem is that F may be larger than the cost of the optimal path, thus from OPEN may be extracted a non optimal goal state.

An heuristic function h is ADMISSIBLE iff for all n, h(n) <= h*(n).

The A* Search algorithm is the A algorithm when using an admissible heuristic function.

FACT: In A*, for any node n, n is extracted from OPEN iff f(n) <= f*(s).

Proof: => (If part)

Consider any node m in the optimal path in the state search graph: f(m) = g(m)+h(m) = g*(m)+h(m) <= g*(m)+h*(m) = f*(m) = f*(s). Since some element of the optimal path will always be in OPEN, the result follows.

<= (Only if part)

Consider a node m such f(m)<=f*(s). Since f is computed for a node only if m has been or is going to be in OPEN, m is at some point in OPEN and will be extracted since it has a better value than f*(s).

Proof: Since by the previous result for each opened m we have f(m) <= f*(s), it follows that if we open a goal state then it is an optimal goal state.

FACT: In A* a node may be moved from CLOSED back to OPEN.

Proof: By example. Consider the graph with the nodes A, B, C, D, where A is the start node and D is the goal node. It has the arcs (A, B), (A, C), (B, C), and (C, D) of length respectively 2, 5, 2, and 5. Assuming that h(B)=7 and h(C)=3 (the value of h for A and D is immaterial), then the node C will be opened twice.

The following table shows the content of the OPEN and CLOSED list during the algorithm. Next to each node we keep its score.

```    ==============================================================
OPEN   | {A}  |   {C(8),B(9)}  |  {B(9),D(10)} |  {C(7),D(10)}
-------------------------------------------------------------
CLOSED | {}   |   {A}          |  {A,C}        |  {A,B}
==============================================================
```

An heuristic function h1 is MORE INFORMED than heuristic function h2 iff for all n, h1(n) >= h2(n).

FACT: If A1 and A2 are two versions of the A* algorithm using respectively heuristic functions h1 and h2, where h1 is more informed than h2, then A1 dominates A2 (i.e. every node opened by A1 is also opened by A2).

Proof: Consider a node n opened by A1. We have f*(s) >= f1(n) = g(n)+h1(n) >= g(n)+h2(n) = f2(n). We have already seen that A* opens all nodes where f(n) < f*(s). Thus n will be opened by A2.

An heuristic function h is MONOTONE (or satisfies the TRIANGULAR INEQUALITY) if for any arc (n, n') we have h(n) <= h(n')+c(n, n')

An heuristic function h is CONSISTENT if for any two node n and n', we have: h(n) <= k(n, n')+h(n')

FACT: h is monotone iff h is consistent

Proof: Clearly consistency implies monotonicity, thus we only need to worry about monotonicity implying consistency. We need to consider only pairs n and n' where there is a path from n to n' (for other pairs the consistency inequality is trivially true). Let n1 n2 n3 .. nm , with n=n1, n'=nm, be the optimal path from n to n'. We have,

h(n) = h(n1) <= c(n1,n2)+h(n2) <= c(n1,n2)+c(n2,n3)+h(n3) <= <= c(n1,n2)+c(n2,n3)+..+c(nm-1,nm)+h(nm) = K(n,n')+h(nm).

FACT: Any consistent heuristic h is admissible.

Proof: We need to show that for all n, h(n) <= h*(n). Consider the goal optimal state t reacheable from n. Then, because of consistency, h(n) <= K(n,t)+h(t) = K(n,t) = h*(n).

FACT: Not every admissible heuristic h is consistent.

Proof: By example, using the same example introduced earlier to show that a node may move back from CLOSED to OPEN.

A*-M is A* when using a monotone heuristic.

FACT: For any node n that is opened by A*-M we have g(n) = g*(n).

Proof: We proceed by contradiction. Assume that for some opened node n, g(n) > g*(n), and n is the first opened node where this condition holds true. Thus there is an optimal path some of whose nodes have not been opened (otherwise g(n) would have been computed along that path). Say n' is one such node. Then f(n') = g(n')+h(n') = g*(n')+h(n') <= g*(n')+K(n',n)+h(n) Since we are considering an optimal path to n, g*(n')+K(n',n) is g*(n). Thus f(n') <= g*(n')+K(n',n)+h(n) = g*(n)+h(n) < g(n)+h(n) = f(n). But then n' is not open and with lower f than n, thus n' would be chosen before n. Contradiction.

FACT: In the sequence of nodes n1 n2 .. opened by the A*-M algorithm, we have f(n1) <= f(n2) <= ...

Proof: Suppose for all the nodes expanded up to i-1 we have f(s) = f(n1) <= f(n2) <= f(n3) <= .. <= f(n(i-1)) The next node opened, ni, is the successor of one of those nodes, say nj. Thus f(ni) = g(ni)+h(ni) = g(nj)+c(nj,ni)+h(ni) >= g(nj)+h(nj) = f(nj).

FACT: A*-M will never move a node from CLOSED to OPEN.

Proof: After the first time, any successive visit to a node n will have the same h value and an increasing g value. Thus the value of f at n will not decrease, thus n will not move from CLOSED back to OPEN.

```
BF
^
| Delayed Termination
|
BF*
^
| f*(n)=g*(n)+h*(n) and g*(n) is the cost of best path from s to n
|
| Path-Based Evaluation Function
A
^
| h(n) <= h*(n)        		Admissible Heuristic
|
A*
^
| h(n') <= h(n)+c(n,n')		Monotone Heuristic
|
A*-M

```

### References

```    Nilsson,N.J.: Principles of Artificial Intelligence
Tioga Publishing Co. 1980	Pages 72-84
Pearl,J.: Heuristics: Intelligent search strategies for computer
problem solving