Notes on Rich and Knight

Skim Pages 29-61: Problems, Problem Spaces, and Search

There are some who believe the simple statement, "AI is search"
State space is sometimes explicit, like a matrix of board positions, and sometimes implicit, like a set of rules or productions
The analytical approach to problem solving is generally the same in every domain. In AI the usual procedure is
1. define a state space
2. identify initial states
3. identify goal states
4. specify operators that change states
Note, this is roughly the same procedure as any requirements analysis for any software project.
Note, define a state space means sub-problem decomposition, also something done in every software project. See How to Solve It by G. Polya
It's stated like this in AI because so often the problem area is akin to a board game
Control strategies break down into the three types of searches
1. depth-first search: top to bottom, left to right, the most easily implemented (recursive) algorithm
2. breadth-first search: left to right, top to bottom, visiting all the children before visiting a grandchild
3. heuristic search, sometimes called "best first": where theres an evaluation function you can use to choose the next path
Key quote from the chapter (pg. 53). "These two problems, chess and newspaper story understanding, illustratte the difference between problems for which a lot of knowledge is important only to constrain the search for a solution and those for which a lot of knowledge is required even to be able to recognize a solution".

Skim Pages 63-98: Heuristic Search Techniques

The general messaage(s) are these:
- very hard problems will tend to have very large search spaces.
- heuristics (general rules that USUALLY apply), can be used to limit search
- some sort of evaluation function is always necessary
- key vocabulary: heuristics allow you to "prune the search tree"
generate-and-test: a brute-force depth first search in its simplest form
hill-climbing: a variation on generate and test that incorporates visualization (see the usual diagram)
key vocabulary: the problem of local minima / maxima
backtracking is a simple universal strategy, that requires the algorithms to maintain state information.
simulated annealing: a variation on hill-climbing where random guesses are introduced (sometimes caled stochastic search)
best-first search: much the same as above, but where the evaluation function is much more reliable.
agenda-driven search: perhaps the most interesting topic in this chapter, as it produces answers for evaluation by re-ordering tasks. This is an oddity in this chapter
problem reduction: another term for "pruning"
constraint satisifaction: be aware AI uses a strange sense of "constraint" - the classic example is the seating chart problem.
means-ends analysis: not usually described in a chapter on heuristic search. Based on human behavior (as described in Polya and elsewhere)
key vocabulary: sub-problem decomposition

Read: 105-129: Knowledge Representation Issues

the problem-solving power of search techniques is limited in part because of their generality
it is generally understood (in symbolic AI) that solving complex problems depends on knowledge and mechanisms to manipulate it
the challenge is known as the (knowledge) representation problem
the discussion on knowledge level and symbol level, and "representation mappings", is all about the relation between symbols (syntax) and meaning
one basic problem is translating informal natural language statements into a formal notation
- dog(Spot) => Spot is a dog
- All X: dog(x) -> hastail(x) =>
  All dogs have tails OR Every dog has a tail
- note: this one fact, and one inference rule, is enough to produce a NEW fact => hastail(Spot)
the authors note this is akin to generalized computer programming: finding concrete implementation of abstract concepts
the authors do not note that the obverse of representation is interpretation - representing facts and relations is for the sole purpose of supporting inference.
i.e. dog(Spot) is just ASCII symbols without an inference mechanism to provide meaning
the typical AI represention is composed of two types of thing: concepts (usually nouns) and relations
relations are sometimes represented as a slot-and-filler structure (also commonly, slots-with-roles-and-fillers), which are also called attribute-value-pairs
vocabulary: frame system is a set of structures linked by semantic relations
a semantic network is a set of concepts linked by semantic relations
the latter is an older specialization of the former, mostly used in early (associational) memory modeling systems
sadly, there is no generally agreed upon set of relations
the other key idea in knowledge representation is abstraction/inheritance (and the special relation: ISA, sometimes written AKO, and its inverse: instance-of)
by combining these in straight-forward ways, we can infer that Spot is warm-blooded without explicitly representing that fact.
procedural knowledge refers to programming that effects actions (like robot arms) or in if-then-else decision making. This is an older term not very useful any more.
Note: many of the issues in knowledge representation are similar to data structure issues
Note: representing time is difficult (hence, there is an entire branch of logic devoted to it)
Vocabulary: granularity, "what level of detail?" "what are the primitives?"

Skim: 131-169: Logic

one fundamental issue with predicate logic is everything is "truth valued"; which causes a difficult representational "fit" for a large class of problems
another issue is that theorem proving is both "generative" and undecidable, where
- generative (also called forward reasoning) means starting with axioms and theorems (i.e. starting from first principles), and trying to generate a new proposition that matches the goal
- unecidable means if the goal is a non-theorem, there's no guarantee the procedure will halt
note, however, that while the idea is conceptually generative, the algorithms usually generate proofs by chaining backward from the theorem to be proved to the axioms
resolution theorem proving is conceptually the same, but takes the approach of "contradicting the negation"
note that unification is one of the steps in resolution
one of the main reasons for the popularity of resolution, unification, and PROLOG, is that the first two are relatively easy to implement in the third
note, as the authors say, "people do not think in resolution".

Skim: 171-193: Rules

rule-based systems are often called "expert systems"
these are typically applied to diagnostic domains (eg. medicine) although the most commercially successful configured systems (R1 by DEC).
PROLOG is often used to implement rule-based systems
one essential control method is the order in which the rules are stored in the rule base
the conceptual algorithm is the same as with logic-based systems: begin with a goal statement (to be "proved") and look for (chains of) assertions that prove it
PROLOG provides a built-in search engine, but search control is fixed (depth-first with backtracking), and it is very difficult to apply domain knowledge to constrain search
a pure PROLOG system (using strictly Horn clauses) is decidable, and implements "negation as failure"
negation as failure implies a "closed world assumption" (that every useful fact is stored in the rule-base)
this assumption causes a difficult representational "fit" for a large class of problems
rule-based systems get more interesting when there is a fcility for "partial matching" (as with the regular expressions in ELIZA)
expert systems evolved rule sets that included "meta rules" (rules about rules) as a way to exert more control over problem-solving and run-times
historical note: expert systems were extremely fashionable (and fundable) in the mid-80s. This led to the formation of a mini-industry for "expert system shells" (systems for building exert systems) which in turn led to the famous de-bunking paper: "The expert system shell game".
summary: expert systems are known to be "brittle" - they are difficult to maintain and difficult to add onto - and they are known for "ungraceful" failures (not producing answers, or producing very bad answers).

Skim: 195-229: Uncertainty

non-monotonic reasoning (also called "defeasible")
the basic intuition views this as reasoning about "possible worlds" where some facts are not indisputable and new facts can change the state of the universe
you can think of it as a "set" of logic- or rule-based systems, where every uncertainty is enumerated in one or another of the possible worlds
then problem solving reduces to computing solutions in ALL the possible worlds to find the best one
this is why non-monotonic reasoning is criticized for its "combinatorial explosion".
special note: abductive reasoning is a new formalism that relaxes the usual rules of deductions;
- eg. if A imlies B, and B is true, then abduction says we can assume A is true, even without direct evidence.
- critics call this "reasoning from a faulty premise" but there is an abductive reasoning community out there.

Skim: 231-248: Statistics

statistical reasoning (sometimes called stochastic reasoning) divides into two general areas
probabilities associated with rules
- where, for example, low grade fever and a runny nose indicate the common cold, but only about 80% of the time.
- these systems depend on judgements of domain experts and reasonably standard set theory and logic.
fuzzy logic, where concepts or entities can have conditional membership in a set
- fuzzy logic is intended to support reasoning on propositions that have "degrees of truth".
- supporters claim this is a better model of reality
- critics observe that decisions based on fuzzy logic always depend on threshhold values, which effectively reduces to truth valued logic.

Modfied: Spring03; Contact: slator@cs.ndsu.edu