0203. Introduction to Artificial Intelligence

Reasoning with Uncertainty

 

1. Introduction

Though there are various kinds of uncertainty in all aspects of a reasoning system (such as term, statement, inference rule, and inference process), the "reasoning with uncertainty" (or "reasoning under uncertainty") research in AI has been focused on the uncertainty of truth value, that is, to allow and process truth values other than "true" and "false".

Generally speaking, to develop a system that reasons with uncertainty means to provide the following:

This is a large and active subfield of AI research. For further information, see Association for Uncertainty in Artificial Intelligence.

 

2. Nonmonotonic logics

A logic is monotonic if the truth of a proposition does not change when new information (axioms) are added to the system --- the set of theorem can only monotonically grows. In contrast, a logic is non-monotonic if the truth of a proposition may change when new information (axioms) is added --- the set of theorem may either grow or shrink.

Nonmonotonic logics are used to formalize plausible reasoning, such as the following inference step:

		Birds typically fly.
		Tweety is a bird.
		--------------------------
		Tweety (presumably) flies.
Such arguments are characteristic of commonsense reasoning, where default rules are applied when case-specific information is not available.

The conclusion of nonmonotonic argument may not be correct. For example, if Tweety is a penguin, it is incorrect to conclude that Tweety flies. Nonmonotonic reasoning often requires jumping to a conclusion and subsequently retracting that conclusion as further information becomes available.

All systems of nonmonotonic reasoning are concerned with the issue of consistency. Inconsistency is resolved by removing the relevant conclusion(s) derived previously by default rules. Simply speaking, the truth value of propositions in a nonmonotonic logic can be classified into the following types:

  1. traditional binary value (true or false), such as "Tweety is a bird";.
  2. accepted defaults, whose conclusions are normally true, such as "Birds typically fly".
  3. conclusions derived from the above two types, such as "Tweety flies"
When an inconsistency is recognized, only the truth value of the last type is changed.

A related issue is belief revision. Revising a knowledge base often follows the principle of minimal change: one conserves as much information as possible.

One approach towards this problem is "truth maintenance system", in which a "justification" for each proposition is stored, so that when some propositions are rejected, some others may need to be removed, too.

Major problems in these approaches: conflicts in defaults and computational efficiency.

Example: "Nixon Diamond".

Further readings.

 

3. Probabilistic reasoning

Basic idea: to use probability theory to represent and process uncertainty. In probabilistic reasoning, the truth value of a proposition is extended from {0, 1} to [0, 1], with binary logic as its special case.

Justification: though no conclusion is absolutely true, the one with the highest probability is preferred. Under certain assumptions, probability theory gives the optimum solutions.

To extend the basic Boolean connectives to probabily functions:

Conditional probability (by division): the probability of B given A is P(B|A) = P(B∧A) / P(A).

Bayes' Theorem is often used to update a system's belief according to new information: P(H|E) = P(E|H) * P(H) / P(E).

Bayesian Networks are directed acyclic graphs in which the nodes represent variables of interest and the links represent informational or causal dependencies among the variables. The strength of dependency is represented by conditional probabilities. Compared to other approaches of probabilistic reasoning, Bayesian network is more efficient, though its actual computational cost is still high for complicated problems.

Applications: diagnostic systems, such as Office Assistant in Microsoft Office.

Limitation of this kind of approach: truth can be unknown, but probability must be known. For a practical application, the necessary data need to be collected and processed to get the required probability values. Also, the calculation must be fast enough.

Considering the uncertainty in probability judgments, some people go further to study imprecise probability and evidence-based confidence.

Further readings.

 

4. Fuzzy logic

Fuzzy logic is a generalization of classical logic, and reflects the impression of human reasoning.

Examples of fuzzy concepts: "young", "furniture", "most", "cloudy", and so on.

According to fuzzy logic, whether an instance belongs to a concept is usually a matter of "yes/no", but a matter of degree. Fuzzy logic uses a degree of membership, which is a real number in [0, 1].

A major difference between this number and probability is: the uncertainty in fuzzy concepts usually does not get reduced with the coming of new information. Compare the following two cases:

Basic fuzzy operations: Typically, in building a fuzzy system, the designer needs to provide all membership functions included in it, by considering how the concepts are used by average people.

The major applications of fuzzy logic so far are in control systems, where expert knowledge is coded into fuzzy rules. Example: Fuzzy Anti-lock Brake Systems (ABS).

Issues:

Other possibilities: an evidence-based interpretation of fuzziness.

Further readings.