Information about First Order Logic
First-order logic (FOL) is a formal deductive system used by mathematicians, philosophers, linguists, and computer scientists. It goes by many names, including: first-order predicate calculus (FOPC), the lower predicate calculus, the language of first-order logic or predicate logic. Unlike natural languages such as English, FOL uses a wholly unambiguous formal language interpreted by mathematical structures. FOL is a system of deduction extending propositional logic by allowing quantification over individuals of a given domain (universe) of discourse. For example, it can be stated in FOL "Every individual has the property P".
While propositional logic deals with simple declarative propositions, first-order logic additionally covers predicates and quantification. Take for example the following sentences: "Socrates is a man", "Plato is a man". In propositional logic these will be two unrelated propositions, denoted for example by p and q. In first-order logic however, both sentences would be connected by the same property: Man(x), where Man(x) means that x is a man. When x=Socrates we get the first proposition - p, and when x=Plato we get the second proposition - q. Such a construction allows for a much more powerful logic when quantifiers are introduced, such as "for every x..." - for example, "for every x, if Man(x), then...". Without quantifiers, every valid argument in FOL is valid in propositional logic, and vice versa.
A first-order theory consists of a set of axioms (usually finite or recursively enumerable) and the statements deducible from them given the underlying deducibility relation. Usually what is meant by 'first-order theory' is some set of axioms together with those of a complete (and sound) axiomatization FOL of first-order logic, closed under the rules of FOL. (Any such system FOL will give rise to the same abstract deducibility relation, so we needn't have a fixed axiomatic system in mind.) A first-order language has sufficient expressive power to formalize two important mathematical theories: ZFC set theory and Peano arithmetic. A first-order language cannot, however, categorically express the notion of countability even though it is expressible in the first-order theory ZFC under the intended interpretation of the symbolism of ZFC. Such ideas can be expressed categorically with second order logic.
to mean "therefore").
This translation is invalid: logic validates arguments according to their structure, and nothing in the structure of this translated argument (C follows from A and B, for arbitrary A, B, C) suggests that it is valid. A translation that preserves the intuitive (and formal) validity of the argument must take into consideration the deeper structure of propositions, such as the essential notions of predication and quantification. Propositional logic deals only with truth-functional validity: any assignment of truth-values to the variables of the argument should make either the conclusion true or at least one of the premises false. Clearly we may (uniformly) assign truth values to the variables of the above argument such that A, B are both true but C is false. Hence the argument is truth-functionally invalid. On the other hand, it is impossible to (uniformly) assign truth values to the argument "A follows from (A and B)" such that (A and B) is true (hence A is true and B is true) and A false.
In contrast, this argument can be easily translated into first-order logic: Where "
" means "for all x", "
" means "implies", means "Socrates is a man", and means "Socrates is mortal".
FOL can also express the existence of something (
), as well as predicates ("functions" that are true or false) with more than one parameter. For example, "there is someone who can be fooled every time" can be expressed as:
Where "
" means "there exists (an) x", "
" means "and", and means "(person) x can be fooled (at time) y".
FOL is used as the basic "building block" for many mathematical theories. FOL provides several built-in rules, such as the axiom
(if P(x) is true for every x then P(x) is true for every x).
Additional non-logical axioms are added to produce specific first-order theories based on the axioms of classical FOL; these theories built on FOL are called classical first-order theories. One example of a classical first-order theory is Peano arithmetic, which adds to axiom
(i.e. for every x there exists y such that y=x+1, where Q(x,y) is interpreted as "y=x+1"). This additional axiom is a non-logical axiom; it is not part of FOL, but instead is an axiom of the theory (an axiom of arithmetic rather than of logic). Axioms of the latter kind are also called axioms of first-order theories.
The axioms of first-order theories are not regarded as truths of logic per se, but rather as truths of the particular theory that usually has associated with it an intended interpretation of its non-logical symbols. (See an analogous idea at logical versus non-logical symbols.) Thus, the axiom about Q(x,y) is true only with the interpretation of the relation Q(x,y) as "y=x+1", and only in the theory of Peano arithmetic.
Classical FOL does not have associated with it an intended interpretation of its non-logical vocabulary (except arguably a symbol denoting identity, depending on whether one regards such a symbol as logical).
Classical set-theory is another example of a first-order theory (a theory built on FOL).
There are several further minor variations listed below:
(forall),
(exists),
(not),
(and),
(or),
(implies), and
(if and only if). The exclusive-or operator "xor" is also common.
The sets of constants, functions, and relations are usually considered to form a language, while the variables, logical operators, and quantifiers are usually considered to belong to the logic. For example, the language of group theory consists of one constant (the identity element), one function of valence 1 (the inverse) one function of valence 2 (the product), and one relation of valence 2 (equality), which would be omitted by authors who include equality in the underlying logic.
For example,
x
y (P(f(x))
(P(x)
Q(f(y),x,z))) is a well-formed formula, if f is a function of valence 1, P a predicate of valence 1 and Q a predicate of valence 3.
x x
is not a well-formed formula.
In Computer Science terminology, a formula implements a built-in "boolean" type, while a term implements all other types.
To see why this condition is necessary, consider the formula φ(x) given by
y y ≤ x ("x is maximal"). If t is a term without y as a free variable, then φ(t) just means t is maximal. However if t is y the formula φ(y) is
y y ≤ y which does not say that y is maximal. The problem is that the free variable
y of t (=y) became bound when we substituted y for x in φ(x). So to form φ(y) we must first change the bound variable y of φ to something else, say z, so that φ(y) is then
z z ≤ y. Forgetting this condition is a notorious cause of errors.
Inference rules are used to prove theorems, which are sentences provable in a theory. If the premises of an inference rule are theorems, then its conclusion is a theorem as well. In other words, inference rules are used to generate "new" theorems from "old" ones. The procedure of generating theorems is known as predicate calculus and is described in the relevant section below.
The inference rule modus ponens is the only one required from propositional logic for the formalization given here. It states that if φ and φ
ψ are both theorems, then ψ is a theorem. This can be written as following;
is a sign that the formula following it is a theorem.
The inference rule called Universal Generalization is characteristic of the predicate calculus. It can be stated as
Generalization is analogous to the Necessitation Rule of modal logic, which is
For any theory, it is of interest to know whether the set of axioms can be generated by an algorithm, or if there is an algorithm which determines whether a well-formed formula is an axiom.
If there is an algorithm to generate all axioms, then the set of axioms is said to be recursively enumerable.
If there is an algorithm which determines after a finite number of steps whether a formula is an axiom or not, then the set of axioms is said to be recursive or decidable. In that case, one may also construct an algorithm to generate all axioms: this algorithm simply builds all possible formulas one by one (with growing length), and for each formula the algorithm determines whether it is an axiom.
Axioms of first-order logic are always decidable. However, in a first-order theory non-logical axioms are not necessarily such.
Another axiom, , for Z in which x does not occur, is sometimes added.
is provable in first order logic, for some finite set of axioms of the theory T. In other words, if one can prove in first order logic that φ follows from the axioms of T. This also means, that we replace the above procedure for finding provable sentences by the following one:
There are many different (but equivalent) ways to define provability. The definition above is a typical example of a "Hilbert style" calculus, which has a lot of different axioms but very few rules of inference. The "Gentzen style" predicate calculus differs in that it has very few axioms but many rules of inference.
One may construct an algorithm which will determine in finite number of steps whether a sentence is provable (a decidable algorithm) only for simple classes of first-order logic.
means "p, or q, or both", that is, it is
inclusive. In English, the word "or" is sometimes inclusive (e.g,
"cream or sugar?"), but sometimes it is exclusive (e.g., "coffee or tea?"
is usually intended to mean one or the other, not both).
Similarly, the English word "some" may mean "at least one, possibly all", but
other times it may mean "not all, possibly none".
The English word "and" should sometimes be translated as "or" (e.g.,
"men and women may apply").
[2]
Mathematically, it is possible to redefine a complete set of new functions that match the formula operators, but this is quite clumsy.[3] A predicate if(c,a,b) can be expressed in FOL if rewritten as , but this is clumsy if the condition c is complex. Many extend FOL to add a special-case predicate named "if(condition, a, b)" (where a and b are formulas) and/or function "ite(condition, a, b)" (where a and b are terms), both of which accept a formula as the condition, and are equal to "a" if condition is true and "b" if it is false. These extensions make FOL easier to use for some problems, and make some kinds of automatic theorem-proving easier.[4] Others extend FOL further so that functions and predicates can accept both terms and formulas at any position.
Single-parameter predicates can be used to implement the notion of types where appropriate. For example, in: , the predicate Man(x) could be considered a kind of "type assertion" (that is, that x must be a Man). Predicates can also be used with the "exists" quantifier to identify types, but this should usually be done with the "and" operator instead, e.g.: ("there exists something that is both a man and is mortal"). It is easy to write , but this would be equivalent to ("there is something that is not a man, and/or there exists something that is mortal"), which is usually not what was intended. Similarly, assertions can be made that one type is a subtype of another type, e.g.: ("for all x, if x is a man, then x is a mammal").
It follows from the Löwenheim–Skolem theorem that it is not possible to characterize finiteness or countability in first-order logic. For example, in first-order logic one cannot assert the least-upper-bound property for sets of real numbers, which states that every bounded, nonempty set of real numbers has a supremum; A second-order logic is needed for that.
First order logic in which no atomic sentence lies in the scope of more than three quantifiers, has the same expressive power as the relation algebra of Tarski and Givant (1987). They also show that FOL with a primitive ordered pair, and a relation algebra including the two ordered pair projection relations, are equivalent.
While propositional logic deals with simple declarative propositions, first-order logic additionally covers predicates and quantification. Take for example the following sentences: "Socrates is a man", "Plato is a man". In propositional logic these will be two unrelated propositions, denoted for example by p and q. In first-order logic however, both sentences would be connected by the same property: Man(x), where Man(x) means that x is a man. When x=Socrates we get the first proposition - p, and when x=Plato we get the second proposition - q. Such a construction allows for a much more powerful logic when quantifiers are introduced, such as "for every x..." - for example, "for every x, if Man(x), then...". Without quantifiers, every valid argument in FOL is valid in propositional logic, and vice versa.
A first-order theory consists of a set of axioms (usually finite or recursively enumerable) and the statements deducible from them given the underlying deducibility relation. Usually what is meant by 'first-order theory' is some set of axioms together with those of a complete (and sound) axiomatization FOL of first-order logic, closed under the rules of FOL. (Any such system FOL will give rise to the same abstract deducibility relation, so we needn't have a fixed axiomatic system in mind.) A first-order language has sufficient expressive power to formalize two important mathematical theories: ZFC set theory and Peano arithmetic. A first-order language cannot, however, categorically express the notion of countability even though it is expressible in the first-order theory ZFC under the intended interpretation of the symbolism of ZFC. Such ideas can be expressed categorically with second order logic.
Why is first-order logic needed?
Propositional logic is not adequate for formalizing valid arguments that rely on the internal structure of the propositions involved. To see this, consider the valid syllogistic argument:- All men are mortal
- Socrates is a man
- Therefore, Socrates is mortal
- A
- B
C
to mean "therefore").
This translation is invalid: logic validates arguments according to their structure, and nothing in the structure of this translated argument (C follows from A and B, for arbitrary A, B, C) suggests that it is valid. A translation that preserves the intuitive (and formal) validity of the argument must take into consideration the deeper structure of propositions, such as the essential notions of predication and quantification. Propositional logic deals only with truth-functional validity: any assignment of truth-values to the variables of the argument should make either the conclusion true or at least one of the premises false. Clearly we may (uniformly) assign truth values to the variables of the above argument such that A, B are both true but C is false. Hence the argument is truth-functionally invalid. On the other hand, it is impossible to (uniformly) assign truth values to the argument "A follows from (A and B)" such that (A and B) is true (hence A is true and B is true) and A false.
In contrast, this argument can be easily translated into first-order logic: Where "
" means "for all x", "
" means "implies", means "Socrates is a man", and means "Socrates is mortal".
FOL can also express the existence of something (
), as well as predicates ("functions" that are true or false) with more than one parameter. For example, "there is someone who can be fooled every time" can be expressed as:
Where "
" means "there exists (an) x", "
" means "and", and means "(person) x can be fooled (at time) y".
Defining first-order logic
A predicate calculus consists of- formation rules (i.e. recursive definitions for forming well-formed formulas).
- transformation rules (i.e. inference rules for deriving theorems).
- axioms or axiom schemata (possibly countably infinite).
FOL is used as the basic "building block" for many mathematical theories. FOL provides several built-in rules, such as the axiom
(if P(x) is true for every x then P(x) is true for every x).
Additional non-logical axioms are added to produce specific first-order theories based on the axioms of classical FOL; these theories built on FOL are called classical first-order theories. One example of a classical first-order theory is Peano arithmetic, which adds to axiom
(i.e. for every x there exists y such that y=x+1, where Q(x,y) is interpreted as "y=x+1"). This additional axiom is a non-logical axiom; it is not part of FOL, but instead is an axiom of the theory (an axiom of arithmetic rather than of logic). Axioms of the latter kind are also called axioms of first-order theories.
The axioms of first-order theories are not regarded as truths of logic per se, but rather as truths of the particular theory that usually has associated with it an intended interpretation of its non-logical symbols. (See an analogous idea at logical versus non-logical symbols.) Thus, the axiom about Q(x,y) is true only with the interpretation of the relation Q(x,y) as "y=x+1", and only in the theory of Peano arithmetic.
Classical FOL does not have associated with it an intended interpretation of its non-logical vocabulary (except arguably a symbol denoting identity, depending on whether one regards such a symbol as logical).
Classical set-theory is another example of a first-order theory (a theory built on FOL).
Vocabulary
Before setting up the formation rules, one has to describe the "vocabulary", which is composed of- A set of predicate variables (or relations) each with some valence (or arity, number of its arguments) ≥1, which are often denoted by uppercase letters P, Q, R,... .
- * For example, P(x) is a predicate variables of valence 1. It can stand for "x is a man", for example.
- * Q(x,y) is a predicate variables of valence 2. It can stand for "x is greater than y" in arithmetic or "x is the father of y", for example.
- * It is possible to allow relations of valence 0; these could be considered as propositional variables. For example, P, which can stand for any statement.
- * By using functions (see below), it is possible to dispense with all predicate variables with valence larger than one. For example, "x>y" (a predicate of valence 2, of the type Q(x,y)) can be replaced by a predicate of valence 1 about the ordered pair (x,y).
- A set of constants, often denoted by lowercase letters at the beginning of the alphabet a, b, c,... .
- * Examples: a may stand for Socrates. In arithmetic, it may stand for 0. In set theory, such a constant may stand for the empty set.
- A set of functions, each of some valence ≥ 1, which are often denoted by lowercase letters f, g, h,... .
- * Examples: f(x) may stand for "the father of x". In arithmetic, it may stand for "-x". In set theory, it may stand for "the power set of x". In arithmetic, f(x,y) may stand for "x+y". In set theory, it may stand for "the union of x and y".
- * A constant is really a function of valence 0. However it is traditional to use the term "function" only for functions of valence at least 1.
- * One can in principle dispense entirely with functions of arity > 2 and predicates of arity > 1 if there is a function symbol of arity 2 representing an ordered pair (or predicate symbols of arity 2 representing the projection relations of an ordered pair). The pair or projections need to satisfy the natural axioms.
- * One can in principle dispense entirely with functions and constants. For example, instead of using a constant 0 one may use a predicate 0(x) (interpreted as "x=0"), and replace every predicate such as P(0,y) with
x 0(x)
P(x,y). A function such as f(x1,x2...) will similarly be replaced by a predicate F(x1,x2...,y) (interpreted as "y=f(x1,x2...)").
- An infinite set of variables, often denoted by lowercase letters at the end of the alphabet x, y, z,... .
- Symbols denoting logical operators (or connectives):
(logical not),
(logical conditional).
- *
(φ
ψ) is logically equivalent to φ
ψ (logical and). φ
ψ can be seen as a shorthand for this. Alternatively, one may add the
symbol as a logical operator to the vocabulary, and appropriate axioms.
- *
φ
ψ is logically equivalent to φ
ψ (logical or). φ
ψ can be seen as a shorthand for this. Alternatively, one may add the
symbol as a logical operator to the vocabulary, and appropriate axioms.
- *Similarly, (φ
ψ)
(ψ
φ) is logically equivalent to φ
ψ (logical biconditional), and one may use the latter as a shorthand for this, or alternatively add this to the vocabulary and add appropriate axioms. Sometimes φ
ψ is written as φ
ψ.
- Symbols denoting quantifiers:
(universal quantification, typically read as "for all").
- *
x
φ) is logically equivalent to
x φ (existential quantification, typically read as "there exists"). The latter can either be used as a shorthand for this, or added to the vocabulary together with appropriate axioms.
- Left and right parenthesis.
- * There are many different conventions about where to put parentheses; for example, one might write
x or (
x). Sometimes one uses colons or full stops instead of parentheses to make formulas unambiguous. One interesting but rather unusual convention is "Polish notation", where one omits all parentheses, and writes
,
, and so on in front of their arguments rather than between them. Polish notation is compact and elegant, but rare because it is hard for humans to read it.
- An identity or equality symbol = is sometimes but not always included in the vocabulary.
- * If equality is considered to be a part of first order logic, then the equality symbol behaves syntactically as a binary predicate. This case is sometimes called first order logic with equality.
There are several further minor variations listed below:
- The set of primitive symbols (operators and quantifiers) often varies. It is possible to include other operators as primitive, such as
(iff),
(and) and
(or), the truth constants T for "true" and F for "false" (these are operators of valence 0), and/or the Sheffer stroke (P | Q, aka NAND). The minimum number of primitive symbols needed is one, but if we restrict ourselves to the operators listed above, we need three, as above.
- Some books and papers use the notation φ
ψ for φ
ψ. This is especially common in proof theory where
is easily confused with the sequent arrow. One also sees ~φ for
φ, φ & ψ for φ
ψ, and a wealth of notations for quantifiers; e.g.,
x φ may be written as (x)φ. This latter notation is common in texts on recursion theory.
- It is often easier in practice to use a simpler notation that supports infix operators and omits unnecessary parentheses. Thus if P is a relation of valence 2, we often write "a P b" instead of "P a b"; for example, we write 1 < 2 instead of <(1 2). Similarly if f is a function of valence 2, we sometimes write "a f b" instead of "f(a b)"; for example, we write 1 + 2 instead of +(1 2). By convention, infix operators tend to use non-alphabetic function names. It is also common to omit some parentheses if this does not lead to ambiguity (leading to defining a precedence).
- Sometimes it is useful to say that "P(x) holds for exactly one x", which can be expressed as x P(x). This notation may be taken to abbreviate a formula such as
x (P(x)
y (P(y)
(x = y))) .
(forall),
(exists),
(not),
(and),
(or),
(implies), and
(if and only if). The exclusive-or operator "xor" is also common.
The sets of constants, functions, and relations are usually considered to form a language, while the variables, logical operators, and quantifiers are usually considered to belong to the logic. For example, the language of group theory consists of one constant (the identity element), one function of valence 1 (the inverse) one function of valence 2 (the product), and one relation of valence 2 (equality), which would be omitted by authors who include equality in the underlying logic.
Formation rules
The formation rules define the terms, formulas and free variables in them as follows.Terms
The set of terms is recursively defined by the following rules:- Any constant is a term.
- Any variable is a term.
- Any expression f(t1,...,tn) of n ≥ 1 arguments (where each argument ti is a term and f is a function symbol of valence n) is a term.
- Closure clause: Nothing else is a term. For example, predicates are not terms.
Formulas
The set of well-formed formulas (usually called wffs or just formulas) is recursively defined by the following rules:- Simple and complex predicates If P is a relation of valence n ≥ 1 and the ai are terms then P(a1,...,an) is well-formed. If equality is considered part of logic, then (a1 = a2) is well formed. All such formulas are said to be atomic.
- Inductive Clause I: If φ is a wff, then
φ is a wff.
- Inductive Clause II: If φ and ψ are wffs, then (φ
ψ) is a wff.
- Inductive Clause III: If φ is a wff and x is a variable, then
x φ is a wff.
- Closure Clause: Nothing else is a wff.
For example,
x
y (P(f(x))
(P(x)
Q(f(y),x,z))) is a well-formed formula, if f is a function of valence 1, P a predicate of valence 1 and Q a predicate of valence 3.
x x
is not a well-formed formula.
In Computer Science terminology, a formula implements a built-in "boolean" type, while a term implements all other types.
Free and Bound Variables
- Atomic formulas If φ is an Atomic formula then x is free in φ if and only if x occurs in φ.
- Inductive Clause I: x is free in
φ if and only if x is free in φ.
- Inductive Clause II: x is free in (φ
ψ) if and only if x is free in φ and does not occur in ψ, or x is free in ψ and does not occur in φ, or x is free in both φ and ψ.
- Inductive Clause III: x is free in
y φ if and only if x is free in φ and x is a different symbol than y.
- Closure Clause: x is bound in φ if and only if x occurs in φ and x is not free in φ.
- For example, in
x
y (P(x)
Q(x,f(x),z)), x and y are bound variables, z is a free variable, and w is neither because it does not occur in the formula.
Example
In mathematics the language of ordered abelian groups has one constant 0, one unary function −, one binary function +, and one binary relation ≤. So:- 0, x, y are atomic terms
- +(x, y), +(x, +(y, −(z))) are terms, usually written as x + y, x + y − z
- =(+(x, y), 0), ≤(+(x, +(y, −(z))), +(x, y)) are atomic formulas, usually written as x + y = 0, x + y - z ≤ x + y,
- (
x
y ≤( +(x, y), z))
(
x =(+(x, y), 0)) is a formula, usually written as (
x
y x + y ≤ z)
(
x x + y = 0).
Substitution
If t is a term and φ(x) is a formula possibly containing x as a free variable, then φ(t) is defined to be the result of replacing all free instances of x by t, provided that no free variable of t becomes bound in this process. If some free variable of t becomes bound, then to substitute t for x it is first necessary to change the names of bound variables of φ to something other than the free variables of t.To see why this condition is necessary, consider the formula φ(x) given by
y y ≤ x ("x is maximal"). If t is a term without y as a free variable, then φ(t) just means t is maximal. However if t is y the formula φ(y) is
y y ≤ y which does not say that y is maximal. The problem is that the free variable
y of t (=y) became bound when we substituted y for x in φ(x). So to form φ(y) we must first change the bound variable y of φ to something else, say z, so that φ(y) is then
z z ≤ y. Forgetting this condition is a notorious cause of errors.
Inference rules
An inference rule relates a fixed number of well-formed formulas, called premises, to one well-formed formula, called conclusion.Inference rules are used to prove theorems, which are sentences provable in a theory. If the premises of an inference rule are theorems, then its conclusion is a theorem as well. In other words, inference rules are used to generate "new" theorems from "old" ones. The procedure of generating theorems is known as predicate calculus and is described in the relevant section below.
The inference rule modus ponens is the only one required from propositional logic for the formalization given here. It states that if φ and φ
ψ are both theorems, then ψ is a theorem. This can be written as following;
- if
and
, then 
is a sign that the formula following it is a theorem.
The inference rule called Universal Generalization is characteristic of the predicate calculus. It can be stated as
- if
, then 
- Notice that the similar-looking formula
is NOT a theorem and is a WRONG statement, unless x does not occur in φ (see Generalization (logic)).
Generalization is analogous to the Necessitation Rule of modal logic, which is
- if
, then 
Axioms
Here follows a description of the axioms of first-order logic. As explained above, a given first-order theory has further, non-logical axioms. The following logical axioms characterize a predicate calculus for this article's example of first-order logic[1].For any theory, it is of interest to know whether the set of axioms can be generated by an algorithm, or if there is an algorithm which determines whether a well-formed formula is an axiom.
If there is an algorithm to generate all axioms, then the set of axioms is said to be recursively enumerable.
If there is an algorithm which determines after a finite number of steps whether a formula is an axiom or not, then the set of axioms is said to be recursive or decidable. In that case, one may also construct an algorithm to generate all axioms: this algorithm simply builds all possible formulas one by one (with growing length), and for each formula the algorithm determines whether it is an axiom.
Axioms of first-order logic are always decidable. However, in a first-order theory non-logical axioms are not necessarily such.
Quantifier axioms
Quantifier axioms change according to how the vocabulary is defined, how the substitution procedure works, what are the formation rules and which inference rules are used. Here follows a specific example of these axioms- PRED-1:
- PRED-2:
- PRED-3:
- PRED-4:
Another axiom, , for Z in which x does not occur, is sometimes added.
Equality and its axioms
There are several different conventions for using equality (or identity) in first order logic. This section summarizes the main ones. The various conventions all give essentially the same results with about the same amount of work, and differ mainly in terminology.- The most common convention for equality is to include the equality symbol as a primitive logical symbol, and add the axioms for equality to the axioms for first order logic. The equality axioms are
- :x = x
- :x = y → f(...,x,...) = f(...,y,...) for any function f
- :x = y → (P(...,x,...) → P(...,y,...)) for any relation P (including the equality relation itself)
- These are, too, axiom schemata: they define an algorithm which decides whether a given formula is an axiom. Thus this is a recursive set of axioms.
- The next most common convention is to include the equality symbol as one of the relations of a theory, and add the equality axioms to the axioms of the theory. In practice this is almost indistinguishable from the previous convention, except in the unusual case of theories with no notion of equality. The axioms are the same, and the only difference is whether one calls some of them logical axioms or axioms of the theory.
- In theories with no functions and a finite number of relations, it is possible to define equality in terms of the relations, by defining the two terms s and t to be equal if any relation is unchanged by changing s to t in any argument.
- :For example, in set theory with one relation
, we may define s = t to be an abbreviation for
x (s
x
t
x)
x (x
s
x
t). This definition of equality then automatically satisfies the axioms for equality.
- :Alternatively, if one does use the equality symbol as a relation of the theory or of logic, then one would have to add axioms. In the previous example, one would have to add the axiom
s
t (
x (x
s
x
t))
s = t.
- In some theories it is possible to give ad hoc definitions of equality. For example, in a theory of partial orders with one relation ≤ we could define s = t to be an abbreviation for s ≤ t
t ≤ s.
Predicate calculus
The predicate calculus is an extension of the propositional calculus that defines which statements of first order logic are provable. It is a formal system used to describe mathematical theories. If the propositional calculus is defined with a suitable set of axioms and the single rule of inference modus ponens (this can be done in many different ways), then the predicate calculus can be defined by appending some additional axioms and the additional inference rule "universal generalization". More precisely, as axioms for the predicate calculus we take:- All tautologies from the propositional calculus (with the proposition variables replaced by formulas).
- The axioms for quantifiers, given above.
- The axioms for equality given above, if equality is regarded as a logical concept.
- An axiom of the predicate calculus is provable in first order logic.
- If the premises of an inference rule are provable in first order logic, then so is its conclusion.
is provable in first order logic, for some finite set of axioms of the theory T. In other words, if one can prove in first order logic that φ follows from the axioms of T. This also means, that we replace the above procedure for finding provable sentences by the following one:
- An axiom of T is provable in T.
- An axiom of the predicate calculus is provable in T.
- If the premises of an inference rule are provable in T, then so is its conclusion.
There are many different (but equivalent) ways to define provability. The definition above is a typical example of a "Hilbert style" calculus, which has a lot of different axioms but very few rules of inference. The "Gentzen style" predicate calculus differs in that it has very few axioms but many rules of inference.
Provable identities
The following identities are examples to sentences provable in first order logic- (where the variable
must not occur free in
)
- (where the variable
must not occur free in
)
Possible additional inference rules
The following set of inference rules is an example to such a set that can be added to the two inference rules discussed above
(If c is a variable, then it must not already be quantified somewhere in P(x))
(x must not appear free in P(c))
Metalogical theorems of first-order logic
Some important metalogical theorems are listed below in bulleted form. What they roughly mean is that a sentence is valid if and only if it is provable. Furthermore, one can construct an algorithm which works as follows: if a sentence is provable, the algorithm will tell us that in a finite but unknown amount of time. If a sentence is unprovable, the algorithm will run forever, and we will not know whether the sentence is unprovable or provable and the algorithm has just not yet told us that. Such an algorithm is called semidecidable.One may construct an algorithm which will determine in finite number of steps whether a sentence is provable (a decidable algorithm) only for simple classes of first-order logic.
- The decision problem for validity is semidecidable; in other words, there is a Turing machine that when given any sentence as input, will halt if and only if the sentence is valid (true in all models).
- * As Gödel's completeness theorem shows, for any valid formula P, P is provable. Conversely, assuming consistency of the logic, any provable formula is valid.
- * The Turing machine can be one which generates all provable formulas in the following manner: for a finite or recursively enumerable set of axioms, such a machine can be one that generates an axiom, then generates a new provable formula by application of axioms and inference rules already generated, then generate another axiom, and so on. Given a sentence as input, the Turing machine simply go on and generates all provable formulas one by one, and will halt if it generates the sentence.
- Unlike the propositional logic, first-order logic is undecidable, provided that the language has at least one predicate of valence at least 2 other than equality. This means that there is no decision procedure that correctly determines whether an arbitrary formula is valid. Because there is a Turing machine as described above, the undecidability is related to the unsolvability of the Halting problem: there is no algorithm which determines after a finite number of steps whether the Turing machine will ever halt for a given sentence as its input, hence whether the sentence is provable. This result was established independently by Church and Turing.
- Monadic predicate logic (i.e., predicate logic with only predicates of one argument and no functions) is decidable.
- The Bernays–Schönfinkel class of first order formulas is also decidable.
Metalogical theorems of first-order theories
It is important to bear in mind that the results quoted above apply only to first-order logic, and not to first-order theories in general. The latter include not only logical axioms, but also non-logical ones, and theorems such as Gödel's completeness theorem do not apply to them. In many first-order theories, and in particular every first-order theory that includes the natural numbers, Gödel's incompleteness theorems show, for example, that there are true statements which cannot be proven.Translating natural language to first-order logic
Concepts expressed in natural language must be "translated" to first-order logic (FOL) before FOL can be used to address them, and there are a number of potential pitfalls in this translation. In FOL,
means "p, or q, or both", that is, it is
inclusive. In English, the word "or" is sometimes inclusive (e.g,
"cream or sugar?"), but sometimes it is exclusive (e.g., "coffee or tea?"
is usually intended to mean one or the other, not both).
Similarly, the English word "some" may mean "at least one, possibly all", but
other times it may mean "not all, possibly none".
The English word "and" should sometimes be translated as "or" (e.g.,
"men and women may apply").
[2]
Limitations of first-order logic
All mathematical notations have their strengths and weaknesses; here are a few such issues with first-order logic.Difficulty representing if-then-else
Oddly enough, FOL with equality (as typically defined) does not include or permit defining an if-then-else predicate or function if(c,a,b), where "c" is a condition expressed as a formula, while a and b are either both terms or both formulas, and its result would be "a" if c is true, and "b" if it is false. The problem is that in FOL, both predicates and functions can only accept terms ("non-booleans") as parameters, but the "obvious" representation of the condition is a formula ("boolean"). This is unfortunate, since many mathematical functions are conveniently expressed in terms of if-then-else, and if-then-else is fundamental for describing most computer programs.Mathematically, it is possible to redefine a complete set of new functions that match the formula operators, but this is quite clumsy.[3] A predicate if(c,a,b) can be expressed in FOL if rewritten as , but this is clumsy if the condition c is complex. Many extend FOL to add a special-case predicate named "if(condition, a, b)" (where a and b are formulas) and/or function "ite(condition, a, b)" (where a and b are terms), both of which accept a formula as the condition, and are equal to "a" if condition is true and "b" if it is false. These extensions make FOL easier to use for some problems, and make some kinds of automatic theorem-proving easier.[4] Others extend FOL further so that functions and predicates can accept both terms and formulas at any position.
Typing (Sorts)
FOL does not include types (sorts) into the notation itself, other than the difference between formulas ("booleans") and terms ("non-booleans"). Some argue that this lack of types is a great advantage [5], but many others find advantages in defining and using types (sorts), such as helping reject some erroneous or undesirable specifications[6]. Those who wish to indicate types must provide such information using the notation available in FOL. Doing so can make such expressions more complex, and can also be easy to get wrong.Single-parameter predicates can be used to implement the notion of types where appropriate. For example, in: , the predicate Man(x) could be considered a kind of "type assertion" (that is, that x must be a Man). Predicates can also be used with the "exists" quantifier to identify types, but this should usually be done with the "and" operator instead, e.g.: ("there exists something that is both a man and is mortal"). It is easy to write , but this would be equivalent to ("there is something that is not a man, and/or there exists something that is mortal"), which is usually not what was intended. Similarly, assertions can be made that one type is a subtype of another type, e.g.: ("for all x, if x is a man, then x is a mammal").
Difficulty in characterizing finiteness or countability
It follows from the Löwenheim–Skolem theorem that it is not possible to characterize finiteness or countability in first-order logic. For example, in first-order logic one cannot assert the least-upper-bound property for sets of real numbers, which states that every bounded, nonempty set of real numbers has a supremum; A second-order logic is needed for that.
Graph reachability cannot be expressed
Many situations can be modeled as a graph of nodes and directed connections (edges). For example, validating many systems requires showing that a "bad" state cannot be reached from a "good" state, and these interconnections of states can often be modelled as a graph. However, it can be proved that reachability cannot be fully expressed in predicate logic. In other words, there is no predicate-logic formula f, with u and v as its only free variables, and R as its only predicate symbol (of arity 2) such that f holds in a directed graph iff there is a path in that graph from the node associated to u to the node associated with v. [7]Comparison with other logics
- Typed first order logic allows variables and terms to have various types (or sorts). If there are only a finite number of types, this does not really differ much from first order logic, because one can describe the types with a finite number of unary predicates and a few axioms. Sometimes there is a special type Ω of truth values, in which case formulas are just terms of type Ω.
- First Order Logic with Domain Conditions adds domain conditions (DCs) to classical first-order logic, enabling the handling of partial functions; these conditions can be proven "on the side" in a manner similar to PVS's type correctness conditions. It also adds if-then-else to keep definitions and proofs manageable (they became too complex without them).[8]
- The SMT-LIB Standard defines a language used by many research groups for satisfiability modulo theories; the full logic is based on FOL with equality, but adds sorts (types), if-then-else for terms and formulas (ite() and if.. then.. else..), a let construct for terms and formulas (let and flet), and a distinct construct declaring a set of listed values as distinct. Its connectives are not, implies, and, or, xor, and iff.[9]
- Weak second-order logic allows quantification over finite subsets.
- Monadic second-order logic allows quantification over subsets, or in other words over unary predicates.
- Second-order logic allows quantification over subsets and relations, or in other words over all predicates. For example, the axiom of extensionality can be stated in second-order logic as x = y ≡def
P (P(x) ↔ P(y)). The strong semantics of second order logic give such sentences a much stronger meaning than first-order semantics.
- Higher order logics allows quantification over higher types than second-order logic permits. These higher types include relations between relations, functions from relations to relations between relations, etc.
- Intuitionistic first order logic uses intuitionistic rather than classical propositional calculus; for example, ¬¬φ need not be equivalent to φ.
- Modal logic has extra modal operators with meanings which can be characterised informally as, for example "it is necessary that φ" and "it is possible that φ".
- In monadic predicate calculus predicates are restricted to having only one argument.
- Infinitary logic allows infinitely long sentences. For example, one may allow a conjunction or disjunction of infinitely many formulas, or quantification over infinitely many variables. Infinitely long sentences arise in areas of mathematics including topology and model theory.
- First order logic with extra quantifiers has new quantifiers Qx,..., with meanings such as "there are many x such that ...". Also see branched quantification and the plural quantifiers of George Boolos and others.
- Predicate Logic with Definitions (PLD, or D-logic) modifies FOL by formally adding syntactic definitions as a type of value (in addition to formulas and terms); these definitions can be used inside terms and formulas[10].
- Independence-friendly logic is characterized by branching quantifiers, which allow one to express independence between quantified variables.
First order logic in which no atomic sentence lies in the scope of more than three quantifiers, has the same expressive power as the relation algebra of Tarski and Givant (1987). They also show that FOL with a primitive ordered pair, and a relation algebra including the two ordered pair projection relations, are equivalent.
Automation
Theorem proving for first-order logic is one of the most mature subfields of automated theorem proving. The logic is expressive enough to allow the specification of arbitrary problems, often in a reasonably natural and intuitive way. On the other hand, it is still semi-decidable, and a number of sound and complete calculi have been developed, enabling fully automated systems. In 1965 J. A. Robinson achieved an important breakthrough with his resolution approach; to prove a theorem it tries to refute the negated theorem, in a goal-directed way, resulting in a much more efficient method to automatically prove theorems in FOL. More expressive logics, such as higher order and modal logics, allow the convenient expression of a wider range of problems than first order logic, but theorem proving for these logics is less well developed.References
1. ^ For another well-worked example, see Metamath proof explorer
2. ^ Suber, Peter, Translation Tips, <[1] (retrieved on 2007-09-20)
3. ^ Otter Example if.in, <[2] (retrieved on 2007-09-21)
4. ^ Manna, Zohar (1974), Mathematical Theory of Computation, McGraw-Hill Computer Science Series, New York, New York: McGraw-Hill Book Company, pp. 77-147, ISBN 0-07-039910-7
5. ^ Leslie Lamport, Lawrence C. Paulson. Should Your Specification Language Be Typed? ACM Transactions on Programming Languages and Systems. 1998. [3]
6. ^ Rushby, John. Subtypes for Specification. 1997. Proceedings of the Sixth European Software Engineering Conference (ESEC/FSE 97). [4]
7. ^ Huth, Michael & Mark Ryan (2004), Logic in Computer Science, 2nd edition, pp. 138-139, ISBN 0-521-54310-X
8. ^ Freek Wiedijk and Jan Zwanenburg. "First Order Logic with Domain Conditions" In Theorem Proving in Higher Order Logics. Book Series "Lecture Notes in Computer Science". Springer Berlin / Heidelberg. ISSN 0302-9743 (Print) 1611-3349 (Online), Volume 2758/2003. ISBN 978-3-540-40664-8. [5]
9. ^ Ranise, Silvio and Cesare Tinelli. The SMT-LIB Standard: Version 1.2 Aug 30, 2006. [6]
10. ^ Makarov, Victor. "Predicate Logic with Definitions". [7]
2. ^ Suber, Peter, Translation Tips, <[1] (retrieved on 2007-09-20)
3. ^ Otter Example if.in, <[2] (retrieved on 2007-09-21)
4. ^ Manna, Zohar (1974), Mathematical Theory of Computation, McGraw-Hill Computer Science Series, New York, New York: McGraw-Hill Book Company, pp. 77-147, ISBN 0-07-039910-7
5. ^ Leslie Lamport, Lawrence C. Paulson. Should Your Specification Language Be Typed? ACM Transactions on Programming Languages and Systems. 1998. [3]
6. ^ Rushby, John. Subtypes for Specification. 1997. Proceedings of the Sixth European Software Engineering Conference (ESEC/FSE 97). [4]
7. ^ Huth, Michael & Mark Ryan (2004), Logic in Computer Science, 2nd edition, pp. 138-139, ISBN 0-521-54310-X
8. ^ Freek Wiedijk and Jan Zwanenburg. "First Order Logic with Domain Conditions" In Theorem Proving in Higher Order Logics. Book Series "Lecture Notes in Computer Science". Springer Berlin / Heidelberg. ISSN 0302-9743 (Print) 1611-3349 (Online), Volume 2758/2003. ISBN 978-3-540-40664-8. [5]
9. ^ Ranise, Silvio and Cesare Tinelli. The SMT-LIB Standard: Version 1.2 Aug 30, 2006. [6]
10. ^ Makarov, Victor. "Predicate Logic with Definitions". [7]
- Jon Barwise and John Etchemendy, 2000. Language Proof and Logic. CSLI (University of Chicago Press) and New York: Seven Bridges Press.
- David Hilbert and Wilhelm Ackermann 1950. Principles of Theoretical Logic (English translation). Chelsea. The 1928 first German edition was titled Grundzüge der theoretischen Logik.
- Wilfrid Hodges, 2001, "Classical Logic I: First Order Logic," in Lou Goble, ed., The Blackwell Guide to Philosophical Logic. Blackwell.
See also
- Automated theorem proving
- List of first-order theories
- List of rules of inference
- Mathematical logic
- Gödel's completeness theorem
- Gödel's incompleteness theorems
- Cylindric algebras
- Table of logic symbols
External links
- Stanford Encyclopedia of Philosophy: "Classical Logic -- by Stewart Shapiro. Covers syntax, model theory, and metatheory for first order logic in the natural deduction style.
- forall x: an introduction to formal logic, by P.D. Magnus, covers formal semantics and proof theory for first-order logic.
- Metamath: an ongoing online project to reconstruct mathematics as a huge first order theory, using first order logic and the axiomatic set theory ZFC. Principia Mathematica modernized and done right.
- Podnieks, Karl. Introduction to mathematical logic.
- Cambridge Mathematics Tripos Notes : "http://john.fremlin.de/schoolwork/logic/index.html -- Notes typeset by John Fremlin. The notes cover part of a past Cambridge Mathematics Tripos course taught to undergraduates students (usually) within their third year. The course is entitled "Logic, Computation and Set Theory" and covers Ordinals and cardinals, Posets and zorn’s Lemma, Propositional logic, Predicate logic, Set theory and Consistency issues related to ZFC and other set theories.
Deduction can refer to one of the following usages:
..... Click the link for more information.
- Deductive reasoning, inference in which the conclusion is of no greater generality than the premises
- Natural deduction, an approach to proof theory that attempts to provide a formal model of logical reasoning as it
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
predicate is either a relation or the boolean-valued function that amounts to the characteristic function or the indicator function of such a relation.
A function P: X→ is called a predicate on X. When P is a predicate on X, we sometimes say P is a property of X.
..... Click the link for more information.
A function P: X→ is called a predicate on X. When P is a predicate on X, we sometimes say P is a property of X.
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
axioms of the theory. Give a set of σ-structures, and define a theory to be the set of sentences in Lσ holding in all these models. For example, the "theory of finite fields" consists of all sentences in the language of fields that are true in
..... Click the link for more information.
..... Click the link for more information.
axiom is a sentence or proposition that is not proved or demonstrated and is considered as self-evident or as an initial necessary consensus for a theory building or acceptation.
..... Click the link for more information.
..... Click the link for more information.
recursively enumerable, computably enumerable, semidecidable or provable if:
..... Click the link for more information.
- There is an algorithm that, when given an input number, eventually halts if and only if the input is an element of S.
..... Click the link for more information.
In mathematical logic, the Peano axioms, also known as the Dedekind-Peano axioms or the Peano postulates, are a set of axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano.
..... Click the link for more information.
..... Click the link for more information.
In logic, given some formal language L and a set Form(L) (the set of well-formed formulas of L) closed under the rules of the grammar (or syntax) for L, there may be an interpretation intended to apply to some privileged subset Con(L) of the connectives of L.
..... Click the link for more information.
..... Click the link for more information.
In logic and mathematics second-order logic is an extension of first-order logic, which itself is an extension of propositional logic.[1] Second-order logic is in turn extended by higher-order logic and type theory.
..... Click the link for more information.
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
A syllogism (Greek: συλλογισμός — "conclusion," "inference"), (usually the categorical syllogism
..... Click the link for more information.
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
WFF (pronounced "wiff") is an abbreviation for well-formed formula. Given a formal grammar, a WFF is any string that is generated by that grammar. To say that a string is a WFF with respect to a given formal grammar is equivalent to saying that belongs to the language
..... Click the link for more information.
..... Click the link for more information.
In logic, a rule of inference is a function from sets of formulae to formulae. The argument is called the premise set (or simply premises) and the value the conclusion.
..... Click the link for more information.
..... Click the link for more information.
countable set is a set with the same cardinality (i.e., number of elements) as some subset of the set of natural numbers. The term was originated by Georg Cantor; it stems from the fact that the natural numbers are often called counting numbers.
..... Click the link for more information.
..... Click the link for more information.
logical axioms versus non-logical ones is based on an idea of separating
..... Click the link for more information.
- factors responsible for a “general” underlying “apparatus?
- from the “specific”, “professional” ones responsible for the peculiarities of the
..... Click the link for more information.
In mathematical logic, the Peano axioms, also known as the Dedekind-Peano axioms or the Peano postulates, are a set of axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano.
..... Click the link for more information.
..... Click the link for more information.
The set of non-logical symbols is that part of a first-order language which grants its “area of specialty” (rather than its overall expressiveness). An analogous idea can be seen at the distinction of logical axioms from the “own”, “professional”
..... Click the link for more information.
..... Click the link for more information.
In mathematical logic, the Peano axioms, also known as the Dedekind-Peano axioms or the Peano postulates, are a set of axioms for the natural numbers presented by the 19th century Italian mathematician Giuseppe Peano.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, axiomatic set theory is a rigorous reformulation of set theory in first-order logic created to address paradoxes in naive set theory. The basis of set theory was created principally by the German mathematician Georg Cantor at the end of the 19th century.
..... Click the link for more information.
..... Click the link for more information.
In logic, mathematics, and computer science, the arity (synonyms include type, adicity, and rank) of a function or operation is the number of arguments or operands that the function takes.
..... Click the link for more information.
..... Click the link for more information.
In logic and mathematics, a propositional calculus (or a sentential calculus) is a formal system in which formulas representing propositions can be formed by combining atomic propositions using logical connectives, and a system of formal proof rules
..... Click the link for more information.
..... Click the link for more information.
In mathematics, an ordered pair is a collection of two not necessarily distinct objects, one of which is distinguished as the first coordinate (or first entry or
..... Click the link for more information.
..... Click the link for more information.
Arithmetic or arithmetics (from the Greek word αριθμός = number) is the oldest and most elementary branch of mathematics, used by almost everyone, for tasks ranging from simple day-to-day counting to advanced science and business
..... Click the link for more information.
..... Click the link for more information.
Set theory is the mathematical theory of sets, which represent collections of abstract objects. It encompasses the everyday notions, introduced in primary school, often as Venn diagrams, of collections of objects, and the elements of, and membership in, such collections.
..... Click the link for more information.
..... Click the link for more information.
empty set is the unique set which contains no elements. In axiomatic set theory it is postulated to exist by the axiom of empty set. The empty set is also sometimes called the null set
..... Click the link for more information.
..... Click the link for more information.
Arithmetic or arithmetics (from the Greek word αριθμός = number) is the oldest and most elementary branch of mathematics, used by almost everyone, for tasks ranging from simple day-to-day counting to advanced science and business
..... Click the link for more information.
..... Click the link for more information.
Set theory is the mathematical theory of sets, which represent collections of abstract objects. It encompasses the everyday notions, introduced in primary school, often as Venn diagrams, of collections of objects, and the elements of, and membership in, such collections.
..... Click the link for more information.
..... Click the link for more information.
This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus





