isabelle: comparison doc-src/Logics/defining.tex

equal deleted inserted replaced

-:595fda4879b6
+:493308514ea8
 &$|$& {\tt!!} $idts$ {\tt.} $prop$ & (0) \\\\
 $logic$ &=& $prop$ ~~$|$~~ $fun$ \\\\
 $aprop$ &=& $id$ ~~$|$~~ $var$
 ~~$|$~~ $fun@{max_pri}$ {\tt(} $logic$ {\tt,} \dots {\tt,} $logic$ {\tt)} \\\\
 $fun$ &=& $id$ ~~$|$~~ $var$ ~~$|$~~ {\tt(} $fun$ {\tt)} \\
+&$|$& $fun@{max_pri}$ {\tt(} $logic$ {\tt,} \dots {\tt,} $logic$ {\tt)} \\
+&$|$& $fun@{max_pri}$ {\tt::} $type$ \\
 &$|$& \ttindex{\%} $idts$ {\tt.} $logic$ & (0) \\\\
 $idts$ &=& $idt$ ~~$|$~~ $idt@1$ $idts$ \\\\
 $idt$ &=& $id$ ~~$|$~~ {\tt(} $idt$ {\tt)} \\
 &$|$& $id$ \ttindex{::} $type$ & (0) \\\\
-$type$ &=& $tfree$ ~~$|$~~ $tvar$ \\
+$type$ &=& $tfree$ ~~$|$~~ $tvar$ ~~$|$~~ $tfree$ {\tt::} $sort$
-&$|$& $tfree$ {\tt::} $sort$ ~~$|$~~ $tvar$ {\tt::} $sort$ \\
+~~$|$~~ $tvar$ {\tt::} $sort$ \\
 &$|$& $id$ ~~$|$~~ $type@{max_pri}$ $id$
 ~~$|$~~ {\tt(} $type$ {\tt,} \dots {\tt,} $type$ {\tt)} $id$ \\
 &$|$& $type@1$ \ttindex{=>} $type$ & (0) \\
 &$|$& {\tt[}  $type$ {\tt,} \dots {\tt,} $type$ {\tt]} {\tt=>} $type$&(0)\\
 &$|$& {\tt(} $type$ {\tt)} \\\\
 \item[$fun$] Terms potentially of function type.
 \item[$type$] Meta-types.
-\item[$idts$] a list of identifiers, possibly constrained by types. Note
+\item[$idts$] A list of identifiers, possibly constrained by types. Note
 that \verb|x :: nat y| is parsed as \verb|x :: (nat y)|, i.e.\ {\tt y}
 would be treated like a type constructor applied to {\tt nat}.
 \end{description}
 Isabelle is concerned with mathematical languages which have a certain
 minimal vocabulary: identifiers, variables, parentheses, and the lambda
 calculus. Logical types, i.e.\ those of class $logic$, are automatically
 equipped with this basic syntax. More precisely, for any type constructor
-$ty$ with arity $(\dots)c$, where $c$ is a subclass of $logic$, the following
+$ty$ with arity $(\vec{s})c$, where $c$ is a subclass of $logic$, the
-productions are added:
+following productions are added:
 \begin{center}
 \begin{tabular}{rclc}
 $ty$ &=& $id$ ~~$|$~~ $var$ ~~$|$~~ {\tt(} $ty$ {\tt)} \\
 &$|$& $fun@{max_pri}$ {\tt(} $logic$ {\tt,} \dots {\tt,} $logic$ {\tt)}\\
 &$|$& $ty@{max_pri}$ {\tt::} $type$\\\\
 \end{ttbox}
 {\tt Syntax.print_syntax} shows virtually all information contained in a
 syntax, therefore being quite verbose. Its output is divided into labeled
 sections. The syntax proper is represented by {\tt lexicon}, {\tt roots} and
 {\tt prods}. The rest refers to the manifold facilities to apply syntactic
-translations (macro expansion etc.). See \S\ref{sec:macros} and
+translations (macro expansion etc.).
-\S\ref{sec:tr_funs} for more details on translations.
 To simplify coping with the verbosity of {\tt Syntax.print_syntax}, there are
 \ttindex{Syntax.print_gram} to print the syntax proper only and
 \ttindex{Syntax.print_trans} to print the translation related information
 only.
 \item $root_of_type((\tau@1, \dots, \tau@n)ty) = ty$.
 \item $root_of_type(\alpha) = \mtt{logic}$.
 \end{itemize}
-Thereby $\tau@1, \dots, \tau@n$ are types, $ty$ a type infix or ordinary
+Thereby $\tau@1, \dots, \tau@n$ are types, $ty$ an infix or ordinary type
-type constructor and $\alpha$ a type variable or unknown. Note that only
+constructor and $\alpha$ a type variable or unknown. Note that only the
-the outermost type constructor is taken into account.
+outermost type constructor is taken into account.
 \item[\ttindex{prods}]
 The list of productions describing the precedence grammar. Nonterminals
-$A@n$ are rendered in ASCII as {\tt $A$[$n$]}, literal tokens are quoted.
+$A@n$ are rendered in {\sc ascii} as {\tt $A$[$n$]}, literal tokens are
-Note that some productions have strings attached after an {\tt =>}. These
+quoted. Some productions have strings attached after an {\tt =>}. These
 strings later become the heads of parse trees, but they also play a vital
 role when terms are printed (see \S\ref{sec:asts}).
 Productions which do not have string attached and thus do not create a
 new parse tree node are called {\bf copy productions}\indexbold{copy
 {\rm is written as} ("_constrain" ("_abs" x t) ("fun" 'a 'b))
 \end{ttbox}
 Note that {\tt ()} and {\tt (f)} are both illegal.
-The resemblance of LISP's S-expressions is intentional, but notice the two
+The resemblance of Lisp's S-expressions is intentional, but notice the two
 kinds of atomic symbols: $\Constant x$ and $\Variable x$. This distinction
 has some more obscure reasons and you can ignore it about half of the time.
 You should not take the names ``{\tt Constant}'' and ``{\tt Variable}'' too
 literally. In the later translation to terms, $\Variable x$ may become a
 constant, free or bound variable, even a type constructor or class name; the
 Forms like {\tt (("_abs" x $t$) $u$)} are perfectly legal, but asts are not
 higher-order: the {\tt "_abs"} does not yet bind the {\tt x} in any way,
 though later at the term level, {\tt ("_abs" x $t$)} will become an {\tt Abs}
 node and occurrences of {\tt x} in $t$ will be replaced by {\tt Bound}s. Even
-if non constant heads of applications may seem unusual, asts should be
+if non-constant heads of applications may seem unusual, asts should be
 regarded as first-order. Purists may think of ${\tt (} f~x@1~\ldots~x@n{\tt
 )}$ as a first-order application of some invisible $(n+1)$-ary constant.
 \subsection{Parse trees to asts}
 Some of these examples illustrate why further translations are desirable in
 order to provide some nice standard form of asts before macro expansion takes
 place. Hence the Pure syntax provides predefined parse ast
 translations\index{parse ast translation!of Pure} for ordinary applications,
-type applications, nested abstraction, meta implication and function types.
+type applications, nested abstractions, meta implications and function types.
 Their net effect on some representative input strings is shown in
 Figure~\ref{fig:parse_ast_tr}.
 \begin{figure}[htb]
 \begin{center}
 \item $constrain(x, \tau) = \Appl{\Constant \mtt{"_constrain"}, x, ty}$,
 where $ty$ is the ast encoding of $\tau$. That is: type constructors as
 {\tt Constant}s, type variables as {\tt Variable}s and type applications
 as {\tt Appl}s with the head type constructor as first element.
 Additionally, if \ttindex{show_sorts} is set to {\tt true}, some type
-variables are decorated with an ast encoding their sort.
+variables are decorated with an ast encoding of their sort.
 \end{itemize}
 \medskip
 After an ast has been normalized wrt.\ the print macros, it is transformed
 into the final output string. The built-in {\bf print ast
 non-constant head or without a corresponding production are printed as
 $f(x@1, \ldots, x@l)$ or $(\alpha@1, \ldots, \alpha@l) ty$. A single
 $\Variable x$ is simply printed as $x$.
 Note that the system does {\em not\/} insert blanks automatically. They
-should be part of the mixfix declaration (which provide the user interface
+should be part of the mixfix declaration the production has been derived
-for defining syntax) if they are required to separate tokens. Mixfix
+from, if they are required to separate tokens. Mixfix declarations may also
-declarations may also contain pretty printing annotations. See
+contain pretty printing annotations.
-\S\ref{sec:mixfix} for details.
 \section{Mixfix declarations} \label{sec:mixfix}
 translation into the abstract syntax and a pretty printing scheme, all in
 one. Isabelle syntax definitions are inspired by \OBJ's~\cite{OBJ} {\em
 mixfix\/} syntax. Each mixfix annotation defines a precedence grammar
 production and optionally associates a constant with it.
-There is a general form of mixfix annotation exhibiting the full power of
+There is a general form of mixfix annotation and some shortcuts for common
-extending a theory's syntax, and some shortcuts for common cases like infix
+cases like infix operators.
-operators.
 The general \bfindex{mixfix declaration} as it may occur within the {\tt
 consts} section\index{consts section@{\tt consts} section} of a {\tt .thy}
 file, specifies a constant declaration and a grammar production at the same
 time. It has the form {\tt $c$ ::\ "$\tau$" ("$sy$" $ps$ $p$)} and is
 "-" :: "exp => exp"         ("- _" [3] 3)
 end
 \end{ttbox}
 Note that the {\tt arities} declaration causes {\tt exp} to be added to the
 syntax' roots. If you put the above text into a file {\tt exp.thy} and load
-it via {\tt use_thy "exp"}, you can run some tests:
+it via {\tt use_thy "EXP"}, you can run some tests:
 \begin{ttbox}
 val read_exp = Syntax.test_read (syn_of EXP.thy) "exp";
 read_exp "0 * 0 * 0 * 0 + 0 + 0 + 0";
 {\out tokens: "0" "*" "0" "*" "0" "*" "0" "+" "0" "+" "0" "+" "0"}
 {\out raw: ("+" ("+" ("+" ("*" "0" ("*" "0" ("*" "0" "0"))) "0") "0") "0")}
 "op \(c\)" ::\ "\(\tau\)"   ("(_ \(c\)/ _)" [\(p + 1\), \(p\)] \(p\))
 \end{ttbox}
 Thus, prefixing infixes with \ttindex{op} makes them behave like ordinary
 function symbols. Special characters occurring in $c$ have to be escaped as
-in delimiters. Also note that the expanded forms above are illegal at the
+in delimiters. Also note that the expanded forms above would be actually
-user level because of duplicate declarations of constants.
+illegal at the user level because of duplicate declarations of constants.
 \subsection{Binders}
 A \bfindex{binder} is a variable-binding construct, such as a
 with {\tt\at} to stress their pure syntactic purpose; they should never occur
 within the final well-typed terms. Another consequence is that the user
 cannot refer to such names directly, since they are not legal identifiers.
 The translations cause the replacement of external forms by internal forms
-after parsing and before printing of terms.
+after parsing, and vice versa before printing of terms.
 \end{example}
 This is only a very simple but common instance of a more powerful mechanism.
 As a specification of what is to be translated, it should be comprehensible
 without further explanations. But there are also some snags and other
 $string$}.
 \end{center}
 This specifies a \rmindex{parse rule} ({\tt =>}) a \rmindex{print rule} ({\tt
 <=}) or both ({\tt ==}). The two $string$s preceded by optional parenthesized
-$root$s denote the left-hand and right-hand side of the rule as 'source
+$root$s denote the left-hand and right-hand side of the rule as `source
 code', i.e.\ in the usual syntax of terms.
 Rules are internalized wrt.\ an intermediate signature that is obtained from
 the parent theories' ones by adding all material of all sections preceding
-{\tt translations} in the {\tt .thy} file, especially new syntax defined in
+{\tt translations} in the {\tt .thy} file. Especially, new syntax defined in
 {\tt consts} is already effective.
 Then part of the process that transforms input strings into terms is applied:
 lexing, parsing and parse ast translations (see \S\ref{sec:asts}). Macros
 specified in the parents are {\em not\/} expanded. Also note that the lexer
 should be treated as constants during matching (see below). These names are
 extracted from all class, type and constant declarations made so far.
 \medskip
 The result are two lists of translation rules in internal form, that is pairs
-of asts. They can be viewed using {\tt Syntax.print_syntax}
+of asts. They can be viewed using {\tt Syntax.print_syntax} (sections
-(\ttindex{parse_rules} and \ttindex{print_rules}). For {\tt SET} of
+\ttindex{parse_rules} and \ttindex{print_rules}). For {\tt SET} of
 Example~\ref{ex:set_trans} these are:
 \begin{ttbox}
 parse_rules:
 ("{\at}Collect" x A P)  ->  ("Collect" A ("_abs" x P))
 ("{\at}Replace" y x A Q)  ->  ("Replace" A ("_abs" x ("_abs" y Q)))
 \subsection{Applying rules}
 In the course of parsing and printing terms, asts are generated as an
 intermediate form as pictured in Figure~\ref{fig:parse_print}. These asts are
 normalized wrt.\ the given lists of translation rules in a uniform manner. As
-stated earlier, asts are supposed to be first-order 'terms'. The rewriting
+stated earlier, asts are supposed to be first-order `terms'. The rewriting
 systems derived from {\tt translations} sections essentially resemble
 traditional first-order term rewriting systems. We first examine how a single
 rule is applied.
 Let $t$ be the ast to be normalized and $(l, r)$ some translation rule. A
 \medskip
 Having first-order matching in mind, the second case of $match$ may look a
 bit odd. But this is exactly the place, where {\tt Variable}s of non-rule
 asts behave like {\tt Constant}s. The deeper meaning of this is related with
-asts being very 'primitive' in some sense, ignorant of the underlying
+asts being very `primitive' in some sense, ignorant of the underlying
-'semantics', not far removed from parse trees. At this level it is not yet
+`semantics', not far removed from parse trees. At this level it is not yet
 known, which $id$s will become constants, bounds, frees, types or classes. As
 $ast_of_pt$ (see \S\ref{sec:asts}) shows, former parse tree heads appear in
 asts as {\tt Constant}s, while $id$s, $var$s, $tfree$s and $tvar$s become
 {\tt Variable}s.
 rewriting systems, but this would often complicate things unnecessarily.
 Therefore, we reveal part of the actual rewriting strategy: The normalizer
 always applies the first matching rule reducing an unspecified redex chosen
 first.
-Thereby, 'first rule' is roughly speaking meant wrt.\ the appearance of the
+Thereby, `first rule' is roughly speaking meant wrt.\ the appearance of the
 rules in the {\tt translations} sections. But this is more tricky than it
 seems: If a given theory is {\em extended}, new rules are simply appended to
 the end. But if theories are {\em merged}, it is not clear which list of
 rules has priority over the other. In fact the merge order is left
 unspecified. This shouldn't cause any problems in practice, since
 \verb|%empty insert. {x}|. This problem arises, because the ast rewriter
 cannot discern constants, frees, bounds etc.\ and looks only for names of
 atoms.
 Thus the names of {\tt Constant}s occurring in the (internal) left-hand side
-of translation rules should be regarded as 'reserved keywords'. It is good
+of translation rules should be regarded as `reserved keywords'. It is good
 practice to choose non-identifiers here like {\tt\at Finset} or sufficiently
 long and strange names.
 \end{example}
 \begin{example} \label{ex:prod_trans}
 Now the second parse rule is where the trick comes in: {\tt _K(B)} is
 introduced during ast rewriting, which later becomes \verb|%x.B| due to a
 parse translation associated with \ttindex{_K}. Note that a leading {\tt _}
 in $id$s is allowed in translation rules, but not in ordinary terms. This
-special behaviour of the lexer is very useful for 'forging' asts containing
+special behaviour of the lexer is very useful for `forging' asts containing
-names that are not directly accessible normally. Unfortunately, there is no
+names that are not directly accessible normally.
-such trick for printing, so we have to add a {\tt ML} section for the print
-translation \ttindex{dependent_tr'}.
+Unfortunately, there is no such trick for printing, so we have to add a {\tt
+ML} section for the print translation \ttindex{dependent_tr'}.
 The parse translation for {\tt _K} is already installed in Pure, and {\tt
 dependent_tr'} is exported by the syntax module for public use. See
 \S\ref{sec:tr_funs} for more of the arcane lore of translation functions.
 \end{example}
 also Figure~\ref{fig:parse_print}): Whenever --- during the transformations
 between parse trees, asts and terms --- a combination of the form
 $(\mtt"c\mtt"~x@1 \ldots x@n)$ is encountered, and a translation function $f$
 of appropriate kind exists for $c$, the result will be $f \mtt[ x@1, \ldots,
 x@n \mtt]$. Thereby, $x@1, \ldots, x@n$ (with $n \ge 0$) are asts for ast
-translations and terms for term translations. A 'combination' at ast level is
+translations and terms for term translations. A `combination' at ast level is
 of the form $\Constant c$ or $\Appl{\Constant c, x@1, \ldots, x@n}$, and at
 term level $\ttfct{Const} (c, \tau)$ or $\ttfct{Const} (c, \tau) \ttrel{\$}
 x@1 \ttrel{\$} \dots \ttrel{\$} x@n$.
 \medskip
 (ast) translations more fundamentally:
 \begin{description}
 \item[Parse (ast) translations] are applied bottom-up, i.e.\ the arguments
 supplied ($x@1, \ldots, x@n$ above) are already in translated form.
 Additionally, they may not fail, exceptions are re-raised after printing
-of an error message.
+an error message.
 \item[Print (ast) translations] are applied top-down, i.e.\ supplied with
 arguments that are partly still in internal form. The result is again fed
 into the translation machinery as a whole. Therefore a print (ast)
 translation should not introduce as head a constant of the same name that
 body $B$ with {\tt Bound}s referring to our {\tt Abs} node replaced by
 $\ttfct{Free} (x', \mtt{dummyT})$.
 We have to be more careful with types here. While types of {\tt Const}s are
 completely ignored, type constraints may be printed for some {\tt Free}s and
-{\tt Var}s (only if \ttindex{show_types} is set to {\tt true}). Variables of
+{\tt Var}s (if \ttindex{show_types} is set to {\tt true}). Variables of type
-type \ttindex{dummyT} are never printed with constraint, though. Thus, a
+\ttindex{dummyT} are never printed with constraint, though. Thus, a
 constraint of $x'$ may only appear at its binding place, since {\tt Free}s of
 $B'$ replacing the appropriate {\tt Bound}s of $B$ via \ttindex{variant_abs}
-have all type {\tt dummyT}.
+have all type {\tt dummyT}. \end{example}
-\end{example}
 \section{Example: some minimal logics} \label{sec:min_logics}
 Trueprop :: "o => prop"   ("_" 5)
 end
 \end{ttbox}
 The constant {\tt Trueprop} (the name is arbitrary) acts as an invisible
 coercion function. Assuming this definition resides in a file {\tt base.thy},
-you have to load it with the command {\tt use_thy "base"}.
+you have to load it with the command {\tt use_thy "Base"}.
 One of the simplest nontrivial logics is {\em minimal logic\/} of
 implication. Its definition in Isabelle needs no advanced features but
 illustrates the overall mechanism quite nicely:
 \begin{ttbox}

changeset 135	493308514ea8
parent 108	e332c5bf9e1f
child 142	6dfae8cddec7