| author | wenzelm | 
| Fri, 20 Jun 2014 20:47:22 +0200 | |
| changeset 57346 | 1d6d44a0583f | 
| parent 56582 | f05b7d6ec592 | 
| child 57478 | fa14d60a8cca | 
| permissions | -rw-r--r-- | 
| 28762 | 1 | theory Inner_Syntax | 
| 42651 | 2 | imports Base Main | 
| 28762 | 3 | begin | 
| 4 | ||
| 28778 | 5 | chapter {* Inner syntax --- the term language \label{ch:inner-syntax} *}
 | 
| 28762 | 6 | |
| 46282 | 7 | text {* The inner syntax of Isabelle provides concrete notation for
 | 
| 8 |   the main entities of the logical framework, notably @{text
 | |
| 9 | "\<lambda>"}-terms with types and type classes. Applications may either | |
| 10 | extend existing syntactic categories by additional notation, or | |
| 11 | define new sub-languages that are linked to the standard term | |
| 12 |   language via some explicit markers.  For example @{verbatim
 | |
| 13 |   FOO}~@{text "foo"} could embed the syntax corresponding for some
 | |
| 14 |   user-defined nonterminal @{text "foo"} --- within the bounds of the
 | |
| 15 | given lexical syntax of Isabelle/Pure. | |
| 16 | ||
| 17 | The most basic way to specify concrete syntax for logical entities | |
| 18 |   works via mixfix annotations (\secref{sec:mixfix}), which may be
 | |
| 19 | usually given as part of the original declaration or via explicit | |
| 20 |   notation commands later on (\secref{sec:notation}).  This already
 | |
| 21 | covers many needs of concrete syntax without having to understand | |
| 22 | the full complexity of inner syntax layers. | |
| 23 | ||
| 24 | Further details of the syntax engine involves the classical | |
| 25 | distinction of lexical language versus context-free grammar (see | |
| 26 |   \secref{sec:pure-syntax}), and various mechanisms for \emph{syntax
 | |
| 48113 | 27 |   transformations} (see \secref{sec:syntax-transformations}).
 | 
| 46282 | 28 | *} | 
| 29 | ||
| 30 | ||
| 28762 | 31 | section {* Printing logical entities *}
 | 
| 32 | ||
| 46284 | 33 | subsection {* Diagnostic commands \label{sec:print-diag} *}
 | 
| 28762 | 34 | |
| 35 | text {*
 | |
| 36 |   \begin{matharray}{rcl}
 | |
| 28766 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 37 |     @{command_def "typ"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | 
| 28762 | 38 |     @{command_def "term"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | 
| 39 |     @{command_def "prop"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | |
| 28766 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 40 |     @{command_def "thm"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | 
| 28762 | 41 |     @{command_def "prf"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | 
| 42 |     @{command_def "full_prf"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | |
| 52430 | 43 |     @{command_def "print_state"}@{text "\<^sup>*"} & : & @{text "any \<rightarrow>"} \\
 | 
| 28762 | 44 |   \end{matharray}
 | 
| 45 | ||
| 46 | These diagnostic commands assist interactive development by printing | |
| 47 | internal logical entities in a human-readable fashion. | |
| 48 | ||
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 49 |   @{rail \<open>
 | 
| 48792 | 50 |     @@{command typ} @{syntax modes}? @{syntax type} ('::' @{syntax sort})?
 | 
| 28762 | 51 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 52 |     @@{command term} @{syntax modes}? @{syntax term}
 | 
| 28762 | 53 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 54 |     @@{command prop} @{syntax modes}? @{syntax prop}
 | 
| 28762 | 55 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 56 |     @@{command thm} @{syntax modes}? @{syntax thmrefs}
 | 
| 28762 | 57 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 58 |     ( @@{command prf} | @@{command full_prf} ) @{syntax modes}? @{syntax thmrefs}?
 | 
| 28762 | 59 | ; | 
| 52430 | 60 |     @@{command print_state} @{syntax modes}?
 | 
| 28762 | 61 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 62 |     @{syntax_def modes}: '(' (@{syntax name} + ) ')'
 | 
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 63 | \<close>} | 
| 28762 | 64 | |
| 65 |   \begin{description}
 | |
| 66 | ||
| 48792 | 67 |   \item @{command "typ"}~@{text \<tau>} reads and prints a type expression
 | 
| 68 | according to the current context. | |
| 69 | ||
| 70 |   \item @{command "typ"}~@{text "\<tau> :: s"} uses type-inference to
 | |
| 71 |   determine the most general way to make @{text "\<tau>"} conform to sort
 | |
| 72 |   @{text "s"}.  For concrete @{text "\<tau>"} this checks if the type
 | |
| 73 |   belongs to that sort.  Dummy type parameters ``@{text "_"}''
 | |
| 74 | (underscore) are assigned to fresh type variables with most general | |
| 75 | sorts, according the the principles of type-inference. | |
| 28766 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 76 | |
| 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 77 |   \item @{command "term"}~@{text t} and @{command "prop"}~@{text \<phi>}
 | 
| 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 78 | read, type-check and print terms or propositions according to the | 
| 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 79 |   current theory or proof context; the inferred type of @{text t} is
 | 
| 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 80 | output as well. Note that these commands are also useful in | 
| 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 81 | inspecting the current environment of term abbreviations. | 
| 28762 | 82 | |
| 83 |   \item @{command "thm"}~@{text "a\<^sub>1 \<dots> a\<^sub>n"} retrieves
 | |
| 84 | theorems from the current theory or proof context. Note that any | |
| 85 | attributes included in the theorem specifications are applied to a | |
| 86 | temporary context derived from the current theory or proof; the | |
| 87 |   result is discarded, i.e.\ attributes involved in @{text "a\<^sub>1,
 | |
| 88 | \<dots>, a\<^sub>n"} do not have any permanent effect. | |
| 89 | ||
| 90 |   \item @{command "prf"} displays the (compact) proof term of the
 | |
| 91 | current proof state (if present), or of the given theorems. Note | |
| 92 | that this requires proof terms to be switched on for the current | |
| 93 | object logic (see the ``Proof terms'' section of the Isabelle | |
| 94 | reference manual for information on how to do this). | |
| 95 | ||
| 96 |   \item @{command "full_prf"} is like @{command "prf"}, but displays
 | |
| 97 | the full proof term, i.e.\ also displays information omitted in the | |
| 98 |   compact proof term, which is denoted by ``@{text _}'' placeholders
 | |
| 99 | there. | |
| 100 | ||
| 52430 | 101 |   \item @{command "print_state"} prints the current proof state (if
 | 
| 102 | present), including current facts and goals. | |
| 28766 
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28765diff
changeset | 103 | |
| 28762 | 104 |   \end{description}
 | 
| 105 | ||
| 106 |   All of the diagnostic commands above admit a list of @{text modes}
 | |
| 42926 | 107 | to be specified, which is appended to the current print mode; see | 
| 46284 | 108 |   also \secref{sec:print-modes}.  Thus the output behavior may be
 | 
| 109 | modified according particular print mode features. For example, | |
| 52430 | 110 |   @{command "print_state"}~@{text "(latex xsymbols)"} prints the
 | 
| 111 | current proof state with mathematical symbols and special characters | |
| 46284 | 112 |   represented in {\LaTeX} source, according to the Isabelle style
 | 
| 28762 | 113 |   \cite{isabelle-sys}.
 | 
| 114 | ||
| 115 |   Note that antiquotations (cf.\ \secref{sec:antiq}) provide a more
 | |
| 116 | systematic way to include formal items into the printed text | |
| 117 | document. | |
| 118 | *} | |
| 119 | ||
| 120 | ||
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 121 | subsection {* Details of printed content *}
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 122 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 123 | text {*
 | 
| 42655 | 124 |   \begin{tabular}{rcll}
 | 
| 49699 | 125 |     @{attribute_def show_markup} & : & @{text attribute} \\
 | 
| 42655 | 126 |     @{attribute_def show_types} & : & @{text attribute} & default @{text false} \\
 | 
| 127 |     @{attribute_def show_sorts} & : & @{text attribute} & default @{text false} \\
 | |
| 128 |     @{attribute_def show_consts} & : & @{text attribute} & default @{text false} \\
 | |
| 129 |     @{attribute_def show_abbrevs} & : & @{text attribute} & default @{text true} \\
 | |
| 130 |     @{attribute_def show_brackets} & : & @{text attribute} & default @{text false} \\
 | |
| 42669 
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
 wenzelm parents: 
42655diff
changeset | 131 |     @{attribute_def names_long} & : & @{text attribute} & default @{text false} \\
 | 
| 
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
 wenzelm parents: 
42655diff
changeset | 132 |     @{attribute_def names_short} & : & @{text attribute} & default @{text false} \\
 | 
| 
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
 wenzelm parents: 
42655diff
changeset | 133 |     @{attribute_def names_unique} & : & @{text attribute} & default @{text true} \\
 | 
| 42655 | 134 |     @{attribute_def eta_contract} & : & @{text attribute} & default @{text true} \\
 | 
| 135 |     @{attribute_def goals_limit} & : & @{text attribute} & default @{text 10} \\
 | |
| 136 |     @{attribute_def show_main_goal} & : & @{text attribute} & default @{text false} \\
 | |
| 137 |     @{attribute_def show_hyps} & : & @{text attribute} & default @{text false} \\
 | |
| 138 |     @{attribute_def show_tags} & : & @{text attribute} & default @{text false} \\
 | |
| 139 |     @{attribute_def show_question_marks} & : & @{text attribute} & default @{text true} \\
 | |
| 140 |   \end{tabular}
 | |
| 141 | \medskip | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 142 | |
| 42655 | 143 | These configuration options control the detail of information that | 
| 144 | is displayed for types, terms, theorems, goals etc. See also | |
| 145 |   \secref{sec:config}.
 | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 146 | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 147 |   \begin{description}
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 148 | |
| 49699 | 149 |   \item @{attribute show_markup} controls direct inlining of markup
 | 
| 150 | into the printed representation of formal entities --- notably type | |
| 151 | and sort constraints. This enables Prover IDE users to retrieve | |
| 152 | that information via tooltips or popups while hovering with the | |
| 153 | mouse over the output window, for example. Consequently, this | |
| 154 | option is enabled by default for Isabelle/jEdit, but disabled for | |
| 155 | TTY and Proof~General~/Emacs where document markup would not work. | |
| 156 | ||
| 42655 | 157 |   \item @{attribute show_types} and @{attribute show_sorts} control
 | 
| 158 | printing of type constraints for term variables, and sort | |
| 159 | constraints for type variables. By default, neither of these are | |
| 160 |   shown in output.  If @{attribute show_sorts} is enabled, types are
 | |
| 49699 | 161 | always shown as well. In Isabelle/jEdit, manual setting of these | 
| 162 |   options is normally not required thanks to @{attribute show_markup}
 | |
| 163 | above. | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 164 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 165 | Note that displaying types and sorts may explain why a polymorphic | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 166 | inference rule fails to resolve with some goal, or why a rewrite | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 167 | rule does not apply as expected. | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 168 | |
| 42655 | 169 |   \item @{attribute show_consts} controls printing of types of
 | 
| 170 | constants when displaying a goal state. | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 171 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 172 | Note that the output can be enormous, because polymorphic constants | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 173 | often occur at several different type instances. | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 174 | |
| 42655 | 175 |   \item @{attribute show_abbrevs} controls folding of constant
 | 
| 176 | abbreviations. | |
| 40879 
ca132ef44944
configuration option "show_abbrevs" supersedes print mode "no_abbrevs", with inverted meaning;
 wenzelm parents: 
40255diff
changeset | 177 | |
| 42655 | 178 |   \item @{attribute show_brackets} controls bracketing in pretty
 | 
| 179 | printed output. If enabled, all sub-expressions of the pretty | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 180 | printing tree will be parenthesized, even if this produces malformed | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 181 | term syntax! This crude way of showing the internal structure of | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 182 | pretty printed entities may occasionally help to diagnose problems | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 183 | with operator priorities, for example. | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 184 | |
| 42669 
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
 wenzelm parents: 
42655diff
changeset | 185 |   \item @{attribute names_long}, @{attribute names_short}, and
 | 
| 
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
 wenzelm parents: 
42655diff
changeset | 186 |   @{attribute names_unique} control the way of printing fully
 | 
| 42358 
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
 wenzelm parents: 
42279diff
changeset | 187 | qualified internal names in external form. See also | 
| 
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
 wenzelm parents: 
42279diff
changeset | 188 |   \secref{sec:antiq} for the document antiquotation options of the
 | 
| 
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
 wenzelm parents: 
42279diff
changeset | 189 | same names. | 
| 
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
 wenzelm parents: 
42279diff
changeset | 190 | |
| 42655 | 191 |   \item @{attribute eta_contract} controls @{text "\<eta>"}-contracted
 | 
| 192 | printing of terms. | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 193 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 194 |   The @{text \<eta>}-contraction law asserts @{prop "(\<lambda>x. f x) \<equiv> f"},
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 195 |   provided @{text x} is not free in @{text f}.  It asserts
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 196 |   \emph{extensionality} of functions: @{prop "f \<equiv> g"} if @{prop "f x \<equiv>
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 197 |   g x"} for all @{text x}.  Higher-order unification frequently puts
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 198 |   terms into a fully @{text \<eta>}-expanded form.  For example, if @{text
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 199 |   F} has type @{text "(\<tau> \<Rightarrow> \<tau>) \<Rightarrow> \<tau>"} then its expanded form is @{term
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 200 | "\<lambda>h. F (\<lambda>x. h x)"}. | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 201 | |
| 42655 | 202 |   Enabling @{attribute eta_contract} makes Isabelle perform @{text
 | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 203 |   \<eta>}-contractions before printing, so that @{term "\<lambda>h. F (\<lambda>x. h x)"}
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 204 |   appears simply as @{text F}.
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 205 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 206 |   Note that the distinction between a term and its @{text \<eta>}-expanded
 | 
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 207 | form occasionally matters. While higher-order resolution and | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 208 |   rewriting operate modulo @{text "\<alpha>\<beta>\<eta>"}-conversion, some other tools
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 209 | might look at terms more discretely. | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 210 | |
| 42655 | 211 |   \item @{attribute goals_limit} controls the maximum number of
 | 
| 51960 
61ac1efe02c3
option "goals_limit", with more uniform description;
 wenzelm parents: 
51657diff
changeset | 212 | subgoals to be printed. | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 213 | |
| 42655 | 214 |   \item @{attribute show_main_goal} controls whether the main result
 | 
| 215 | to be proven should be displayed. This information might be | |
| 39130 | 216 | relevant for schematic goals, to inspect the current claim that has | 
| 217 | been synthesized so far. | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 218 | |
| 42655 | 219 |   \item @{attribute show_hyps} controls printing of implicit
 | 
| 220 | hypotheses of local facts. Normally, only those hypotheses are | |
| 221 |   displayed that are \emph{not} covered by the assumptions of the
 | |
| 222 | current context: this situation indicates a fault in some tool being | |
| 223 | used. | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 224 | |
| 42655 | 225 |   By enabling @{attribute show_hyps}, output of \emph{all} hypotheses
 | 
| 226 | can be enforced, which is occasionally useful for diagnostic | |
| 227 | purposes. | |
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 228 | |
| 42655 | 229 |   \item @{attribute show_tags} controls printing of extra annotations
 | 
| 230 | within theorems, such as internal position information, or the case | |
| 231 |   names being attached by the attribute @{attribute case_names}.
 | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 232 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 233 |   Note that the @{attribute tagged} and @{attribute untagged}
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 234 | attributes provide low-level access to the collection of tags | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 235 | associated with a theorem. | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 236 | |
| 42655 | 237 |   \item @{attribute show_question_marks} controls printing of question
 | 
| 238 |   marks for schematic variables, such as @{text ?x}.  Only the leading
 | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 239 | question mark is affected, the remaining text is unchanged | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 240 | (including proper markup for schematic variables that might be | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 241 | relevant for user interfaces). | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 242 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 243 |   \end{description}
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 244 | *} | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 245 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 246 | |
| 46284 | 247 | subsection {* Alternative print modes \label{sec:print-modes} *}
 | 
| 248 | ||
| 249 | text {*
 | |
| 250 |   \begin{mldecls}
 | |
| 251 |     @{index_ML print_mode_value: "unit -> string list"} \\
 | |
| 252 |     @{index_ML Print_Mode.with_modes: "string list -> ('a -> 'b) -> 'a -> 'b"} \\
 | |
| 253 |   \end{mldecls}
 | |
| 254 | ||
| 255 |   The \emph{print mode} facility allows to modify various operations
 | |
| 256 |   for printing.  Commands like @{command typ}, @{command term},
 | |
| 257 |   @{command thm} (see \secref{sec:print-diag}) take additional print
 | |
| 258 | modes as optional argument. The underlying ML operations are as | |
| 259 | follows. | |
| 260 | ||
| 261 |   \begin{description}
 | |
| 262 | ||
| 263 |   \item @{ML "print_mode_value ()"} yields the list of currently
 | |
| 264 | active print mode names. This should be understood as symbolic | |
| 265 | representation of certain individual features for printing (with | |
| 266 | precedence from left to right). | |
| 267 | ||
| 268 |   \item @{ML Print_Mode.with_modes}~@{text "modes f x"} evaluates
 | |
| 269 |   @{text "f x"} in an execution context where the print mode is
 | |
| 270 |   prepended by the given @{text "modes"}.  This provides a thread-safe
 | |
| 271 | way to augment print modes. It is also monotonic in the set of mode | |
| 272 | names: it retains the default print mode that certain | |
| 273 | user-interfaces might have installed for their proper functioning! | |
| 274 | ||
| 275 |   \end{description}
 | |
| 276 | ||
| 277 |   \begin{warn}
 | |
| 278 |   The old global reference @{ML print_mode} should never be used
 | |
| 279 | directly in applications. Its main reason for being publicly | |
| 280 | accessible is to support historic versions of Proof~General. | |
| 281 |   \end{warn}
 | |
| 282 | ||
| 283 | \medskip The pretty printer for inner syntax maintains alternative | |
| 284 | mixfix productions for any print mode name invented by the user, say | |
| 285 |   in commands like @{command notation} or @{command abbreviation}.
 | |
| 286 | Mode names can be arbitrary, but the following ones have a specific | |
| 287 | meaning by convention: | |
| 288 | ||
| 289 |   \begin{itemize}
 | |
| 290 | ||
| 291 |   \item @{verbatim "\"\""} (the empty string): default mode;
 | |
| 292 | implicitly active as last element in the list of modes. | |
| 293 | ||
| 294 |   \item @{verbatim input}: dummy print mode that is never active; may
 | |
| 295 | be used to specify notation that is only available for input. | |
| 296 | ||
| 297 |   \item @{verbatim internal} dummy print mode that is never active;
 | |
| 298 | used internally in Isabelle/Pure. | |
| 299 | ||
| 300 |   \item @{verbatim xsymbols}: enable proper mathematical symbols
 | |
| 301 |   instead of ASCII art.\footnote{This traditional mode name stems from
 | |
| 302 | the ``X-Symbol'' package for old versions Proof~General with XEmacs, | |
| 303 | although that package has been superseded by Unicode in recent | |
| 304 | years.} | |
| 305 | ||
| 306 |   \item @{verbatim HTML}: additional mode that is active in HTML
 | |
| 307 | presentation of Isabelle theory sources; allows to provide | |
| 308 | alternative output notation. | |
| 309 | ||
| 310 |   \item @{verbatim latex}: additional mode that is active in {\LaTeX}
 | |
| 311 | document preparation of Isabelle theory sources; allows to provide | |
| 312 | alternative output notation. | |
| 313 | ||
| 314 |   \end{itemize}
 | |
| 315 | *} | |
| 316 | ||
| 317 | ||
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 318 | subsection {* Printing limits *}
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 319 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 320 | text {*
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 321 |   \begin{mldecls}
 | 
| 36745 | 322 |     @{index_ML Pretty.margin_default: "int Unsynchronized.ref"} \\
 | 
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 323 |   \end{mldecls}
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 324 | |
| 56281 | 325 |   \begin{tabular}{rcll}
 | 
| 326 |     @{attribute_def ML_print_depth} & : & @{text attribute} & default 10 \\ %FIXME move?
 | |
| 327 |   \end{tabular}
 | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 328 | |
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 329 |   \begin{description}
 | 
| 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 330 | |
| 36745 | 331 |   \item @{ML Pretty.margin_default} indicates the global default for
 | 
| 332 | the right margin of the built-in pretty printer, with initial value | |
| 333 | 76. Note that user-interfaces typically control margins | |
| 334 | automatically when resizing windows, or even bypass the formatting | |
| 335 | engine of Isabelle/ML altogether and do it within the front end via | |
| 336 | Isabelle/Scala. | |
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 337 | |
| 56281 | 338 |   \item @{attribute ML_print_depth} limits the printing depth of the
 | 
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 339 | ML toplevel pretty printer; the precise effect depends on the ML | 
| 56281 | 340 | compiler and run-time system. Typically the limit should be less | 
| 28765 
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
 wenzelm parents: 
28763diff
changeset | 341 | than 10. Bigger values such as 100--1000 are useful for debugging. | 
| 28763 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 342 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 343 |   \end{description}
 | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 344 | *} | 
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 345 | |
| 
b5e6122ff575
added pretty printing options (from old ref manual);
 wenzelm parents: 
28762diff
changeset | 346 | |
| 46282 | 347 | section {* Mixfix annotations \label{sec:mixfix} *}
 | 
| 28762 | 348 | |
| 349 | text {* Mixfix annotations specify concrete \emph{inner syntax} of
 | |
| 35351 
7425aece4ee3
allow general mixfix syntax for type constructors;
 wenzelm parents: 
32833diff
changeset | 350 | Isabelle types and terms. Locally fixed parameters in toplevel | 
| 46290 | 351 | theorem statements, locale and class specifications also admit | 
| 352 | mixfix annotations in a fairly uniform manner. A mixfix annotation | |
| 50635 | 353 | describes the concrete syntax, the translation to abstract | 
| 46290 | 354 | syntax, and the pretty printing. Special case annotations provide a | 
| 355 | simple means of specifying infix operators and binders. | |
| 356 | ||
| 357 |   Isabelle mixfix syntax is inspired by {\OBJ} \cite{OBJ}.  It allows
 | |
| 358 | to specify any context-free priority grammar, which is more general | |
| 359 | than the fixity declarations of ML and Prolog. | |
| 28762 | 360 | |
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 361 |   @{rail \<open>
 | 
| 51654 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 362 |     @{syntax_def mixfix}: '('
 | 
| 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 363 |       @{syntax template} prios? @{syntax nat}? |
 | 
| 46290 | 364 |       (@'infix' | @'infixl' | @'infixr') @{syntax template} @{syntax nat} |
 | 
| 51654 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 365 |       @'binder' @{syntax template} prios? @{syntax nat} |
 | 
| 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 366 | @'structure' | 
| 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 367 | ')' | 
| 46290 | 368 | ; | 
| 369 | template: string | |
| 46289 | 370 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 371 |     prios: '[' (@{syntax nat} + ',') ']'
 | 
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 372 | \<close>} | 
| 28762 | 373 | |
| 46290 | 374 |   The string given as @{text template} may include literal text,
 | 
| 375 |   spacing, blocks, and arguments (denoted by ``@{text _}''); the
 | |
| 376 |   special symbol ``@{verbatim "\<index>"}'' (printed as ``@{text "\<index>"}'')
 | |
| 51657 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 377 |   represents an index argument that specifies an implicit @{keyword
 | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 378 |   "structure"} reference (see also \secref{sec:locale}).  Only locally
 | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 379 |   fixed variables may be declared as @{keyword "structure"}.
 | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 380 | |
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 381 | Infix and binder declarations provide common abbreviations for | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 382 | particular mixfix declarations. So in practice, mixfix templates | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 383 | mostly degenerate to literal text for concrete syntax, such as | 
| 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 384 |   ``@{verbatim "++"}'' for an infix symbol.  *}
 | 
| 28762 | 385 | |
| 46290 | 386 | |
| 387 | subsection {* The general mixfix form *}
 | |
| 388 | ||
| 389 | text {* In full generality, mixfix declarations work as follows.
 | |
| 390 |   Suppose a constant @{text "c :: \<tau>\<^sub>1 \<Rightarrow> \<dots> \<tau>\<^sub>n \<Rightarrow> \<tau>"} is annotated by
 | |
| 391 |   @{text "(mixfix [p\<^sub>1, \<dots>, p\<^sub>n] p)"}, where @{text "mixfix"} is a string
 | |
| 392 |   @{text "d\<^sub>0 _ d\<^sub>1 _ \<dots> _ d\<^sub>n"} consisting of delimiters that surround
 | |
| 393 | argument positions as indicated by underscores. | |
| 28762 | 394 | |
| 395 | Altogether this determines a production for a context-free priority | |
| 396 |   grammar, where for each argument @{text "i"} the syntactic category
 | |
| 46292 | 397 |   is determined by @{text "\<tau>\<^sub>i"} (with priority @{text "p\<^sub>i"}), and the
 | 
| 398 |   result category is determined from @{text "\<tau>"} (with priority @{text
 | |
| 399 | "p"}). Priority specifications are optional, with default 0 for | |
| 400 |   arguments and 1000 for the result.\footnote{Omitting priorities is
 | |
| 401 | prone to syntactic ambiguities unless the delimiter tokens determine | |
| 402 |   fully bracketed notation, as in @{text "if _ then _ else _ fi"}.}
 | |
| 28762 | 403 | |
| 404 |   Since @{text "\<tau>"} may be again a function type, the constant
 | |
| 405 | type scheme may have more argument positions than the mixfix | |
| 406 |   pattern.  Printing a nested application @{text "c t\<^sub>1 \<dots> t\<^sub>m"} for
 | |
| 407 |   @{text "m > n"} works by attaching concrete notation only to the
 | |
| 408 |   innermost part, essentially by printing @{text "(c t\<^sub>1 \<dots> t\<^sub>n) \<dots> t\<^sub>m"}
 | |
| 409 | instead. If a term has fewer arguments than specified in the mixfix | |
| 410 | template, the concrete syntax is ignored. | |
| 411 | ||
| 412 | \medskip A mixfix template may also contain additional directives | |
| 413 | for pretty printing, notably spaces, blocks, and breaks. The | |
| 414 | general template format is a sequence over any of the following | |
| 415 | entities. | |
| 416 | ||
| 28778 | 417 |   \begin{description}
 | 
| 28762 | 418 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 419 |   \item @{text "d"} is a delimiter, namely a non-empty sequence of
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 420 | characters other than the following special characters: | 
| 28762 | 421 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 422 | \smallskip | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 423 |   \begin{tabular}{ll}
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 424 |     @{verbatim "'"} & single quote \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 425 |     @{verbatim "_"} & underscore \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 426 |     @{text "\<index>"} & index symbol \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 427 |     @{verbatim "("} & open parenthesis \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 428 |     @{verbatim ")"} & close parenthesis \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 429 |     @{verbatim "/"} & slash \\
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 430 |   \end{tabular}
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 431 | \medskip | 
| 28762 | 432 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 433 |   \item @{verbatim "'"} escapes the special meaning of these
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 434 | meta-characters, producing a literal version of the following | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 435 | character, unless that is a blank. | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 436 | |
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 437 | A single quote followed by a blank separates delimiters, without | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 438 | affecting printing, but input tokens may have additional white space | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 439 | here. | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 440 | |
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 441 |   \item @{verbatim "_"} is an argument position, which stands for a
 | 
| 28762 | 442 | certain syntactic category in the underlying grammar. | 
| 443 | ||
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 444 |   \item @{text "\<index>"} is an indexed argument position; this is the place
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 445 | where implicit structure arguments can be attached. | 
| 28762 | 446 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 447 |   \item @{text "s"} is a non-empty sequence of spaces for printing.
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 448 | This and the following specifications do not affect parsing at all. | 
| 28762 | 449 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 450 |   \item @{verbatim "("}@{text n} opens a pretty printing block.  The
 | 
| 28762 | 451 | optional number specifies how much indentation to add when a line | 
| 452 | break occurs within the block. If the parenthesis is not followed | |
| 453 | by digits, the indentation defaults to 0. A block specified via | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 454 |   @{verbatim "(00"} is unbreakable.
 | 
| 28762 | 455 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 456 |   \item @{verbatim ")"} closes a pretty printing block.
 | 
| 28762 | 457 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 458 |   \item @{verbatim "//"} forces a line break.
 | 
| 28762 | 459 | |
| 28771 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 460 |   \item @{verbatim "/"}@{text s} allows a line break.  Here @{text s}
 | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 461 | stands for the string of spaces (zero or more) right after the | 
| 
4510201c6aaf
mixfix annotations: verbatim for special symbols;
 wenzelm parents: 
28770diff
changeset | 462 |   slash.  These spaces are printed if the break is \emph{not} taken.
 | 
| 28762 | 463 | |
| 28778 | 464 |   \end{description}
 | 
| 28762 | 465 | |
| 466 | The general idea of pretty printing with blocks and breaks is also | |
| 46286 | 467 |   described in \cite{paulson-ml2}; it goes back to \cite{Oppen:1980}.
 | 
| 28762 | 468 | *} | 
| 469 | ||
| 470 | ||
| 46290 | 471 | subsection {* Infixes *}
 | 
| 472 | ||
| 473 | text {* Infix operators are specified by convenient short forms that
 | |
| 474 | abbreviate general mixfix annotations as follows: | |
| 475 | ||
| 476 |   \begin{center}
 | |
| 477 |   \begin{tabular}{lll}
 | |
| 478 | ||
| 46292 | 479 |   @{verbatim "("}@{keyword_def "infix"}~@{verbatim "\""}@{text sy}@{verbatim "\""} @{text "p"}@{verbatim ")"}
 | 
| 46290 | 480 |   & @{text "\<mapsto>"} &
 | 
| 481 |   @{verbatim "(\"(_ "}@{text sy}@{verbatim "/ _)\" ["}@{text "p + 1"}@{verbatim ", "}@{text "p + 1"}@{verbatim "]"}@{text " p"}@{verbatim ")"} \\
 | |
| 46292 | 482 |   @{verbatim "("}@{keyword_def "infixl"}~@{verbatim "\""}@{text sy}@{verbatim "\""} @{text "p"}@{verbatim ")"}
 | 
| 46290 | 483 |   & @{text "\<mapsto>"} &
 | 
| 484 |   @{verbatim "(\"(_ "}@{text sy}@{verbatim "/ _)\" ["}@{text "p"}@{verbatim ", "}@{text "p + 1"}@{verbatim "]"}@{text " p"}@{verbatim ")"} \\
 | |
| 46292 | 485 |   @{verbatim "("}@{keyword_def "infixr"}~@{verbatim "\""}@{text sy}@{verbatim "\""} @{text "p"}@{verbatim ")"}
 | 
| 46290 | 486 |   & @{text "\<mapsto>"} &
 | 
| 487 |   @{verbatim "(\"(_ "}@{text sy}@{verbatim "/ _)\" ["}@{text "p + 1"}@{verbatim ", "}@{text "p"}@{verbatim "]"}@{text " p"}@{verbatim ")"} \\
 | |
| 488 | ||
| 489 |   \end{tabular}
 | |
| 490 |   \end{center}
 | |
| 491 | ||
| 46292 | 492 |   The mixfix template @{verbatim "\"(_ "}@{text sy}@{verbatim "/ _)\""}
 | 
| 493 | specifies two argument positions; the delimiter is preceded by a | |
| 494 | space and followed by a space or line break; the entire phrase is a | |
| 495 | pretty printing block. | |
| 46290 | 496 | |
| 497 |   The alternative notation @{verbatim "op"}~@{text sy} is introduced
 | |
| 498 | in addition. Thus any infix operator may be written in prefix form | |
| 499 | (as in ML), independently of the number of arguments in the term. | |
| 500 | *} | |
| 501 | ||
| 502 | ||
| 503 | subsection {* Binders *}
 | |
| 504 | ||
| 505 | text {* A \emph{binder} is a variable-binding construct such as a
 | |
| 506 |   quantifier.  The idea to formalize @{text "\<forall>x. b"} as @{text "All
 | |
| 507 |   (\<lambda>x. b)"} for @{text "All :: ('a \<Rightarrow> bool) \<Rightarrow> bool"} already goes back
 | |
| 508 |   to \cite{church40}.  Isabelle declarations of certain higher-order
 | |
| 46292 | 509 |   operators may be annotated with @{keyword_def "binder"} annotations
 | 
| 510 | as follows: | |
| 46290 | 511 | |
| 512 |   \begin{center}
 | |
| 513 |   @{text "c :: "}@{verbatim "\""}@{text "(\<tau>\<^sub>1 \<Rightarrow> \<tau>\<^sub>2) \<Rightarrow> \<tau>\<^sub>3"}@{verbatim "\"  ("}@{keyword "binder"}@{verbatim " \""}@{text "sy"}@{verbatim "\" ["}@{text "p"}@{verbatim "] "}@{text "q"}@{verbatim ")"}
 | |
| 514 |   \end{center}
 | |
| 515 | ||
| 516 |   This introduces concrete binder syntax @{text "sy x. b"}, where
 | |
| 517 |   @{text x} is a bound variable of type @{text "\<tau>\<^sub>1"}, the body @{text
 | |
| 518 |   b} has type @{text "\<tau>\<^sub>2"} and the whole term has type @{text "\<tau>\<^sub>3"}.
 | |
| 519 |   The optional integer @{text p} specifies the syntactic priority of
 | |
| 520 |   the body; the default is @{text "q"}, which is also the priority of
 | |
| 521 | the whole construct. | |
| 522 | ||
| 523 | Internally, the binder syntax is expanded to something like this: | |
| 524 |   \begin{center}
 | |
| 525 |   @{text "c_binder :: "}@{verbatim "\""}@{text "idts \<Rightarrow> \<tau>\<^sub>2 \<Rightarrow> \<tau>\<^sub>3"}@{verbatim "\"  (\"(3"}@{text sy}@{verbatim "_./ _)\" [0, "}@{text "p"}@{verbatim "] "}@{text "q"}@{verbatim ")"}
 | |
| 526 |   \end{center}
 | |
| 527 | ||
| 528 |   Here @{syntax (inner) idts} is the nonterminal symbol for a list of
 | |
| 529 | identifiers with optional type constraints (see also | |
| 530 |   \secref{sec:pure-grammar}).  The mixfix template @{verbatim
 | |
| 531 |   "\"(3"}@{text sy}@{verbatim "_./ _)\""} defines argument positions
 | |
| 532 | for the bound identifiers and the body, separated by a dot with | |
| 533 | optional line break; the entire phrase is a pretty printing block of | |
| 534 |   indentation level 3.  Note that there is no extra space after @{text
 | |
| 535 | "sy"}, so it needs to be included user specification if the binder | |
| 536 | syntax ends with a token that may be continued by an identifier | |
| 537 |   token at the start of @{syntax (inner) idts}.
 | |
| 538 | ||
| 539 |   Furthermore, a syntax translation to transforms @{text "c_binder x\<^sub>1
 | |
| 540 |   \<dots> x\<^sub>n b"} into iterated application @{text "c (\<lambda>x\<^sub>1. \<dots> c (\<lambda>x\<^sub>n. b)\<dots>)"}.
 | |
| 541 | This works in both directions, for parsing and printing. *} | |
| 542 | ||
| 543 | ||
| 46282 | 544 | section {* Explicit notation \label{sec:notation} *}
 | 
| 28762 | 545 | |
| 546 | text {*
 | |
| 547 |   \begin{matharray}{rcll}
 | |
| 35413 | 548 |     @{command_def "type_notation"} & : & @{text "local_theory \<rightarrow> local_theory"} \\
 | 
| 549 |     @{command_def "no_type_notation"} & : & @{text "local_theory \<rightarrow> local_theory"} \\
 | |
| 28762 | 550 |     @{command_def "notation"} & : & @{text "local_theory \<rightarrow> local_theory"} \\
 | 
| 551 |     @{command_def "no_notation"} & : & @{text "local_theory \<rightarrow> local_theory"} \\
 | |
| 36508 
03d2a2d0ee4a
allow concrete syntax for local entities within a proof body, either via regular mixfix annotations to 'fix' etc. or the separate 'write' command;
 wenzelm parents: 
35413diff
changeset | 552 |     @{command_def "write"} & : & @{text "proof(state) \<rightarrow> proof(state)"} \\
 | 
| 28762 | 553 |   \end{matharray}
 | 
| 554 | ||
| 46288 | 555 | Commands that introduce new logical entities (terms or types) | 
| 556 | usually allow to provide mixfix annotations on the spot, which is | |
| 557 | convenient for default notation. Nonetheless, the syntax may be | |
| 558 | modified later on by declarations for explicit notation. This | |
| 559 | allows to add or delete mixfix annotations for of existing logical | |
| 560 | entities within the current context. | |
| 561 | ||
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 562 |   @{rail \<open>
 | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 563 |     (@@{command type_notation} | @@{command no_type_notation}) @{syntax target}?
 | 
| 55029 
61a6bf7d4b02
clarified @{rail} syntax: prefer explicit \<newline> symbol;
 wenzelm parents: 
52430diff
changeset | 564 |       @{syntax mode}? \<newline> (@{syntax nameref} @{syntax mixfix} + @'and')
 | 
| 35413 | 565 | ; | 
| 55029 
61a6bf7d4b02
clarified @{rail} syntax: prefer explicit \<newline> symbol;
 wenzelm parents: 
52430diff
changeset | 566 |     (@@{command notation} | @@{command no_notation}) @{syntax target}? @{syntax mode}? \<newline>
 | 
| 51654 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 567 |       (@{syntax nameref} @{syntax mixfix} + @'and')
 | 
| 28762 | 568 | ; | 
| 51654 
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
 wenzelm parents: 
50636diff
changeset | 569 |     @@{command write} @{syntax mode}? (@{syntax nameref} @{syntax mixfix} + @'and')
 | 
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 570 | \<close>} | 
| 28762 | 571 | |
| 572 |   \begin{description}
 | |
| 573 | ||
| 35413 | 574 |   \item @{command "type_notation"}~@{text "c (mx)"} associates mixfix
 | 
| 575 | syntax with an existing type constructor. The arity of the | |
| 576 | constructor is retrieved from the context. | |
| 46282 | 577 | |
| 35413 | 578 |   \item @{command "no_type_notation"} is similar to @{command
 | 
| 579 | "type_notation"}, but removes the specified syntax annotation from | |
| 580 | the present context. | |
| 581 | ||
| 28762 | 582 |   \item @{command "notation"}~@{text "c (mx)"} associates mixfix
 | 
| 35413 | 583 | syntax with an existing constant or fixed variable. The type | 
| 584 | declaration of the given entity is retrieved from the context. | |
| 46282 | 585 | |
| 28762 | 586 |   \item @{command "no_notation"} is similar to @{command "notation"},
 | 
| 587 | but removes the specified syntax annotation from the present | |
| 588 | context. | |
| 589 | ||
| 36508 
03d2a2d0ee4a
allow concrete syntax for local entities within a proof body, either via regular mixfix annotations to 'fix' etc. or the separate 'write' command;
 wenzelm parents: 
35413diff
changeset | 590 |   \item @{command "write"} is similar to @{command "notation"}, but
 | 
| 
03d2a2d0ee4a
allow concrete syntax for local entities within a proof body, either via regular mixfix annotations to 'fix' etc. or the separate 'write' command;
 wenzelm parents: 
35413diff
changeset | 591 | works within an Isar proof body. | 
| 
03d2a2d0ee4a
allow concrete syntax for local entities within a proof body, either via regular mixfix annotations to 'fix' etc. or the separate 'write' command;
 wenzelm parents: 
35413diff
changeset | 592 | |
| 28762 | 593 |   \end{description}
 | 
| 594 | *} | |
| 595 | ||
| 28778 | 596 | |
| 597 | section {* The Pure syntax \label{sec:pure-syntax} *}
 | |
| 28769 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 598 | |
| 46282 | 599 | subsection {* Lexical matters \label{sec:inner-lex} *}
 | 
| 600 | ||
| 601 | text {* The inner lexical syntax vaguely resembles the outer one
 | |
| 602 |   (\secref{sec:outer-lex}), but some details are different.  There are
 | |
| 603 | two main categories of inner syntax tokens: | |
| 604 | ||
| 605 |   \begin{enumerate}
 | |
| 606 | ||
| 607 |   \item \emph{delimiters} --- the literal tokens occurring in
 | |
| 608 | productions of the given priority grammar (cf.\ | |
| 609 |   \secref{sec:priority-grammar});
 | |
| 610 | ||
| 611 |   \item \emph{named tokens} --- various categories of identifiers etc.
 | |
| 612 | ||
| 613 |   \end{enumerate}
 | |
| 614 | ||
| 615 | Delimiters override named tokens and may thus render certain | |
| 616 | identifiers inaccessible. Sometimes the logical context admits | |
| 617 | alternative ways to refer to the same entity, potentially via | |
| 618 | qualified names. | |
| 619 | ||
| 620 | \medskip The categories for named tokens are defined once and for | |
| 621 | all as follows, reusing some categories of the outer token syntax | |
| 622 |   (\secref{sec:outer-lex}).
 | |
| 623 | ||
| 624 |   \begin{center}
 | |
| 625 |   \begin{supertabular}{rcl}
 | |
| 626 |     @{syntax_def (inner) id} & = & @{syntax_ref ident} \\
 | |
| 627 |     @{syntax_def (inner) longid} & = & @{syntax_ref longident} \\
 | |
| 628 |     @{syntax_def (inner) var} & = & @{syntax_ref var} \\
 | |
| 629 |     @{syntax_def (inner) tid} & = & @{syntax_ref typefree} \\
 | |
| 630 |     @{syntax_def (inner) tvar} & = & @{syntax_ref typevar} \\
 | |
| 631 |     @{syntax_def (inner) num_token} & = & @{syntax_ref nat}@{text "  |  "}@{verbatim "-"}@{syntax_ref nat} \\
 | |
| 632 |     @{syntax_def (inner) float_token} & = & @{syntax_ref nat}@{verbatim "."}@{syntax_ref nat}@{text "  |  "}@{verbatim "-"}@{syntax_ref nat}@{verbatim "."}@{syntax_ref nat} \\
 | |
| 633 |     @{syntax_def (inner) xnum_token} & = & @{verbatim "#"}@{syntax_ref nat}@{text "  |  "}@{verbatim "#-"}@{syntax_ref nat} \\
 | |
| 46483 | 634 |     @{syntax_def (inner) str_token} & = & @{verbatim "''"} @{text "\<dots>"} @{verbatim "''"} \\
 | 
| 55108 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 635 |     @{syntax_def (inner) string_token} & = & @{verbatim "\""} @{text "\<dots>"} @{verbatim "\""} \\
 | 
| 55033 | 636 |     @{syntax_def (inner) cartouche} & = & @{verbatim "\<open>"} @{text "\<dots>"} @{verbatim "\<close>"} \\
 | 
| 46282 | 637 |   \end{supertabular}
 | 
| 638 |   \end{center}
 | |
| 639 | ||
| 640 |   The token categories @{syntax (inner) num_token}, @{syntax (inner)
 | |
| 55033 | 641 |   float_token}, @{syntax (inner) xnum_token}, @{syntax (inner)
 | 
| 55108 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 642 |   str_token}, @{syntax (inner) string_token}, and @{syntax (inner)
 | 
| 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 643 | cartouche} are not used in Pure. Object-logics may implement | 
| 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 644 | numerals and string literals by adding appropriate syntax | 
| 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 645 | declarations, together with some translation functions (e.g.\ see | 
| 
0b7a0c1fdf7e
inner syntax token language allows regular quoted strings;
 wenzelm parents: 
55033diff
changeset | 646 |   @{file "~~/src/HOL/Tools/string_syntax.ML"}).
 | 
| 46282 | 647 | |
| 648 |   The derived categories @{syntax_def (inner) num_const}, @{syntax_def
 | |
| 649 |   (inner) float_const}, and @{syntax_def (inner) num_const} provide
 | |
| 650 | robust access to the respective tokens: the syntax tree holds a | |
| 651 | syntactic constant instead of a free variable. | |
| 652 | *} | |
| 653 | ||
| 654 | ||
| 28777 | 655 | subsection {* Priority grammars \label{sec:priority-grammar} *}
 | 
| 28769 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 656 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 657 | text {* A context-free grammar consists of a set of \emph{terminal
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 658 |   symbols}, a set of \emph{nonterminal symbols} and a set of
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 659 |   \emph{productions}.  Productions have the form @{text "A = \<gamma>"},
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 660 |   where @{text A} is a nonterminal and @{text \<gamma>} is a string of
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 661 | terminals and nonterminals. One designated nonterminal is called | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 662 |   the \emph{root symbol}.  The language defined by the grammar
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 663 | consists of all strings of terminals that can be derived from the | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 664 | root symbol by applying productions as rewrite rules. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 665 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 666 |   The standard Isabelle parser for inner syntax uses a \emph{priority
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 667 | grammar}. Each nonterminal is decorated by an integer priority: | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 668 |   @{text "A\<^sup>(\<^sup>p\<^sup>)"}.  In a derivation, @{text "A\<^sup>(\<^sup>p\<^sup>)"} may be rewritten
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 669 |   using a production @{text "A\<^sup>(\<^sup>q\<^sup>) = \<gamma>"} only if @{text "p \<le> q"}.  Any
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 670 | priority grammar can be translated into a normal context-free | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 671 | grammar by introducing new nonterminals and productions. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 672 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 673 |   \medskip Formally, a set of context free productions @{text G}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 674 |   induces a derivation relation @{text "\<longrightarrow>\<^sub>G"} as follows.  Let @{text
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 675 |   \<alpha>} and @{text \<beta>} denote strings of terminal or nonterminal symbols.
 | 
| 28774 | 676 |   Then @{text "\<alpha> A\<^sup>(\<^sup>p\<^sup>) \<beta> \<longrightarrow>\<^sub>G \<alpha> \<gamma> \<beta>"} holds if and only if @{text G}
 | 
| 677 |   contains some production @{text "A\<^sup>(\<^sup>q\<^sup>) = \<gamma>"} for @{text "p \<le> q"}.
 | |
| 28769 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 678 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 679 | \medskip The following grammar for arithmetic expressions | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 680 | demonstrates how binding power and associativity of operators can be | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 681 | enforced by priorities. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 682 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 683 |   \begin{center}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 684 |   \begin{tabular}{rclr}
 | 
| 28774 | 685 |   @{text "A\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)"} & @{text "="} & @{verbatim "("} @{text "A\<^sup>(\<^sup>0\<^sup>)"} @{verbatim ")"} \\
 | 
| 28769 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 686 |   @{text "A\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)"} & @{text "="} & @{verbatim 0} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 687 |   @{text "A\<^sup>(\<^sup>0\<^sup>)"} & @{text "="} & @{text "A\<^sup>(\<^sup>0\<^sup>)"} @{verbatim "+"} @{text "A\<^sup>(\<^sup>1\<^sup>)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 688 |   @{text "A\<^sup>(\<^sup>2\<^sup>)"} & @{text "="} & @{text "A\<^sup>(\<^sup>3\<^sup>)"} @{verbatim "*"} @{text "A\<^sup>(\<^sup>2\<^sup>)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 689 |   @{text "A\<^sup>(\<^sup>3\<^sup>)"} & @{text "="} & @{verbatim "-"} @{text "A\<^sup>(\<^sup>3\<^sup>)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 690 |   \end{tabular}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 691 |   \end{center}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 692 |   The choice of priorities determines that @{verbatim "-"} binds
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 693 |   tighter than @{verbatim "*"}, which binds tighter than @{verbatim
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 694 |   "+"}.  Furthermore @{verbatim "+"} associates to the left and
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 695 |   @{verbatim "*"} to the right.
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 696 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 697 | \medskip For clarity, grammars obey these conventions: | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 698 |   \begin{itemize}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 699 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 700 | \item All priorities must lie between 0 and 1000. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 701 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 702 | \item Priority 0 on the right-hand side and priority 1000 on the | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 703 | left-hand side may be omitted. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 704 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 705 |   \item The production @{text "A\<^sup>(\<^sup>p\<^sup>) = \<alpha>"} is written as @{text "A = \<alpha>
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 706 | (p)"}, i.e.\ the priority of the left-hand side actually appears in | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 707 | a column on the far right. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 708 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 709 |   \item Alternatives are separated by @{text "|"}.
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 710 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 711 |   \item Repetition is indicated by dots @{text "(\<dots>)"} in an informal
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 712 | but obvious way. | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 713 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 714 |   \end{itemize}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 715 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 716 | Using these conventions, the example grammar specification above | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 717 | takes the form: | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 718 |   \begin{center}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 719 |   \begin{tabular}{rclc}
 | 
| 28774 | 720 |     @{text A} & @{text "="} & @{verbatim "("} @{text A} @{verbatim ")"} \\
 | 
| 721 |               & @{text "|"} & @{verbatim 0} & \qquad\qquad \\
 | |
| 28769 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 722 |               & @{text "|"} & @{text A} @{verbatim "+"} @{text "A\<^sup>(\<^sup>1\<^sup>)"} & @{text "(0)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 723 |               & @{text "|"} & @{text "A\<^sup>(\<^sup>3\<^sup>)"} @{verbatim "*"} @{text "A\<^sup>(\<^sup>2\<^sup>)"} & @{text "(2)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 724 |               & @{text "|"} & @{verbatim "-"} @{text "A\<^sup>(\<^sup>3\<^sup>)"} & @{text "(3)"} \\
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 725 |   \end{tabular}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 726 |   \end{center}
 | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 727 | *} | 
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 728 | |
| 
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
 wenzelm parents: 
28767diff
changeset | 729 | |
| 46290 | 730 | subsection {* The Pure grammar \label{sec:pure-grammar} *}
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 731 | |
| 46287 | 732 | text {* The priority grammar of the @{text "Pure"} theory is defined
 | 
| 733 | approximately like this: | |
| 28774 | 734 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 735 |   \begin{center}
 | 
| 28773 | 736 |   \begin{supertabular}{rclr}
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 737 | |
| 28778 | 738 |   @{syntax_def (inner) any} & = & @{text "prop  |  logic"} \\\\
 | 
| 28772 | 739 | |
| 28778 | 740 |   @{syntax_def (inner) prop} & = & @{verbatim "("} @{text prop} @{verbatim ")"} \\
 | 
| 28772 | 741 |     & @{text "|"} & @{text "prop\<^sup>(\<^sup>4\<^sup>)"} @{verbatim "::"} @{text type} & @{text "(3)"} \\
 | 
| 50636 
07f47142378e
uniform notation for == and \<equiv> (cf. 3e3c2af5e8a5);
 wenzelm parents: 
50635diff
changeset | 742 |     & @{text "|"} & @{text "any\<^sup>(\<^sup>3\<^sup>)"} @{verbatim "=="} @{text "any\<^sup>(\<^sup>3\<^sup>)"} & @{text "(2)"} \\
 | 
| 
07f47142378e
uniform notation for == and \<equiv> (cf. 3e3c2af5e8a5);
 wenzelm parents: 
50635diff
changeset | 743 |     & @{text "|"} & @{text "any\<^sup>(\<^sup>3\<^sup>)"} @{text "\<equiv>"} @{text "any\<^sup>(\<^sup>3\<^sup>)"} & @{text "(2)"} \\
 | 
| 28856 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 744 |     & @{text "|"} & @{text "prop\<^sup>(\<^sup>3\<^sup>)"} @{verbatim "&&&"} @{text "prop\<^sup>(\<^sup>2\<^sup>)"} & @{text "(2)"} \\
 | 
| 28772 | 745 |     & @{text "|"} & @{text "prop\<^sup>(\<^sup>2\<^sup>)"} @{verbatim "==>"} @{text "prop\<^sup>(\<^sup>1\<^sup>)"} & @{text "(1)"} \\
 | 
| 28773 | 746 |     & @{text "|"} & @{text "prop\<^sup>(\<^sup>2\<^sup>)"} @{text "\<Longrightarrow>"} @{text "prop\<^sup>(\<^sup>1\<^sup>)"} & @{text "(1)"} \\
 | 
| 28772 | 747 |     & @{text "|"} & @{verbatim "[|"} @{text prop} @{verbatim ";"} @{text "\<dots>"} @{verbatim ";"} @{text prop} @{verbatim "|]"} @{verbatim "==>"} @{text "prop\<^sup>(\<^sup>1\<^sup>)"} & @{text "(1)"} \\
 | 
| 28773 | 748 |     & @{text "|"} & @{text "\<lbrakk>"} @{text prop} @{verbatim ";"} @{text "\<dots>"} @{verbatim ";"} @{text prop} @{text "\<rbrakk>"} @{text "\<Longrightarrow>"} @{text "prop\<^sup>(\<^sup>1\<^sup>)"} & @{text "(1)"} \\
 | 
| 28772 | 749 |     & @{text "|"} & @{verbatim "!!"} @{text idts} @{verbatim "."} @{text prop} & @{text "(0)"} \\
 | 
| 28773 | 750 |     & @{text "|"} & @{text "\<And>"} @{text idts} @{verbatim "."} @{text prop} & @{text "(0)"} \\
 | 
| 751 |     & @{text "|"} & @{verbatim OFCLASS} @{verbatim "("} @{text type} @{verbatim ","} @{text logic} @{verbatim ")"} \\
 | |
| 752 |     & @{text "|"} & @{verbatim SORT_CONSTRAINT} @{verbatim "("} @{text type} @{verbatim ")"} \\
 | |
| 28856 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 753 |     & @{text "|"} & @{verbatim TERM} @{text logic} \\
 | 
| 28773 | 754 |     & @{text "|"} & @{verbatim PROP} @{text aprop} \\\\
 | 
| 28772 | 755 | |
| 28856 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 756 |   @{syntax_def (inner) aprop} & = & @{verbatim "("} @{text aprop} @{verbatim ")"} \\
 | 
| 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 757 |     & @{text "|"} & @{text "id  |  longid  |  var  |  "}@{verbatim "_"}@{text "  |  "}@{verbatim "..."} \\
 | 
| 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 758 |     & @{text "|"} & @{verbatim CONST} @{text "id  |  "}@{verbatim CONST} @{text "longid"} \\
 | 
| 46287 | 759 |     & @{text "|"} & @{verbatim XCONST} @{text "id  |  "}@{verbatim XCONST} @{text "longid"} \\
 | 
| 28773 | 760 |     & @{text "|"} & @{text "logic\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)  any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) \<dots> any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)"} & @{text "(999)"} \\\\
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 761 | |
| 28778 | 762 |   @{syntax_def (inner) logic} & = & @{verbatim "("} @{text logic} @{verbatim ")"} \\
 | 
| 28772 | 763 |     & @{text "|"} & @{text "logic\<^sup>(\<^sup>4\<^sup>)"} @{verbatim "::"} @{text type} & @{text "(3)"} \\
 | 
| 28773 | 764 |     & @{text "|"} & @{text "id  |  longid  |  var  |  "}@{verbatim "_"}@{text "  |  "}@{verbatim "..."} \\
 | 
| 28856 
5e009a80fe6d
Pure syntax: more coherent treatment of aprop, permanent TERM and &&&;
 wenzelm parents: 
28779diff
changeset | 765 |     & @{text "|"} & @{verbatim CONST} @{text "id  |  "}@{verbatim CONST} @{text "longid"} \\
 | 
| 46287 | 766 |     & @{text "|"} & @{verbatim XCONST} @{text "id  |  "}@{verbatim XCONST} @{text "longid"} \\
 | 
| 28773 | 767 |     & @{text "|"} & @{text "logic\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)  any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) \<dots> any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)"} & @{text "(999)"} \\
 | 
| 46287 | 768 |     & @{text "|"} & @{text "\<struct> index\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)"} \\
 | 
| 28772 | 769 |     & @{text "|"} & @{verbatim "%"} @{text pttrns} @{verbatim "."} @{text "any\<^sup>(\<^sup>3\<^sup>)"} & @{text "(3)"} \\
 | 
| 28773 | 770 |     & @{text "|"} & @{text \<lambda>} @{text pttrns} @{verbatim "."} @{text "any\<^sup>(\<^sup>3\<^sup>)"} & @{text "(3)"} \\
 | 
| 46287 | 771 |     & @{text "|"} & @{verbatim op} @{verbatim "=="}@{text "  |  "}@{verbatim op} @{text "\<equiv>"}@{text "  |  "}@{verbatim op} @{verbatim "&&&"} \\
 | 
| 772 |     & @{text "|"} & @{verbatim op} @{verbatim "==>"}@{text "  |  "}@{verbatim op} @{text "\<Longrightarrow>"} \\
 | |
| 28772 | 773 |     & @{text "|"} & @{verbatim TYPE} @{verbatim "("} @{text type} @{verbatim ")"} \\\\
 | 
| 774 | ||
| 28778 | 775 |   @{syntax_def (inner) idt} & = & @{verbatim "("} @{text idt} @{verbatim ")"}@{text "  |  id  |  "}@{verbatim "_"} \\
 | 
| 28773 | 776 |     & @{text "|"} & @{text id} @{verbatim "::"} @{text type} & @{text "(0)"} \\
 | 
| 777 |     & @{text "|"} & @{verbatim "_"} @{verbatim "::"} @{text type} & @{text "(0)"} \\\\
 | |
| 28772 | 778 | |
| 46287 | 779 |   @{syntax_def (inner) index} & = & @{verbatim "\<^bsub>"} @{text "logic\<^sup>(\<^sup>0\<^sup>)"} @{verbatim "\<^esub>"}@{text "  |  |  \<index>"} \\\\
 | 
| 780 | ||
| 28778 | 781 |   @{syntax_def (inner) idts} & = & @{text "idt  |  idt\<^sup>(\<^sup>1\<^sup>) idts"} & @{text "(0)"} \\\\
 | 
| 28772 | 782 | |
| 28778 | 783 |   @{syntax_def (inner) pttrn} & = & @{text idt} \\\\
 | 
| 28772 | 784 | |
| 28778 | 785 |   @{syntax_def (inner) pttrns} & = & @{text "pttrn  |  pttrn\<^sup>(\<^sup>1\<^sup>) pttrns"} & @{text "(0)"} \\\\
 | 
| 28774 | 786 | |
| 28778 | 787 |   @{syntax_def (inner) type} & = & @{verbatim "("} @{text type} @{verbatim ")"} \\
 | 
| 28773 | 788 |     & @{text "|"} & @{text "tid  |  tvar  |  "}@{verbatim "_"} \\
 | 
| 789 |     & @{text "|"} & @{text "tid"} @{verbatim "::"} @{text "sort  |  tvar  "}@{verbatim "::"} @{text "sort  |  "}@{verbatim "_"} @{verbatim "::"} @{text "sort"} \\
 | |
| 46287 | 790 |     & @{text "|"} & @{text "type_name  |  type\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) type_name"} \\
 | 
| 791 |     & @{text "|"} & @{verbatim "("} @{text type} @{verbatim ","} @{text "\<dots>"} @{verbatim ","} @{text type} @{verbatim ")"} @{text type_name} \\
 | |
| 28772 | 792 |     & @{text "|"} & @{text "type\<^sup>(\<^sup>1\<^sup>)"} @{verbatim "=>"} @{text type} & @{text "(0)"} \\
 | 
| 28773 | 793 |     & @{text "|"} & @{text "type\<^sup>(\<^sup>1\<^sup>)"} @{text "\<Rightarrow>"} @{text type} & @{text "(0)"} \\
 | 
| 794 |     & @{text "|"} & @{verbatim "["} @{text type} @{verbatim ","} @{text "\<dots>"} @{verbatim ","} @{text type} @{verbatim "]"} @{verbatim "=>"} @{text type} & @{text "(0)"} \\
 | |
| 46287 | 795 |     & @{text "|"} & @{verbatim "["} @{text type} @{verbatim ","} @{text "\<dots>"} @{verbatim ","} @{text type} @{verbatim "]"} @{text "\<Rightarrow>"} @{text type} & @{text "(0)"} \\
 | 
| 796 |   @{syntax_def (inner) type_name} & = & @{text "id  |  longid"} \\\\
 | |
| 28772 | 797 | |
| 46287 | 798 |   @{syntax_def (inner) sort} & = & @{syntax class_name}~@{text "  |  "}@{verbatim "{}"} \\
 | 
| 799 |     & @{text "|"} & @{verbatim "{"} @{syntax class_name} @{verbatim ","} @{text "\<dots>"} @{verbatim ","} @{syntax class_name} @{verbatim "}"} \\
 | |
| 800 |   @{syntax_def (inner) class_name} & = & @{text "id  |  longid"} \\
 | |
| 28773 | 801 |   \end{supertabular}
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 802 |   \end{center}
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 803 | |
| 28774 | 804 |   \medskip Here literal terminals are printed @{verbatim "verbatim"};
 | 
| 805 |   see also \secref{sec:inner-lex} for further token categories of the
 | |
| 806 | inner syntax. The meaning of the nonterminals defined by the above | |
| 807 | grammar is as follows: | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 808 | |
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 809 |   \begin{description}
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 810 | |
| 28778 | 811 |   \item @{syntax_ref (inner) any} denotes any term.
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 812 | |
| 28778 | 813 |   \item @{syntax_ref (inner) prop} denotes meta-level propositions,
 | 
| 814 |   which are terms of type @{typ prop}.  The syntax of such formulae of
 | |
| 815 | the meta-logic is carefully distinguished from usual conventions for | |
| 816 |   object-logics.  In particular, plain @{text "\<lambda>"}-term notation is
 | |
| 817 |   \emph{not} recognized as @{syntax (inner) prop}.
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 818 | |
| 28778 | 819 |   \item @{syntax_ref (inner) aprop} denotes atomic propositions, which
 | 
| 820 |   are embedded into regular @{syntax (inner) prop} by means of an
 | |
| 821 |   explicit @{verbatim PROP} token.
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 822 | |
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 823 |   Terms of type @{typ prop} with non-constant head, e.g.\ a plain
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 824 |   variable, are printed in this form.  Constants that yield type @{typ
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 825 | prop} are expected to provide their own concrete syntax; otherwise | 
| 28778 | 826 |   the printed version will appear like @{syntax (inner) logic} and
 | 
| 827 |   cannot be parsed again as @{syntax (inner) prop}.
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 828 | |
| 28778 | 829 |   \item @{syntax_ref (inner) logic} denotes arbitrary terms of a
 | 
| 830 |   logical type, excluding type @{typ prop}.  This is the main
 | |
| 831 |   syntactic category of object-logic entities, covering plain @{text
 | |
| 832 | \<lambda>}-term notation (variables, abstraction, application), plus | |
| 833 | anything defined by the user. | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 834 | |
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 835 | When specifying notation for logical entities, all logical types | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 836 |   (excluding @{typ prop}) are \emph{collapsed} to this single category
 | 
| 28778 | 837 |   of @{syntax (inner) logic}.
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 838 | |
| 46287 | 839 |   \item @{syntax_ref (inner) index} denotes an optional index term for
 | 
| 51657 
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
 wenzelm parents: 
51654diff
changeset | 840 |   indexed syntax.  If omitted, it refers to the first @{keyword_ref
 | 
| 46287 | 841 |   "structure"} variable in the context.  The special dummy ``@{text
 | 
| 842 | "\<index>"}'' serves as pattern variable in mixfix annotations that | |
| 843 | introduce indexed notation. | |
| 844 | ||
| 28778 | 845 |   \item @{syntax_ref (inner) idt} denotes identifiers, possibly
 | 
| 846 | constrained by types. | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 847 | |
| 28778 | 848 |   \item @{syntax_ref (inner) idts} denotes a sequence of @{syntax_ref
 | 
| 849 | (inner) idt}. This is the most basic category for variables in | |
| 850 |   iterated binders, such as @{text "\<lambda>"} or @{text "\<And>"}.
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 851 | |
| 28778 | 852 |   \item @{syntax_ref (inner) pttrn} and @{syntax_ref (inner) pttrns}
 | 
| 853 | denote patterns for abstraction, cases bindings etc. In Pure, these | |
| 854 |   categories start as a merely copy of @{syntax (inner) idt} and
 | |
| 855 |   @{syntax (inner) idts}, respectively.  Object-logics may add
 | |
| 856 | additional productions for binding forms. | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 857 | |
| 28778 | 858 |   \item @{syntax_ref (inner) type} denotes types of the meta-logic.
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 859 | |
| 28778 | 860 |   \item @{syntax_ref (inner) sort} denotes meta-level sorts.
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 861 | |
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 862 |   \end{description}
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 863 | |
| 28774 | 864 | Here are some further explanations of certain syntax features. | 
| 28773 | 865 | |
| 866 |   \begin{itemize}
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 867 | |
| 28778 | 868 |   \item In @{syntax (inner) idts}, note that @{text "x :: nat y"} is
 | 
| 869 |   parsed as @{text "x :: (nat y)"}, treating @{text y} like a type
 | |
| 870 |   constructor applied to @{text nat}.  To avoid this interpretation,
 | |
| 871 |   write @{text "(x :: nat) y"} with explicit parentheses.
 | |
| 28773 | 872 | |
| 873 |   \item Similarly, @{text "x :: nat y :: nat"} is parsed as @{text "x ::
 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 874 |   (nat y :: nat)"}.  The correct form is @{text "(x :: nat) (y ::
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 875 |   nat)"}, or @{text "(x :: nat) y :: nat"} if @{text y} is last in the
 | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 876 | sequence of identifiers. | 
| 28773 | 877 | |
| 878 | \item Type constraints for terms bind very weakly. For example, | |
| 879 |   @{text "x < y :: nat"} is normally parsed as @{text "(x < y) ::
 | |
| 880 |   nat"}, unless @{text "<"} has a very low priority, in which case the
 | |
| 881 |   input is likely to be ambiguous.  The correct form is @{text "x < (y
 | |
| 882 | :: nat)"}. | |
| 883 | ||
| 884 | \item Constraints may be either written with two literal colons | |
| 885 |   ``@{verbatim "::"}'' or the double-colon symbol @{verbatim "\<Colon>"},
 | |
| 28774 | 886 |   which actually looks exactly the same in some {\LaTeX} styles.
 | 
| 28773 | 887 | |
| 28774 | 888 | \item Dummy variables (written as underscore) may occur in different | 
| 889 | roles. | |
| 28773 | 890 | |
| 891 |   \begin{description}
 | |
| 892 | ||
| 28774 | 893 |   \item A type ``@{text "_"}'' or ``@{text "_ :: sort"}'' acts like an
 | 
| 894 | anonymous inference parameter, which is filled-in according to the | |
| 895 | most general type produced by the type-checking phase. | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 896 | |
| 28774 | 897 |   \item A bound ``@{text "_"}'' refers to a vacuous abstraction, where
 | 
| 898 | the body does not refer to the binding introduced here. As in the | |
| 899 |   term @{term "\<lambda>x _. x"}, which is @{text "\<alpha>"}-equivalent to @{text
 | |
| 900 | "\<lambda>x y. x"}. | |
| 28773 | 901 | |
| 28774 | 902 |   \item A free ``@{text "_"}'' refers to an implicit outer binding.
 | 
| 903 |   Higher definitional packages usually allow forms like @{text "f x _
 | |
| 904 | = x"}. | |
| 28773 | 905 | |
| 28774 | 906 |   \item A schematic ``@{text "_"}'' (within a term pattern, see
 | 
| 907 |   \secref{sec:term-decls}) refers to an anonymous variable that is
 | |
| 908 | implicitly abstracted over its context of locally bound variables. | |
| 909 |   For example, this allows pattern matching of @{text "{x. f x = g
 | |
| 910 |   x}"} against @{text "{x. _ = _}"}, or even @{text "{_. _ = _}"} by
 | |
| 911 | using both bound and schematic dummies. | |
| 28773 | 912 | |
| 913 |   \end{description}
 | |
| 914 | ||
| 28774 | 915 |   \item The three literal dots ``@{verbatim "..."}'' may be also
 | 
| 916 |   written as ellipsis symbol @{verbatim "\<dots>"}.  In both cases this
 | |
| 917 | refers to a special schematic variable, which is bound in the | |
| 918 | context. This special term abbreviation works nicely with | |
| 919 |   calculational reasoning (\secref{sec:calculation}).
 | |
| 920 | ||
| 46287 | 921 |   \item @{verbatim CONST} ensures that the given identifier is treated
 | 
| 922 | as constant term, and passed through the parse tree in fully | |
| 923 | internalized form. This is particularly relevant for translation | |
| 924 |   rules (\secref{sec:syn-trans}), notably on the RHS.
 | |
| 925 | ||
| 926 |   \item @{verbatim XCONST} is similar to @{verbatim CONST}, but
 | |
| 927 | retains the constant name as given. This is only relevant to | |
| 928 |   translation rules (\secref{sec:syn-trans}), notably on the LHS.
 | |
| 929 | ||
| 28773 | 930 |   \end{itemize}
 | 
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 931 | *} | 
| 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 932 | |
| 28777 | 933 | |
| 46282 | 934 | subsection {* Inspecting the syntax *}
 | 
| 28777 | 935 | |
| 46282 | 936 | text {*
 | 
| 937 |   \begin{matharray}{rcl}
 | |
| 938 |     @{command_def "print_syntax"}@{text "\<^sup>*"} & : & @{text "context \<rightarrow>"} \\
 | |
| 939 |   \end{matharray}
 | |
| 28777 | 940 | |
| 46282 | 941 |   \begin{description}
 | 
| 942 | ||
| 943 |   \item @{command "print_syntax"} prints the inner syntax of the
 | |
| 944 | current context. The output can be quite large; the most important | |
| 945 | sections are explained below. | |
| 28777 | 946 | |
| 46282 | 947 |   \begin{description}
 | 
| 28777 | 948 | |
| 46282 | 949 |   \item @{text "lexicon"} lists the delimiters of the inner token
 | 
| 950 |   language; see \secref{sec:inner-lex}.
 | |
| 28777 | 951 | |
| 46282 | 952 |   \item @{text "prods"} lists the productions of the underlying
 | 
| 953 |   priority grammar; see \secref{sec:priority-grammar}.
 | |
| 28777 | 954 | |
| 46282 | 955 |   The nonterminal @{text "A\<^sup>(\<^sup>p\<^sup>)"} is rendered in plain text as @{text
 | 
| 956 | "A[p]"}; delimiters are quoted. Many productions have an extra | |
| 957 |   @{text "\<dots> => name"}.  These names later become the heads of parse
 | |
| 958 | trees; they also guide the pretty printer. | |
| 28777 | 959 | |
| 46282 | 960 |   Productions without such parse tree names are called \emph{copy
 | 
| 961 | productions}. Their right-hand side must have exactly one | |
| 962 | nonterminal symbol (or named token). The parser does not create a | |
| 963 | new parse tree node for copy productions, but simply returns the | |
| 964 | parse tree of the right-hand symbol. | |
| 965 | ||
| 966 | If the right-hand side of a copy production consists of a single | |
| 967 |   nonterminal without any delimiters, then it is called a \emph{chain
 | |
| 968 | production}. Chain productions act as abbreviations: conceptually, | |
| 969 | they are removed from the grammar by adding new productions. | |
| 970 | Priority information attached to chain productions is ignored; only | |
| 971 |   the dummy value @{text "-1"} is displayed.
 | |
| 972 | ||
| 973 |   \item @{text "print modes"} lists the alternative print modes
 | |
| 974 |   provided by this grammar; see \secref{sec:print-modes}.
 | |
| 28777 | 975 | |
| 46282 | 976 |   \item @{text "parse_rules"} and @{text "print_rules"} relate to
 | 
| 977 |   syntax translations (macros); see \secref{sec:syn-trans}.
 | |
| 978 | ||
| 979 |   \item @{text "parse_ast_translation"} and @{text
 | |
| 980 | "print_ast_translation"} list sets of constants that invoke | |
| 981 | translation functions for abstract syntax trees, which are only | |
| 982 |   required in very special situations; see \secref{sec:tr-funs}.
 | |
| 28777 | 983 | |
| 46282 | 984 |   \item @{text "parse_translation"} and @{text "print_translation"}
 | 
| 985 | list the sets of constants that invoke regular translation | |
| 986 |   functions; see \secref{sec:tr-funs}.
 | |
| 29157 | 987 | |
| 46282 | 988 |   \end{description}
 | 
| 989 | ||
| 990 |   \end{description}
 | |
| 28777 | 991 | *} | 
| 28774 | 992 | |
| 28770 
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
 wenzelm parents: 
28769diff
changeset | 993 | |
| 46291 | 994 | subsection {* Ambiguity of parsed expressions *}
 | 
| 995 | ||
| 996 | text {*
 | |
| 997 |   \begin{tabular}{rcll}
 | |
| 46512 
4f9f61f9b535
simplified configuration options for syntax ambiguity;
 wenzelm parents: 
46506diff
changeset | 998 |     @{attribute_def syntax_ambiguity_warning} & : & @{text attribute} & default @{text true} \\
 | 
| 46506 
c7faa011bfa7
simplified configuration options for syntax ambiguity;
 wenzelm parents: 
46494diff
changeset | 999 |     @{attribute_def syntax_ambiguity_limit} & : & @{text attribute} & default @{text 10} \\
 | 
| 46291 | 1000 |   \end{tabular}
 | 
| 1001 | ||
| 1002 | Depending on the grammar and the given input, parsing may be | |
| 1003 | ambiguous. Isabelle lets the Earley parser enumerate all possible | |
| 1004 | parse trees, and then tries to make the best out of the situation. | |
| 1005 | Terms that cannot be type-checked are filtered out, which often | |
| 1006 | leads to a unique result in the end. Unlike regular type | |
| 1007 | reconstruction, which is applied to the whole collection of input | |
| 1008 | terms simultaneously, the filtering stage only treats each given | |
| 1009 | term in isolation. Filtering is also not attempted for individual | |
| 1010 |   types or raw ASTs (as required for @{command translations}).
 | |
| 1011 | ||
| 1012 | Certain warning or error messages are printed, depending on the | |
| 1013 | situation and the given configuration options. Parsing ultimately | |
| 1014 | fails, if multiple results remain after the filtering phase. | |
| 1015 | ||
| 1016 |   \begin{description}
 | |
| 1017 | ||
| 46512 
4f9f61f9b535
simplified configuration options for syntax ambiguity;
 wenzelm parents: 
46506diff
changeset | 1018 |   \item @{attribute syntax_ambiguity_warning} controls output of
 | 
| 
4f9f61f9b535
simplified configuration options for syntax ambiguity;
 wenzelm parents: 
46506diff
changeset | 1019 | explicit warning messages about syntax ambiguity. | 
| 46291 | 1020 | |
| 46506 
c7faa011bfa7
simplified configuration options for syntax ambiguity;
 wenzelm parents: 
46494diff
changeset | 1021 |   \item @{attribute syntax_ambiguity_limit} determines the number of
 | 
| 46291 | 1022 | resulting parse trees that are shown as part of the printed message | 
| 1023 | in case of an ambiguity. | |
| 1024 | ||
| 1025 |   \end{description}
 | |
| 1026 | *} | |
| 1027 | ||
| 1028 | ||
| 48113 | 1029 | section {* Syntax transformations \label{sec:syntax-transformations} *}
 | 
| 1030 | ||
| 1031 | text {* The inner syntax engine of Isabelle provides separate
 | |
| 52413 | 1032 | mechanisms to transform parse trees either via rewrite systems on | 
| 48113 | 1033 |   first-order ASTs (\secref{sec:syn-trans}), or ML functions on ASTs
 | 
| 1034 |   or syntactic @{text "\<lambda>"}-terms (\secref{sec:tr-funs}).  This works
 | |
| 1035 | both for parsing and printing, as outlined in | |
| 1036 |   \figref{fig:parse-print}.
 | |
| 1037 | ||
| 1038 |   \begin{figure}[htbp]
 | |
| 1039 |   \begin{center}
 | |
| 1040 |   \begin{tabular}{cl}
 | |
| 1041 | string & \\ | |
| 1042 |   @{text "\<down>"}     & lexer + parser \\
 | |
| 1043 | parse tree & \\ | |
| 1044 |   @{text "\<down>"}     & parse AST translation \\
 | |
| 1045 | AST & \\ | |
| 1046 |   @{text "\<down>"}     & AST rewriting (macros) \\
 | |
| 1047 | AST & \\ | |
| 1048 |   @{text "\<down>"}     & parse translation \\
 | |
| 1049 | --- pre-term --- & \\ | |
| 1050 |   @{text "\<down>"}     & print translation \\
 | |
| 1051 | AST & \\ | |
| 1052 |   @{text "\<down>"}     & AST rewriting (macros) \\
 | |
| 1053 | AST & \\ | |
| 1054 |   @{text "\<down>"}     & print AST translation \\
 | |
| 1055 | string & | |
| 1056 |   \end{tabular}
 | |
| 1057 |   \end{center}
 | |
| 1058 |   \caption{Parsing and printing with translations}\label{fig:parse-print}
 | |
| 1059 |   \end{figure}
 | |
| 1060 | ||
| 1061 | These intermediate syntax tree formats eventually lead to a pre-term | |
| 1062 | with all names and binding scopes resolved, but most type | |
| 1063 | information still missing. Explicit type constraints might be given by | |
| 1064 | the user, or implicit position information by the system --- both | |
| 48816 | 1065 | need to be passed-through carefully by syntax transformations. | 
| 48113 | 1066 | |
| 1067 |   Pre-terms are further processed by the so-called \emph{check} and
 | |
| 1068 |   \emph{unckeck} phases that are intertwined with type-inference (see
 | |
| 1069 |   also \cite{isabelle-implementation}).  The latter allows to operate
 | |
| 1070 | on higher-order abstract syntax with proper binding and type | |
| 1071 | information already available. | |
| 1072 | ||
| 1073 | As a rule of thumb, anything that manipulates bindings of variables | |
| 1074 | or constants needs to be implemented as syntax transformation (see | |
| 1075 | below). Anything else is better done via check/uncheck: a prominent | |
| 1076 |   example application is the @{command abbreviation} concept of
 | |
| 1077 | Isabelle/Pure. *} | |
| 1078 | ||
| 1079 | ||
| 48115 | 1080 | subsection {* Abstract syntax trees \label{sec:ast} *}
 | 
| 48113 | 1081 | |
| 48114 | 1082 | text {* The ML datatype @{ML_type Ast.ast} explicitly represents the
 | 
| 1083 | intermediate AST format that is used for syntax rewriting | |
| 1084 |   (\secref{sec:syn-trans}).  It is defined in ML as follows:
 | |
| 1085 |   \begin{ttbox}
 | |
| 1086 | datatype ast = | |
| 1087 | Constant of string | | |
| 1088 | Variable of string | | |
| 1089 | Appl of ast list | |
| 1090 |   \end{ttbox}
 | |
| 1091 | ||
| 1092 | An AST is either an atom (constant or variable) or a list of (at | |
| 1093 | least two) subtrees. Occasional diagnostic output of ASTs uses | |
| 1094 | notation that resembles S-expression of LISP. Constant atoms are | |
| 1095 | shown as quoted strings, variable atoms as non-quoted strings and | |
| 1096 | applications as a parenthesized list of subtrees. For example, the | |
| 1097 | AST | |
| 1098 |   @{ML [display] "Ast.Appl
 | |
| 1099 | [Ast.Constant \"_abs\", Ast.Variable \"x\", Ast.Variable \"t\"]"} | |
| 1100 |   is pretty-printed as @{verbatim "(\"_abs\" x t)"}.  Note that
 | |
| 1101 |   @{verbatim "()"} and @{verbatim "(x)"} are excluded as ASTs, because
 | |
| 1102 | they have too few subtrees. | |
| 1103 | ||
| 1104 | \medskip AST application is merely a pro-forma mechanism to indicate | |
| 1105 |   certain syntactic structures.  Thus @{verbatim "(c a b)"} could mean
 | |
| 1106 | either term application or type application, depending on the | |
| 1107 | syntactic context. | |
| 1108 | ||
| 1109 |   Nested application like @{verbatim "((\"_abs\" x t) u)"} is also
 | |
| 1110 | possible, but ASTs are definitely first-order: the syntax constant | |
| 1111 |   @{verbatim "\"_abs\""} does not bind the @{verbatim x} in any way.
 | |
| 1112 | Proper bindings are introduced in later stages of the term syntax, | |
| 1113 |   where @{verbatim "(\"_abs\" x t)"} becomes an @{ML Abs} node and
 | |
| 1114 |   occurrences of @{verbatim x} in @{verbatim t} are replaced by bound
 | |
| 1115 | variables (represented as de-Bruijn indices). | |
| 48113 | 1116 | *} | 
| 1117 | ||
| 1118 | ||
| 48115 | 1119 | subsubsection {* AST constants versus variables *}
 | 
| 48114 | 1120 | |
| 1121 | text {* Depending on the situation --- input syntax, output syntax,
 | |
| 56582 | 1122 |   translation patterns --- the distinction of atomic ASTs as @{ML
 | 
| 48114 | 1123 |   Ast.Constant} versus @{ML Ast.Variable} serves slightly different
 | 
| 1124 | purposes. | |
| 1125 | ||
| 1126 |   Input syntax of a term such as @{text "f a b = c"} does not yet
 | |
| 1127 |   indicate the scopes of atomic entities @{text "f, a, b, c"}: they
 | |
| 1128 | could be global constants or local variables, even bound ones | |
| 1129 |   depending on the context of the term.  @{ML Ast.Variable} leaves
 | |
| 1130 | this choice still open: later syntax layers (or translation | |
| 1131 | functions) may capture such a variable to determine its role | |
| 1132 | specifically, to make it a constant, bound variable, free variable | |
| 1133 | etc. In contrast, syntax translations that introduce already known | |
| 1134 |   constants would rather do it via @{ML Ast.Constant} to prevent
 | |
| 1135 | accidental re-interpretation later on. | |
| 1136 | ||
| 1137 |   Output syntax turns term constants into @{ML Ast.Constant} and
 | |
| 1138 |   variables (free or schematic) into @{ML Ast.Variable}.  This
 | |
| 1139 |   information is precise when printing fully formal @{text "\<lambda>"}-terms.
 | |
| 1140 | ||
| 52413 | 1141 |   \medskip AST translation patterns (\secref{sec:syn-trans}) that
 | 
| 1142 | represent terms cannot distinguish constants and variables | |
| 1143 |   syntactically.  Explicit indication of @{text "CONST c"} inside the
 | |
| 1144 |   term language is required, unless @{text "c"} is known as special
 | |
| 1145 |   \emph{syntax constant} (see also @{command syntax}).  It is also
 | |
| 1146 |   possible to use @{command syntax} declarations (without mixfix
 | |
| 1147 | annotation) to enforce that certain unqualified names are always | |
| 1148 | treated as constant within the syntax machinery. | |
| 48114 | 1149 | |
| 52413 | 1150 | The situation is simpler for ASTs that represent types or sorts, | 
| 1151 | since the concrete syntax already distinguishes type variables from | |
| 1152 |   type constants (constructors).  So @{text "('a, 'b) foo"}
 | |
| 1153 |   corresponds to an AST application of some constant for @{text foo}
 | |
| 1154 |   and variable arguments for @{text "'a"} and @{text "'b"}.  Note that
 | |
| 1155 | the postfix application is merely a feature of the concrete syntax, | |
| 1156 | while in the AST the constructor occurs in head position. *} | |
| 48114 | 1157 | |
| 1158 | ||
| 1159 | subsubsection {* Authentic syntax names *}
 | |
| 1160 | ||
| 1161 | text {* Naming constant entities within ASTs is another delicate
 | |
| 52413 | 1162 | issue. Unqualified names are resolved in the name space tables in | 
| 48114 | 1163 | the last stage of parsing, after all translations have been applied. | 
| 1164 | Since syntax transformations do not know about this later name | |
| 52413 | 1165 | resolution, there can be surprises in boundary cases. | 
| 48114 | 1166 | |
| 1167 |   \emph{Authentic syntax names} for @{ML Ast.Constant} avoid this
 | |
| 1168 | problem: the fully-qualified constant name with a special prefix for | |
| 1169 |   its formal category (@{text "class"}, @{text "type"}, @{text
 | |
| 1170 |   "const"}, @{text "fixed"}) represents the information faithfully
 | |
| 1171 | within the untyped AST format. Accidental overlap with free or | |
| 1172 | bound variables is excluded as well. Authentic syntax names work | |
| 1173 | implicitly in the following situations: | |
| 1174 | ||
| 1175 |   \begin{itemize}
 | |
| 1176 | ||
| 1177 | \item Input of term constants (or fixed variables) that are | |
| 1178 |   introduced by concrete syntax via @{command notation}: the
 | |
| 1179 | correspondence of a particular grammar production to some known term | |
| 1180 | entity is preserved. | |
| 1181 | ||
| 48816 | 1182 | \item Input of type constants (constructors) and type classes --- | 
| 48114 | 1183 | thanks to explicit syntactic distinction independently on the | 
| 1184 | context. | |
| 1185 | ||
| 1186 | \item Output of term constants, type constants, type classes --- | |
| 1187 | this information is already available from the internal term to be | |
| 1188 | printed. | |
| 1189 | ||
| 1190 |   \end{itemize}
 | |
| 1191 | ||
| 1192 | In other words, syntax transformations that operate on input terms | |
| 48816 | 1193 | written as prefix applications are difficult to make robust. | 
| 1194 | Luckily, this case rarely occurs in practice, because syntax forms | |
| 52413 | 1195 | to be translated usually correspond to some concrete notation. *} | 
| 48114 | 1196 | |
| 1197 | ||
| 48113 | 1198 | subsection {* Raw syntax and translations \label{sec:syn-trans} *}
 | 
| 28762 | 1199 | |
| 1200 | text {*
 | |
| 48117 | 1201 |   \begin{tabular}{rcll}
 | 
| 41229 
d797baa3d57c
replaced command 'nonterminals' by slightly modernized version 'nonterminal';
 wenzelm parents: 
40879diff
changeset | 1202 |     @{command_def "nonterminal"} & : & @{text "theory \<rightarrow> theory"} \\
 | 
| 28762 | 1203 |     @{command_def "syntax"} & : & @{text "theory \<rightarrow> theory"} \\
 | 
| 1204 |     @{command_def "no_syntax"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1205 |     @{command_def "translations"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1206 |     @{command_def "no_translations"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 48117 | 1207 |     @{attribute_def syntax_ast_trace} & : & @{text attribute} & default @{text false} \\
 | 
| 1208 |     @{attribute_def syntax_ast_stats} & : & @{text attribute} & default @{text false} \\
 | |
| 1209 |   \end{tabular}
 | |
| 28762 | 1210 | |
| 46292 | 1211 | Unlike mixfix notation for existing formal entities | 
| 1212 |   (\secref{sec:notation}), raw syntax declarations provide full access
 | |
| 48115 | 1213 | to the priority grammar of the inner syntax, without any sanity | 
| 1214 | checks. This includes additional syntactic categories (via | |
| 1215 |   @{command nonterminal}) and free-form grammar productions (via
 | |
| 1216 |   @{command syntax}).  Additional syntax translations (or macros, via
 | |
| 1217 |   @{command translations}) are required to turn resulting parse trees
 | |
| 1218 | into proper representations of formal entities again. | |
| 46292 | 1219 | |
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 1220 |   @{rail \<open>
 | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1221 |     @@{command nonterminal} (@{syntax name} + @'and')
 | 
| 28762 | 1222 | ; | 
| 46494 
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
 wenzelm parents: 
46483diff
changeset | 1223 |     (@@{command syntax} | @@{command no_syntax}) @{syntax mode}? (constdecl +)
 | 
| 28762 | 1224 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1225 |     (@@{command translations} | @@{command no_translations})
 | 
| 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1226 |       (transpat ('==' | '=>' | '<=' | '\<rightleftharpoons>' | '\<rightharpoonup>' | '\<leftharpoondown>') transpat +)
 | 
| 28762 | 1227 | ; | 
| 1228 | ||
| 46494 
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
 wenzelm parents: 
46483diff
changeset | 1229 |     constdecl: @{syntax name} '::' @{syntax type} @{syntax mixfix}?
 | 
| 
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
 wenzelm parents: 
46483diff
changeset | 1230 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1231 |     mode: ('(' ( @{syntax name} | @'output' | @{syntax name} @'output' ) ')')
 | 
| 28762 | 1232 | ; | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1233 |     transpat: ('(' @{syntax nameref} ')')? @{syntax string}
 | 
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 1234 | \<close>} | 
| 28762 | 1235 | |
| 1236 |   \begin{description}
 | |
| 46282 | 1237 | |
| 41229 
d797baa3d57c
replaced command 'nonterminals' by slightly modernized version 'nonterminal';
 wenzelm parents: 
40879diff
changeset | 1238 |   \item @{command "nonterminal"}~@{text c} declares a type
 | 
| 28762 | 1239 |   constructor @{text c} (without arguments) to act as purely syntactic
 | 
| 1240 | type: a nonterminal symbol of the inner syntax. | |
| 1241 | ||
| 46292 | 1242 |   \item @{command "syntax"}~@{text "(mode) c :: \<sigma> (mx)"} augments the
 | 
| 1243 | priority grammar and the pretty printer table for the given print | |
| 1244 |   mode (default @{verbatim "\"\""}). An optional keyword @{keyword_ref
 | |
| 1245 | "output"} means that only the pretty printer table is affected. | |
| 1246 | ||
| 1247 |   Following \secref{sec:mixfix}, the mixfix annotation @{text "mx =
 | |
| 1248 |   template ps q"} together with type @{text "\<sigma> = \<tau>\<^sub>1 \<Rightarrow> \<dots> \<tau>\<^sub>n \<Rightarrow> \<tau>"} and
 | |
| 1249 |   specify a grammar production.  The @{text template} contains
 | |
| 1250 |   delimiter tokens that surround @{text "n"} argument positions
 | |
| 1251 |   (@{verbatim "_"}).  The latter correspond to nonterminal symbols
 | |
| 1252 |   @{text "A\<^sub>i"} derived from the argument types @{text "\<tau>\<^sub>i"} as
 | |
| 1253 | follows: | |
| 1254 |   \begin{itemize}
 | |
| 1255 | ||
| 1256 |   \item @{text "prop"} if @{text "\<tau>\<^sub>i = prop"}
 | |
| 1257 | ||
| 1258 |   \item @{text "logic"} if @{text "\<tau>\<^sub>i = (\<dots>)\<kappa>"} for logical type
 | |
| 1259 |   constructor @{text "\<kappa> \<noteq> prop"}
 | |
| 1260 | ||
| 1261 |   \item @{text any} if @{text "\<tau>\<^sub>i = \<alpha>"} for type variables
 | |
| 1262 | ||
| 1263 |   \item @{text "\<kappa>"} if @{text "\<tau>\<^sub>i = \<kappa>"} for nonterminal @{text "\<kappa>"}
 | |
| 1264 | (syntactic type constructor) | |
| 1265 | ||
| 1266 |   \end{itemize}
 | |
| 1267 | ||
| 1268 |   Each @{text "A\<^sub>i"} is decorated by priority @{text "p\<^sub>i"} from the
 | |
| 56582 | 1269 |   given list @{text "ps"}; missing priorities default to 0.
 | 
| 46292 | 1270 | |
| 1271 | The resulting nonterminal of the production is determined similarly | |
| 1272 |   from type @{text "\<tau>"}, with priority @{text "q"} and default 1000.
 | |
| 1273 | ||
| 1274 |   \medskip Parsing via this production produces parse trees @{text
 | |
| 1275 | "t\<^sub>1, \<dots>, t\<^sub>n"} for the argument slots. The resulting parse tree is | |
| 1276 |   composed as @{text "c t\<^sub>1 \<dots> t\<^sub>n"}, by using the syntax constant @{text
 | |
| 1277 | "c"} of the syntax declaration. | |
| 1278 | ||
| 1279 | Such syntactic constants are invented on the spot, without formal | |
| 1280 | check wrt.\ existing declarations. It is conventional to use plain | |
| 1281 |   identifiers prefixed by a single underscore (e.g.\ @{text
 | |
| 1282 | "_foobar"}). Names should be chosen with care, to avoid clashes | |
| 48816 | 1283 | with other syntax declarations. | 
| 46292 | 1284 | |
| 1285 |   \medskip The special case of copy production is specified by @{text
 | |
| 1286 |   "c = "}@{verbatim "\"\""} (empty string).  It means that the
 | |
| 1287 |   resulting parse tree @{text "t"} is copied directly, without any
 | |
| 1288 | further decoration. | |
| 46282 | 1289 | |
| 28762 | 1290 |   \item @{command "no_syntax"}~@{text "(mode) decls"} removes grammar
 | 
| 1291 |   declarations (and translations) resulting from @{text decls}, which
 | |
| 1292 |   are interpreted in the same manner as for @{command "syntax"} above.
 | |
| 46282 | 1293 | |
| 28762 | 1294 |   \item @{command "translations"}~@{text rules} specifies syntactic
 | 
| 48115 | 1295 | translation rules (i.e.\ macros) as first-order rewrite rules on | 
| 48816 | 1296 |   ASTs (\secref{sec:ast}).  The theory context maintains two
 | 
| 48115 | 1297 |   independent lists translation rules: parse rules (@{verbatim "=>"}
 | 
| 1298 |   or @{text "\<rightharpoonup>"}) and print rules (@{verbatim "<="} or @{text "\<leftharpoondown>"}).
 | |
| 1299 | For convenience, both can be specified simultaneously as parse~/ | |
| 1300 |   print rules (@{verbatim "=="} or @{text "\<rightleftharpoons>"}).
 | |
| 1301 | ||
| 28762 | 1302 | Translation patterns may be prefixed by the syntactic category to be | 
| 48115 | 1303 |   used for parsing; the default is @{text logic} which means that
 | 
| 1304 | regular term syntax is used. Both sides of the syntax translation | |
| 1305 | rule undergo parsing and parse AST translations | |
| 1306 |   \secref{sec:tr-funs}, in order to perform some fundamental
 | |
| 1307 |   normalization like @{text "\<lambda>x y. b \<leadsto> \<lambda>x. \<lambda>y. b"}, but other AST
 | |
| 1308 |   translation rules are \emph{not} applied recursively here.
 | |
| 1309 | ||
| 1310 | When processing AST patterns, the inner syntax lexer runs in a | |
| 1311 | different mode that allows identifiers to start with underscore. | |
| 1312 | This accommodates the usual naming convention for auxiliary syntax | |
| 1313 | constants --- those that do not have a logical counter part --- by | |
| 1314 | allowing to specify arbitrary AST applications within the term | |
| 1315 | syntax, independently of the corresponding concrete syntax. | |
| 1316 | ||
| 1317 |   Atomic ASTs are distinguished as @{ML Ast.Constant} versus @{ML
 | |
| 1318 | Ast.Variable} as follows: a qualified name or syntax constant | |
| 1319 |   declared via @{command syntax}, or parse tree head of concrete
 | |
| 1320 |   notation becomes @{ML Ast.Constant}, anything else @{ML
 | |
| 1321 |   Ast.Variable}.  Note that @{text CONST} and @{text XCONST} within
 | |
| 1322 |   the term language (\secref{sec:pure-grammar}) allow to enforce
 | |
| 1323 | treatment as constants. | |
| 1324 | ||
| 1325 |   AST rewrite rules @{text "(lhs, rhs)"} need to obey the following
 | |
| 1326 | side-conditions: | |
| 1327 | ||
| 1328 |   \begin{itemize}
 | |
| 1329 | ||
| 1330 |   \item Rules must be left linear: @{text "lhs"} must not contain
 | |
| 1331 |   repeated variables.\footnote{The deeper reason for this is that AST
 | |
| 1332 | equality is not well-defined: different occurrences of the ``same'' | |
| 1333 | AST could be decorated differently by accidental type-constraints or | |
| 1334 | source position information, for example.} | |
| 1335 | ||
| 1336 |   \item Every variable in @{text "rhs"} must also occur in @{text
 | |
| 1337 | "lhs"}. | |
| 1338 | ||
| 1339 |   \end{itemize}
 | |
| 46282 | 1340 | |
| 28762 | 1341 |   \item @{command "no_translations"}~@{text rules} removes syntactic
 | 
| 1342 | translation rules, which are interpreted in the same manner as for | |
| 1343 |   @{command "translations"} above.
 | |
| 1344 | ||
| 48117 | 1345 |   \item @{attribute syntax_ast_trace} and @{attribute
 | 
| 1346 | syntax_ast_stats} control diagnostic output in the AST normalization | |
| 1347 | process, when translation rules are applied to concrete input or | |
| 1348 | output. | |
| 1349 | ||
| 28762 | 1350 |   \end{description}
 | 
| 46293 | 1351 | |
| 1352 | Raw syntax and translations provides a slightly more low-level | |
| 1353 | access to the grammar and the form of resulting parse trees. It is | |
| 1354 | often possible to avoid this untyped macro mechanism, and use | |
| 1355 |   type-safe @{command abbreviation} or @{command notation} instead.
 | |
| 1356 |   Some important situations where @{command syntax} and @{command
 | |
| 1357 | translations} are really need are as follows: | |
| 1358 | ||
| 1359 |   \begin{itemize}
 | |
| 1360 | ||
| 1361 |   \item Iterated replacement via recursive @{command translations}.
 | |
| 1362 |   For example, consider list enumeration @{term "[a, b, c, d]"} as
 | |
| 1363 |   defined in theory @{theory List} in Isabelle/HOL.
 | |
| 1364 | ||
| 1365 | \item Change of binding status of variables: anything beyond the | |
| 1366 |   built-in @{keyword "binder"} mixfix annotation requires explicit
 | |
| 1367 | syntax translations. For example, consider list filter | |
| 1368 |   comprehension @{term "[x \<leftarrow> xs . P]"} as defined in theory @{theory
 | |
| 1369 | List} in Isabelle/HOL. | |
| 1370 | ||
| 1371 |   \end{itemize}
 | |
| 28762 | 1372 | *} | 
| 1373 | ||
| 48117 | 1374 | subsubsection {* Applying translation rules *}
 | 
| 1375 | ||
| 1376 | text {* As a term is being parsed or printed, an AST is generated as
 | |
| 1377 |   an intermediate form according to \figref{fig:parse-print}.  The AST
 | |
| 1378 | is normalized by applying translation rules in the manner of a | |
| 1379 | first-order term rewriting system. We first examine how a single | |
| 1380 | rule is applied. | |
| 1381 | ||
| 1382 |   Let @{text "t"} be the abstract syntax tree to be normalized and
 | |
| 1383 |   @{text "(lhs, rhs)"} some translation rule.  A subtree @{text "u"}
 | |
| 1384 |   of @{text "t"} is called \emph{redex} if it is an instance of @{text
 | |
| 1385 |   "lhs"}; in this case the pattern @{text "lhs"} is said to match the
 | |
| 1386 |   object @{text "u"}.  A redex matched by @{text "lhs"} may be
 | |
| 1387 |   replaced by the corresponding instance of @{text "rhs"}, thus
 | |
| 1388 |   \emph{rewriting} the AST @{text "t"}.  Matching requires some notion
 | |
| 1389 |   of \emph{place-holders} in rule patterns: @{ML Ast.Variable} serves
 | |
| 1390 | this purpose. | |
| 1391 | ||
| 1392 |   More precisely, the matching of the object @{text "u"} against the
 | |
| 1393 |   pattern @{text "lhs"} is performed as follows:
 | |
| 1394 | ||
| 1395 |   \begin{itemize}
 | |
| 1396 | ||
| 1397 |   \item Objects of the form @{ML Ast.Variable}~@{text "x"} or @{ML
 | |
| 1398 |   Ast.Constant}~@{text "x"} are matched by pattern @{ML
 | |
| 1399 |   Ast.Constant}~@{text "x"}.  Thus all atomic ASTs in the object are
 | |
| 1400 | treated as (potential) constants, and a successful match makes them | |
| 1401 | actual constants even before name space resolution (see also | |
| 1402 |   \secref{sec:ast}).
 | |
| 1403 | ||
| 1404 |   \item Object @{text "u"} is matched by pattern @{ML
 | |
| 1405 |   Ast.Variable}~@{text "x"}, binding @{text "x"} to @{text "u"}.
 | |
| 1406 | ||
| 1407 |   \item Object @{ML Ast.Appl}~@{text "us"} is matched by @{ML
 | |
| 1408 |   Ast.Appl}~@{text "ts"} if @{text "us"} and @{text "ts"} have the
 | |
| 1409 | same length and each corresponding subtree matches. | |
| 1410 | ||
| 1411 | \item In every other case, matching fails. | |
| 1412 | ||
| 1413 |   \end{itemize}
 | |
| 1414 | ||
| 1415 |   A successful match yields a substitution that is applied to @{text
 | |
| 1416 |   "rhs"}, generating the instance that replaces @{text "u"}.
 | |
| 1417 | ||
| 1418 | Normalizing an AST involves repeatedly applying translation rules | |
| 1419 | until none are applicable. This works yoyo-like: top-down, | |
| 1420 | bottom-up, top-down, etc. At each subtree position, rules are | |
| 1421 | chosen in order of appearance in the theory definitions. | |
| 1422 | ||
| 1423 |   The configuration options @{attribute syntax_ast_trace} and
 | |
| 48816 | 1424 |   @{attribute syntax_ast_stats} might help to understand this process
 | 
| 48117 | 1425 | and diagnose problems. | 
| 1426 | ||
| 1427 |   \begin{warn}
 | |
| 1428 | If syntax translation rules work incorrectly, the output of | |
| 48118 | 1429 |   @{command_ref print_syntax} with its \emph{rules} sections reveals the
 | 
| 48117 | 1430 | actual internal forms of AST pattern, without potentially confusing | 
| 1431 | concrete syntax. Recall that AST constants appear as quoted strings | |
| 1432 | and variables without quotes. | |
| 1433 |   \end{warn}
 | |
| 1434 | ||
| 1435 |   \begin{warn}
 | |
| 1436 |   If @{attribute_ref eta_contract} is set to @{text "true"}, terms
 | |
| 1437 |   will be @{text "\<eta>"}-contracted \emph{before} the AST rewriter sees
 | |
| 1438 | them. Thus some abstraction nodes needed for print rules to match | |
| 1439 |   may vanish.  For example, @{text "Ball A (\<lambda>x. P x)"} would contract
 | |
| 1440 |   to @{text "Ball A P"} and the standard print rule would fail to
 | |
| 1441 | apply. This problem can be avoided by hand-written ML translation | |
| 1442 |   functions (see also \secref{sec:tr-funs}), which is in fact the same
 | |
| 1443 |   mechanism used in built-in @{keyword "binder"} declarations.
 | |
| 1444 |   \end{warn}
 | |
| 1445 | *} | |
| 1446 | ||
| 28762 | 1447 | |
| 48113 | 1448 | subsection {* Syntax translation functions \label{sec:tr-funs} *}
 | 
| 28762 | 1449 | |
| 1450 | text {*
 | |
| 1451 |   \begin{matharray}{rcl}
 | |
| 1452 |     @{command_def "parse_ast_translation"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1453 |     @{command_def "parse_translation"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1454 |     @{command_def "print_translation"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1455 |     @{command_def "typed_print_translation"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 1456 |     @{command_def "print_ast_translation"} & : & @{text "theory \<rightarrow> theory"} \\
 | |
| 56186 | 1457 |     @{ML_antiquotation_def "class_syntax"} & : & @{text "ML antiquotation"} \\
 | 
| 1458 |     @{ML_antiquotation_def "type_syntax"} & : & @{text "ML antiquotation"} \\
 | |
| 1459 |     @{ML_antiquotation_def "const_syntax"} & : & @{text "ML antiquotation"} \\
 | |
| 1460 |     @{ML_antiquotation_def "syntax_const"} & : & @{text "ML antiquotation"} \\
 | |
| 28762 | 1461 |   \end{matharray}
 | 
| 1462 | ||
| 48118 | 1463 | Syntax translation functions written in ML admit almost arbitrary | 
| 1464 | manipulations of inner syntax, at the expense of some complexity and | |
| 1465 | obscurity in the implementation. | |
| 1466 | ||
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 1467 |   @{rail \<open>
 | 
| 42596 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1468 |   ( @@{command parse_ast_translation} | @@{command parse_translation} |
 | 
| 
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
 wenzelm parents: 
42358diff
changeset | 1469 |     @@{command print_translation} | @@{command typed_print_translation} |
 | 
| 52143 | 1470 |     @@{command print_ast_translation}) @{syntax text}
 | 
| 48119 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1471 | ; | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1472 |   (@@{ML_antiquotation class_syntax} |
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1473 |    @@{ML_antiquotation type_syntax} |
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1474 |    @@{ML_antiquotation const_syntax} |
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1475 |    @@{ML_antiquotation syntax_const}) name
 | 
| 55112 
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
 wenzelm parents: 
55108diff
changeset | 1476 | \<close>} | 
| 28762 | 1477 | |
| 48119 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1478 |   \begin{description}
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1479 | |
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1480 |   \item @{command parse_translation} etc. declare syntax translation
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1481 | functions to the theory. Any of these commands have a single | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1482 |   @{syntax text} argument that refers to an ML expression of
 | 
| 52413 | 1483 | appropriate type as follows: | 
| 48118 | 1484 | |
| 1485 | \medskip | |
| 48119 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1486 |   {\footnotesize
 | 
| 52143 | 1487 |   \begin{tabular}{l}
 | 
| 1488 |   @{command parse_ast_translation} : \\
 | |
| 1489 |   \quad @{ML_type "(string * (Proof.context -> Ast.ast list -> Ast.ast)) list"} \\
 | |
| 1490 |   @{command parse_translation} : \\
 | |
| 1491 |   \quad @{ML_type "(string * (Proof.context -> term list -> term)) list"} \\
 | |
| 1492 |   @{command print_translation} : \\
 | |
| 1493 |   \quad @{ML_type "(string * (Proof.context -> term list -> term)) list"} \\
 | |
| 1494 |   @{command typed_print_translation} : \\
 | |
| 1495 |   \quad @{ML_type "(string * (Proof.context -> typ -> term list -> term)) list"} \\
 | |
| 1496 |   @{command print_ast_translation} : \\
 | |
| 1497 |   \quad @{ML_type "(string * (Proof.context -> Ast.ast list -> Ast.ast)) list"} \\
 | |
| 48118 | 1498 |   \end{tabular}}
 | 
| 1499 | \medskip | |
| 28762 | 1500 | |
| 48816 | 1501 |   The argument list consists of @{text "(c, tr)"} pairs, where @{text
 | 
| 1502 |   "c"} is the syntax name of the formal entity involved, and @{text
 | |
| 1503 |   "tr"} a function that translates a syntax form @{text "c args"} into
 | |
| 52413 | 1504 |   @{text "tr ctxt args"} (depending on the context).  The Isabelle/ML
 | 
| 1505 |   naming convention for parse translations is @{text "c_tr"} and for
 | |
| 1506 |   print translations @{text "c_tr'"}.
 | |
| 48118 | 1507 | |
| 1508 |   The @{command_ref print_syntax} command displays the sets of names
 | |
| 1509 |   associated with the translation functions of a theory under @{text
 | |
| 1510 | "parse_ast_translation"} etc. | |
| 1511 | ||
| 48119 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1512 |   \item @{text "@{class_syntax c}"}, @{text "@{type_syntax c}"},
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1513 |   @{text "@{const_syntax c}"} inline the authentic syntax name of the
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1514 | given formal entities into the ML source. This is the | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1515 | fully-qualified logical name prefixed by a special marker to | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1516 | indicate its kind: thus different logical name spaces are properly | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1517 | distinguished within parse trees. | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1518 | |
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1519 |   \item @{text "@{const_syntax c}"} inlines the name @{text "c"} of
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1520 | the given syntax constant, having checked that it has been declared | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1521 |   via some @{command syntax} commands within the theory context.  Note
 | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1522 | that the usual naming convention makes syntax constants start with | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1523 | underscore, to reduce the chance of accidental clashes with other | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1524 | names occurring in parse trees (unqualified constants etc.). | 
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1525 | |
| 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1526 |   \end{description}
 | 
| 48118 | 1527 | *} | 
| 1528 | ||
| 48119 
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
 wenzelm parents: 
48118diff
changeset | 1529 | |
| 48118 | 1530 | subsubsection {* The translation strategy *}
 | 
| 28762 | 1531 | |
| 48816 | 1532 | text {* The different kinds of translation functions are invoked during
 | 
| 48118 | 1533 | the transformations between parse trees, ASTs and syntactic terms | 
| 1534 |   (cf.\ \figref{fig:parse-print}).  Whenever a combination of the form
 | |
| 1535 |   @{text "c x\<^sub>1 \<dots> x\<^sub>n"} is encountered, and a translation function
 | |
| 1536 |   @{text "f"} of appropriate kind is declared for @{text "c"}, the
 | |
| 1537 |   result is produced by evaluation of @{text "f [x\<^sub>1, \<dots>, x\<^sub>n]"} in ML.
 | |
| 1538 | ||
| 1539 |   For AST translations, the arguments @{text "x\<^sub>1, \<dots>, x\<^sub>n"} are ASTs.  A
 | |
| 1540 |   combination has the form @{ML "Ast.Constant"}~@{text "c"} or @{ML
 | |
| 1541 |   "Ast.Appl"}~@{text "["}@{ML Ast.Constant}~@{text "c, x\<^sub>1, \<dots>, x\<^sub>n]"}.
 | |
| 1542 | For term translations, the arguments are terms and a combination has | |
| 1543 |   the form @{ML Const}~@{text "(c, \<tau>)"} or @{ML Const}~@{text "(c, \<tau>)
 | |
| 1544 | $ x\<^sub>1 $ \<dots> $ x\<^sub>n"}. Terms allow more sophisticated transformations | |
| 1545 | than ASTs do, typically involving abstractions and bound | |
| 1546 |   variables. \emph{Typed} print translations may even peek at the type
 | |
| 52413 | 1547 |   @{text "\<tau>"} of the constant they are invoked on, although some
 | 
| 1548 | information might have been suppressed for term output already. | |
| 48118 | 1549 | |
| 1550 | Regardless of whether they act on ASTs or terms, translation | |
| 1551 | functions called during the parsing process differ from those for | |
| 1552 | printing in their overall behaviour: | |
| 1553 | ||
| 1554 |   \begin{description}
 | |
| 28762 | 1555 | |
| 48118 | 1556 | \item [Parse translations] are applied bottom-up. The arguments are | 
| 1557 | already in translated form. The translations must not fail; | |
| 1558 | exceptions trigger an error message. There may be at most one | |
| 1559 | function associated with any syntactic name. | |
| 46294 | 1560 | |
| 48118 | 1561 | \item [Print translations] are applied top-down. They are supplied | 
| 1562 | with arguments that are partly still in internal form. The result | |
| 1563 | again undergoes translation; therefore a print translation should | |
| 1564 | not introduce as head the very constant that invoked it. The | |
| 1565 |   function may raise exception @{ML Match} to indicate failure; in
 | |
| 1566 | this event it has no effect. Multiple functions associated with | |
| 1567 | some syntactic name are tried in the order of declaration in the | |
| 1568 | theory. | |
| 1569 | ||
| 1570 |   \end{description}
 | |
| 1571 | ||
| 1572 |   Only constant atoms --- constructor @{ML Ast.Constant} for ASTs and
 | |
| 1573 |   @{ML Const} for terms --- can invoke translation functions.  This
 | |
| 1574 | means that parse translations can only be associated with parse tree | |
| 1575 | heads of concrete syntax, or syntactic constants introduced via | |
| 1576 | other translations. For plain identifiers within the term language, | |
| 1577 | the status of constant versus variable is not yet know during | |
| 1578 | parsing. This is in contrast to print translations, where constants | |
| 1579 | are explicitly known from the given term in its fully internal form. | |
| 28762 | 1580 | *} | 
| 1581 | ||
| 52414 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1582 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1583 | subsection {* Built-in syntax transformations *}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1584 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1585 | text {*
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1586 | Here are some further details of the main syntax transformation | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1587 |   phases of \figref{fig:parse-print}.
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1588 | *} | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1589 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1590 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1591 | subsubsection {* Transforming parse trees to ASTs *}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1592 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1593 | text {* The parse tree is the raw output of the parser.  It is
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1594 | transformed into an AST according to some basic scheme that may be | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1595 | augmented by AST translation functions as explained in | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1596 |   \secref{sec:tr-funs}.
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1597 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1598 | The parse tree is constructed by nesting the right-hand sides of the | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1599 | productions used to recognize the input. Such parse trees are | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1600 | simply lists of tokens and constituent parse trees, the latter | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1601 | representing the nonterminals of the productions. Ignoring AST | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1602 | translation functions, parse trees are transformed to ASTs by | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1603 | stripping out delimiters and copy productions, while retaining some | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1604 | source position information from input tokens. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1605 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1606 | The Pure syntax provides predefined AST translations to make the | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1607 |   basic @{text "\<lambda>"}-term structure more apparent within the
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1608 | (first-order) AST representation, and thus facilitate the use of | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1609 |   @{command translations} (see also \secref{sec:syn-trans}).  This
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1610 | covers ordinary term application, type application, nested | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1611 | abstraction, iterated meta implications and function types. The | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1612 | effect is illustrated on some representative input strings is as | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1613 | follows: | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1614 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1615 |   \begin{center}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1616 |   \begin{tabular}{ll}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1617 | input source & AST \\ | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1618 | \hline | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1619 |   @{text "f x y z"} & @{verbatim "(f x y z)"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1620 |   @{text "'a ty"} & @{verbatim "(ty 'a)"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1621 |   @{text "('a, 'b)ty"} & @{verbatim "(ty 'a 'b)"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1622 |   @{text "\<lambda>x y z. t"} & @{verbatim "(\"_abs\" x (\"_abs\" y (\"_abs\" z t)))"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1623 |   @{text "\<lambda>x :: 'a. t"} & @{verbatim "(\"_abs\" (\"_constrain\" x 'a) t)"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1624 |   @{text "\<lbrakk>P; Q; R\<rbrakk> \<Longrightarrow> S"} & @{verbatim "(\"==>\" P (\"==>\" Q (\"==>\" R S)))"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1625 |    @{text "['a, 'b, 'c] \<Rightarrow> 'd"} & @{verbatim "(\"fun\" 'a (\"fun\" 'b (\"fun\" 'c 'd)))"} \\
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1626 |   \end{tabular}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1627 |   \end{center}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1628 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1629 | Note that type and sort constraints may occur in further places --- | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1630 | translations need to be ready to cope with them. The built-in | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1631 | syntax transformation from parse trees to ASTs insert additional | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1632 | constraints that represent source positions. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1633 | *} | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1634 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1635 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1636 | subsubsection {* Transforming ASTs to terms *}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1637 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1638 | text {* After application of macros (\secref{sec:syn-trans}), the AST
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1639 | is transformed into a term. This term still lacks proper type | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1640 | information, but it might contain some constraints consisting of | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1641 |   applications with head @{verbatim "_constrain"}, where the second
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1642 | argument is a type encoded as a pre-term within the syntax. Type | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1643 | inference later introduces correct types, or indicates type errors | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1644 | in the input. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1645 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1646 | Ignoring parse translations, ASTs are transformed to terms by | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1647 | mapping AST constants to term constants, AST variables to term | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1648 | variables or constants (according to the name space), and AST | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1649 | applications to iterated term applications. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1650 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1651 | The outcome is still a first-order term. Proper abstractions and | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1652 | bound variables are introduced by parse translations associated with | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1653 |   certain syntax constants.  Thus @{verbatim "(_abs x x)"} eventually
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1654 |   becomes a de-Bruijn term @{verbatim "Abs (\"x\", _, Bound 0)"}.
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1655 | *} | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1656 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1657 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1658 | subsubsection {* Printing of terms *}
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1659 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1660 | text {* The output phase is essentially the inverse of the input
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1661 | phase. Terms are translated via abstract syntax trees into | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1662 | pretty-printed text. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1663 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1664 | Ignoring print translations, the transformation maps term constants, | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1665 | variables and applications to the corresponding constructs on ASTs. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1666 | Abstractions are mapped to applications of the special constant | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1667 |   @{verbatim "_abs"} as seen before.  Type constraints are represented
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1668 |   via special @{verbatim "_constrain"} forms, according to various
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1669 | policies of type annotation determined elsewhere. Sort constraints | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1670 | of type variables are handled in a similar fashion. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1671 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1672 |   After application of macros (\secref{sec:syn-trans}), the AST is
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1673 | finally pretty-printed. The built-in print AST translations reverse | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1674 | the corresponding parse AST translations. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1675 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1676 | \medskip For the actual printing process, the priority grammar | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1677 |   (\secref{sec:priority-grammar}) plays a vital role: productions are
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1678 | used as templates for pretty printing, with argument slots stemming | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1679 | from nonterminals, and syntactic sugar stemming from literal tokens. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1680 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1681 |   Each AST application with constant head @{text "c"} and arguments
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1682 |   @{text "t\<^sub>1"}, \dots, @{text "t\<^sub>n"} (for @{text "n = 0"} the AST is
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1683 |   just the constant @{text "c"} itself) is printed according to the
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1684 |   first grammar production of result name @{text "c"}.  The required
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1685 | syntax priority of the argument slot is given by its nonterminal | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1686 |   @{text "A\<^sup>(\<^sup>p\<^sup>)"}.  The argument @{text "t\<^sub>i"} that corresponds to the
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1687 |   position of @{text "A\<^sup>(\<^sup>p\<^sup>)"} is printed recursively, and then put in
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1688 |   parentheses \emph{if} its priority @{text "p"} requires this.  The
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1689 | resulting output is concatenated with the syntactic sugar according | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1690 | to the grammar production. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1691 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1692 |   If an AST application @{text "(c x\<^sub>1 \<dots> x\<^sub>m)"} has more arguments than
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1693 |   the corresponding production, it is first split into @{text "((c x\<^sub>1
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1694 | \<dots> x\<^sub>n) x\<^sub>n\<^sub>+\<^sub>1 \<dots> x\<^sub>m)"} and then printed recursively as above. | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1695 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1696 | Applications with too few arguments or with non-constant head or | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1697 | without a corresponding production are printed in prefix-form like | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1698 |   @{text "f t\<^sub>1 \<dots> t\<^sub>n"} for terms.
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1699 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1700 |   Multiple productions associated with some name @{text "c"} are tried
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1701 | in order of appearance within the grammar. An occurrence of some | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1702 |   AST variable @{text "x"} is printed as @{text "x"} outright.
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1703 | |
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1704 |   \medskip White space is \emph{not} inserted automatically.  If
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1705 | blanks (or breaks) are required to separate tokens, they need to be | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1706 |   specified in the mixfix declaration (\secref{sec:mixfix}).
 | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1707 | *} | 
| 
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
 wenzelm parents: 
52413diff
changeset | 1708 | |
| 28762 | 1709 | end |