| author | wenzelm |
| Wed, 30 Dec 2015 17:18:32 +0100 | |
| changeset 61978 | 7ab2dc7ba8f8 |
| parent 61656 | cfabbc083977 |
| child 61997 | 4d9518c3d031 |
| permissions | -rw-r--r-- |
| 61656 | 1 |
(*:maxLineLen=78:*) |
2 |
||
| 28762 | 3 |
theory Inner_Syntax |
| 42651 | 4 |
imports Base Main |
| 28762 | 5 |
begin |
6 |
||
| 58618 | 7 |
chapter \<open>Inner syntax --- the term language \label{ch:inner-syntax}\<close>
|
| 28762 | 8 |
|
| 58618 | 9 |
text \<open>The inner syntax of Isabelle provides concrete notation for |
| 61493 | 10 |
the main entities of the logical framework, notably \<open>\<lambda>\<close>-terms with types and type classes. Applications may either |
| 46282 | 11 |
extend existing syntactic categories by additional notation, or |
12 |
define new sub-languages that are linked to the standard term |
|
| 61503 | 13 |
language via some explicit markers. For example \<^verbatim>\<open>FOO\<close>~\<open>foo\<close> could |
14 |
embed the syntax corresponding for some |
|
| 61493 | 15 |
user-defined nonterminal \<open>foo\<close> --- within the bounds of the |
| 46282 | 16 |
given lexical syntax of Isabelle/Pure. |
17 |
||
18 |
The most basic way to specify concrete syntax for logical entities |
|
19 |
works via mixfix annotations (\secref{sec:mixfix}), which may be
|
|
20 |
usually given as part of the original declaration or via explicit |
|
21 |
notation commands later on (\secref{sec:notation}). This already
|
|
22 |
covers many needs of concrete syntax without having to understand |
|
23 |
the full complexity of inner syntax layers. |
|
24 |
||
25 |
Further details of the syntax engine involves the classical |
|
26 |
distinction of lexical language versus context-free grammar (see |
|
| 61477 | 27 |
\secref{sec:pure-syntax}), and various mechanisms for \<^emph>\<open>syntax
|
28 |
transformations\<close> (see \secref{sec:syntax-transformations}).
|
|
| 58618 | 29 |
\<close> |
| 46282 | 30 |
|
31 |
||
| 58618 | 32 |
section \<open>Printing logical entities\<close> |
| 28762 | 33 |
|
| 58618 | 34 |
subsection \<open>Diagnostic commands \label{sec:print-diag}\<close>
|
| 28762 | 35 |
|
| 58618 | 36 |
text \<open> |
| 28762 | 37 |
\begin{matharray}{rcl}
|
| 61493 | 38 |
@{command_def "typ"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
39 |
@{command_def "term"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
|
40 |
@{command_def "prop"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
|
41 |
@{command_def "thm"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
|
42 |
@{command_def "prf"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
|
43 |
@{command_def "full_prf"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
|
44 |
@{command_def "print_state"}\<open>\<^sup>*\<close> & : & \<open>any \<rightarrow>\<close> \\
|
|
| 28762 | 45 |
\end{matharray}
|
46 |
||
47 |
These diagnostic commands assist interactive development by printing |
|
48 |
internal logical entities in a human-readable fashion. |
|
49 |
||
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
50 |
@{rail \<open>
|
| 48792 | 51 |
@@{command typ} @{syntax modes}? @{syntax type} ('::' @{syntax sort})?
|
| 28762 | 52 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
53 |
@@{command term} @{syntax modes}? @{syntax term}
|
| 28762 | 54 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
55 |
@@{command prop} @{syntax modes}? @{syntax prop}
|
| 28762 | 56 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
57 |
@@{command thm} @{syntax modes}? @{syntax thmrefs}
|
| 28762 | 58 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
59 |
( @@{command prf} | @@{command full_prf} ) @{syntax modes}? @{syntax thmrefs}?
|
| 28762 | 60 |
; |
| 52430 | 61 |
@@{command print_state} @{syntax modes}?
|
| 28762 | 62 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
63 |
@{syntax_def modes}: '(' (@{syntax name} + ) ')'
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
64 |
\<close>} |
| 28762 | 65 |
|
| 61493 | 66 |
\<^descr> @{command "typ"}~\<open>\<tau>\<close> reads and prints a type expression
|
| 48792 | 67 |
according to the current context. |
68 |
||
| 61493 | 69 |
\<^descr> @{command "typ"}~\<open>\<tau> :: s\<close> uses type-inference to
|
70 |
determine the most general way to make \<open>\<tau>\<close> conform to sort |
|
71 |
\<open>s\<close>. For concrete \<open>\<tau>\<close> this checks if the type |
|
72 |
belongs to that sort. Dummy type parameters ``\<open>_\<close>'' |
|
| 48792 | 73 |
(underscore) are assigned to fresh type variables with most general |
74 |
sorts, according the the principles of type-inference. |
|
|
28766
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28765
diff
changeset
|
75 |
|
| 61493 | 76 |
\<^descr> @{command "term"}~\<open>t\<close> and @{command "prop"}~\<open>\<phi>\<close>
|
|
28766
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28765
diff
changeset
|
77 |
read, type-check and print terms or propositions according to the |
| 61493 | 78 |
current theory or proof context; the inferred type of \<open>t\<close> is |
|
28766
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28765
diff
changeset
|
79 |
output as well. Note that these commands are also useful in |
|
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28765
diff
changeset
|
80 |
inspecting the current environment of term abbreviations. |
| 28762 | 81 |
|
| 61493 | 82 |
\<^descr> @{command "thm"}~\<open>a\<^sub>1 \<dots> a\<^sub>n\<close> retrieves
|
| 28762 | 83 |
theorems from the current theory or proof context. Note that any |
84 |
attributes included in the theorem specifications are applied to a |
|
85 |
temporary context derived from the current theory or proof; the |
|
| 61493 | 86 |
result is discarded, i.e.\ attributes involved in \<open>a\<^sub>1, |
87 |
\<dots>, a\<^sub>n\<close> do not have any permanent effect. |
|
| 28762 | 88 |
|
| 61439 | 89 |
\<^descr> @{command "prf"} displays the (compact) proof term of the
|
| 28762 | 90 |
current proof state (if present), or of the given theorems. Note |
91 |
that this requires proof terms to be switched on for the current |
|
92 |
object logic (see the ``Proof terms'' section of the Isabelle |
|
93 |
reference manual for information on how to do this). |
|
94 |
||
| 61439 | 95 |
\<^descr> @{command "full_prf"} is like @{command "prf"}, but displays
|
| 28762 | 96 |
the full proof term, i.e.\ also displays information omitted in the |
| 61493 | 97 |
compact proof term, which is denoted by ``\<open>_\<close>'' placeholders |
| 28762 | 98 |
there. |
99 |
||
| 61439 | 100 |
\<^descr> @{command "print_state"} prints the current proof state (if
|
| 52430 | 101 |
present), including current facts and goals. |
|
28766
accab7594b8e
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28765
diff
changeset
|
102 |
|
| 28762 | 103 |
|
| 61493 | 104 |
All of the diagnostic commands above admit a list of \<open>modes\<close> |
| 42926 | 105 |
to be specified, which is appended to the current print mode; see |
| 46284 | 106 |
also \secref{sec:print-modes}. Thus the output behavior may be
|
107 |
modified according particular print mode features. For example, |
|
| 61493 | 108 |
@{command "print_state"}~\<open>(latex xsymbols)\<close> prints the
|
| 52430 | 109 |
current proof state with mathematical symbols and special characters |
| 46284 | 110 |
represented in {\LaTeX} source, according to the Isabelle style
|
| 60270 | 111 |
@{cite "isabelle-system"}.
|
| 28762 | 112 |
|
113 |
Note that antiquotations (cf.\ \secref{sec:antiq}) provide a more
|
|
114 |
systematic way to include formal items into the printed text |
|
115 |
document. |
|
| 58618 | 116 |
\<close> |
| 28762 | 117 |
|
118 |
||
| 58618 | 119 |
subsection \<open>Details of printed content\<close> |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
120 |
|
| 58618 | 121 |
text \<open> |
| 42655 | 122 |
\begin{tabular}{rcll}
|
| 61493 | 123 |
@{attribute_def show_markup} & : & \<open>attribute\<close> \\
|
124 |
@{attribute_def show_types} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
125 |
@{attribute_def show_sorts} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
126 |
@{attribute_def show_consts} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
127 |
@{attribute_def show_abbrevs} & : & \<open>attribute\<close> & default \<open>true\<close> \\
|
|
128 |
@{attribute_def show_brackets} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
129 |
@{attribute_def names_long} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
130 |
@{attribute_def names_short} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
131 |
@{attribute_def names_unique} & : & \<open>attribute\<close> & default \<open>true\<close> \\
|
|
132 |
@{attribute_def eta_contract} & : & \<open>attribute\<close> & default \<open>true\<close> \\
|
|
133 |
@{attribute_def goals_limit} & : & \<open>attribute\<close> & default \<open>10\<close> \\
|
|
134 |
@{attribute_def show_main_goal} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
135 |
@{attribute_def show_hyps} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
136 |
@{attribute_def show_tags} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
137 |
@{attribute_def show_question_marks} & : & \<open>attribute\<close> & default \<open>true\<close> \\
|
|
| 42655 | 138 |
\end{tabular}
|
| 61421 | 139 |
\<^medskip> |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
140 |
|
| 42655 | 141 |
These configuration options control the detail of information that |
142 |
is displayed for types, terms, theorems, goals etc. See also |
|
143 |
\secref{sec:config}.
|
|
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
144 |
|
| 61439 | 145 |
\<^descr> @{attribute show_markup} controls direct inlining of markup
|
| 49699 | 146 |
into the printed representation of formal entities --- notably type |
147 |
and sort constraints. This enables Prover IDE users to retrieve |
|
148 |
that information via tooltips or popups while hovering with the |
|
149 |
mouse over the output window, for example. Consequently, this |
|
| 58842 | 150 |
option is enabled by default for Isabelle/jEdit. |
| 49699 | 151 |
|
| 61439 | 152 |
\<^descr> @{attribute show_types} and @{attribute show_sorts} control
|
| 42655 | 153 |
printing of type constraints for term variables, and sort |
154 |
constraints for type variables. By default, neither of these are |
|
155 |
shown in output. If @{attribute show_sorts} is enabled, types are
|
|
| 49699 | 156 |
always shown as well. In Isabelle/jEdit, manual setting of these |
157 |
options is normally not required thanks to @{attribute show_markup}
|
|
158 |
above. |
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
159 |
|
|
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
160 |
Note that displaying types and sorts may explain why a polymorphic |
|
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
161 |
inference rule fails to resolve with some goal, or why a rewrite |
|
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
162 |
rule does not apply as expected. |
|
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
163 |
|
| 61439 | 164 |
\<^descr> @{attribute show_consts} controls printing of types of
|
| 42655 | 165 |
constants when displaying a goal state. |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
166 |
|
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
167 |
Note that the output can be enormous, because polymorphic constants |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
168 |
often occur at several different type instances. |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
169 |
|
| 61439 | 170 |
\<^descr> @{attribute show_abbrevs} controls folding of constant
|
| 42655 | 171 |
abbreviations. |
|
40879
ca132ef44944
configuration option "show_abbrevs" supersedes print mode "no_abbrevs", with inverted meaning;
wenzelm
parents:
40255
diff
changeset
|
172 |
|
| 61439 | 173 |
\<^descr> @{attribute show_brackets} controls bracketing in pretty
|
| 42655 | 174 |
printed output. If enabled, all sub-expressions of the pretty |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
175 |
printing tree will be parenthesized, even if this produces malformed |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
176 |
term syntax! This crude way of showing the internal structure of |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
177 |
pretty printed entities may occasionally help to diagnose problems |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
178 |
with operator priorities, for example. |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
179 |
|
| 61439 | 180 |
\<^descr> @{attribute names_long}, @{attribute names_short}, and
|
|
42669
04dfffda5671
more conventional naming scheme: names_long, names_short, names_unique;
wenzelm
parents:
42655
diff
changeset
|
181 |
@{attribute names_unique} control the way of printing fully
|
|
42358
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
wenzelm
parents:
42279
diff
changeset
|
182 |
qualified internal names in external form. See also |
|
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
wenzelm
parents:
42279
diff
changeset
|
183 |
\secref{sec:antiq} for the document antiquotation options of the
|
|
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
wenzelm
parents:
42279
diff
changeset
|
184 |
same names. |
|
b47d41d9f4b5
Name_Space: proper configuration options long_names, short_names, unique_names instead of former unsynchronized references;
wenzelm
parents:
42279
diff
changeset
|
185 |
|
| 61493 | 186 |
\<^descr> @{attribute eta_contract} controls \<open>\<eta>\<close>-contracted
|
| 42655 | 187 |
printing of terms. |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
188 |
|
| 61493 | 189 |
The \<open>\<eta>\<close>-contraction law asserts @{prop "(\<lambda>x. f x) \<equiv> f"},
|
190 |
provided \<open>x\<close> is not free in \<open>f\<close>. It asserts |
|
| 61477 | 191 |
\<^emph>\<open>extensionality\<close> of functions: @{prop "f \<equiv> g"} if @{prop "f x \<equiv>
|
| 61493 | 192 |
g x"} for all \<open>x\<close>. Higher-order unification frequently puts |
193 |
terms into a fully \<open>\<eta>\<close>-expanded form. For example, if \<open>F\<close> has type \<open>(\<tau> \<Rightarrow> \<tau>) \<Rightarrow> \<tau>\<close> then its expanded form is @{term
|
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
194 |
"\<lambda>h. F (\<lambda>x. h x)"}. |
|
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
195 |
|
| 61493 | 196 |
Enabling @{attribute eta_contract} makes Isabelle perform \<open>\<eta>\<close>-contractions before printing, so that @{term "\<lambda>h. F (\<lambda>x. h x)"}
|
197 |
appears simply as \<open>F\<close>. |
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
198 |
|
| 61493 | 199 |
Note that the distinction between a term and its \<open>\<eta>\<close>-expanded |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
200 |
form occasionally matters. While higher-order resolution and |
| 61493 | 201 |
rewriting operate modulo \<open>\<alpha>\<beta>\<eta>\<close>-conversion, some other tools |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
202 |
might look at terms more discretely. |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
203 |
|
| 61439 | 204 |
\<^descr> @{attribute goals_limit} controls the maximum number of
|
|
51960
61ac1efe02c3
option "goals_limit", with more uniform description;
wenzelm
parents:
51657
diff
changeset
|
205 |
subgoals to be printed. |
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
206 |
|
| 61439 | 207 |
\<^descr> @{attribute show_main_goal} controls whether the main result
|
| 42655 | 208 |
to be proven should be displayed. This information might be |
| 39130 | 209 |
relevant for schematic goals, to inspect the current claim that has |
210 |
been synthesized so far. |
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
211 |
|
| 61439 | 212 |
\<^descr> @{attribute show_hyps} controls printing of implicit
|
| 42655 | 213 |
hypotheses of local facts. Normally, only those hypotheses are |
| 61477 | 214 |
displayed that are \<^emph>\<open>not\<close> covered by the assumptions of the |
| 42655 | 215 |
current context: this situation indicates a fault in some tool being |
216 |
used. |
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
217 |
|
| 61477 | 218 |
By enabling @{attribute show_hyps}, output of \<^emph>\<open>all\<close> hypotheses
|
| 42655 | 219 |
can be enforced, which is occasionally useful for diagnostic |
220 |
purposes. |
|
|
28763
b5e6122ff575
added pretty printing options (from old ref manual);
wenzelm
parents:
28762
diff
changeset
|
221 |
|
| 61439 | 222 |
\<^descr> @{attribute show_tags} controls printing of extra annotations
|
| 42655 | 223 |
within theorems, such as internal position information, or the case |
224 |
names being attached by the attribute @{attribute case_names}.
|
|
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
225 |
|
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
226 |
Note that the @{attribute tagged} and @{attribute untagged}
|
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
227 |
attributes provide low-level access to the collection of tags |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
228 |
associated with a theorem. |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
229 |
|
| 61439 | 230 |
\<^descr> @{attribute show_question_marks} controls printing of question
|
| 61493 | 231 |
marks for schematic variables, such as \<open>?x\<close>. Only the leading |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
232 |
question mark is affected, the remaining text is unchanged |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
233 |
(including proper markup for schematic variables that might be |
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
234 |
relevant for user interfaces). |
| 58618 | 235 |
\<close> |
|
28765
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
236 |
|
|
da8f6f4a74be
misc tuning and rearrangement of section "Printing logical entities";
wenzelm
parents:
28763
diff
changeset
|
237 |
|
| 58618 | 238 |
subsection \<open>Alternative print modes \label{sec:print-modes}\<close>
|
| 46284 | 239 |
|
| 58618 | 240 |
text \<open> |
| 46284 | 241 |
\begin{mldecls}
|
242 |
@{index_ML print_mode_value: "unit -> string list"} \\
|
|
243 |
@{index_ML Print_Mode.with_modes: "string list -> ('a -> 'b) -> 'a -> 'b"} \\
|
|
244 |
\end{mldecls}
|
|
245 |
||
| 61477 | 246 |
The \<^emph>\<open>print mode\<close> facility allows to modify various operations |
| 46284 | 247 |
for printing. Commands like @{command typ}, @{command term},
|
248 |
@{command thm} (see \secref{sec:print-diag}) take additional print
|
|
249 |
modes as optional argument. The underlying ML operations are as |
|
250 |
follows. |
|
251 |
||
| 61439 | 252 |
\<^descr> @{ML "print_mode_value ()"} yields the list of currently
|
| 46284 | 253 |
active print mode names. This should be understood as symbolic |
254 |
representation of certain individual features for printing (with |
|
255 |
precedence from left to right). |
|
256 |
||
| 61493 | 257 |
\<^descr> @{ML Print_Mode.with_modes}~\<open>modes f x\<close> evaluates
|
258 |
\<open>f x\<close> in an execution context where the print mode is |
|
259 |
prepended by the given \<open>modes\<close>. This provides a thread-safe |
|
| 46284 | 260 |
way to augment print modes. It is also monotonic in the set of mode |
261 |
names: it retains the default print mode that certain |
|
262 |
user-interfaces might have installed for their proper functioning! |
|
263 |
||
264 |
||
| 61421 | 265 |
\<^medskip> |
266 |
The pretty printer for inner syntax maintains alternative |
|
| 46284 | 267 |
mixfix productions for any print mode name invented by the user, say |
268 |
in commands like @{command notation} or @{command abbreviation}.
|
|
269 |
Mode names can be arbitrary, but the following ones have a specific |
|
270 |
meaning by convention: |
|
271 |
||
| 61503 | 272 |
\<^item> \<^verbatim>\<open>""\<close> (the empty string): default mode; |
| 46284 | 273 |
implicitly active as last element in the list of modes. |
274 |
||
| 61503 | 275 |
\<^item> \<^verbatim>\<open>input\<close>: dummy print mode that is never active; may |
| 46284 | 276 |
be used to specify notation that is only available for input. |
277 |
||
| 61503 | 278 |
\<^item> \<^verbatim>\<open>internal\<close> dummy print mode that is never active; |
| 46284 | 279 |
used internally in Isabelle/Pure. |
280 |
||
| 61503 | 281 |
\<^item> \<^verbatim>\<open>xsymbols\<close>: enable proper mathematical symbols |
| 61572 | 282 |
instead of ASCII art.\<^footnote>\<open>This traditional mode name stems from |
283 |
the ``X-Symbol'' package for classic Proof~General with XEmacs.\<close> |
|
| 46284 | 284 |
|
| 61503 | 285 |
\<^item> \<^verbatim>\<open>latex\<close>: additional mode that is active in {\LaTeX}
|
| 46284 | 286 |
document preparation of Isabelle theory sources; allows to provide |
287 |
alternative output notation. |
|
| 58618 | 288 |
\<close> |
| 46284 | 289 |
|
290 |
||
| 58618 | 291 |
section \<open>Mixfix annotations \label{sec:mixfix}\<close>
|
| 28762 | 292 |
|
| 61477 | 293 |
text \<open>Mixfix annotations specify concrete \<^emph>\<open>inner syntax\<close> of |
|
35351
7425aece4ee3
allow general mixfix syntax for type constructors;
wenzelm
parents:
32833
diff
changeset
|
294 |
Isabelle types and terms. Locally fixed parameters in toplevel |
| 46290 | 295 |
theorem statements, locale and class specifications also admit |
296 |
mixfix annotations in a fairly uniform manner. A mixfix annotation |
|
| 50635 | 297 |
describes the concrete syntax, the translation to abstract |
| 46290 | 298 |
syntax, and the pretty printing. Special case annotations provide a |
299 |
simple means of specifying infix operators and binders. |
|
300 |
||
| 58552 | 301 |
Isabelle mixfix syntax is inspired by {\OBJ} @{cite OBJ}. It allows
|
| 46290 | 302 |
to specify any context-free priority grammar, which is more general |
303 |
than the fixity declarations of ML and Prolog. |
|
| 28762 | 304 |
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
305 |
@{rail \<open>
|
|
51654
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
wenzelm
parents:
50636
diff
changeset
|
306 |
@{syntax_def mixfix}: '('
|
| 58761 | 307 |
(@{syntax template} prios? @{syntax nat}? |
|
308 |
(@'infix' | @'infixl' | @'infixr') @{syntax template} @{syntax nat} |
|
|
309 |
@'binder' @{syntax template} prios? @{syntax nat} |
|
|
310 |
@'structure') ')' |
|
| 46290 | 311 |
; |
312 |
template: string |
|
| 46289 | 313 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
314 |
prios: '[' (@{syntax nat} + ',') ']'
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
315 |
\<close>} |
| 28762 | 316 |
|
| 61493 | 317 |
The string given as \<open>template\<close> may include literal text, |
318 |
spacing, blocks, and arguments (denoted by ``\<open>_\<close>''); the |
|
| 61503 | 319 |
special symbol ``\<^verbatim>\<open>\<index>\<close>'' (printed as ``\<open>\<index>\<close>'') |
|
51657
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
320 |
represents an index argument that specifies an implicit @{keyword
|
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
321 |
"structure"} reference (see also \secref{sec:locale}). Only locally
|
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
322 |
fixed variables may be declared as @{keyword "structure"}.
|
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
323 |
|
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
324 |
Infix and binder declarations provide common abbreviations for |
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
325 |
particular mixfix declarations. So in practice, mixfix templates |
|
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
326 |
mostly degenerate to literal text for concrete syntax, such as |
| 61503 | 327 |
``\<^verbatim>\<open>++\<close>'' for an infix symbol. |
328 |
\<close> |
|
| 28762 | 329 |
|
| 46290 | 330 |
|
| 58618 | 331 |
subsection \<open>The general mixfix form\<close> |
| 46290 | 332 |
|
| 58618 | 333 |
text \<open>In full generality, mixfix declarations work as follows. |
| 61493 | 334 |
Suppose a constant \<open>c :: \<tau>\<^sub>1 \<Rightarrow> \<dots> \<tau>\<^sub>n \<Rightarrow> \<tau>\<close> is annotated by |
335 |
\<open>(mixfix [p\<^sub>1, \<dots>, p\<^sub>n] p)\<close>, where \<open>mixfix\<close> is a string |
|
336 |
\<open>d\<^sub>0 _ d\<^sub>1 _ \<dots> _ d\<^sub>n\<close> consisting of delimiters that surround |
|
| 46290 | 337 |
argument positions as indicated by underscores. |
| 28762 | 338 |
|
339 |
Altogether this determines a production for a context-free priority |
|
| 61493 | 340 |
grammar, where for each argument \<open>i\<close> the syntactic category |
341 |
is determined by \<open>\<tau>\<^sub>i\<close> (with priority \<open>p\<^sub>i\<close>), and the |
|
342 |
result category is determined from \<open>\<tau>\<close> (with priority \<open>p\<close>). Priority specifications are optional, with default 0 for |
|
| 61572 | 343 |
arguments and 1000 for the result.\<^footnote>\<open>Omitting priorities is |
| 46292 | 344 |
prone to syntactic ambiguities unless the delimiter tokens determine |
| 61572 | 345 |
fully bracketed notation, as in \<open>if _ then _ else _ fi\<close>.\<close> |
| 28762 | 346 |
|
| 61493 | 347 |
Since \<open>\<tau>\<close> may be again a function type, the constant |
| 28762 | 348 |
type scheme may have more argument positions than the mixfix |
| 61493 | 349 |
pattern. Printing a nested application \<open>c t\<^sub>1 \<dots> t\<^sub>m\<close> for |
350 |
\<open>m > n\<close> works by attaching concrete notation only to the |
|
351 |
innermost part, essentially by printing \<open>(c t\<^sub>1 \<dots> t\<^sub>n) \<dots> t\<^sub>m\<close> |
|
| 28762 | 352 |
instead. If a term has fewer arguments than specified in the mixfix |
353 |
template, the concrete syntax is ignored. |
|
354 |
||
| 61421 | 355 |
\<^medskip> |
356 |
A mixfix template may also contain additional directives |
|
| 28762 | 357 |
for pretty printing, notably spaces, blocks, and breaks. The |
358 |
general template format is a sequence over any of the following |
|
359 |
entities. |
|
360 |
||
| 61493 | 361 |
\<^descr> \<open>d\<close> is a delimiter, namely a non-empty sequence of |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
362 |
characters other than the following special characters: |
| 28762 | 363 |
|
| 61421 | 364 |
\<^medskip> |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
365 |
\begin{tabular}{ll}
|
| 61503 | 366 |
\<^verbatim>\<open>'\<close> & single quote \\ |
367 |
\<^verbatim>\<open>_\<close> & underscore \\ |
|
| 61493 | 368 |
\<open>\<index>\<close> & index symbol \\ |
| 61503 | 369 |
\<^verbatim>\<open>(\<close> & open parenthesis \\ |
370 |
\<^verbatim>\<open>)\<close> & close parenthesis \\ |
|
371 |
\<^verbatim>\<open>/\<close> & slash \\ |
|
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
372 |
\end{tabular}
|
| 61421 | 373 |
\<^medskip> |
| 28762 | 374 |
|
| 61503 | 375 |
\<^descr> \<^verbatim>\<open>'\<close> escapes the special meaning of these |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
376 |
meta-characters, producing a literal version of the following |
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
377 |
character, unless that is a blank. |
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
378 |
|
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
379 |
A single quote followed by a blank separates delimiters, without |
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
380 |
affecting printing, but input tokens may have additional white space |
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
381 |
here. |
|
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
382 |
|
| 61503 | 383 |
\<^descr> \<^verbatim>\<open>_\<close> is an argument position, which stands for a |
| 28762 | 384 |
certain syntactic category in the underlying grammar. |
385 |
||
| 61493 | 386 |
\<^descr> \<open>\<index>\<close> is an indexed argument position; this is the place |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
387 |
where implicit structure arguments can be attached. |
| 28762 | 388 |
|
| 61493 | 389 |
\<^descr> \<open>s\<close> is a non-empty sequence of spaces for printing. |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
390 |
This and the following specifications do not affect parsing at all. |
| 28762 | 391 |
|
| 61503 | 392 |
\<^descr> \<^verbatim>\<open>(\<close>\<open>n\<close> opens a pretty printing block. The |
| 28762 | 393 |
optional number specifies how much indentation to add when a line |
394 |
break occurs within the block. If the parenthesis is not followed |
|
395 |
by digits, the indentation defaults to 0. A block specified via |
|
| 61503 | 396 |
\<^verbatim>\<open>(00\<close> is unbreakable. |
| 28762 | 397 |
|
| 61503 | 398 |
\<^descr> \<^verbatim>\<open>)\<close> closes a pretty printing block. |
| 28762 | 399 |
|
| 61503 | 400 |
\<^descr> \<^verbatim>\<open>//\<close> forces a line break. |
| 28762 | 401 |
|
| 61503 | 402 |
\<^descr> \<^verbatim>\<open>/\<close>\<open>s\<close> allows a line break. Here \<open>s\<close> |
|
28771
4510201c6aaf
mixfix annotations: verbatim for special symbols;
wenzelm
parents:
28770
diff
changeset
|
403 |
stands for the string of spaces (zero or more) right after the |
| 61477 | 404 |
slash. These spaces are printed if the break is \<^emph>\<open>not\<close> taken. |
| 28762 | 405 |
|
406 |
||
407 |
The general idea of pretty printing with blocks and breaks is also |
|
| 58552 | 408 |
described in @{cite "paulson-ml2"}; it goes back to @{cite "Oppen:1980"}.
|
| 58618 | 409 |
\<close> |
| 28762 | 410 |
|
411 |
||
| 58618 | 412 |
subsection \<open>Infixes\<close> |
| 46290 | 413 |
|
| 58618 | 414 |
text \<open>Infix operators are specified by convenient short forms that |
| 46290 | 415 |
abbreviate general mixfix annotations as follows: |
416 |
||
417 |
\begin{center}
|
|
418 |
\begin{tabular}{lll}
|
|
419 |
||
| 61503 | 420 |
\<^verbatim>\<open>(\<close>@{keyword_def "infix"}~\<^verbatim>\<open>"\<close>\<open>sy\<close>\<^verbatim>\<open>"\<close> \<open>p\<close>\<^verbatim>\<open>)\<close>
|
| 61493 | 421 |
& \<open>\<mapsto>\<close> & |
| 61503 | 422 |
\<^verbatim>\<open>("(_\<close>~\<open>sy\<close>\<^verbatim>\<open>/ _)" [\<close>\<open>p + 1\<close>\<^verbatim>\<open>,\<close>~\<open>p + 1\<close>\<^verbatim>\<open>]\<close>~\<open>p\<close>\<^verbatim>\<open>)\<close> \\
|
423 |
\<^verbatim>\<open>(\<close>@{keyword_def "infixl"}~\<^verbatim>\<open>"\<close>\<open>sy\<close>\<^verbatim>\<open>"\<close> \<open>p\<close>\<^verbatim>\<open>)\<close>
|
|
| 61493 | 424 |
& \<open>\<mapsto>\<close> & |
| 61503 | 425 |
\<^verbatim>\<open>("(_\<close>~\<open>sy\<close>\<^verbatim>\<open>/ _)" [\<close>\<open>p\<close>\<^verbatim>\<open>,\<close>~\<open>p + 1\<close>\<^verbatim>\<open>]\<close>~\<open>p\<close>\<^verbatim>\<open>)\<close> \\
|
426 |
\<^verbatim>\<open>(\<close>@{keyword_def "infixr"}~\<^verbatim>\<open>"\<close>\<open>sy\<close>\<^verbatim>\<open>"\<close>~\<open>p\<close>\<^verbatim>\<open>)\<close>
|
|
| 61493 | 427 |
& \<open>\<mapsto>\<close> & |
| 61503 | 428 |
\<^verbatim>\<open>("(_\<close>~\<open>sy\<close>\<^verbatim>\<open>/ _)" [\<close>\<open>p + 1\<close>\<^verbatim>\<open>,\<close>~\<open>p\<close>\<^verbatim>\<open>]\<close>~\<open>p\<close>\<^verbatim>\<open>)\<close> \\
|
| 46290 | 429 |
|
430 |
\end{tabular}
|
|
431 |
\end{center}
|
|
432 |
||
| 61503 | 433 |
The mixfix template \<^verbatim>\<open>"(_\<close>~\<open>sy\<close>\<^verbatim>\<open>/ _)"\<close> |
| 46292 | 434 |
specifies two argument positions; the delimiter is preceded by a |
435 |
space and followed by a space or line break; the entire phrase is a |
|
436 |
pretty printing block. |
|
| 46290 | 437 |
|
| 61503 | 438 |
The alternative notation \<^verbatim>\<open>op\<close>~\<open>sy\<close> is introduced |
| 46290 | 439 |
in addition. Thus any infix operator may be written in prefix form |
440 |
(as in ML), independently of the number of arguments in the term. |
|
| 58618 | 441 |
\<close> |
| 46290 | 442 |
|
443 |
||
| 58618 | 444 |
subsection \<open>Binders\<close> |
| 46290 | 445 |
|
| 61477 | 446 |
text \<open>A \<^emph>\<open>binder\<close> is a variable-binding construct such as a |
| 61493 | 447 |
quantifier. The idea to formalize \<open>\<forall>x. b\<close> as \<open>All |
448 |
(\<lambda>x. b)\<close> for \<open>All :: ('a \<Rightarrow> bool) \<Rightarrow> bool\<close> already goes back
|
|
| 58552 | 449 |
to @{cite church40}. Isabelle declarations of certain higher-order
|
| 46292 | 450 |
operators may be annotated with @{keyword_def "binder"} annotations
|
451 |
as follows: |
|
| 46290 | 452 |
|
453 |
\begin{center}
|
|
| 61503 | 454 |
\<open>c ::\<close>~\<^verbatim>\<open>"\<close>\<open>(\<tau>\<^sub>1 \<Rightarrow> \<tau>\<^sub>2) \<Rightarrow> \<tau>\<^sub>3\<close>\<^verbatim>\<open>" (\<close>@{keyword "binder"}~\<^verbatim>\<open>"\<close>\<open>sy\<close>\<^verbatim>\<open>" [\<close>\<open>p\<close>\<^verbatim>\<open>]\<close>~\<open>q\<close>\<^verbatim>\<open>)\<close>
|
| 46290 | 455 |
\end{center}
|
456 |
||
| 61493 | 457 |
This introduces concrete binder syntax \<open>sy x. b\<close>, where |
458 |
\<open>x\<close> is a bound variable of type \<open>\<tau>\<^sub>1\<close>, the body \<open>b\<close> has type \<open>\<tau>\<^sub>2\<close> and the whole term has type \<open>\<tau>\<^sub>3\<close>. |
|
459 |
The optional integer \<open>p\<close> specifies the syntactic priority of |
|
460 |
the body; the default is \<open>q\<close>, which is also the priority of |
|
| 46290 | 461 |
the whole construct. |
462 |
||
463 |
Internally, the binder syntax is expanded to something like this: |
|
464 |
\begin{center}
|
|
| 61503 | 465 |
\<open>c_binder ::\<close>~\<^verbatim>\<open>"\<close>\<open>idts \<Rightarrow> \<tau>\<^sub>2 \<Rightarrow> \<tau>\<^sub>3\<close>\<^verbatim>\<open>" ("(3\<close>\<open>sy\<close>\<^verbatim>\<open>_./ _)" [0,\<close>~\<open>p\<close>\<^verbatim>\<open>]\<close>~\<open>q\<close>\<^verbatim>\<open>)\<close>
|
| 46290 | 466 |
\end{center}
|
467 |
||
468 |
Here @{syntax (inner) idts} is the nonterminal symbol for a list of
|
|
469 |
identifiers with optional type constraints (see also |
|
| 61503 | 470 |
\secref{sec:pure-grammar}). The mixfix template \<^verbatim>\<open>"(3\<close>\<open>sy\<close>\<^verbatim>\<open>_./ _)"\<close>
|
471 |
defines argument positions |
|
| 46290 | 472 |
for the bound identifiers and the body, separated by a dot with |
473 |
optional line break; the entire phrase is a pretty printing block of |
|
| 61493 | 474 |
indentation level 3. Note that there is no extra space after \<open>sy\<close>, so it needs to be included user specification if the binder |
| 46290 | 475 |
syntax ends with a token that may be continued by an identifier |
476 |
token at the start of @{syntax (inner) idts}.
|
|
477 |
||
| 61493 | 478 |
Furthermore, a syntax translation to transforms \<open>c_binder x\<^sub>1 |
479 |
\<dots> x\<^sub>n b\<close> into iterated application \<open>c (\<lambda>x\<^sub>1. \<dots> c (\<lambda>x\<^sub>n. b)\<dots>)\<close>. |
|
| 58618 | 480 |
This works in both directions, for parsing and printing.\<close> |
| 46290 | 481 |
|
482 |
||
| 58618 | 483 |
section \<open>Explicit notation \label{sec:notation}\<close>
|
| 28762 | 484 |
|
| 58618 | 485 |
text \<open> |
| 28762 | 486 |
\begin{matharray}{rcll}
|
| 61493 | 487 |
@{command_def "type_notation"} & : & \<open>local_theory \<rightarrow> local_theory\<close> \\
|
488 |
@{command_def "no_type_notation"} & : & \<open>local_theory \<rightarrow> local_theory\<close> \\
|
|
489 |
@{command_def "notation"} & : & \<open>local_theory \<rightarrow> local_theory\<close> \\
|
|
490 |
@{command_def "no_notation"} & : & \<open>local_theory \<rightarrow> local_theory\<close> \\
|
|
491 |
@{command_def "write"} & : & \<open>proof(state) \<rightarrow> proof(state)\<close> \\
|
|
| 28762 | 492 |
\end{matharray}
|
493 |
||
| 46288 | 494 |
Commands that introduce new logical entities (terms or types) |
495 |
usually allow to provide mixfix annotations on the spot, which is |
|
496 |
convenient for default notation. Nonetheless, the syntax may be |
|
497 |
modified later on by declarations for explicit notation. This |
|
498 |
allows to add or delete mixfix annotations for of existing logical |
|
499 |
entities within the current context. |
|
500 |
||
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
501 |
@{rail \<open>
|
|
59783
00b62aa9f430
tuned syntax diagrams -- no duplication of "target";
wenzelm
parents:
58842
diff
changeset
|
502 |
(@@{command type_notation} | @@{command no_type_notation}) @{syntax mode}? \<newline>
|
|
00b62aa9f430
tuned syntax diagrams -- no duplication of "target";
wenzelm
parents:
58842
diff
changeset
|
503 |
(@{syntax nameref} @{syntax mixfix} + @'and')
|
| 35413 | 504 |
; |
|
59783
00b62aa9f430
tuned syntax diagrams -- no duplication of "target";
wenzelm
parents:
58842
diff
changeset
|
505 |
(@@{command notation} | @@{command no_notation}) @{syntax mode}? \<newline>
|
|
51654
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
wenzelm
parents:
50636
diff
changeset
|
506 |
(@{syntax nameref} @{syntax mixfix} + @'and')
|
| 28762 | 507 |
; |
|
51654
8450b944e58a
just one syntax category "mixfix" -- check structure annotation semantically;
wenzelm
parents:
50636
diff
changeset
|
508 |
@@{command write} @{syntax mode}? (@{syntax nameref} @{syntax mixfix} + @'and')
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
509 |
\<close>} |
| 28762 | 510 |
|
| 61493 | 511 |
\<^descr> @{command "type_notation"}~\<open>c (mx)\<close> associates mixfix
|
| 35413 | 512 |
syntax with an existing type constructor. The arity of the |
513 |
constructor is retrieved from the context. |
|
| 46282 | 514 |
|
| 61439 | 515 |
\<^descr> @{command "no_type_notation"} is similar to @{command
|
| 35413 | 516 |
"type_notation"}, but removes the specified syntax annotation from |
517 |
the present context. |
|
518 |
||
| 61493 | 519 |
\<^descr> @{command "notation"}~\<open>c (mx)\<close> associates mixfix
|
| 35413 | 520 |
syntax with an existing constant or fixed variable. The type |
521 |
declaration of the given entity is retrieved from the context. |
|
| 46282 | 522 |
|
| 61439 | 523 |
\<^descr> @{command "no_notation"} is similar to @{command "notation"},
|
| 28762 | 524 |
but removes the specified syntax annotation from the present |
525 |
context. |
|
526 |
||
| 61439 | 527 |
\<^descr> @{command "write"} is similar to @{command "notation"}, but
|
|
36508
03d2a2d0ee4a
allow concrete syntax for local entities within a proof body, either via regular mixfix annotations to 'fix' etc. or the separate 'write' command;
wenzelm
parents:
35413
diff
changeset
|
528 |
works within an Isar proof body. |
| 58618 | 529 |
\<close> |
| 28762 | 530 |
|
| 28778 | 531 |
|
| 58618 | 532 |
section \<open>The Pure syntax \label{sec:pure-syntax}\<close>
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
533 |
|
| 58618 | 534 |
subsection \<open>Lexical matters \label{sec:inner-lex}\<close>
|
| 46282 | 535 |
|
| 58618 | 536 |
text \<open>The inner lexical syntax vaguely resembles the outer one |
| 46282 | 537 |
(\secref{sec:outer-lex}), but some details are different. There are
|
538 |
two main categories of inner syntax tokens: |
|
539 |
||
| 61477 | 540 |
\<^enum> \<^emph>\<open>delimiters\<close> --- the literal tokens occurring in |
| 46282 | 541 |
productions of the given priority grammar (cf.\ |
542 |
\secref{sec:priority-grammar});
|
|
543 |
||
| 61477 | 544 |
\<^enum> \<^emph>\<open>named tokens\<close> --- various categories of identifiers etc. |
| 46282 | 545 |
|
546 |
||
547 |
Delimiters override named tokens and may thus render certain |
|
548 |
identifiers inaccessible. Sometimes the logical context admits |
|
549 |
alternative ways to refer to the same entity, potentially via |
|
550 |
qualified names. |
|
551 |
||
| 61421 | 552 |
\<^medskip> |
553 |
The categories for named tokens are defined once and for |
|
| 46282 | 554 |
all as follows, reusing some categories of the outer token syntax |
555 |
(\secref{sec:outer-lex}).
|
|
556 |
||
557 |
\begin{center}
|
|
558 |
\begin{supertabular}{rcl}
|
|
559 |
@{syntax_def (inner) id} & = & @{syntax_ref ident} \\
|
|
560 |
@{syntax_def (inner) longid} & = & @{syntax_ref longident} \\
|
|
561 |
@{syntax_def (inner) var} & = & @{syntax_ref var} \\
|
|
562 |
@{syntax_def (inner) tid} & = & @{syntax_ref typefree} \\
|
|
563 |
@{syntax_def (inner) tvar} & = & @{syntax_ref typevar} \\
|
|
|
58410
6d46ad54a2ab
explicit separation of signed and unsigned numerals using existing lexical categories num and xnum
haftmann
parents:
58409
diff
changeset
|
564 |
@{syntax_def (inner) num_token} & = & @{syntax_ref nat} \\
|
| 61503 | 565 |
@{syntax_def (inner) float_token} & = & @{syntax_ref nat}\<^verbatim>\<open>.\<close>@{syntax_ref nat} \\
|
566 |
@{syntax_def (inner) str_token} & = & \<^verbatim>\<open>''\<close> \<open>\<dots>\<close> \<^verbatim>\<open>''\<close> \\
|
|
567 |
@{syntax_def (inner) string_token} & = & \<^verbatim>\<open>"\<close> \<open>\<dots>\<close> \<^verbatim>\<open>"\<close> \\
|
|
| 61493 | 568 |
@{syntax_def (inner) cartouche} & = & @{verbatim "\<open>"} \<open>\<dots>\<close> @{verbatim "\<close>"} \\
|
| 46282 | 569 |
\end{supertabular}
|
570 |
\end{center}
|
|
571 |
||
572 |
The token categories @{syntax (inner) num_token}, @{syntax (inner)
|
|
| 58421 | 573 |
float_token}, @{syntax (inner) str_token}, @{syntax (inner) string_token},
|
574 |
and @{syntax (inner) cartouche} are not used in Pure. Object-logics may
|
|
575 |
implement numerals and string literals by adding appropriate syntax |
|
576 |
declarations, together with some translation functions (e.g.\ see @{file
|
|
577 |
"~~/src/HOL/Tools/string_syntax.ML"}). |
|
| 46282 | 578 |
|
| 58421 | 579 |
The derived categories @{syntax_def (inner) num_const}, and @{syntax_def
|
580 |
(inner) float_const}, provide robust access to the respective tokens: the |
|
581 |
syntax tree holds a syntactic constant instead of a free variable. |
|
| 58618 | 582 |
\<close> |
| 46282 | 583 |
|
584 |
||
| 58618 | 585 |
subsection \<open>Priority grammars \label{sec:priority-grammar}\<close>
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
586 |
|
| 61477 | 587 |
text \<open>A context-free grammar consists of a set of \<^emph>\<open>terminal |
588 |
symbols\<close>, a set of \<^emph>\<open>nonterminal symbols\<close> and a set of |
|
| 61493 | 589 |
\<^emph>\<open>productions\<close>. Productions have the form \<open>A = \<gamma>\<close>, |
590 |
where \<open>A\<close> is a nonterminal and \<open>\<gamma>\<close> is a string of |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
591 |
terminals and nonterminals. One designated nonterminal is called |
| 61477 | 592 |
the \<^emph>\<open>root symbol\<close>. The language defined by the grammar |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
593 |
consists of all strings of terminals that can be derived from the |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
594 |
root symbol by applying productions as rewrite rules. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
595 |
|
| 61477 | 596 |
The standard Isabelle parser for inner syntax uses a \<^emph>\<open>priority |
597 |
grammar\<close>. Each nonterminal is decorated by an integer priority: |
|
| 61493 | 598 |
\<open>A\<^sup>(\<^sup>p\<^sup>)\<close>. In a derivation, \<open>A\<^sup>(\<^sup>p\<^sup>)\<close> may be rewritten |
599 |
using a production \<open>A\<^sup>(\<^sup>q\<^sup>) = \<gamma>\<close> only if \<open>p \<le> q\<close>. Any |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
600 |
priority grammar can be translated into a normal context-free |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
601 |
grammar by introducing new nonterminals and productions. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
602 |
|
| 61421 | 603 |
\<^medskip> |
| 61493 | 604 |
Formally, a set of context free productions \<open>G\<close> |
605 |
induces a derivation relation \<open>\<longrightarrow>\<^sub>G\<close> as follows. Let \<open>\<alpha>\<close> and \<open>\<beta>\<close> denote strings of terminal or nonterminal symbols. |
|
606 |
Then \<open>\<alpha> A\<^sup>(\<^sup>p\<^sup>) \<beta> \<longrightarrow>\<^sub>G \<alpha> \<gamma> \<beta>\<close> holds if and only if \<open>G\<close> |
|
607 |
contains some production \<open>A\<^sup>(\<^sup>q\<^sup>) = \<gamma>\<close> for \<open>p \<le> q\<close>. |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
608 |
|
| 61421 | 609 |
\<^medskip> |
610 |
The following grammar for arithmetic expressions |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
611 |
demonstrates how binding power and associativity of operators can be |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
612 |
enforced by priorities. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
613 |
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
614 |
\begin{center}
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
615 |
\begin{tabular}{rclr}
|
| 61503 | 616 |
\<open>A\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)\<close> & \<open>=\<close> & \<^verbatim>\<open>(\<close> \<open>A\<^sup>(\<^sup>0\<^sup>)\<close> \<^verbatim>\<open>)\<close> \\ |
617 |
\<open>A\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)\<close> & \<open>=\<close> & \<^verbatim>\<open>0\<close> \\ |
|
618 |
\<open>A\<^sup>(\<^sup>0\<^sup>)\<close> & \<open>=\<close> & \<open>A\<^sup>(\<^sup>0\<^sup>)\<close> \<^verbatim>\<open>+\<close> \<open>A\<^sup>(\<^sup>1\<^sup>)\<close> \\ |
|
619 |
\<open>A\<^sup>(\<^sup>2\<^sup>)\<close> & \<open>=\<close> & \<open>A\<^sup>(\<^sup>3\<^sup>)\<close> \<^verbatim>\<open>*\<close> \<open>A\<^sup>(\<^sup>2\<^sup>)\<close> \\ |
|
620 |
\<open>A\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>=\<close> & \<^verbatim>\<open>-\<close> \<open>A\<^sup>(\<^sup>3\<^sup>)\<close> \\ |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
621 |
\end{tabular}
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
622 |
\end{center}
|
| 61503 | 623 |
The choice of priorities determines that \<^verbatim>\<open>-\<close> binds |
624 |
tighter than \<^verbatim>\<open>*\<close>, which binds tighter than \<^verbatim>\<open>+\<close>. |
|
625 |
Furthermore \<^verbatim>\<open>+\<close> associates to the left and |
|
626 |
\<^verbatim>\<open>*\<close> to the right. |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
627 |
|
| 61421 | 628 |
\<^medskip> |
629 |
For clarity, grammars obey these conventions: |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
630 |
|
| 61421 | 631 |
\<^item> All priorities must lie between 0 and 1000. |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
632 |
|
| 61421 | 633 |
\<^item> Priority 0 on the right-hand side and priority 1000 on the |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
634 |
left-hand side may be omitted. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
635 |
|
| 61493 | 636 |
\<^item> The production \<open>A\<^sup>(\<^sup>p\<^sup>) = \<alpha>\<close> is written as \<open>A = \<alpha> |
637 |
(p)\<close>, i.e.\ the priority of the left-hand side actually appears in |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
638 |
a column on the far right. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
639 |
|
| 61493 | 640 |
\<^item> Alternatives are separated by \<open>|\<close>. |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
641 |
|
| 61493 | 642 |
\<^item> Repetition is indicated by dots \<open>(\<dots>)\<close> in an informal |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
643 |
but obvious way. |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
644 |
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
645 |
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
646 |
Using these conventions, the example grammar specification above |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
647 |
takes the form: |
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
648 |
\begin{center}
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
649 |
\begin{tabular}{rclc}
|
| 61503 | 650 |
\<open>A\<close> & \<open>=\<close> & \<^verbatim>\<open>(\<close> \<open>A\<close> \<^verbatim>\<open>)\<close> \\ |
651 |
& \<open>|\<close> & \<^verbatim>\<open>0\<close> & \qquad\qquad \\ |
|
652 |
& \<open>|\<close> & \<open>A\<close> \<^verbatim>\<open>+\<close> \<open>A\<^sup>(\<^sup>1\<^sup>)\<close> & \<open>(0)\<close> \\ |
|
653 |
& \<open>|\<close> & \<open>A\<^sup>(\<^sup>3\<^sup>)\<close> \<^verbatim>\<open>*\<close> \<open>A\<^sup>(\<^sup>2\<^sup>)\<close> & \<open>(2)\<close> \\ |
|
654 |
& \<open>|\<close> & \<^verbatim>\<open>-\<close> \<open>A\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>(3)\<close> \\ |
|
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
655 |
\end{tabular}
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
656 |
\end{center}
|
| 58618 | 657 |
\<close> |
|
28769
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
658 |
|
|
8fc228f21861
added section "Priority grammars" (variant from old ref manual);
wenzelm
parents:
28767
diff
changeset
|
659 |
|
| 58618 | 660 |
subsection \<open>The Pure grammar \label{sec:pure-grammar}\<close>
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
661 |
|
| 61493 | 662 |
text \<open>The priority grammar of the \<open>Pure\<close> theory is defined |
| 46287 | 663 |
approximately like this: |
| 28774 | 664 |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
665 |
\begin{center}
|
| 28773 | 666 |
\begin{supertabular}{rclr}
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
667 |
|
| 61493 | 668 |
@{syntax_def (inner) any} & = & \<open>prop | logic\<close> \\\\
|
| 28772 | 669 |
|
| 61503 | 670 |
@{syntax_def (inner) prop} & = & \<^verbatim>\<open>(\<close> \<open>prop\<close> \<^verbatim>\<open>)\<close> \\
|
671 |
& \<open>|\<close> & \<open>prop\<^sup>(\<^sup>4\<^sup>)\<close> \<^verbatim>\<open>::\<close> \<open>type\<close> & \<open>(3)\<close> \\ |
|
672 |
& \<open>|\<close> & \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> \<^verbatim>\<open>==\<close> \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>(2)\<close> \\ |
|
| 61493 | 673 |
& \<open>|\<close> & \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> \<open>\<equiv>\<close> \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>(2)\<close> \\ |
| 61503 | 674 |
& \<open>|\<close> & \<open>prop\<^sup>(\<^sup>3\<^sup>)\<close> \<^verbatim>\<open>&&&\<close> \<open>prop\<^sup>(\<^sup>2\<^sup>)\<close> & \<open>(2)\<close> \\ |
675 |
& \<open>|\<close> & \<open>prop\<^sup>(\<^sup>2\<^sup>)\<close> \<^verbatim>\<open>==>\<close> \<open>prop\<^sup>(\<^sup>1\<^sup>)\<close> & \<open>(1)\<close> \\ |
|
| 61493 | 676 |
& \<open>|\<close> & \<open>prop\<^sup>(\<^sup>2\<^sup>)\<close> \<open>\<Longrightarrow>\<close> \<open>prop\<^sup>(\<^sup>1\<^sup>)\<close> & \<open>(1)\<close> \\ |
| 61503 | 677 |
& \<open>|\<close> & \<^verbatim>\<open>[|\<close> \<open>prop\<close> \<^verbatim>\<open>;\<close> \<open>\<dots>\<close> \<^verbatim>\<open>;\<close> \<open>prop\<close> \<^verbatim>\<open>|]\<close> \<^verbatim>\<open>==>\<close> \<open>prop\<^sup>(\<^sup>1\<^sup>)\<close> & \<open>(1)\<close> \\ |
678 |
& \<open>|\<close> & \<open>\<lbrakk>\<close> \<open>prop\<close> \<^verbatim>\<open>;\<close> \<open>\<dots>\<close> \<^verbatim>\<open>;\<close> \<open>prop\<close> \<open>\<rbrakk>\<close> \<open>\<Longrightarrow>\<close> \<open>prop\<^sup>(\<^sup>1\<^sup>)\<close> & \<open>(1)\<close> \\ |
|
679 |
& \<open>|\<close> & \<^verbatim>\<open>!!\<close> \<open>idts\<close> \<^verbatim>\<open>.\<close> \<open>prop\<close> & \<open>(0)\<close> \\ |
|
680 |
& \<open>|\<close> & \<open>\<And>\<close> \<open>idts\<close> \<^verbatim>\<open>.\<close> \<open>prop\<close> & \<open>(0)\<close> \\ |
|
681 |
& \<open>|\<close> & \<^verbatim>\<open>OFCLASS\<close> \<^verbatim>\<open>(\<close> \<open>type\<close> \<^verbatim>\<open>,\<close> \<open>logic\<close> \<^verbatim>\<open>)\<close> \\ |
|
682 |
& \<open>|\<close> & \<^verbatim>\<open>SORT_CONSTRAINT\<close> \<^verbatim>\<open>(\<close> \<open>type\<close> \<^verbatim>\<open>)\<close> \\ |
|
683 |
& \<open>|\<close> & \<^verbatim>\<open>TERM\<close> \<open>logic\<close> \\ |
|
684 |
& \<open>|\<close> & \<^verbatim>\<open>PROP\<close> \<open>aprop\<close> \\\\ |
|
| 28772 | 685 |
|
| 61503 | 686 |
@{syntax_def (inner) aprop} & = & \<^verbatim>\<open>(\<close> \<open>aprop\<close> \<^verbatim>\<open>)\<close> \\
|
687 |
& \<open>|\<close> & \<open>id | longid | var |\<close>~~\<^verbatim>\<open>_\<close>~~\<open>|\<close>~~\<^verbatim>\<open>...\<close> \\ |
|
688 |
& \<open>|\<close> & \<^verbatim>\<open>CONST\<close> \<open>id |\<close>~~\<^verbatim>\<open>CONST\<close> \<open>longid\<close> \\ |
|
689 |
& \<open>|\<close> & \<^verbatim>\<open>XCONST\<close> \<open>id |\<close>~~\<^verbatim>\<open>XCONST\<close> \<open>longid\<close> \\ |
|
| 61493 | 690 |
& \<open>|\<close> & \<open>logic\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) \<dots> any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)\<close> & \<open>(999)\<close> \\\\ |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
691 |
|
| 61503 | 692 |
@{syntax_def (inner) logic} & = & \<^verbatim>\<open>(\<close> \<open>logic\<close> \<^verbatim>\<open>)\<close> \\
|
693 |
& \<open>|\<close> & \<open>logic\<^sup>(\<^sup>4\<^sup>)\<close> \<^verbatim>\<open>::\<close> \<open>type\<close> & \<open>(3)\<close> \\ |
|
694 |
& \<open>|\<close> & \<open>id | longid | var |\<close>~~\<^verbatim>\<open>_\<close>~~\<open>|\<close>~~\<^verbatim>\<open>...\<close> \\ |
|
695 |
& \<open>|\<close> & \<^verbatim>\<open>CONST\<close> \<open>id |\<close>~~\<^verbatim>\<open>CONST\<close> \<open>longid\<close> \\ |
|
696 |
& \<open>|\<close> & \<^verbatim>\<open>XCONST\<close> \<open>id |\<close>~~\<^verbatim>\<open>XCONST\<close> \<open>longid\<close> \\ |
|
| 61493 | 697 |
& \<open>|\<close> & \<open>logic\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) \<dots> any\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)\<close> & \<open>(999)\<close> \\ |
698 |
& \<open>|\<close> & \<open>\<struct> index\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>)\<close> \\ |
|
| 61503 | 699 |
& \<open>|\<close> & \<^verbatim>\<open>%\<close> \<open>pttrns\<close> \<^verbatim>\<open>.\<close> \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>(3)\<close> \\ |
700 |
& \<open>|\<close> & \<open>\<lambda>\<close> \<open>pttrns\<close> \<^verbatim>\<open>.\<close> \<open>any\<^sup>(\<^sup>3\<^sup>)\<close> & \<open>(3)\<close> \\ |
|
701 |
& \<open>|\<close> & \<^verbatim>\<open>op\<close> \<^verbatim>\<open>==\<close>~~\<open>|\<close>~~\<^verbatim>\<open>op\<close> \<open>\<equiv>\<close>~~\<open>|\<close>~~\<^verbatim>\<open>op\<close> \<^verbatim>\<open>&&&\<close> \\ |
|
702 |
& \<open>|\<close> & \<^verbatim>\<open>op\<close> \<^verbatim>\<open>==>\<close>~~\<open>|\<close>~~\<^verbatim>\<open>op\<close> \<open>\<Longrightarrow>\<close> \\ |
|
703 |
& \<open>|\<close> & \<^verbatim>\<open>TYPE\<close> \<^verbatim>\<open>(\<close> \<open>type\<close> \<^verbatim>\<open>)\<close> \\\\ |
|
| 28772 | 704 |
|
| 61503 | 705 |
@{syntax_def (inner) idt} & = & \<^verbatim>\<open>(\<close> \<open>idt\<close> \<^verbatim>\<open>)\<close>~~\<open>| id |\<close>~~\<^verbatim>\<open>_\<close> \\
|
706 |
& \<open>|\<close> & \<open>id\<close> \<^verbatim>\<open>::\<close> \<open>type\<close> & \<open>(0)\<close> \\ |
|
707 |
& \<open>|\<close> & \<^verbatim>\<open>_\<close> \<^verbatim>\<open>::\<close> \<open>type\<close> & \<open>(0)\<close> \\\\ |
|
| 28772 | 708 |
|
| 61503 | 709 |
@{syntax_def (inner) index} & = & \<^verbatim>\<open>\<^bsub>\<close> \<open>logic\<^sup>(\<^sup>0\<^sup>)\<close> \<^verbatim>\<open>\<^esub>\<close>~~\<open>| | \<index>\<close> \\\\
|
| 46287 | 710 |
|
| 61493 | 711 |
@{syntax_def (inner) idts} & = & \<open>idt | idt\<^sup>(\<^sup>1\<^sup>) idts\<close> & \<open>(0)\<close> \\\\
|
| 28772 | 712 |
|
| 61493 | 713 |
@{syntax_def (inner) pttrn} & = & \<open>idt\<close> \\\\
|
| 28772 | 714 |
|
| 61493 | 715 |
@{syntax_def (inner) pttrns} & = & \<open>pttrn | pttrn\<^sup>(\<^sup>1\<^sup>) pttrns\<close> & \<open>(0)\<close> \\\\
|
| 28774 | 716 |
|
| 61503 | 717 |
@{syntax_def (inner) type} & = & \<^verbatim>\<open>(\<close> \<open>type\<close> \<^verbatim>\<open>)\<close> \\
|
718 |
& \<open>|\<close> & \<open>tid | tvar |\<close>~~\<^verbatim>\<open>_\<close> \\ |
|
719 |
& \<open>|\<close> & \<open>tid\<close> \<^verbatim>\<open>::\<close> \<open>sort | tvar\<close>~~\<^verbatim>\<open>::\<close> \<open>sort |\<close>~~\<^verbatim>\<open>_\<close> \<^verbatim>\<open>::\<close> \<open>sort\<close> \\ |
|
| 61493 | 720 |
& \<open>|\<close> & \<open>type_name | type\<^sup>(\<^sup>1\<^sup>0\<^sup>0\<^sup>0\<^sup>) type_name\<close> \\ |
| 61503 | 721 |
& \<open>|\<close> & \<^verbatim>\<open>(\<close> \<open>type\<close> \<^verbatim>\<open>,\<close> \<open>\<dots>\<close> \<^verbatim>\<open>,\<close> \<open>type\<close> \<^verbatim>\<open>)\<close> \<open>type_name\<close> \\ |
722 |
& \<open>|\<close> & \<open>type\<^sup>(\<^sup>1\<^sup>)\<close> \<^verbatim>\<open>=>\<close> \<open>type\<close> & \<open>(0)\<close> \\ |
|
| 61493 | 723 |
& \<open>|\<close> & \<open>type\<^sup>(\<^sup>1\<^sup>)\<close> \<open>\<Rightarrow>\<close> \<open>type\<close> & \<open>(0)\<close> \\ |
| 61503 | 724 |
& \<open>|\<close> & \<^verbatim>\<open>[\<close> \<open>type\<close> \<^verbatim>\<open>,\<close> \<open>\<dots>\<close> \<^verbatim>\<open>,\<close> \<open>type\<close> \<^verbatim>\<open>]\<close> \<^verbatim>\<open>=>\<close> \<open>type\<close> & \<open>(0)\<close> \\ |
725 |
& \<open>|\<close> & \<^verbatim>\<open>[\<close> \<open>type\<close> \<^verbatim>\<open>,\<close> \<open>\<dots>\<close> \<^verbatim>\<open>,\<close> \<open>type\<close> \<^verbatim>\<open>]\<close> \<open>\<Rightarrow>\<close> \<open>type\<close> & \<open>(0)\<close> \\ |
|
| 61493 | 726 |
@{syntax_def (inner) type_name} & = & \<open>id | longid\<close> \\\\
|
| 28772 | 727 |
|
| 61503 | 728 |
@{syntax_def (inner) sort} & = & @{syntax class_name}~~\<open>|\<close>~~\<^verbatim>\<open>{}\<close> \\
|
729 |
& \<open>|\<close> & \<^verbatim>\<open>{\<close> @{syntax class_name} \<^verbatim>\<open>,\<close> \<open>\<dots>\<close> \<^verbatim>\<open>,\<close> @{syntax class_name} \<^verbatim>\<open>}\<close> \\
|
|
| 61493 | 730 |
@{syntax_def (inner) class_name} & = & \<open>id | longid\<close> \\
|
| 28773 | 731 |
\end{supertabular}
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
732 |
\end{center}
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
733 |
|
| 61421 | 734 |
\<^medskip> |
| 61503 | 735 |
Here literal terminals are printed \<^verbatim>\<open>verbatim\<close>; |
| 28774 | 736 |
see also \secref{sec:inner-lex} for further token categories of the
|
737 |
inner syntax. The meaning of the nonterminals defined by the above |
|
738 |
grammar is as follows: |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
739 |
|
| 61439 | 740 |
\<^descr> @{syntax_ref (inner) any} denotes any term.
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
741 |
|
| 61439 | 742 |
\<^descr> @{syntax_ref (inner) prop} denotes meta-level propositions,
|
| 28778 | 743 |
which are terms of type @{typ prop}. The syntax of such formulae of
|
744 |
the meta-logic is carefully distinguished from usual conventions for |
|
| 61493 | 745 |
object-logics. In particular, plain \<open>\<lambda>\<close>-term notation is |
| 61477 | 746 |
\<^emph>\<open>not\<close> recognized as @{syntax (inner) prop}.
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
747 |
|
| 61439 | 748 |
\<^descr> @{syntax_ref (inner) aprop} denotes atomic propositions, which
|
| 28778 | 749 |
are embedded into regular @{syntax (inner) prop} by means of an
|
| 61503 | 750 |
explicit \<^verbatim>\<open>PROP\<close> token. |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
751 |
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
752 |
Terms of type @{typ prop} with non-constant head, e.g.\ a plain
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
753 |
variable, are printed in this form. Constants that yield type @{typ
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
754 |
prop} are expected to provide their own concrete syntax; otherwise |
| 28778 | 755 |
the printed version will appear like @{syntax (inner) logic} and
|
756 |
cannot be parsed again as @{syntax (inner) prop}.
|
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
757 |
|
| 61439 | 758 |
\<^descr> @{syntax_ref (inner) logic} denotes arbitrary terms of a
|
| 28778 | 759 |
logical type, excluding type @{typ prop}. This is the main
|
| 61493 | 760 |
syntactic category of object-logic entities, covering plain \<open>\<lambda>\<close>-term notation (variables, abstraction, application), plus |
| 28778 | 761 |
anything defined by the user. |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
762 |
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
763 |
When specifying notation for logical entities, all logical types |
| 61477 | 764 |
(excluding @{typ prop}) are \<^emph>\<open>collapsed\<close> to this single category
|
| 28778 | 765 |
of @{syntax (inner) logic}.
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
766 |
|
| 61439 | 767 |
\<^descr> @{syntax_ref (inner) index} denotes an optional index term for
|
|
51657
3db1bbc82d8d
more accurate documentation of "(structure)" mixfix;
wenzelm
parents:
51654
diff
changeset
|
768 |
indexed syntax. If omitted, it refers to the first @{keyword_ref
|
| 61493 | 769 |
"structure"} variable in the context. The special dummy ``\<open>\<index>\<close>'' serves as pattern variable in mixfix annotations that |
| 46287 | 770 |
introduce indexed notation. |
771 |
||
| 61439 | 772 |
\<^descr> @{syntax_ref (inner) idt} denotes identifiers, possibly
|
| 28778 | 773 |
constrained by types. |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
774 |
|
| 61439 | 775 |
\<^descr> @{syntax_ref (inner) idts} denotes a sequence of @{syntax_ref
|
| 28778 | 776 |
(inner) idt}. This is the most basic category for variables in |
| 61493 | 777 |
iterated binders, such as \<open>\<lambda>\<close> or \<open>\<And>\<close>. |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
778 |
|
| 61439 | 779 |
\<^descr> @{syntax_ref (inner) pttrn} and @{syntax_ref (inner) pttrns}
|
| 28778 | 780 |
denote patterns for abstraction, cases bindings etc. In Pure, these |
781 |
categories start as a merely copy of @{syntax (inner) idt} and
|
|
782 |
@{syntax (inner) idts}, respectively. Object-logics may add
|
|
783 |
additional productions for binding forms. |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
784 |
|
| 61439 | 785 |
\<^descr> @{syntax_ref (inner) type} denotes types of the meta-logic.
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
786 |
|
| 61439 | 787 |
\<^descr> @{syntax_ref (inner) sort} denotes meta-level sorts.
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
788 |
|
|
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
789 |
|
| 28774 | 790 |
Here are some further explanations of certain syntax features. |
| 28773 | 791 |
|
| 61493 | 792 |
\<^item> In @{syntax (inner) idts}, note that \<open>x :: nat y\<close> is
|
793 |
parsed as \<open>x :: (nat y)\<close>, treating \<open>y\<close> like a type |
|
794 |
constructor applied to \<open>nat\<close>. To avoid this interpretation, |
|
795 |
write \<open>(x :: nat) y\<close> with explicit parentheses. |
|
| 28773 | 796 |
|
| 61493 | 797 |
\<^item> Similarly, \<open>x :: nat y :: nat\<close> is parsed as \<open>x :: |
798 |
(nat y :: nat)\<close>. The correct form is \<open>(x :: nat) (y :: |
|
799 |
nat)\<close>, or \<open>(x :: nat) y :: nat\<close> if \<open>y\<close> is last in the |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
800 |
sequence of identifiers. |
| 28773 | 801 |
|
| 61421 | 802 |
\<^item> Type constraints for terms bind very weakly. For example, |
| 61493 | 803 |
\<open>x < y :: nat\<close> is normally parsed as \<open>(x < y) :: |
804 |
nat\<close>, unless \<open><\<close> has a very low priority, in which case the |
|
805 |
input is likely to be ambiguous. The correct form is \<open>x < (y |
|
806 |
:: nat)\<close>. |
|
| 28773 | 807 |
|
| 61421 | 808 |
\<^item> Dummy variables (written as underscore) may occur in different |
| 28774 | 809 |
roles. |
| 28773 | 810 |
|
| 61493 | 811 |
\<^descr> A type ``\<open>_\<close>'' or ``\<open>_ :: sort\<close>'' acts like an |
| 61458 | 812 |
anonymous inference parameter, which is filled-in according to the |
813 |
most general type produced by the type-checking phase. |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
814 |
|
| 61493 | 815 |
\<^descr> A bound ``\<open>_\<close>'' refers to a vacuous abstraction, where |
| 61458 | 816 |
the body does not refer to the binding introduced here. As in the |
| 61493 | 817 |
term @{term "\<lambda>x _. x"}, which is \<open>\<alpha>\<close>-equivalent to \<open>\<lambda>x y. x\<close>.
|
| 28773 | 818 |
|
| 61493 | 819 |
\<^descr> A free ``\<open>_\<close>'' refers to an implicit outer binding. |
820 |
Higher definitional packages usually allow forms like \<open>f x _ |
|
821 |
= x\<close>. |
|
| 28773 | 822 |
|
| 61493 | 823 |
\<^descr> A schematic ``\<open>_\<close>'' (within a term pattern, see |
| 61458 | 824 |
\secref{sec:term-decls}) refers to an anonymous variable that is
|
825 |
implicitly abstracted over its context of locally bound variables. |
|
| 61493 | 826 |
For example, this allows pattern matching of \<open>{x. f x = g
|
827 |
x}\<close> against \<open>{x. _ = _}\<close>, or even \<open>{_. _ = _}\<close> by
|
|
| 61458 | 828 |
using both bound and schematic dummies. |
| 28773 | 829 |
|
| 61503 | 830 |
\<^descr> The three literal dots ``\<^verbatim>\<open>...\<close>'' may be also |
831 |
written as ellipsis symbol \<^verbatim>\<open>\<dots>\<close>. In both cases this |
|
| 28774 | 832 |
refers to a special schematic variable, which is bound in the |
833 |
context. This special term abbreviation works nicely with |
|
834 |
calculational reasoning (\secref{sec:calculation}).
|
|
835 |
||
| 61503 | 836 |
\<^descr> \<^verbatim>\<open>CONST\<close> ensures that the given identifier is treated |
| 46287 | 837 |
as constant term, and passed through the parse tree in fully |
838 |
internalized form. This is particularly relevant for translation |
|
839 |
rules (\secref{sec:syn-trans}), notably on the RHS.
|
|
840 |
||
| 61503 | 841 |
\<^descr> \<^verbatim>\<open>XCONST\<close> is similar to \<^verbatim>\<open>CONST\<close>, but |
| 46287 | 842 |
retains the constant name as given. This is only relevant to |
843 |
translation rules (\secref{sec:syn-trans}), notably on the LHS.
|
|
| 58618 | 844 |
\<close> |
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
845 |
|
| 28777 | 846 |
|
| 58618 | 847 |
subsection \<open>Inspecting the syntax\<close> |
| 28777 | 848 |
|
| 58618 | 849 |
text \<open> |
| 46282 | 850 |
\begin{matharray}{rcl}
|
| 61493 | 851 |
@{command_def "print_syntax"}\<open>\<^sup>*\<close> & : & \<open>context \<rightarrow>\<close> \\
|
| 46282 | 852 |
\end{matharray}
|
| 28777 | 853 |
|
| 61439 | 854 |
\<^descr> @{command "print_syntax"} prints the inner syntax of the
|
| 46282 | 855 |
current context. The output can be quite large; the most important |
856 |
sections are explained below. |
|
| 28777 | 857 |
|
| 61493 | 858 |
\<^descr> \<open>lexicon\<close> lists the delimiters of the inner token |
| 61458 | 859 |
language; see \secref{sec:inner-lex}.
|
| 28777 | 860 |
|
| 61493 | 861 |
\<^descr> \<open>prods\<close> lists the productions of the underlying |
| 61458 | 862 |
priority grammar; see \secref{sec:priority-grammar}.
|
| 28777 | 863 |
|
| 61493 | 864 |
The nonterminal \<open>A\<^sup>(\<^sup>p\<^sup>)\<close> is rendered in plain text as \<open>A[p]\<close>; delimiters are quoted. Many productions have an extra |
865 |
\<open>\<dots> => name\<close>. These names later become the heads of parse |
|
| 61458 | 866 |
trees; they also guide the pretty printer. |
| 28777 | 867 |
|
| 61477 | 868 |
Productions without such parse tree names are called \<^emph>\<open>copy |
869 |
productions\<close>. Their right-hand side must have exactly one |
|
| 61458 | 870 |
nonterminal symbol (or named token). The parser does not create a |
871 |
new parse tree node for copy productions, but simply returns the |
|
872 |
parse tree of the right-hand symbol. |
|
| 46282 | 873 |
|
| 61458 | 874 |
If the right-hand side of a copy production consists of a single |
| 61477 | 875 |
nonterminal without any delimiters, then it is called a \<^emph>\<open>chain |
876 |
production\<close>. Chain productions act as abbreviations: conceptually, |
|
| 61458 | 877 |
they are removed from the grammar by adding new productions. |
878 |
Priority information attached to chain productions is ignored; only |
|
| 61493 | 879 |
the dummy value \<open>-1\<close> is displayed. |
| 46282 | 880 |
|
| 61493 | 881 |
\<^descr> \<open>print modes\<close> lists the alternative print modes |
| 61458 | 882 |
provided by this grammar; see \secref{sec:print-modes}.
|
| 28777 | 883 |
|
| 61493 | 884 |
\<^descr> \<open>parse_rules\<close> and \<open>print_rules\<close> relate to |
| 61458 | 885 |
syntax translations (macros); see \secref{sec:syn-trans}.
|
| 46282 | 886 |
|
| 61493 | 887 |
\<^descr> \<open>parse_ast_translation\<close> and \<open>print_ast_translation\<close> list sets of constants that invoke |
| 61458 | 888 |
translation functions for abstract syntax trees, which are only |
889 |
required in very special situations; see \secref{sec:tr-funs}.
|
|
| 28777 | 890 |
|
| 61493 | 891 |
\<^descr> \<open>parse_translation\<close> and \<open>print_translation\<close> |
| 61458 | 892 |
list the sets of constants that invoke regular translation |
893 |
functions; see \secref{sec:tr-funs}.
|
|
| 58618 | 894 |
\<close> |
| 28774 | 895 |
|
|
28770
93a372e2dc7a
added section "The Pure grammar" (incomplete version, based on old ref manual);
wenzelm
parents:
28769
diff
changeset
|
896 |
|
| 58618 | 897 |
subsection \<open>Ambiguity of parsed expressions\<close> |
| 46291 | 898 |
|
| 58618 | 899 |
text \<open> |
| 46291 | 900 |
\begin{tabular}{rcll}
|
| 61493 | 901 |
@{attribute_def syntax_ambiguity_warning} & : & \<open>attribute\<close> & default \<open>true\<close> \\
|
902 |
@{attribute_def syntax_ambiguity_limit} & : & \<open>attribute\<close> & default \<open>10\<close> \\
|
|
| 46291 | 903 |
\end{tabular}
|
904 |
||
905 |
Depending on the grammar and the given input, parsing may be |
|
906 |
ambiguous. Isabelle lets the Earley parser enumerate all possible |
|
907 |
parse trees, and then tries to make the best out of the situation. |
|
908 |
Terms that cannot be type-checked are filtered out, which often |
|
909 |
leads to a unique result in the end. Unlike regular type |
|
910 |
reconstruction, which is applied to the whole collection of input |
|
911 |
terms simultaneously, the filtering stage only treats each given |
|
912 |
term in isolation. Filtering is also not attempted for individual |
|
913 |
types or raw ASTs (as required for @{command translations}).
|
|
914 |
||
915 |
Certain warning or error messages are printed, depending on the |
|
916 |
situation and the given configuration options. Parsing ultimately |
|
917 |
fails, if multiple results remain after the filtering phase. |
|
918 |
||
| 61439 | 919 |
\<^descr> @{attribute syntax_ambiguity_warning} controls output of
|
|
46512
4f9f61f9b535
simplified configuration options for syntax ambiguity;
wenzelm
parents:
46506
diff
changeset
|
920 |
explicit warning messages about syntax ambiguity. |
| 46291 | 921 |
|
| 61439 | 922 |
\<^descr> @{attribute syntax_ambiguity_limit} determines the number of
|
| 46291 | 923 |
resulting parse trees that are shown as part of the printed message |
924 |
in case of an ambiguity. |
|
| 58618 | 925 |
\<close> |
| 46291 | 926 |
|
927 |
||
| 58618 | 928 |
section \<open>Syntax transformations \label{sec:syntax-transformations}\<close>
|
| 48113 | 929 |
|
| 58618 | 930 |
text \<open>The inner syntax engine of Isabelle provides separate |
| 52413 | 931 |
mechanisms to transform parse trees either via rewrite systems on |
| 48113 | 932 |
first-order ASTs (\secref{sec:syn-trans}), or ML functions on ASTs
|
| 61493 | 933 |
or syntactic \<open>\<lambda>\<close>-terms (\secref{sec:tr-funs}). This works
|
| 48113 | 934 |
both for parsing and printing, as outlined in |
935 |
\figref{fig:parse-print}.
|
|
936 |
||
937 |
\begin{figure}[htbp]
|
|
938 |
\begin{center}
|
|
939 |
\begin{tabular}{cl}
|
|
940 |
string & \\ |
|
| 61493 | 941 |
\<open>\<down>\<close> & lexer + parser \\ |
| 48113 | 942 |
parse tree & \\ |
| 61493 | 943 |
\<open>\<down>\<close> & parse AST translation \\ |
| 48113 | 944 |
AST & \\ |
| 61493 | 945 |
\<open>\<down>\<close> & AST rewriting (macros) \\ |
| 48113 | 946 |
AST & \\ |
| 61493 | 947 |
\<open>\<down>\<close> & parse translation \\ |
| 48113 | 948 |
--- pre-term --- & \\ |
| 61493 | 949 |
\<open>\<down>\<close> & print translation \\ |
| 48113 | 950 |
AST & \\ |
| 61493 | 951 |
\<open>\<down>\<close> & AST rewriting (macros) \\ |
| 48113 | 952 |
AST & \\ |
| 61493 | 953 |
\<open>\<down>\<close> & print AST translation \\ |
| 48113 | 954 |
string & |
955 |
\end{tabular}
|
|
956 |
\end{center}
|
|
957 |
\caption{Parsing and printing with translations}\label{fig:parse-print}
|
|
958 |
\end{figure}
|
|
959 |
||
960 |
These intermediate syntax tree formats eventually lead to a pre-term |
|
961 |
with all names and binding scopes resolved, but most type |
|
962 |
information still missing. Explicit type constraints might be given by |
|
963 |
the user, or implicit position information by the system --- both |
|
| 48816 | 964 |
need to be passed-through carefully by syntax transformations. |
| 48113 | 965 |
|
| 61477 | 966 |
Pre-terms are further processed by the so-called \<^emph>\<open>check\<close> and |
967 |
\<^emph>\<open>uncheck\<close> phases that are intertwined with type-inference (see |
|
| 58552 | 968 |
also @{cite "isabelle-implementation"}). The latter allows to operate
|
| 48113 | 969 |
on higher-order abstract syntax with proper binding and type |
970 |
information already available. |
|
971 |
||
972 |
As a rule of thumb, anything that manipulates bindings of variables |
|
973 |
or constants needs to be implemented as syntax transformation (see |
|
974 |
below). Anything else is better done via check/uncheck: a prominent |
|
975 |
example application is the @{command abbreviation} concept of
|
|
| 58618 | 976 |
Isabelle/Pure.\<close> |
| 48113 | 977 |
|
978 |
||
| 58618 | 979 |
subsection \<open>Abstract syntax trees \label{sec:ast}\<close>
|
| 48113 | 980 |
|
| 58618 | 981 |
text \<open>The ML datatype @{ML_type Ast.ast} explicitly represents the
|
| 48114 | 982 |
intermediate AST format that is used for syntax rewriting |
983 |
(\secref{sec:syn-trans}). It is defined in ML as follows:
|
|
|
61408
9020a3ba6c9a
@{verbatim [display]} supersedes old alltt/ttbox;
wenzelm
parents:
61143
diff
changeset
|
984 |
@{verbatim [display]
|
|
9020a3ba6c9a
@{verbatim [display]} supersedes old alltt/ttbox;
wenzelm
parents:
61143
diff
changeset
|
985 |
\<open>datatype ast = |
|
9020a3ba6c9a
@{verbatim [display]} supersedes old alltt/ttbox;
wenzelm
parents:
61143
diff
changeset
|
986 |
Constant of string | |
|
9020a3ba6c9a
@{verbatim [display]} supersedes old alltt/ttbox;
wenzelm
parents:
61143
diff
changeset
|
987 |
Variable of string | |
|
9020a3ba6c9a
@{verbatim [display]} supersedes old alltt/ttbox;
wenzelm
parents:
61143
diff
changeset
|
988 |
Appl of ast list\<close>} |
| 48114 | 989 |
|
990 |
An AST is either an atom (constant or variable) or a list of (at |
|
991 |
least two) subtrees. Occasional diagnostic output of ASTs uses |
|
992 |
notation that resembles S-expression of LISP. Constant atoms are |
|
993 |
shown as quoted strings, variable atoms as non-quoted strings and |
|
994 |
applications as a parenthesized list of subtrees. For example, the |
|
995 |
AST |
|
| 58724 | 996 |
@{ML [display] \<open>Ast.Appl [Ast.Constant "_abs", Ast.Variable "x", Ast.Variable "t"]\<close>}
|
| 61503 | 997 |
is pretty-printed as \<^verbatim>\<open>("_abs" x t)\<close>. Note that
|
998 |
\<^verbatim>\<open>()\<close> and \<^verbatim>\<open>(x)\<close> are excluded as ASTs, because |
|
| 48114 | 999 |
they have too few subtrees. |
1000 |
||
| 61421 | 1001 |
\<^medskip> |
1002 |
AST application is merely a pro-forma mechanism to indicate |
|
| 61503 | 1003 |
certain syntactic structures. Thus \<^verbatim>\<open>(c a b)\<close> could mean |
| 48114 | 1004 |
either term application or type application, depending on the |
1005 |
syntactic context. |
|
1006 |
||
| 61503 | 1007 |
Nested application like \<^verbatim>\<open>(("_abs" x t) u)\<close> is also
|
| 48114 | 1008 |
possible, but ASTs are definitely first-order: the syntax constant |
| 61503 | 1009 |
\<^verbatim>\<open>"_abs"\<close> does not bind the \<^verbatim>\<open>x\<close> in any way. |
| 48114 | 1010 |
Proper bindings are introduced in later stages of the term syntax, |
| 61503 | 1011 |
where \<^verbatim>\<open>("_abs" x t)\<close> becomes an @{ML Abs} node and
|
1012 |
occurrences of \<^verbatim>\<open>x\<close> in \<^verbatim>\<open>t\<close> are replaced by bound |
|
| 48114 | 1013 |
variables (represented as de-Bruijn indices). |
| 58618 | 1014 |
\<close> |
| 48113 | 1015 |
|
1016 |
||
| 58618 | 1017 |
subsubsection \<open>AST constants versus variables\<close> |
| 48114 | 1018 |
|
| 58618 | 1019 |
text \<open>Depending on the situation --- input syntax, output syntax, |
| 56582 | 1020 |
translation patterns --- the distinction of atomic ASTs as @{ML
|
| 48114 | 1021 |
Ast.Constant} versus @{ML Ast.Variable} serves slightly different
|
1022 |
purposes. |
|
1023 |
||
| 61493 | 1024 |
Input syntax of a term such as \<open>f a b = c\<close> does not yet |
1025 |
indicate the scopes of atomic entities \<open>f, a, b, c\<close>: they |
|
| 48114 | 1026 |
could be global constants or local variables, even bound ones |
1027 |
depending on the context of the term. @{ML Ast.Variable} leaves
|
|
1028 |
this choice still open: later syntax layers (or translation |
|
1029 |
functions) may capture such a variable to determine its role |
|
1030 |
specifically, to make it a constant, bound variable, free variable |
|
1031 |
etc. In contrast, syntax translations that introduce already known |
|
1032 |
constants would rather do it via @{ML Ast.Constant} to prevent
|
|
1033 |
accidental re-interpretation later on. |
|
1034 |
||
1035 |
Output syntax turns term constants into @{ML Ast.Constant} and
|
|
1036 |
variables (free or schematic) into @{ML Ast.Variable}. This
|
|
| 61493 | 1037 |
information is precise when printing fully formal \<open>\<lambda>\<close>-terms. |
| 48114 | 1038 |
|
| 61421 | 1039 |
\<^medskip> |
1040 |
AST translation patterns (\secref{sec:syn-trans}) that
|
|
| 52413 | 1041 |
represent terms cannot distinguish constants and variables |
| 61493 | 1042 |
syntactically. Explicit indication of \<open>CONST c\<close> inside the |
1043 |
term language is required, unless \<open>c\<close> is known as special |
|
| 61477 | 1044 |
\<^emph>\<open>syntax constant\<close> (see also @{command syntax}). It is also
|
| 52413 | 1045 |
possible to use @{command syntax} declarations (without mixfix
|
1046 |
annotation) to enforce that certain unqualified names are always |
|
1047 |
treated as constant within the syntax machinery. |
|
| 48114 | 1048 |
|
| 52413 | 1049 |
The situation is simpler for ASTs that represent types or sorts, |
1050 |
since the concrete syntax already distinguishes type variables from |
|
| 61493 | 1051 |
type constants (constructors). So \<open>('a, 'b) foo\<close>
|
1052 |
corresponds to an AST application of some constant for \<open>foo\<close> |
|
1053 |
and variable arguments for \<open>'a\<close> and \<open>'b\<close>. Note that |
|
| 52413 | 1054 |
the postfix application is merely a feature of the concrete syntax, |
| 58618 | 1055 |
while in the AST the constructor occurs in head position.\<close> |
| 48114 | 1056 |
|
1057 |
||
| 58618 | 1058 |
subsubsection \<open>Authentic syntax names\<close> |
| 48114 | 1059 |
|
| 58618 | 1060 |
text \<open>Naming constant entities within ASTs is another delicate |
| 52413 | 1061 |
issue. Unqualified names are resolved in the name space tables in |
| 48114 | 1062 |
the last stage of parsing, after all translations have been applied. |
1063 |
Since syntax transformations do not know about this later name |
|
| 52413 | 1064 |
resolution, there can be surprises in boundary cases. |
| 48114 | 1065 |
|
| 61477 | 1066 |
\<^emph>\<open>Authentic syntax names\<close> for @{ML Ast.Constant} avoid this
|
| 48114 | 1067 |
problem: the fully-qualified constant name with a special prefix for |
| 61493 | 1068 |
its formal category (\<open>class\<close>, \<open>type\<close>, \<open>const\<close>, \<open>fixed\<close>) represents the information faithfully |
| 48114 | 1069 |
within the untyped AST format. Accidental overlap with free or |
1070 |
bound variables is excluded as well. Authentic syntax names work |
|
1071 |
implicitly in the following situations: |
|
1072 |
||
| 61421 | 1073 |
\<^item> Input of term constants (or fixed variables) that are |
| 48114 | 1074 |
introduced by concrete syntax via @{command notation}: the
|
1075 |
correspondence of a particular grammar production to some known term |
|
1076 |
entity is preserved. |
|
1077 |
||
| 61421 | 1078 |
\<^item> Input of type constants (constructors) and type classes --- |
| 48114 | 1079 |
thanks to explicit syntactic distinction independently on the |
1080 |
context. |
|
1081 |
||
| 61421 | 1082 |
\<^item> Output of term constants, type constants, type classes --- |
| 48114 | 1083 |
this information is already available from the internal term to be |
1084 |
printed. |
|
1085 |
||
1086 |
||
1087 |
In other words, syntax transformations that operate on input terms |
|
| 48816 | 1088 |
written as prefix applications are difficult to make robust. |
1089 |
Luckily, this case rarely occurs in practice, because syntax forms |
|
| 58618 | 1090 |
to be translated usually correspond to some concrete notation.\<close> |
| 48114 | 1091 |
|
1092 |
||
| 58618 | 1093 |
subsection \<open>Raw syntax and translations \label{sec:syn-trans}\<close>
|
| 28762 | 1094 |
|
| 58618 | 1095 |
text \<open> |
| 48117 | 1096 |
\begin{tabular}{rcll}
|
| 61493 | 1097 |
@{command_def "nonterminal"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
1098 |
@{command_def "syntax"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1099 |
@{command_def "no_syntax"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1100 |
@{command_def "translations"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1101 |
@{command_def "no_translations"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1102 |
@{attribute_def syntax_ast_trace} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
1103 |
@{attribute_def syntax_ast_stats} & : & \<open>attribute\<close> & default \<open>false\<close> \\
|
|
| 48117 | 1104 |
\end{tabular}
|
| 61421 | 1105 |
\<^medskip> |
|
59783
00b62aa9f430
tuned syntax diagrams -- no duplication of "target";
wenzelm
parents:
58842
diff
changeset
|
1106 |
|
| 46292 | 1107 |
Unlike mixfix notation for existing formal entities |
1108 |
(\secref{sec:notation}), raw syntax declarations provide full access
|
|
| 48115 | 1109 |
to the priority grammar of the inner syntax, without any sanity |
1110 |
checks. This includes additional syntactic categories (via |
|
1111 |
@{command nonterminal}) and free-form grammar productions (via
|
|
1112 |
@{command syntax}). Additional syntax translations (or macros, via
|
|
1113 |
@{command translations}) are required to turn resulting parse trees
|
|
1114 |
into proper representations of formal entities again. |
|
| 46292 | 1115 |
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
1116 |
@{rail \<open>
|
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1117 |
@@{command nonterminal} (@{syntax name} + @'and')
|
| 28762 | 1118 |
; |
|
46494
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
wenzelm
parents:
46483
diff
changeset
|
1119 |
(@@{command syntax} | @@{command no_syntax}) @{syntax mode}? (constdecl +)
|
| 28762 | 1120 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1121 |
(@@{command translations} | @@{command no_translations})
|
|
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1122 |
(transpat ('==' | '=>' | '<=' | '\<rightleftharpoons>' | '\<rightharpoonup>' | '\<leftharpoondown>') transpat +)
|
| 28762 | 1123 |
; |
1124 |
||
|
46494
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
wenzelm
parents:
46483
diff
changeset
|
1125 |
constdecl: @{syntax name} '::' @{syntax type} @{syntax mixfix}?
|
|
ea2ae63336f3
clarified outer syntax "constdecl", which is only local to some rail diagrams;
wenzelm
parents:
46483
diff
changeset
|
1126 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1127 |
mode: ('(' ( @{syntax name} | @'output' | @{syntax name} @'output' ) ')')
|
| 28762 | 1128 |
; |
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1129 |
transpat: ('(' @{syntax nameref} ')')? @{syntax string}
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
1130 |
\<close>} |
| 28762 | 1131 |
|
| 61493 | 1132 |
\<^descr> @{command "nonterminal"}~\<open>c\<close> declares a type
|
1133 |
constructor \<open>c\<close> (without arguments) to act as purely syntactic |
|
| 28762 | 1134 |
type: a nonterminal symbol of the inner syntax. |
1135 |
||
| 61493 | 1136 |
\<^descr> @{command "syntax"}~\<open>(mode) c :: \<sigma> (mx)\<close> augments the
|
| 46292 | 1137 |
priority grammar and the pretty printer table for the given print |
| 61503 | 1138 |
mode (default \<^verbatim>\<open>""\<close>). An optional keyword @{keyword_ref
|
| 46292 | 1139 |
"output"} means that only the pretty printer table is affected. |
1140 |
||
| 61493 | 1141 |
Following \secref{sec:mixfix}, the mixfix annotation \<open>mx =
|
1142 |
template ps q\<close> together with type \<open>\<sigma> = \<tau>\<^sub>1 \<Rightarrow> \<dots> \<tau>\<^sub>n \<Rightarrow> \<tau>\<close> and |
|
1143 |
specify a grammar production. The \<open>template\<close> contains |
|
1144 |
delimiter tokens that surround \<open>n\<close> argument positions |
|
| 61503 | 1145 |
(\<^verbatim>\<open>_\<close>). The latter correspond to nonterminal symbols |
| 61493 | 1146 |
\<open>A\<^sub>i\<close> derived from the argument types \<open>\<tau>\<^sub>i\<close> as |
| 46292 | 1147 |
follows: |
1148 |
||
| 61493 | 1149 |
\<^item> \<open>prop\<close> if \<open>\<tau>\<^sub>i = prop\<close> |
| 46292 | 1150 |
|
| 61493 | 1151 |
\<^item> \<open>logic\<close> if \<open>\<tau>\<^sub>i = (\<dots>)\<kappa>\<close> for logical type |
1152 |
constructor \<open>\<kappa> \<noteq> prop\<close> |
|
| 46292 | 1153 |
|
| 61493 | 1154 |
\<^item> \<open>any\<close> if \<open>\<tau>\<^sub>i = \<alpha>\<close> for type variables |
| 46292 | 1155 |
|
| 61493 | 1156 |
\<^item> \<open>\<kappa>\<close> if \<open>\<tau>\<^sub>i = \<kappa>\<close> for nonterminal \<open>\<kappa>\<close> |
| 61458 | 1157 |
(syntactic type constructor) |
| 46292 | 1158 |
|
| 61493 | 1159 |
Each \<open>A\<^sub>i\<close> is decorated by priority \<open>p\<^sub>i\<close> from the |
1160 |
given list \<open>ps\<close>; missing priorities default to 0. |
|
| 46292 | 1161 |
|
1162 |
The resulting nonterminal of the production is determined similarly |
|
| 61493 | 1163 |
from type \<open>\<tau>\<close>, with priority \<open>q\<close> and default 1000. |
| 46292 | 1164 |
|
| 61421 | 1165 |
\<^medskip> |
| 61493 | 1166 |
Parsing via this production produces parse trees \<open>t\<^sub>1, \<dots>, t\<^sub>n\<close> for the argument slots. The resulting parse tree is |
1167 |
composed as \<open>c t\<^sub>1 \<dots> t\<^sub>n\<close>, by using the syntax constant \<open>c\<close> of the syntax declaration. |
|
| 46292 | 1168 |
|
1169 |
Such syntactic constants are invented on the spot, without formal |
|
1170 |
check wrt.\ existing declarations. It is conventional to use plain |
|
| 61493 | 1171 |
identifiers prefixed by a single underscore (e.g.\ \<open>_foobar\<close>). Names should be chosen with care, to avoid clashes |
| 48816 | 1172 |
with other syntax declarations. |
| 46292 | 1173 |
|
| 61421 | 1174 |
\<^medskip> |
| 61503 | 1175 |
The special case of copy production is specified by \<open>c =\<close>~\<^verbatim>\<open>""\<close> (empty string). |
1176 |
It means that the resulting parse tree \<open>t\<close> is copied directly, without any |
|
| 46292 | 1177 |
further decoration. |
| 46282 | 1178 |
|
| 61493 | 1179 |
\<^descr> @{command "no_syntax"}~\<open>(mode) decls\<close> removes grammar
|
1180 |
declarations (and translations) resulting from \<open>decls\<close>, which |
|
| 28762 | 1181 |
are interpreted in the same manner as for @{command "syntax"} above.
|
| 46282 | 1182 |
|
| 61493 | 1183 |
\<^descr> @{command "translations"}~\<open>rules\<close> specifies syntactic
|
| 48115 | 1184 |
translation rules (i.e.\ macros) as first-order rewrite rules on |
| 48816 | 1185 |
ASTs (\secref{sec:ast}). The theory context maintains two
|
| 61503 | 1186 |
independent lists translation rules: parse rules (\<^verbatim>\<open>=>\<close> |
1187 |
or \<open>\<rightharpoonup>\<close>) and print rules (\<^verbatim>\<open><=\<close> or \<open>\<leftharpoondown>\<close>). |
|
| 48115 | 1188 |
For convenience, both can be specified simultaneously as parse~/ |
| 61503 | 1189 |
print rules (\<^verbatim>\<open>==\<close> or \<open>\<rightleftharpoons>\<close>). |
| 48115 | 1190 |
|
| 28762 | 1191 |
Translation patterns may be prefixed by the syntactic category to be |
| 61493 | 1192 |
used for parsing; the default is \<open>logic\<close> which means that |
| 48115 | 1193 |
regular term syntax is used. Both sides of the syntax translation |
1194 |
rule undergo parsing and parse AST translations |
|
1195 |
\secref{sec:tr-funs}, in order to perform some fundamental
|
|
| 61493 | 1196 |
normalization like \<open>\<lambda>x y. b \<leadsto> \<lambda>x. \<lambda>y. b\<close>, but other AST |
| 61477 | 1197 |
translation rules are \<^emph>\<open>not\<close> applied recursively here. |
| 48115 | 1198 |
|
1199 |
When processing AST patterns, the inner syntax lexer runs in a |
|
1200 |
different mode that allows identifiers to start with underscore. |
|
1201 |
This accommodates the usual naming convention for auxiliary syntax |
|
1202 |
constants --- those that do not have a logical counter part --- by |
|
1203 |
allowing to specify arbitrary AST applications within the term |
|
1204 |
syntax, independently of the corresponding concrete syntax. |
|
1205 |
||
1206 |
Atomic ASTs are distinguished as @{ML Ast.Constant} versus @{ML
|
|
1207 |
Ast.Variable} as follows: a qualified name or syntax constant |
|
1208 |
declared via @{command syntax}, or parse tree head of concrete
|
|
1209 |
notation becomes @{ML Ast.Constant}, anything else @{ML
|
|
| 61493 | 1210 |
Ast.Variable}. Note that \<open>CONST\<close> and \<open>XCONST\<close> within |
| 48115 | 1211 |
the term language (\secref{sec:pure-grammar}) allow to enforce
|
1212 |
treatment as constants. |
|
1213 |
||
| 61493 | 1214 |
AST rewrite rules \<open>(lhs, rhs)\<close> need to obey the following |
| 48115 | 1215 |
side-conditions: |
1216 |
||
| 61493 | 1217 |
\<^item> Rules must be left linear: \<open>lhs\<close> must not contain |
| 61572 | 1218 |
repeated variables.\<^footnote>\<open>The deeper reason for this is that AST |
| 61458 | 1219 |
equality is not well-defined: different occurrences of the ``same'' |
1220 |
AST could be decorated differently by accidental type-constraints or |
|
| 61572 | 1221 |
source position information, for example.\<close> |
| 48115 | 1222 |
|
| 61493 | 1223 |
\<^item> Every variable in \<open>rhs\<close> must also occur in \<open>lhs\<close>. |
| 48115 | 1224 |
|
| 61493 | 1225 |
\<^descr> @{command "no_translations"}~\<open>rules\<close> removes syntactic
|
| 28762 | 1226 |
translation rules, which are interpreted in the same manner as for |
1227 |
@{command "translations"} above.
|
|
1228 |
||
| 61439 | 1229 |
\<^descr> @{attribute syntax_ast_trace} and @{attribute
|
| 48117 | 1230 |
syntax_ast_stats} control diagnostic output in the AST normalization |
1231 |
process, when translation rules are applied to concrete input or |
|
1232 |
output. |
|
1233 |
||
| 46293 | 1234 |
|
1235 |
Raw syntax and translations provides a slightly more low-level |
|
1236 |
access to the grammar and the form of resulting parse trees. It is |
|
1237 |
often possible to avoid this untyped macro mechanism, and use |
|
1238 |
type-safe @{command abbreviation} or @{command notation} instead.
|
|
1239 |
Some important situations where @{command syntax} and @{command
|
|
1240 |
translations} are really need are as follows: |
|
1241 |
||
| 61421 | 1242 |
\<^item> Iterated replacement via recursive @{command translations}.
|
| 46293 | 1243 |
For example, consider list enumeration @{term "[a, b, c, d]"} as
|
1244 |
defined in theory @{theory List} in Isabelle/HOL.
|
|
1245 |
||
| 61421 | 1246 |
\<^item> Change of binding status of variables: anything beyond the |
| 46293 | 1247 |
built-in @{keyword "binder"} mixfix annotation requires explicit
|
1248 |
syntax translations. For example, consider list filter |
|
1249 |
comprehension @{term "[x \<leftarrow> xs . P]"} as defined in theory @{theory
|
|
1250 |
List} in Isabelle/HOL. |
|
| 61458 | 1251 |
\<close> |
| 46293 | 1252 |
|
| 28762 | 1253 |
|
| 58618 | 1254 |
subsubsection \<open>Applying translation rules\<close> |
| 48117 | 1255 |
|
| 58618 | 1256 |
text \<open>As a term is being parsed or printed, an AST is generated as |
| 48117 | 1257 |
an intermediate form according to \figref{fig:parse-print}. The AST
|
1258 |
is normalized by applying translation rules in the manner of a |
|
1259 |
first-order term rewriting system. We first examine how a single |
|
1260 |
rule is applied. |
|
1261 |
||
| 61493 | 1262 |
Let \<open>t\<close> be the abstract syntax tree to be normalized and |
1263 |
\<open>(lhs, rhs)\<close> some translation rule. A subtree \<open>u\<close> |
|
1264 |
of \<open>t\<close> is called \<^emph>\<open>redex\<close> if it is an instance of \<open>lhs\<close>; in this case the pattern \<open>lhs\<close> is said to match the |
|
1265 |
object \<open>u\<close>. A redex matched by \<open>lhs\<close> may be |
|
1266 |
replaced by the corresponding instance of \<open>rhs\<close>, thus |
|
1267 |
\<^emph>\<open>rewriting\<close> the AST \<open>t\<close>. Matching requires some notion |
|
| 61477 | 1268 |
of \<^emph>\<open>place-holders\<close> in rule patterns: @{ML Ast.Variable} serves
|
| 48117 | 1269 |
this purpose. |
1270 |
||
| 61493 | 1271 |
More precisely, the matching of the object \<open>u\<close> against the |
1272 |
pattern \<open>lhs\<close> is performed as follows: |
|
| 48117 | 1273 |
|
| 61493 | 1274 |
\<^item> Objects of the form @{ML Ast.Variable}~\<open>x\<close> or @{ML
|
1275 |
Ast.Constant}~\<open>x\<close> are matched by pattern @{ML
|
|
1276 |
Ast.Constant}~\<open>x\<close>. Thus all atomic ASTs in the object are |
|
| 48117 | 1277 |
treated as (potential) constants, and a successful match makes them |
1278 |
actual constants even before name space resolution (see also |
|
1279 |
\secref{sec:ast}).
|
|
1280 |
||
| 61493 | 1281 |
\<^item> Object \<open>u\<close> is matched by pattern @{ML
|
1282 |
Ast.Variable}~\<open>x\<close>, binding \<open>x\<close> to \<open>u\<close>. |
|
| 48117 | 1283 |
|
| 61493 | 1284 |
\<^item> Object @{ML Ast.Appl}~\<open>us\<close> is matched by @{ML
|
1285 |
Ast.Appl}~\<open>ts\<close> if \<open>us\<close> and \<open>ts\<close> have the |
|
| 48117 | 1286 |
same length and each corresponding subtree matches. |
1287 |
||
| 61421 | 1288 |
\<^item> In every other case, matching fails. |
| 48117 | 1289 |
|
1290 |
||
| 61493 | 1291 |
A successful match yields a substitution that is applied to \<open>rhs\<close>, generating the instance that replaces \<open>u\<close>. |
| 48117 | 1292 |
|
1293 |
Normalizing an AST involves repeatedly applying translation rules |
|
1294 |
until none are applicable. This works yoyo-like: top-down, |
|
1295 |
bottom-up, top-down, etc. At each subtree position, rules are |
|
1296 |
chosen in order of appearance in the theory definitions. |
|
1297 |
||
1298 |
The configuration options @{attribute syntax_ast_trace} and
|
|
| 48816 | 1299 |
@{attribute syntax_ast_stats} might help to understand this process
|
| 48117 | 1300 |
and diagnose problems. |
1301 |
||
1302 |
\begin{warn}
|
|
1303 |
If syntax translation rules work incorrectly, the output of |
|
| 61477 | 1304 |
@{command_ref print_syntax} with its \<^emph>\<open>rules\<close> sections reveals the
|
| 48117 | 1305 |
actual internal forms of AST pattern, without potentially confusing |
1306 |
concrete syntax. Recall that AST constants appear as quoted strings |
|
1307 |
and variables without quotes. |
|
1308 |
\end{warn}
|
|
1309 |
||
1310 |
\begin{warn}
|
|
| 61493 | 1311 |
If @{attribute_ref eta_contract} is set to \<open>true\<close>, terms
|
1312 |
will be \<open>\<eta>\<close>-contracted \<^emph>\<open>before\<close> the AST rewriter sees |
|
| 48117 | 1313 |
them. Thus some abstraction nodes needed for print rules to match |
| 61493 | 1314 |
may vanish. For example, \<open>Ball A (\<lambda>x. P x)\<close> would contract |
1315 |
to \<open>Ball A P\<close> and the standard print rule would fail to |
|
| 48117 | 1316 |
apply. This problem can be avoided by hand-written ML translation |
1317 |
functions (see also \secref{sec:tr-funs}), which is in fact the same
|
|
1318 |
mechanism used in built-in @{keyword "binder"} declarations.
|
|
1319 |
\end{warn}
|
|
| 58618 | 1320 |
\<close> |
| 48117 | 1321 |
|
| 28762 | 1322 |
|
| 58618 | 1323 |
subsection \<open>Syntax translation functions \label{sec:tr-funs}\<close>
|
| 28762 | 1324 |
|
| 58618 | 1325 |
text \<open> |
| 28762 | 1326 |
\begin{matharray}{rcl}
|
| 61493 | 1327 |
@{command_def "parse_ast_translation"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
1328 |
@{command_def "parse_translation"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1329 |
@{command_def "print_translation"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1330 |
@{command_def "typed_print_translation"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1331 |
@{command_def "print_ast_translation"} & : & \<open>theory \<rightarrow> theory\<close> \\
|
|
1332 |
@{ML_antiquotation_def "class_syntax"} & : & \<open>ML antiquotation\<close> \\
|
|
1333 |
@{ML_antiquotation_def "type_syntax"} & : & \<open>ML antiquotation\<close> \\
|
|
1334 |
@{ML_antiquotation_def "const_syntax"} & : & \<open>ML antiquotation\<close> \\
|
|
1335 |
@{ML_antiquotation_def "syntax_const"} & : & \<open>ML antiquotation\<close> \\
|
|
| 28762 | 1336 |
\end{matharray}
|
1337 |
||
| 48118 | 1338 |
Syntax translation functions written in ML admit almost arbitrary |
1339 |
manipulations of inner syntax, at the expense of some complexity and |
|
1340 |
obscurity in the implementation. |
|
1341 |
||
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
1342 |
@{rail \<open>
|
|
42596
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1343 |
( @@{command parse_ast_translation} | @@{command parse_translation} |
|
|
6c621a9d612a
modernized rail diagrams using @{rail} antiquotation;
wenzelm
parents:
42358
diff
changeset
|
1344 |
@@{command print_translation} | @@{command typed_print_translation} |
|
| 52143 | 1345 |
@@{command print_ast_translation}) @{syntax text}
|
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1346 |
; |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1347 |
(@@{ML_antiquotation class_syntax} |
|
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1348 |
@@{ML_antiquotation type_syntax} |
|
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1349 |
@@{ML_antiquotation const_syntax} |
|
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1350 |
@@{ML_antiquotation syntax_const}) name
|
|
55112
b1a5d603fd12
prefer rail cartouche -- avoid back-slashed quotes;
wenzelm
parents:
55108
diff
changeset
|
1351 |
\<close>} |
| 28762 | 1352 |
|
| 61439 | 1353 |
\<^descr> @{command parse_translation} etc. declare syntax translation
|
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1354 |
functions to the theory. Any of these commands have a single |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1355 |
@{syntax text} argument that refers to an ML expression of
|
| 52413 | 1356 |
appropriate type as follows: |
| 48118 | 1357 |
|
| 61421 | 1358 |
\<^medskip> |
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1359 |
{\footnotesize
|
| 52143 | 1360 |
\begin{tabular}{l}
|
1361 |
@{command parse_ast_translation} : \\
|
|
1362 |
\quad @{ML_type "(string * (Proof.context -> Ast.ast list -> Ast.ast)) list"} \\
|
|
1363 |
@{command parse_translation} : \\
|
|
1364 |
\quad @{ML_type "(string * (Proof.context -> term list -> term)) list"} \\
|
|
1365 |
@{command print_translation} : \\
|
|
1366 |
\quad @{ML_type "(string * (Proof.context -> term list -> term)) list"} \\
|
|
1367 |
@{command typed_print_translation} : \\
|
|
1368 |
\quad @{ML_type "(string * (Proof.context -> typ -> term list -> term)) list"} \\
|
|
1369 |
@{command print_ast_translation} : \\
|
|
1370 |
\quad @{ML_type "(string * (Proof.context -> Ast.ast list -> Ast.ast)) list"} \\
|
|
| 48118 | 1371 |
\end{tabular}}
|
| 61421 | 1372 |
\<^medskip> |
| 28762 | 1373 |
|
| 61493 | 1374 |
The argument list consists of \<open>(c, tr)\<close> pairs, where \<open>c\<close> is the syntax name of the formal entity involved, and \<open>tr\<close> a function that translates a syntax form \<open>c args\<close> into |
1375 |
\<open>tr ctxt args\<close> (depending on the context). The Isabelle/ML |
|
1376 |
naming convention for parse translations is \<open>c_tr\<close> and for |
|
1377 |
print translations \<open>c_tr'\<close>. |
|
| 48118 | 1378 |
|
1379 |
The @{command_ref print_syntax} command displays the sets of names
|
|
| 61493 | 1380 |
associated with the translation functions of a theory under \<open>parse_ast_translation\<close> etc. |
| 48118 | 1381 |
|
| 61493 | 1382 |
\<^descr> \<open>@{class_syntax c}\<close>, \<open>@{type_syntax c}\<close>,
|
1383 |
\<open>@{const_syntax c}\<close> inline the authentic syntax name of the
|
|
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1384 |
given formal entities into the ML source. This is the |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1385 |
fully-qualified logical name prefixed by a special marker to |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1386 |
indicate its kind: thus different logical name spaces are properly |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1387 |
distinguished within parse trees. |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1388 |
|
| 61493 | 1389 |
\<^descr> \<open>@{const_syntax c}\<close> inlines the name \<open>c\<close> of
|
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1390 |
the given syntax constant, having checked that it has been declared |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1391 |
via some @{command syntax} commands within the theory context. Note
|
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1392 |
that the usual naming convention makes syntax constants start with |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1393 |
underscore, to reduce the chance of accidental clashes with other |
|
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1394 |
names occurring in parse trees (unqualified constants etc.). |
| 58618 | 1395 |
\<close> |
| 48118 | 1396 |
|
|
48119
55c305e29f4b
cover @{class_syntax}, @{type_syntax}, @{const_syntax}, @{syntax_const} in isar-ref, in contrast to other ML antiquotations in implementation manual;
wenzelm
parents:
48118
diff
changeset
|
1397 |
|
| 58618 | 1398 |
subsubsection \<open>The translation strategy\<close> |
| 28762 | 1399 |
|
| 58618 | 1400 |
text \<open>The different kinds of translation functions are invoked during |
| 48118 | 1401 |
the transformations between parse trees, ASTs and syntactic terms |
1402 |
(cf.\ \figref{fig:parse-print}). Whenever a combination of the form
|
|
| 61493 | 1403 |
\<open>c x\<^sub>1 \<dots> x\<^sub>n\<close> is encountered, and a translation function |
1404 |
\<open>f\<close> of appropriate kind is declared for \<open>c\<close>, the |
|
1405 |
result is produced by evaluation of \<open>f [x\<^sub>1, \<dots>, x\<^sub>n]\<close> in ML. |
|
| 48118 | 1406 |
|
| 61493 | 1407 |
For AST translations, the arguments \<open>x\<^sub>1, \<dots>, x\<^sub>n\<close> are ASTs. A |
1408 |
combination has the form @{ML "Ast.Constant"}~\<open>c\<close> or @{ML
|
|
1409 |
"Ast.Appl"}~\<open>[\<close>@{ML Ast.Constant}~\<open>c, x\<^sub>1, \<dots>, x\<^sub>n]\<close>.
|
|
| 48118 | 1410 |
For term translations, the arguments are terms and a combination has |
| 61493 | 1411 |
the form @{ML Const}~\<open>(c, \<tau>)\<close> or @{ML Const}~\<open>(c, \<tau>)
|
1412 |
$ x\<^sub>1 $ \<dots> $ x\<^sub>n\<close>. Terms allow more sophisticated transformations |
|
| 48118 | 1413 |
than ASTs do, typically involving abstractions and bound |
| 61477 | 1414 |
variables. \<^emph>\<open>Typed\<close> print translations may even peek at the type |
| 61493 | 1415 |
\<open>\<tau>\<close> of the constant they are invoked on, although some |
| 52413 | 1416 |
information might have been suppressed for term output already. |
| 48118 | 1417 |
|
1418 |
Regardless of whether they act on ASTs or terms, translation |
|
1419 |
functions called during the parsing process differ from those for |
|
1420 |
printing in their overall behaviour: |
|
1421 |
||
| 61439 | 1422 |
\<^descr>[Parse translations] are applied bottom-up. The arguments are |
| 48118 | 1423 |
already in translated form. The translations must not fail; |
1424 |
exceptions trigger an error message. There may be at most one |
|
1425 |
function associated with any syntactic name. |
|
| 46294 | 1426 |
|
| 61439 | 1427 |
\<^descr>[Print translations] are applied top-down. They are supplied |
| 48118 | 1428 |
with arguments that are partly still in internal form. The result |
1429 |
again undergoes translation; therefore a print translation should |
|
1430 |
not introduce as head the very constant that invoked it. The |
|
1431 |
function may raise exception @{ML Match} to indicate failure; in
|
|
1432 |
this event it has no effect. Multiple functions associated with |
|
1433 |
some syntactic name are tried in the order of declaration in the |
|
1434 |
theory. |
|
1435 |
||
1436 |
||
1437 |
Only constant atoms --- constructor @{ML Ast.Constant} for ASTs and
|
|
1438 |
@{ML Const} for terms --- can invoke translation functions. This
|
|
1439 |
means that parse translations can only be associated with parse tree |
|
1440 |
heads of concrete syntax, or syntactic constants introduced via |
|
1441 |
other translations. For plain identifiers within the term language, |
|
1442 |
the status of constant versus variable is not yet know during |
|
1443 |
parsing. This is in contrast to print translations, where constants |
|
1444 |
are explicitly known from the given term in its fully internal form. |
|
| 58618 | 1445 |
\<close> |
| 28762 | 1446 |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1447 |
|
| 58618 | 1448 |
subsection \<open>Built-in syntax transformations\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1449 |
|
| 58618 | 1450 |
text \<open> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1451 |
Here are some further details of the main syntax transformation |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1452 |
phases of \figref{fig:parse-print}.
|
| 58618 | 1453 |
\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1454 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1455 |
|
| 58618 | 1456 |
subsubsection \<open>Transforming parse trees to ASTs\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1457 |
|
| 58618 | 1458 |
text \<open>The parse tree is the raw output of the parser. It is |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1459 |
transformed into an AST according to some basic scheme that may be |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1460 |
augmented by AST translation functions as explained in |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1461 |
\secref{sec:tr-funs}.
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1462 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1463 |
The parse tree is constructed by nesting the right-hand sides of the |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1464 |
productions used to recognize the input. Such parse trees are |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1465 |
simply lists of tokens and constituent parse trees, the latter |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1466 |
representing the nonterminals of the productions. Ignoring AST |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1467 |
translation functions, parse trees are transformed to ASTs by |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1468 |
stripping out delimiters and copy productions, while retaining some |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1469 |
source position information from input tokens. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1470 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1471 |
The Pure syntax provides predefined AST translations to make the |
| 61493 | 1472 |
basic \<open>\<lambda>\<close>-term structure more apparent within the |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1473 |
(first-order) AST representation, and thus facilitate the use of |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1474 |
@{command translations} (see also \secref{sec:syn-trans}). This
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1475 |
covers ordinary term application, type application, nested |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1476 |
abstraction, iterated meta implications and function types. The |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1477 |
effect is illustrated on some representative input strings is as |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1478 |
follows: |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1479 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1480 |
\begin{center}
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1481 |
\begin{tabular}{ll}
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1482 |
input source & AST \\ |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1483 |
\hline |
| 61503 | 1484 |
\<open>f x y z\<close> & \<^verbatim>\<open>(f x y z)\<close> \\ |
1485 |
\<open>'a ty\<close> & \<^verbatim>\<open>(ty 'a)\<close> \\ |
|
1486 |
\<open>('a, 'b)ty\<close> & \<^verbatim>\<open>(ty 'a 'b)\<close> \\
|
|
1487 |
\<open>\<lambda>x y z. t\<close> & \<^verbatim>\<open>("_abs" x ("_abs" y ("_abs" z t)))\<close> \\
|
|
1488 |
\<open>\<lambda>x :: 'a. t\<close> & \<^verbatim>\<open>("_abs" ("_constrain" x 'a) t)\<close> \\
|
|
1489 |
\<open>\<lbrakk>P; Q; R\<rbrakk> \<Longrightarrow> S\<close> & \<^verbatim>\<open>("Pure.imp" P ("Pure.imp" Q ("Pure.imp" R S)))\<close> \\
|
|
1490 |
\<open>['a, 'b, 'c] \<Rightarrow> 'd\<close> & \<^verbatim>\<open>("fun" 'a ("fun" 'b ("fun" 'c 'd)))\<close> \\
|
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1491 |
\end{tabular}
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1492 |
\end{center}
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1493 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1494 |
Note that type and sort constraints may occur in further places --- |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1495 |
translations need to be ready to cope with them. The built-in |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1496 |
syntax transformation from parse trees to ASTs insert additional |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1497 |
constraints that represent source positions. |
| 58618 | 1498 |
\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1499 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1500 |
|
| 58618 | 1501 |
subsubsection \<open>Transforming ASTs to terms\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1502 |
|
| 58618 | 1503 |
text \<open>After application of macros (\secref{sec:syn-trans}), the AST
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1504 |
is transformed into a term. This term still lacks proper type |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1505 |
information, but it might contain some constraints consisting of |
| 61503 | 1506 |
applications with head \<^verbatim>\<open>_constrain\<close>, where the second |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1507 |
argument is a type encoded as a pre-term within the syntax. Type |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1508 |
inference later introduces correct types, or indicates type errors |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1509 |
in the input. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1510 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1511 |
Ignoring parse translations, ASTs are transformed to terms by |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1512 |
mapping AST constants to term constants, AST variables to term |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1513 |
variables or constants (according to the name space), and AST |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1514 |
applications to iterated term applications. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1515 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1516 |
The outcome is still a first-order term. Proper abstractions and |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1517 |
bound variables are introduced by parse translations associated with |
| 61503 | 1518 |
certain syntax constants. Thus \<^verbatim>\<open>("_abs" x x)\<close> eventually
|
1519 |
becomes a de-Bruijn term \<^verbatim>\<open>Abs ("x", _, Bound 0)\<close>.
|
|
| 58618 | 1520 |
\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1521 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1522 |
|
| 58618 | 1523 |
subsubsection \<open>Printing of terms\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1524 |
|
| 58618 | 1525 |
text \<open>The output phase is essentially the inverse of the input |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1526 |
phase. Terms are translated via abstract syntax trees into |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1527 |
pretty-printed text. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1528 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1529 |
Ignoring print translations, the transformation maps term constants, |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1530 |
variables and applications to the corresponding constructs on ASTs. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1531 |
Abstractions are mapped to applications of the special constant |
| 61503 | 1532 |
\<^verbatim>\<open>_abs\<close> as seen before. Type constraints are represented |
1533 |
via special \<^verbatim>\<open>_constrain\<close> forms, according to various |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1534 |
policies of type annotation determined elsewhere. Sort constraints |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1535 |
of type variables are handled in a similar fashion. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1536 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1537 |
After application of macros (\secref{sec:syn-trans}), the AST is
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1538 |
finally pretty-printed. The built-in print AST translations reverse |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1539 |
the corresponding parse AST translations. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1540 |
|
| 61421 | 1541 |
\<^medskip> |
1542 |
For the actual printing process, the priority grammar |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1543 |
(\secref{sec:priority-grammar}) plays a vital role: productions are
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1544 |
used as templates for pretty printing, with argument slots stemming |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1545 |
from nonterminals, and syntactic sugar stemming from literal tokens. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1546 |
|
| 61493 | 1547 |
Each AST application with constant head \<open>c\<close> and arguments |
1548 |
\<open>t\<^sub>1\<close>, \dots, \<open>t\<^sub>n\<close> (for \<open>n = 0\<close> the AST is |
|
1549 |
just the constant \<open>c\<close> itself) is printed according to the |
|
1550 |
first grammar production of result name \<open>c\<close>. The required |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1551 |
syntax priority of the argument slot is given by its nonterminal |
| 61493 | 1552 |
\<open>A\<^sup>(\<^sup>p\<^sup>)\<close>. The argument \<open>t\<^sub>i\<close> that corresponds to the |
1553 |
position of \<open>A\<^sup>(\<^sup>p\<^sup>)\<close> is printed recursively, and then put in |
|
1554 |
parentheses \<^emph>\<open>if\<close> its priority \<open>p\<close> requires this. The |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1555 |
resulting output is concatenated with the syntactic sugar according |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1556 |
to the grammar production. |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1557 |
|
| 61493 | 1558 |
If an AST application \<open>(c x\<^sub>1 \<dots> x\<^sub>m)\<close> has more arguments than |
1559 |
the corresponding production, it is first split into \<open>((c x\<^sub>1 |
|
1560 |
\<dots> x\<^sub>n) x\<^sub>n\<^sub>+\<^sub>1 \<dots> x\<^sub>m)\<close> and then printed recursively as above. |
|
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1561 |
|
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1562 |
Applications with too few arguments or with non-constant head or |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1563 |
without a corresponding production are printed in prefix-form like |
| 61493 | 1564 |
\<open>f t\<^sub>1 \<dots> t\<^sub>n\<close> for terms. |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1565 |
|
| 61493 | 1566 |
Multiple productions associated with some name \<open>c\<close> are tried |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1567 |
in order of appearance within the grammar. An occurrence of some |
| 61493 | 1568 |
AST variable \<open>x\<close> is printed as \<open>x\<close> outright. |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1569 |
|
| 61421 | 1570 |
\<^medskip> |
| 61477 | 1571 |
White space is \<^emph>\<open>not\<close> inserted automatically. If |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1572 |
blanks (or breaks) are required to separate tokens, they need to be |
|
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1573 |
specified in the mixfix declaration (\secref{sec:mixfix}).
|
| 58618 | 1574 |
\<close> |
|
52414
8429123bc58a
more on built-in syntax transformations, based on reduced version of old material;
wenzelm
parents:
52413
diff
changeset
|
1575 |
|
| 28762 | 1576 |
end |