author | wenzelm |
Mon, 18 Oct 2010 12:33:13 +0100 | |
changeset 39863 | c0de5386017e |
parent 39861 | b8d89db3e238 |
child 39864 | f3b4fde34cd1 |
permissions | -rw-r--r-- |
29755 | 1 |
theory Prelim |
2 |
imports Base |
|
3 |
begin |
|
18537 | 4 |
|
5 |
chapter {* Preliminaries *} |
|
6 |
||
20429 | 7 |
section {* Contexts \label{sec:context} *} |
18537 | 8 |
|
20429 | 9 |
text {* |
20451 | 10 |
A logical context represents the background that is required for |
11 |
formulating statements and composing proofs. It acts as a medium to |
|
12 |
produce formal content, depending on earlier material (declarations, |
|
13 |
results etc.). |
|
18537 | 14 |
|
20451 | 15 |
For example, derivations within the Isabelle/Pure logic can be |
16 |
described as a judgment @{text "\<Gamma> \<turnstile>\<^sub>\<Theta> \<phi>"}, which means that a |
|
20429 | 17 |
proposition @{text "\<phi>"} is derivable from hypotheses @{text "\<Gamma>"} |
18 |
within the theory @{text "\<Theta>"}. There are logical reasons for |
|
20451 | 19 |
keeping @{text "\<Theta>"} and @{text "\<Gamma>"} separate: theories can be |
20 |
liberal about supporting type constructors and schematic |
|
21 |
polymorphism of constants and axioms, while the inner calculus of |
|
22 |
@{text "\<Gamma> \<turnstile> \<phi>"} is strictly limited to Simple Type Theory (with |
|
23 |
fixed type variables in the assumptions). |
|
18537 | 24 |
|
20429 | 25 |
\medskip Contexts and derivations are linked by the following key |
26 |
principles: |
|
27 |
||
28 |
\begin{itemize} |
|
29 |
||
30 |
\item Transfer: monotonicity of derivations admits results to be |
|
20451 | 31 |
transferred into a \emph{larger} context, i.e.\ @{text "\<Gamma> \<turnstile>\<^sub>\<Theta> |
32 |
\<phi>"} implies @{text "\<Gamma>' \<turnstile>\<^sub>\<Theta>\<^sub>' \<phi>"} for contexts @{text "\<Theta>' |
|
33 |
\<supseteq> \<Theta>"} and @{text "\<Gamma>' \<supseteq> \<Gamma>"}. |
|
18537 | 34 |
|
20429 | 35 |
\item Export: discharge of hypotheses admits results to be exported |
20451 | 36 |
into a \emph{smaller} context, i.e.\ @{text "\<Gamma>' \<turnstile>\<^sub>\<Theta> \<phi>"} |
37 |
implies @{text "\<Gamma> \<turnstile>\<^sub>\<Theta> \<Delta> \<Longrightarrow> \<phi>"} where @{text "\<Gamma>' \<supseteq> \<Gamma>"} and |
|
38 |
@{text "\<Delta> = \<Gamma>' - \<Gamma>"}. Note that @{text "\<Theta>"} remains unchanged here, |
|
39 |
only the @{text "\<Gamma>"} part is affected. |
|
20429 | 40 |
|
41 |
\end{itemize} |
|
18537 | 42 |
|
20451 | 43 |
\medskip By modeling the main characteristics of the primitive |
44 |
@{text "\<Theta>"} and @{text "\<Gamma>"} above, and abstracting over any |
|
45 |
particular logical content, we arrive at the fundamental notions of |
|
46 |
\emph{theory context} and \emph{proof context} in Isabelle/Isar. |
|
47 |
These implement a certain policy to manage arbitrary \emph{context |
|
48 |
data}. There is a strongly-typed mechanism to declare new kinds of |
|
20429 | 49 |
data at compile time. |
18537 | 50 |
|
20451 | 51 |
The internal bootstrap process of Isabelle/Pure eventually reaches a |
52 |
stage where certain data slots provide the logical content of @{text |
|
53 |
"\<Theta>"} and @{text "\<Gamma>"} sketched above, but this does not stop there! |
|
54 |
Various additional data slots support all kinds of mechanisms that |
|
55 |
are not necessarily part of the core logic. |
|
18537 | 56 |
|
20429 | 57 |
For example, there would be data for canonical introduction and |
58 |
elimination rules for arbitrary operators (depending on the |
|
59 |
object-logic and application), which enables users to perform |
|
20451 | 60 |
standard proof steps implicitly (cf.\ the @{text "rule"} method |
61 |
\cite{isabelle-isar-ref}). |
|
18537 | 62 |
|
20451 | 63 |
\medskip Thus Isabelle/Isar is able to bring forth more and more |
64 |
concepts successively. In particular, an object-logic like |
|
65 |
Isabelle/HOL continues the Isabelle/Pure setup by adding specific |
|
66 |
components for automated reasoning (classical reasoner, tableau |
|
67 |
prover, structured induction etc.) and derived specification |
|
68 |
mechanisms (inductive predicates, recursive functions etc.). All of |
|
69 |
this is ultimately based on the generic data management by theory |
|
70 |
and proof contexts introduced here. |
|
18537 | 71 |
*} |
72 |
||
73 |
||
74 |
subsection {* Theory context \label{sec:context-theory} *} |
|
75 |
||
34921 | 76 |
text {* A \emph{theory} is a data container with explicit name and |
77 |
unique identifier. Theories are related by a (nominal) sub-theory |
|
20451 | 78 |
relation, which corresponds to the dependency graph of the original |
79 |
construction; each theory is derived from a certain sub-graph of |
|
34921 | 80 |
ancestor theories. To this end, the system maintains a set of |
81 |
symbolic ``identification stamps'' within each theory. |
|
18537 | 82 |
|
34921 | 83 |
In order to avoid the full-scale overhead of explicit sub-theory |
84 |
identification of arbitrary intermediate stages, a theory is |
|
85 |
switched into @{text "draft"} mode under certain circumstances. A |
|
86 |
draft theory acts like a linear type, where updates invalidate |
|
87 |
earlier versions. An invalidated draft is called \emph{stale}. |
|
20429 | 88 |
|
34921 | 89 |
The @{text "checkpoint"} operation produces a safe stepping stone |
90 |
that will survive the next update without becoming stale: both the |
|
91 |
old and the new theory remain valid and are related by the |
|
92 |
sub-theory relation. Checkpointing essentially recovers purely |
|
93 |
functional theory values, at the expense of some extra internal |
|
94 |
bookkeeping. |
|
20447 | 95 |
|
96 |
The @{text "copy"} operation produces an auxiliary version that has |
|
97 |
the same data content, but is unrelated to the original: updates of |
|
98 |
the copy do not affect the original, neither does the sub-theory |
|
99 |
relation hold. |
|
20429 | 100 |
|
34921 | 101 |
The @{text "merge"} operation produces the least upper bound of two |
102 |
theories, which actually degenerates into absorption of one theory |
|
103 |
into the other (according to the nominal sub-theory relation). |
|
104 |
||
105 |
The @{text "begin"} operation starts a new theory by importing |
|
106 |
several parent theories and entering a special mode of nameless |
|
107 |
incremental updates, until the final @{text "end"} operation is |
|
108 |
performed. |
|
109 |
||
20447 | 110 |
\medskip The example in \figref{fig:ex-theory} below shows a theory |
20451 | 111 |
graph derived from @{text "Pure"}, with theory @{text "Length"} |
112 |
importing @{text "Nat"} and @{text "List"}. The body of @{text |
|
113 |
"Length"} consists of a sequence of updates, working mostly on |
|
34921 | 114 |
drafts internally, while transaction boundaries of Isar top-level |
115 |
commands (\secref{sec:isar-toplevel}) are guaranteed to be safe |
|
116 |
checkpoints. |
|
20447 | 117 |
|
118 |
\begin{figure}[htb] |
|
119 |
\begin{center} |
|
20429 | 120 |
\begin{tabular}{rcccl} |
20447 | 121 |
& & @{text "Pure"} \\ |
122 |
& & @{text "\<down>"} \\ |
|
123 |
& & @{text "FOL"} \\ |
|
18537 | 124 |
& $\swarrow$ & & $\searrow$ & \\ |
21852 | 125 |
@{text "Nat"} & & & & @{text "List"} \\ |
18537 | 126 |
& $\searrow$ & & $\swarrow$ \\ |
20447 | 127 |
& & @{text "Length"} \\ |
26864 | 128 |
& & \multicolumn{3}{l}{~~@{keyword "imports"}} \\ |
129 |
& & \multicolumn{3}{l}{~~@{keyword "begin"}} \\ |
|
18537 | 130 |
& & $\vdots$~~ \\ |
20447 | 131 |
& & @{text "\<bullet>"}~~ \\ |
132 |
& & $\vdots$~~ \\ |
|
133 |
& & @{text "\<bullet>"}~~ \\ |
|
134 |
& & $\vdots$~~ \\ |
|
26864 | 135 |
& & \multicolumn{3}{l}{~~@{command "end"}} \\ |
20429 | 136 |
\end{tabular} |
20451 | 137 |
\caption{A theory definition depending on ancestors}\label{fig:ex-theory} |
20447 | 138 |
\end{center} |
139 |
\end{figure} |
|
20451 | 140 |
|
141 |
\medskip There is a separate notion of \emph{theory reference} for |
|
142 |
maintaining a live link to an evolving theory context: updates on |
|
39821 | 143 |
drafts are propagated automatically. Dynamic updating stops when |
144 |
the next @{text "checkpoint"} is reached. |
|
20451 | 145 |
|
146 |
Derived entities may store a theory reference in order to indicate |
|
39821 | 147 |
the formal context from which they are derived. This implicitly |
148 |
assumes monotonic reasoning, because the referenced context may |
|
149 |
become larger without further notice. |
|
18537 | 150 |
*} |
151 |
||
20430 | 152 |
text %mlref {* |
20447 | 153 |
\begin{mldecls} |
154 |
@{index_ML_type theory} \\ |
|
39837 | 155 |
@{index_ML Theory.eq_thy: "theory * theory -> bool"} \\ |
20447 | 156 |
@{index_ML Theory.subthy: "theory * theory -> bool"} \\ |
157 |
@{index_ML Theory.checkpoint: "theory -> theory"} \\ |
|
20547 | 158 |
@{index_ML Theory.copy: "theory -> theory"} \\ |
34921 | 159 |
@{index_ML Theory.merge: "theory * theory -> theory"} \\ |
160 |
@{index_ML Theory.begin_theory: "string -> theory list -> theory"} \\ |
|
39837 | 161 |
@{index_ML Theory.parents_of: "theory -> theory list"} \\ |
162 |
@{index_ML Theory.ancestors_of: "theory -> theory list"} \\ |
|
20547 | 163 |
\end{mldecls} |
164 |
\begin{mldecls} |
|
20447 | 165 |
@{index_ML_type theory_ref} \\ |
166 |
@{index_ML Theory.deref: "theory_ref -> theory"} \\ |
|
24137
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
167 |
@{index_ML Theory.check_thy: "theory -> theory_ref"} \\ |
20447 | 168 |
\end{mldecls} |
169 |
||
170 |
\begin{description} |
|
171 |
||
20451 | 172 |
\item @{ML_type theory} represents theory contexts. This is |
39821 | 173 |
essentially a linear type, with explicit runtime checking. |
174 |
Primitive theory operations destroy the original version, which then |
|
175 |
becomes ``stale''. This can be prevented by explicit checkpointing, |
|
176 |
which the system does at least at the boundary of toplevel command |
|
177 |
transactions \secref{sec:isar-toplevel}. |
|
20447 | 178 |
|
39837 | 179 |
\item @{ML "Theory.eq_thy"}~@{text "(thy\<^sub>1, thy\<^sub>2)"} check strict |
180 |
identity of two theories. |
|
181 |
||
34921 | 182 |
\item @{ML "Theory.subthy"}~@{text "(thy\<^sub>1, thy\<^sub>2)"} compares theories |
183 |
according to the intrinsic graph structure of the construction. |
|
184 |
This sub-theory relation is a nominal approximation of inclusion |
|
185 |
(@{text "\<subseteq>"}) of the corresponding content (according to the |
|
186 |
semantics of the ML modules that implement the data). |
|
20447 | 187 |
|
188 |
\item @{ML "Theory.checkpoint"}~@{text "thy"} produces a safe |
|
34921 | 189 |
stepping stone in the linear development of @{text "thy"}. This |
190 |
changes the old theory, but the next update will result in two |
|
191 |
related, valid theories. |
|
20447 | 192 |
|
193 |
\item @{ML "Theory.copy"}~@{text "thy"} produces a variant of @{text |
|
34921 | 194 |
"thy"} with the same data. The copy is not related to the original, |
195 |
but the original is unchanged. |
|
196 |
||
197 |
\item @{ML "Theory.merge"}~@{text "(thy\<^sub>1, thy\<^sub>2)"} absorbs one theory |
|
198 |
into the other, without changing @{text "thy\<^sub>1"} or @{text "thy\<^sub>2"}. |
|
199 |
This version of ad-hoc theory merge fails for unrelated theories! |
|
200 |
||
201 |
\item @{ML "Theory.begin_theory"}~@{text "name parents"} constructs |
|
39825
f9066b94bf07
eliminated fancy \ML logo for the sake of simpler source text (less dependence on LaTeX);
wenzelm
parents:
39821
diff
changeset
|
202 |
a new theory based on the given parents. This ML function is |
34921 | 203 |
normally not invoked directly. |
20447 | 204 |
|
39837 | 205 |
\item @{ML "Theory.parents_of"}~@{text "thy"} returns the direct |
206 |
ancestors of @{text thy}. |
|
207 |
||
208 |
\item @{ML "Theory.ancestors_of"}~@{text "thy"} returns all |
|
209 |
ancestors of @{text thy} (not including @{text thy} itself). |
|
210 |
||
20451 | 211 |
\item @{ML_type theory_ref} represents a sliding reference to an |
212 |
always valid theory; updates on the original are propagated |
|
20447 | 213 |
automatically. |
214 |
||
24137
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
215 |
\item @{ML "Theory.deref"}~@{text "thy_ref"} turns a @{ML_type |
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
216 |
"theory_ref"} into an @{ML_type "theory"} value. As the referenced |
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
217 |
theory evolves monotonically over time, later invocations of @{ML |
20451 | 218 |
"Theory.deref"} may refer to a larger context. |
20447 | 219 |
|
24137
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
220 |
\item @{ML "Theory.check_thy"}~@{text "thy"} produces a @{ML_type |
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
221 |
"theory_ref"} from a valid @{ML_type "theory"} value. |
8d7896398147
replaced Theory.self_ref by Theory.check_thy, which now produces a checked ref;
wenzelm
parents:
22869
diff
changeset
|
222 |
|
20447 | 223 |
\end{description} |
20430 | 224 |
*} |
225 |
||
39832 | 226 |
text %mlantiq {* |
227 |
\begin{matharray}{rcl} |
|
228 |
@{ML_antiquotation_def "theory"} & : & @{text ML_antiquotation} \\ |
|
229 |
@{ML_antiquotation_def "theory_ref"} & : & @{text ML_antiquotation} \\ |
|
230 |
\end{matharray} |
|
231 |
||
232 |
\begin{rail} |
|
233 |
('theory' | 'theory\_ref') nameref? |
|
234 |
; |
|
235 |
\end{rail} |
|
236 |
||
237 |
\begin{description} |
|
238 |
||
239 |
\item @{text "@{theory}"} refers to the background theory of the |
|
240 |
current context --- as abstract value. |
|
241 |
||
242 |
\item @{text "@{theory A}"} refers to an explicitly named ancestor |
|
243 |
theory @{text "A"} of the background theory of the current context |
|
244 |
--- as abstract value. |
|
245 |
||
246 |
\item @{text "@{theory_ref}"} is similar to @{text "@{theory}"}, but |
|
247 |
produces a @{ML_type theory_ref} via @{ML "Theory.check_thy"} as |
|
248 |
explained above. |
|
249 |
||
250 |
\end{description} |
|
251 |
*} |
|
252 |
||
18537 | 253 |
|
254 |
subsection {* Proof context \label{sec:context-proof} *} |
|
255 |
||
34921 | 256 |
text {* A proof context is a container for pure data with a |
39821 | 257 |
back-reference to the theory from which it is derived. The @{text |
258 |
"init"} operation creates a proof context from a given theory. |
|
34921 | 259 |
Modifications to draft theories are propagated to the proof context |
260 |
as usual, but there is also an explicit @{text "transfer"} operation |
|
261 |
to force resynchronization with more substantial updates to the |
|
262 |
underlying theory. |
|
20429 | 263 |
|
34921 | 264 |
Entities derived in a proof context need to record logical |
20447 | 265 |
requirements explicitly, since there is no separate context |
34921 | 266 |
identification or symbolic inclusion as for theories. For example, |
267 |
hypotheses used in primitive derivations (cf.\ \secref{sec:thms}) |
|
268 |
are recorded separately within the sequent @{text "\<Gamma> \<turnstile> \<phi>"}, just to |
|
269 |
make double sure. Results could still leak into an alien proof |
|
270 |
context due to programming errors, but Isabelle/Isar includes some |
|
271 |
extra validity checks in critical positions, notably at the end of a |
|
272 |
sub-proof. |
|
20429 | 273 |
|
20451 | 274 |
Proof contexts may be manipulated arbitrarily, although the common |
275 |
discipline is to follow block structure as a mental model: a given |
|
276 |
context is extended consecutively, and results are exported back |
|
34921 | 277 |
into the original context. Note that an Isar proof state models |
20451 | 278 |
block-structured reasoning explicitly, using a stack of proof |
34921 | 279 |
contexts internally. For various technical reasons, the background |
280 |
theory of an Isar proof state must not be changed while the proof is |
|
281 |
still under construction! |
|
18537 | 282 |
*} |
283 |
||
20449 | 284 |
text %mlref {* |
285 |
\begin{mldecls} |
|
286 |
@{index_ML_type Proof.context} \\ |
|
36611 | 287 |
@{index_ML ProofContext.init_global: "theory -> Proof.context"} \\ |
20449 | 288 |
@{index_ML ProofContext.theory_of: "Proof.context -> theory"} \\ |
289 |
@{index_ML ProofContext.transfer: "theory -> Proof.context -> Proof.context"} \\ |
|
290 |
\end{mldecls} |
|
291 |
||
292 |
\begin{description} |
|
293 |
||
294 |
\item @{ML_type Proof.context} represents proof contexts. Elements |
|
295 |
of this type are essentially pure values, with a sliding reference |
|
296 |
to the background theory. |
|
297 |
||
36611 | 298 |
\item @{ML ProofContext.init_global}~@{text "thy"} produces a proof context |
20449 | 299 |
derived from @{text "thy"}, initializing all data. |
300 |
||
301 |
\item @{ML ProofContext.theory_of}~@{text "ctxt"} selects the |
|
20451 | 302 |
background theory from @{text "ctxt"}, dereferencing its internal |
303 |
@{ML_type theory_ref}. |
|
20449 | 304 |
|
305 |
\item @{ML ProofContext.transfer}~@{text "thy ctxt"} promotes the |
|
306 |
background theory of @{text "ctxt"} to the super theory @{text |
|
307 |
"thy"}. |
|
308 |
||
309 |
\end{description} |
|
310 |
*} |
|
311 |
||
39832 | 312 |
text %mlantiq {* |
313 |
\begin{matharray}{rcl} |
|
314 |
@{ML_antiquotation_def "context"} & : & @{text ML_antiquotation} \\ |
|
315 |
\end{matharray} |
|
316 |
||
317 |
\begin{description} |
|
318 |
||
319 |
\item @{text "@{context}"} refers to \emph{the} context at |
|
320 |
compile-time --- as abstract value. Independently of (local) theory |
|
321 |
or proof mode, this always produces a meaningful result. |
|
322 |
||
323 |
This is probably the most common antiquotation in interactive |
|
324 |
experimentation with ML inside Isar. |
|
325 |
||
326 |
\end{description} |
|
327 |
*} |
|
328 |
||
20430 | 329 |
|
20451 | 330 |
subsection {* Generic contexts \label{sec:generic-context} *} |
20429 | 331 |
|
20449 | 332 |
text {* |
333 |
A generic context is the disjoint sum of either a theory or proof |
|
20451 | 334 |
context. Occasionally, this enables uniform treatment of generic |
20450 | 335 |
context data, typically extra-logical information. Operations on |
20449 | 336 |
generic contexts include the usual injections, partial selections, |
337 |
and combinators for lifting operations on either component of the |
|
338 |
disjoint sum. |
|
339 |
||
340 |
Moreover, there are total operations @{text "theory_of"} and @{text |
|
341 |
"proof_of"} to convert a generic context into either kind: a theory |
|
20451 | 342 |
can always be selected from the sum, while a proof context might |
34921 | 343 |
have to be constructed by an ad-hoc @{text "init"} operation, which |
344 |
incurs a small runtime overhead. |
|
20449 | 345 |
*} |
20430 | 346 |
|
20449 | 347 |
text %mlref {* |
348 |
\begin{mldecls} |
|
349 |
@{index_ML_type Context.generic} \\ |
|
350 |
@{index_ML Context.theory_of: "Context.generic -> theory"} \\ |
|
351 |
@{index_ML Context.proof_of: "Context.generic -> Proof.context"} \\ |
|
352 |
\end{mldecls} |
|
353 |
||
354 |
\begin{description} |
|
20430 | 355 |
|
20449 | 356 |
\item @{ML_type Context.generic} is the direct sum of @{ML_type |
20451 | 357 |
"theory"} and @{ML_type "Proof.context"}, with the datatype |
358 |
constructors @{ML "Context.Theory"} and @{ML "Context.Proof"}. |
|
20449 | 359 |
|
360 |
\item @{ML Context.theory_of}~@{text "context"} always produces a |
|
361 |
theory from the generic @{text "context"}, using @{ML |
|
362 |
"ProofContext.theory_of"} as required. |
|
363 |
||
364 |
\item @{ML Context.proof_of}~@{text "context"} always produces a |
|
365 |
proof context from the generic @{text "context"}, using @{ML |
|
36611 | 366 |
"ProofContext.init_global"} as required (note that this re-initializes the |
20451 | 367 |
context data with each invocation). |
20449 | 368 |
|
369 |
\end{description} |
|
370 |
*} |
|
20437 | 371 |
|
20476 | 372 |
|
373 |
subsection {* Context data \label{sec:context-data} *} |
|
20447 | 374 |
|
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
375 |
text {* The main purpose of theory and proof contexts is to manage |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
376 |
arbitrary (pure) data. New data types can be declared incrementally |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
377 |
at compile time. There are separate declaration mechanisms for any |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
378 |
of the three kinds of contexts: theory, proof, generic. |
20449 | 379 |
|
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
380 |
\paragraph{Theory data} declarations need to implement the following |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
381 |
SML signature: |
20449 | 382 |
|
383 |
\medskip |
|
384 |
\begin{tabular}{ll} |
|
22869 | 385 |
@{text "\<type> T"} & representing type \\ |
386 |
@{text "\<val> empty: T"} & empty default value \\ |
|
387 |
@{text "\<val> extend: T \<rightarrow> T"} & re-initialize on import \\ |
|
388 |
@{text "\<val> merge: T \<times> T \<rightarrow> T"} & join on import \\ |
|
20449 | 389 |
\end{tabular} |
390 |
\medskip |
|
391 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
392 |
The @{text "empty"} value acts as initial default for \emph{any} |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
393 |
theory that does not declare actual data content; @{text "extend"} |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
394 |
is acts like a unitary version of @{text "merge"}. |
20449 | 395 |
|
34921 | 396 |
Implementing @{text "merge"} can be tricky. The general idea is |
397 |
that @{text "merge (data\<^sub>1, data\<^sub>2)"} inserts those parts of @{text |
|
398 |
"data\<^sub>2"} into @{text "data\<^sub>1"} that are not yet present, while |
|
399 |
keeping the general order of things. The @{ML Library.merge} |
|
400 |
function on plain lists may serve as canonical template. |
|
401 |
||
402 |
Particularly note that shared parts of the data must not be |
|
403 |
duplicated by naive concatenation, or a theory graph that is like a |
|
404 |
chain of diamonds would cause an exponential blowup! |
|
405 |
||
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
406 |
\paragraph{Proof context data} declarations need to implement the |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
407 |
following SML signature: |
20449 | 408 |
|
409 |
\medskip |
|
410 |
\begin{tabular}{ll} |
|
22869 | 411 |
@{text "\<type> T"} & representing type \\ |
412 |
@{text "\<val> init: theory \<rightarrow> T"} & produce initial value \\ |
|
20449 | 413 |
\end{tabular} |
414 |
\medskip |
|
415 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
416 |
The @{text "init"} operation is supposed to produce a pure value |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
417 |
from the given background theory and should be somehow |
34921 | 418 |
``immediate''. Whenever a proof context is initialized, which |
419 |
happens frequently, the the system invokes the @{text "init"} |
|
39821 | 420 |
operation of \emph{all} theory data slots ever declared. This also |
421 |
means that one needs to be economic about the total number of proof |
|
422 |
data declarations in the system, i.e.\ each ML module should declare |
|
423 |
at most one, sometimes two data slots for its internal use. |
|
424 |
Repeated data declarations to simulate a record type should be |
|
425 |
avoided! |
|
20449 | 426 |
|
20451 | 427 |
\paragraph{Generic data} provides a hybrid interface for both theory |
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
428 |
and proof data. The @{text "init"} operation for proof contexts is |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
429 |
predefined to select the current data value from the background |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
430 |
theory. |
20449 | 431 |
|
39821 | 432 |
\bigskip Any of the above data declarations over type @{text "T"} |
433 |
result in an ML structure with the following signature: |
|
20449 | 434 |
|
435 |
\medskip |
|
436 |
\begin{tabular}{ll} |
|
437 |
@{text "get: context \<rightarrow> T"} \\ |
|
438 |
@{text "put: T \<rightarrow> context \<rightarrow> context"} \\ |
|
439 |
@{text "map: (T \<rightarrow> T) \<rightarrow> context \<rightarrow> context"} \\ |
|
440 |
\end{tabular} |
|
441 |
\medskip |
|
442 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
443 |
These other operations provide exclusive access for the particular |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
444 |
kind of context (theory, proof, or generic context). This interface |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
445 |
observes the ML discipline for types and scopes: there is no other |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
446 |
way to access the corresponding data slot of a context. By keeping |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
447 |
these operations private, an Isabelle/ML module may maintain |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
448 |
abstract values authentically. *} |
20447 | 449 |
|
20450 | 450 |
text %mlref {* |
451 |
\begin{mldecls} |
|
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
452 |
@{index_ML_functor Theory_Data} \\ |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
453 |
@{index_ML_functor Proof_Data} \\ |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
454 |
@{index_ML_functor Generic_Data} \\ |
20450 | 455 |
\end{mldecls} |
456 |
||
457 |
\begin{description} |
|
458 |
||
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
459 |
\item @{ML_functor Theory_Data}@{text "(spec)"} declares data for |
20450 | 460 |
type @{ML_type theory} according to the specification provided as |
20451 | 461 |
argument structure. The resulting structure provides data init and |
462 |
access operations as described above. |
|
20450 | 463 |
|
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
464 |
\item @{ML_functor Proof_Data}@{text "(spec)"} is analogous to |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
465 |
@{ML_functor Theory_Data} for type @{ML_type Proof.context}. |
20450 | 466 |
|
33524
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
467 |
\item @{ML_functor Generic_Data}@{text "(spec)"} is analogous to |
a08e6c1cbc04
updated functor Theory_Data, Proof_Data, Generic_Data;
wenzelm
parents:
33174
diff
changeset
|
468 |
@{ML_functor Theory_Data} for type @{ML_type Context.generic}. |
20450 | 469 |
|
470 |
\end{description} |
|
471 |
*} |
|
472 |
||
34928 | 473 |
text %mlex {* |
474 |
The following artificial example demonstrates theory |
|
475 |
data: we maintain a set of terms that are supposed to be wellformed |
|
476 |
wrt.\ the enclosing theory. The public interface is as follows: |
|
477 |
*} |
|
478 |
||
479 |
ML {* |
|
480 |
signature WELLFORMED_TERMS = |
|
481 |
sig |
|
482 |
val get: theory -> term list |
|
483 |
val add: term -> theory -> theory |
|
484 |
end; |
|
485 |
*} |
|
486 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
487 |
text {* The implementation uses private theory data internally, and |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
488 |
only exposes an operation that involves explicit argument checking |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
489 |
wrt.\ the given theory. *} |
34928 | 490 |
|
491 |
ML {* |
|
492 |
structure Wellformed_Terms: WELLFORMED_TERMS = |
|
493 |
struct |
|
494 |
||
495 |
structure Terms = Theory_Data |
|
496 |
( |
|
39687 | 497 |
type T = term Ord_List.T; |
34928 | 498 |
val empty = []; |
499 |
val extend = I; |
|
500 |
fun merge (ts1, ts2) = |
|
39687 | 501 |
Ord_List.union Term_Ord.fast_term_ord ts1 ts2; |
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
502 |
); |
34928 | 503 |
|
504 |
val get = Terms.get; |
|
505 |
||
506 |
fun add raw_t thy = |
|
39821 | 507 |
let |
508 |
val t = Sign.cert_term thy raw_t; |
|
509 |
in |
|
510 |
Terms.map (Ord_List.insert Term_Ord.fast_term_ord t) thy |
|
511 |
end; |
|
34928 | 512 |
|
513 |
end; |
|
514 |
*} |
|
515 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
516 |
text {* We use @{ML_type "term Ord_List.T"} for reasonably efficient |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
517 |
representation of a set of terms: all operations are linear in the |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
518 |
number of stored elements. Here we assume that users of this module |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
519 |
do not care about the declaration order, since that data structure |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
520 |
forces its own arrangement of elements. |
34928 | 521 |
|
522 |
Observe how the @{verbatim merge} operation joins the data slots of |
|
39687 | 523 |
the two constituents: @{ML Ord_List.union} prevents duplication of |
34928 | 524 |
common data from different branches, thus avoiding the danger of |
39821 | 525 |
exponential blowup. Plain list append etc.\ must never be used for |
526 |
theory data merges! |
|
34928 | 527 |
|
528 |
\medskip Our intended invariant is achieved as follows: |
|
529 |
\begin{enumerate} |
|
530 |
||
531 |
\item @{ML Wellformed_Terms.add} only admits terms that have passed |
|
532 |
the @{ML Sign.cert_term} check of the given theory at that point. |
|
533 |
||
534 |
\item Wellformedness in the sense of @{ML Sign.cert_term} is |
|
535 |
monotonic wrt.\ the sub-theory relation. So our data can move |
|
536 |
upwards in the hierarchy (via extension or merges), and maintain |
|
537 |
wellformedness without further checks. |
|
538 |
||
539 |
\end{enumerate} |
|
540 |
||
541 |
Note that all basic operations of the inference kernel (which |
|
542 |
includes @{ML Sign.cert_term}) observe this monotonicity principle, |
|
543 |
but other user-space tools don't. For example, fully-featured |
|
544 |
type-inference via @{ML Syntax.check_term} (cf.\ |
|
545 |
\secref{sec:term-check}) is not necessarily monotonic wrt.\ the |
|
546 |
background theory, since constraints of term constants can be |
|
39821 | 547 |
modified by later declarations, for example. |
34928 | 548 |
|
549 |
In most cases, user-space context data does not have to take such |
|
550 |
invariants too seriously. The situation is different in the |
|
551 |
implementation of the inference kernel itself, which uses the very |
|
552 |
same data mechanisms for types, constants, axioms etc. |
|
553 |
*} |
|
554 |
||
20447 | 555 |
|
26872 | 556 |
section {* Names \label{sec:names} *} |
20451 | 557 |
|
34925 | 558 |
text {* In principle, a name is just a string, but there are various |
559 |
conventions for representing additional structure. For example, |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
560 |
``@{text "Foo.bar.baz"}'' is considered as a long name consisting of |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
561 |
qualifier @{text "Foo.bar"} and base name @{text "baz"}. The |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
562 |
individual constituents of a name may have further substructure, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
563 |
e.g.\ the string ``\verb,\,\verb,<alpha>,'' encodes as a single |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
564 |
symbol. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
565 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
566 |
\medskip Subsequently, we shall introduce specific categories of |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
567 |
names. Roughly speaking these correspond to logical entities as |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
568 |
follows: |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
569 |
\begin{itemize} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
570 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
571 |
\item Basic names (\secref{sec:basic-name}): free and bound |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
572 |
variables. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
573 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
574 |
\item Indexed names (\secref{sec:indexname}): schematic variables. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
575 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
576 |
\item Long names (\secref{sec:long-name}): constants of any kind |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
577 |
(type constructors, term constants, other concepts defined in user |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
578 |
space). Such entities are typically managed via name spaces |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
579 |
(\secref{sec:name-space}). |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
580 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
581 |
\end{itemize} |
20451 | 582 |
*} |
20437 | 583 |
|
584 |
||
39863 | 585 |
subsection {* Strings of symbols \label{sec:symbols} *} |
20437 | 586 |
|
34925 | 587 |
text {* A \emph{symbol} constitutes the smallest textual unit in |
588 |
Isabelle --- raw ML characters are normally not encountered at all! |
|
589 |
Isabelle strings consist of a sequence of symbols, represented as a |
|
590 |
packed string or an exploded list of strings. Each symbol is in |
|
591 |
itself a small string, which has either one of the following forms: |
|
20437 | 592 |
|
20451 | 593 |
\begin{enumerate} |
20437 | 594 |
|
37533
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
595 |
\item a single ASCII character ``@{text "c"}'', for example |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
596 |
``\verb,a,'', |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
597 |
|
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
598 |
\item a codepoint according to UTF8 (non-ASCII byte sequence), |
20437 | 599 |
|
20488 | 600 |
\item a regular symbol ``\verb,\,\verb,<,@{text "ident"}\verb,>,'', |
20476 | 601 |
for example ``\verb,\,\verb,<alpha>,'', |
20437 | 602 |
|
20488 | 603 |
\item a control symbol ``\verb,\,\verb,<^,@{text "ident"}\verb,>,'', |
20476 | 604 |
for example ``\verb,\,\verb,<^bold>,'', |
20437 | 605 |
|
20488 | 606 |
\item a raw symbol ``\verb,\,\verb,<^raw:,@{text text}\verb,>,'' |
34925 | 607 |
where @{text text} consists of printable characters excluding |
20476 | 608 |
``\verb,.,'' and ``\verb,>,'', for example |
609 |
``\verb,\,\verb,<^raw:$\sum_{i = 1}^n$>,'', |
|
20437 | 610 |
|
20488 | 611 |
\item a numbered raw control symbol ``\verb,\,\verb,<^raw,@{text |
20476 | 612 |
n}\verb,>, where @{text n} consists of digits, for example |
20451 | 613 |
``\verb,\,\verb,<^raw42>,''. |
20437 | 614 |
|
20451 | 615 |
\end{enumerate} |
20437 | 616 |
|
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
617 |
The @{text "ident"} syntax for symbol names is @{text "letter |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
618 |
(letter | digit)\<^sup>*"}, where @{text "letter = A..Za..z"} and @{text |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
619 |
"digit = 0..9"}. There are infinitely many regular symbols and |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
620 |
control symbols, but a fixed collection of standard symbols is |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
621 |
treated specifically. For example, ``\verb,\,\verb,<alpha>,'' is |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
622 |
classified as a letter, which means it may occur within regular |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
623 |
Isabelle identifiers. |
20437 | 624 |
|
37533
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
625 |
The character set underlying Isabelle symbols is 7-bit ASCII, but |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
626 |
8-bit character sequences are passed-through unchanged. Unicode/UCS |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
627 |
data in UTF-8 encoding is processed in a non-strict fashion, such |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
628 |
that well-formed code sequences are recognized |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
629 |
accordingly.\footnote{Note that ISO-Latin-1 differs from UTF-8 only |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
630 |
in some special punctuation characters that even have replacements |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
631 |
within the standard collection of Isabelle symbols. Text consisting |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
632 |
of ASCII plus accented letters can be processed in either encoding.} |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
633 |
Unicode provides its own collection of mathematical symbols, but |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
634 |
within the core Isabelle/ML world there is no link to the standard |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
635 |
collection of Isabelle regular symbols. |
20476 | 636 |
|
637 |
\medskip Output of Isabelle symbols depends on the print mode |
|
29758 | 638 |
(\secref{print-mode}). For example, the standard {\LaTeX} setup of |
639 |
the Isabelle document preparation system would present |
|
20451 | 640 |
``\verb,\,\verb,<alpha>,'' as @{text "\<alpha>"}, and |
641 |
``\verb,\,\verb,<^bold>,\verb,\,\verb,<alpha>,'' as @{text |
|
34925 | 642 |
"\<^bold>\<alpha>"}. On-screen rendering usually works by mapping a finite |
643 |
subset of Isabelle symbols to suitable Unicode characters. |
|
20451 | 644 |
*} |
20437 | 645 |
|
646 |
text %mlref {* |
|
647 |
\begin{mldecls} |
|
34921 | 648 |
@{index_ML_type "Symbol.symbol": string} \\ |
20437 | 649 |
@{index_ML Symbol.explode: "string -> Symbol.symbol list"} \\ |
650 |
@{index_ML Symbol.is_letter: "Symbol.symbol -> bool"} \\ |
|
651 |
@{index_ML Symbol.is_digit: "Symbol.symbol -> bool"} \\ |
|
652 |
@{index_ML Symbol.is_quasi: "Symbol.symbol -> bool"} \\ |
|
20547 | 653 |
@{index_ML Symbol.is_blank: "Symbol.symbol -> bool"} \\ |
654 |
\end{mldecls} |
|
655 |
\begin{mldecls} |
|
20437 | 656 |
@{index_ML_type "Symbol.sym"} \\ |
657 |
@{index_ML Symbol.decode: "Symbol.symbol -> Symbol.sym"} \\ |
|
658 |
\end{mldecls} |
|
659 |
||
660 |
\begin{description} |
|
661 |
||
20488 | 662 |
\item @{ML_type "Symbol.symbol"} represents individual Isabelle |
34921 | 663 |
symbols. |
20437 | 664 |
|
20476 | 665 |
\item @{ML "Symbol.explode"}~@{text "str"} produces a symbol list |
39821 | 666 |
from the packed form. This function supersedes @{ML |
20476 | 667 |
"String.explode"} for virtually all purposes of manipulating text in |
34925 | 668 |
Isabelle!\footnote{The runtime overhead for exploded strings is |
669 |
mainly that of the list structure: individual symbols that happen to |
|
39821 | 670 |
be a singleton string do not require extra memory in Poly/ML.} |
20437 | 671 |
|
672 |
\item @{ML "Symbol.is_letter"}, @{ML "Symbol.is_digit"}, @{ML |
|
20476 | 673 |
"Symbol.is_quasi"}, @{ML "Symbol.is_blank"} classify standard |
674 |
symbols according to fixed syntactic conventions of Isabelle, cf.\ |
|
675 |
\cite{isabelle-isar-ref}. |
|
20437 | 676 |
|
677 |
\item @{ML_type "Symbol.sym"} is a concrete datatype that represents |
|
20488 | 678 |
the different kinds of symbols explicitly, with constructors @{ML |
37533
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
679 |
"Symbol.Char"}, @{ML "Symbol.Sym"}, @{ML "Symbol.UTF8"}, @{ML |
d775bd70f571
explicit treatment of UTF8 character sequences as Isabelle symbols;
wenzelm
parents:
36611
diff
changeset
|
680 |
"Symbol.Ctrl"}, @{ML "Symbol.Raw"}. |
20437 | 681 |
|
682 |
\item @{ML "Symbol.decode"} converts the string representation of a |
|
20451 | 683 |
symbol into the datatype version. |
20437 | 684 |
|
685 |
\end{description} |
|
34925 | 686 |
|
687 |
\paragraph{Historical note.} In the original SML90 standard the |
|
688 |
primitive ML type @{ML_type char} did not exists, and the basic @{ML |
|
689 |
"explode: string -> string list"} operation would produce a list of |
|
690 |
singleton strings as in Isabelle/ML today. When SML97 came out, |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
691 |
Isabelle did not adopt its slightly anachronistic 8-bit characters, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
692 |
but the idea of exploding a string into a list of small strings was |
34925 | 693 |
extended to ``symbols'' as explained above. Thus Isabelle sources |
694 |
can refer to an infinite store of user-defined symbols, without |
|
695 |
having to worry about the multitude of Unicode encodings. |
|
20437 | 696 |
*} |
697 |
||
698 |
||
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
699 |
subsection {* Basic names \label{sec:basic-name} *} |
20476 | 700 |
|
701 |
text {* |
|
702 |
A \emph{basic name} essentially consists of a single Isabelle |
|
703 |
identifier. There are conventions to mark separate classes of basic |
|
29761 | 704 |
names, by attaching a suffix of underscores: one underscore means |
705 |
\emph{internal name}, two underscores means \emph{Skolem name}, |
|
706 |
three underscores means \emph{internal Skolem name}. |
|
20476 | 707 |
|
708 |
For example, the basic name @{text "foo"} has the internal version |
|
709 |
@{text "foo_"}, with Skolem versions @{text "foo__"} and @{text |
|
710 |
"foo___"}, respectively. |
|
711 |
||
20488 | 712 |
These special versions provide copies of the basic name space, apart |
713 |
from anything that normally appears in the user text. For example, |
|
714 |
system generated variables in Isar proof contexts are usually marked |
|
34926 | 715 |
as internal, which prevents mysterious names like @{text "xaa"} to |
716 |
appear in human-readable text. |
|
20476 | 717 |
|
20488 | 718 |
\medskip Manipulating binding scopes often requires on-the-fly |
719 |
renamings. A \emph{name context} contains a collection of already |
|
720 |
used names. The @{text "declare"} operation adds names to the |
|
721 |
context. |
|
20476 | 722 |
|
20488 | 723 |
The @{text "invents"} operation derives a number of fresh names from |
724 |
a given starting point. For example, the first three names derived |
|
725 |
from @{text "a"} are @{text "a"}, @{text "b"}, @{text "c"}. |
|
20476 | 726 |
|
727 |
The @{text "variants"} operation produces fresh names by |
|
20488 | 728 |
incrementing tentative names as base-26 numbers (with digits @{text |
729 |
"a..z"}) until all clashes are resolved. For example, name @{text |
|
730 |
"foo"} results in variants @{text "fooa"}, @{text "foob"}, @{text |
|
731 |
"fooc"}, \dots, @{text "fooaa"}, @{text "fooab"} etc.; each renaming |
|
732 |
step picks the next unused variant from this sequence. |
|
20476 | 733 |
*} |
734 |
||
735 |
text %mlref {* |
|
736 |
\begin{mldecls} |
|
737 |
@{index_ML Name.internal: "string -> string"} \\ |
|
20547 | 738 |
@{index_ML Name.skolem: "string -> string"} \\ |
739 |
\end{mldecls} |
|
740 |
\begin{mldecls} |
|
20476 | 741 |
@{index_ML_type Name.context} \\ |
742 |
@{index_ML Name.context: Name.context} \\ |
|
743 |
@{index_ML Name.declare: "string -> Name.context -> Name.context"} \\ |
|
744 |
@{index_ML Name.invents: "Name.context -> string -> int -> string list"} \\ |
|
745 |
@{index_ML Name.variants: "string list -> Name.context -> string list * Name.context"} \\ |
|
746 |
\end{mldecls} |
|
34926 | 747 |
\begin{mldecls} |
748 |
@{index_ML Variable.names_of: "Proof.context -> Name.context"} \\ |
|
749 |
\end{mldecls} |
|
20476 | 750 |
|
751 |
\begin{description} |
|
752 |
||
753 |
\item @{ML Name.internal}~@{text "name"} produces an internal name |
|
754 |
by adding one underscore. |
|
755 |
||
756 |
\item @{ML Name.skolem}~@{text "name"} produces a Skolem name by |
|
757 |
adding two underscores. |
|
758 |
||
759 |
\item @{ML_type Name.context} represents the context of already used |
|
760 |
names; the initial value is @{ML "Name.context"}. |
|
761 |
||
20488 | 762 |
\item @{ML Name.declare}~@{text "name"} enters a used name into the |
763 |
context. |
|
20437 | 764 |
|
20488 | 765 |
\item @{ML Name.invents}~@{text "context name n"} produces @{text |
766 |
"n"} fresh names derived from @{text "name"}. |
|
767 |
||
768 |
\item @{ML Name.variants}~@{text "names context"} produces fresh |
|
29761 | 769 |
variants of @{text "names"}; the result is entered into the context. |
20476 | 770 |
|
34926 | 771 |
\item @{ML Variable.names_of}~@{text "ctxt"} retrieves the context |
772 |
of declared type and term variable names. Projecting a proof |
|
773 |
context down to a primitive name context is occasionally useful when |
|
774 |
invoking lower-level operations. Regular management of ``fresh |
|
775 |
variables'' is done by suitable operations of structure @{ML_struct |
|
776 |
Variable}, which is also able to provide an official status of |
|
777 |
``locally fixed variable'' within the logical environment (cf.\ |
|
778 |
\secref{sec:variables}). |
|
779 |
||
20476 | 780 |
\end{description} |
781 |
*} |
|
782 |
||
39857 | 783 |
text %mlex {* The following simple examples demonstrate how to produce |
784 |
fresh names from the initial @{ML Name.context}. *} |
|
785 |
||
786 |
ML {* |
|
787 |
Name.invents Name.context "a" 5; |
|
788 |
#1 (Name.variants ["x", "x", "a", "a", "'a", "'a"] Name.context); |
|
789 |
*} |
|
790 |
||
791 |
text {* \medskip The same works reletively to the formal context as |
|
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
792 |
follows. *} |
39857 | 793 |
|
794 |
locale ex = fixes a b c :: 'a |
|
795 |
begin |
|
796 |
||
797 |
ML {* |
|
798 |
val names = Variable.names_of @{context}; |
|
799 |
Name.invents names "a" 5; |
|
800 |
#1 (Name.variants ["x", "x", "a", "a", "'a", "'a"] names); |
|
801 |
*} |
|
802 |
||
803 |
end |
|
804 |
||
20476 | 805 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
806 |
subsection {* Indexed names \label{sec:indexname} *} |
20476 | 807 |
|
808 |
text {* |
|
809 |
An \emph{indexed name} (or @{text "indexname"}) is a pair of a basic |
|
20488 | 810 |
name and a natural number. This representation allows efficient |
811 |
renaming by incrementing the second component only. The canonical |
|
812 |
way to rename two collections of indexnames apart from each other is |
|
813 |
this: determine the maximum index @{text "maxidx"} of the first |
|
814 |
collection, then increment all indexes of the second collection by |
|
815 |
@{text "maxidx + 1"}; the maximum index of an empty collection is |
|
816 |
@{text "-1"}. |
|
20476 | 817 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
818 |
Occasionally, basic names are injected into the same pair type of |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
819 |
indexed names: then @{text "(x, -1)"} is used to encode the basic |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
820 |
name @{text "x"}. |
20488 | 821 |
|
822 |
\medskip Isabelle syntax observes the following rules for |
|
823 |
representing an indexname @{text "(x, i)"} as a packed string: |
|
20476 | 824 |
|
825 |
\begin{itemize} |
|
826 |
||
20479 | 827 |
\item @{text "?x"} if @{text "x"} does not end with a digit and @{text "i = 0"}, |
20476 | 828 |
|
829 |
\item @{text "?xi"} if @{text "x"} does not end with a digit, |
|
830 |
||
20488 | 831 |
\item @{text "?x.i"} otherwise. |
20476 | 832 |
|
833 |
\end{itemize} |
|
20470 | 834 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
835 |
Indexnames may acquire large index numbers after several maxidx |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
836 |
shifts have been applied. Results are usually normalized towards |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
837 |
@{text "0"} at certain checkpoints, notably at the end of a proof. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
838 |
This works by producing variants of the corresponding basic name |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
839 |
components. For example, the collection @{text "?x1, ?x7, ?x42"} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
840 |
becomes @{text "?x, ?xa, ?xb"}. |
20476 | 841 |
*} |
842 |
||
843 |
text %mlref {* |
|
844 |
\begin{mldecls} |
|
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
845 |
@{index_ML_type indexname: "string * int"} \\ |
20476 | 846 |
\end{mldecls} |
847 |
||
848 |
\begin{description} |
|
849 |
||
850 |
\item @{ML_type indexname} represents indexed names. This is an |
|
851 |
abbreviation for @{ML_type "string * int"}. The second component is |
|
852 |
usually non-negative, except for situations where @{text "(x, -1)"} |
|
34926 | 853 |
is used to inject basic names into this type. Other negative |
854 |
indexes should not be used. |
|
20476 | 855 |
|
856 |
\end{description} |
|
857 |
*} |
|
858 |
||
859 |
||
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
860 |
subsection {* Long names \label{sec:long-name} *} |
20476 | 861 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
862 |
text {* A \emph{long name} consists of a sequence of non-empty name |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
863 |
components. The packed representation uses a dot as separator, as |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
864 |
in ``@{text "A.b.c"}''. The last component is called \emph{base |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
865 |
name}, the remaining prefix is called \emph{qualifier} (which may be |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
866 |
empty). The qualifier can be understood as the access path to the |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
867 |
named entity while passing through some nested block-structure, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
868 |
although our free-form long names do not really enforce any strict |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
869 |
discipline. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
870 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
871 |
For example, an item named ``@{text "A.b.c"}'' may be understood as |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
872 |
a local entity @{text "c"}, within a local structure @{text "b"}, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
873 |
within a global structure @{text "A"}. In practice, long names |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
874 |
usually represent 1--3 levels of qualification. User ML code should |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
875 |
not make any assumptions about the particular structure of long |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
876 |
names! |
20437 | 877 |
|
20476 | 878 |
The empty name is commonly used as an indication of unnamed |
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
879 |
entities, or entities that are not entered into the corresponding |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
880 |
name space, whenever this makes any sense. The basic operations on |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
881 |
long names map empty names again to empty names. |
20437 | 882 |
*} |
883 |
||
20476 | 884 |
text %mlref {* |
885 |
\begin{mldecls} |
|
30365 | 886 |
@{index_ML Long_Name.base_name: "string -> string"} \\ |
887 |
@{index_ML Long_Name.qualifier: "string -> string"} \\ |
|
888 |
@{index_ML Long_Name.append: "string -> string -> string"} \\ |
|
889 |
@{index_ML Long_Name.implode: "string list -> string"} \\ |
|
890 |
@{index_ML Long_Name.explode: "string -> string list"} \\ |
|
20547 | 891 |
\end{mldecls} |
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
892 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
893 |
\begin{description} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
894 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
895 |
\item @{ML Long_Name.base_name}~@{text "name"} returns the base name |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
896 |
of a long name. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
897 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
898 |
\item @{ML Long_Name.qualifier}~@{text "name"} returns the qualifier |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
899 |
of a long name. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
900 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
901 |
\item @{ML Long_Name.append}~@{text "name\<^isub>1 name\<^isub>2"} appends two long |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
902 |
names. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
903 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
904 |
\item @{ML Long_Name.implode}~@{text "names"} and @{ML |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
905 |
Long_Name.explode}~@{text "name"} convert between the packed string |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
906 |
representation and the explicit list form of long names. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
907 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
908 |
\end{description} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
909 |
*} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
910 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
911 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
912 |
subsection {* Name spaces \label{sec:name-space} *} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
913 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
914 |
text {* A @{text "name space"} manages a collection of long names, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
915 |
together with a mapping between partially qualified external names |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
916 |
and fully qualified internal names (in both directions). Note that |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
917 |
the corresponding @{text "intern"} and @{text "extern"} operations |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
918 |
are mostly used for parsing and printing only! The @{text |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
919 |
"declare"} operation augments a name space according to the accesses |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
920 |
determined by a given binding, and a naming policy from the context. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
921 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
922 |
\medskip A @{text "binding"} specifies details about the prospective |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
923 |
long name of a newly introduced formal entity. It consists of a |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
924 |
base name, prefixes for qualification (separate ones for system |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
925 |
infrastructure and user-space mechanisms), a slot for the original |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
926 |
source position, and some additional flags. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
927 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
928 |
\medskip A @{text "naming"} provides some additional details for |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
929 |
producing a long name from a binding. Normally, the naming is |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
930 |
implicit in the theory or proof context. The @{text "full"} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
931 |
operation (and its variants for different context types) produces a |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
932 |
fully qualified internal name to be entered into a name space. The |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
933 |
main equation of this ``chemical reaction'' when binding new |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
934 |
entities in a context is as follows: |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
935 |
|
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
936 |
\medskip |
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
937 |
\begin{tabular}{l} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
938 |
@{text "binding + naming \<longrightarrow> long name + name space accesses"} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
939 |
\end{tabular} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
940 |
|
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
941 |
\bigskip As a general principle, there is a separate name space for |
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
942 |
each kind of formal entity, e.g.\ fact, logical constant, type |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
943 |
constructor, type class. It is usually clear from the occurrence in |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
944 |
concrete syntax (or from the scope) which kind of entity a name |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
945 |
refers to. For example, the very same name @{text "c"} may be used |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
946 |
uniformly for a constant, type constructor, and type class. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
947 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
948 |
There are common schemes to name derived entities systematically |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
949 |
according to the name of the main logical entity involved, e.g.\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
950 |
fact @{text "c.intro"} for a canonical introduction rule related to |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
951 |
constant @{text "c"}. This technique of mapping names from one |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
952 |
space into another requires some care in order to avoid conflicts. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
953 |
In particular, theorem names derived from a type constructor or type |
39839 | 954 |
class should get an additional suffix in addition to the usual |
955 |
qualification. This leads to the following conventions for derived |
|
956 |
names: |
|
957 |
||
958 |
\medskip |
|
959 |
\begin{tabular}{ll} |
|
960 |
logical entity & fact name \\\hline |
|
961 |
constant @{text "c"} & @{text "c.intro"} \\ |
|
962 |
type @{text "c"} & @{text "c_type.intro"} \\ |
|
963 |
class @{text "c"} & @{text "c_class.intro"} \\ |
|
964 |
\end{tabular} |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
965 |
*} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
966 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
967 |
text %mlref {* |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
968 |
\begin{mldecls} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
969 |
@{index_ML_type binding} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
970 |
@{index_ML Binding.empty: binding} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
971 |
@{index_ML Binding.name: "string -> binding"} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
972 |
@{index_ML Binding.qualify: "bool -> string -> binding -> binding"} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
973 |
@{index_ML Binding.prefix: "bool -> string -> binding -> binding"} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
974 |
@{index_ML Binding.conceal: "binding -> binding"} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
975 |
@{index_ML Binding.str_of: "binding -> string"} \\ |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
976 |
\end{mldecls} |
20547 | 977 |
\begin{mldecls} |
33174 | 978 |
@{index_ML_type Name_Space.naming} \\ |
979 |
@{index_ML Name_Space.default_naming: Name_Space.naming} \\ |
|
980 |
@{index_ML Name_Space.add_path: "string -> Name_Space.naming -> Name_Space.naming"} \\ |
|
981 |
@{index_ML Name_Space.full_name: "Name_Space.naming -> binding -> string"} \\ |
|
20547 | 982 |
\end{mldecls} |
983 |
\begin{mldecls} |
|
33174 | 984 |
@{index_ML_type Name_Space.T} \\ |
985 |
@{index_ML Name_Space.empty: "string -> Name_Space.T"} \\ |
|
986 |
@{index_ML Name_Space.merge: "Name_Space.T * Name_Space.T -> Name_Space.T"} \\ |
|
987 |
@{index_ML Name_Space.declare: "bool -> Name_Space.naming -> binding -> Name_Space.T -> |
|
988 |
string * Name_Space.T"} \\ |
|
989 |
@{index_ML Name_Space.intern: "Name_Space.T -> string -> string"} \\ |
|
990 |
@{index_ML Name_Space.extern: "Name_Space.T -> string -> string"} \\ |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
991 |
@{index_ML Name_Space.is_concealed: "Name_Space.T -> string -> bool"} |
20476 | 992 |
\end{mldecls} |
20437 | 993 |
|
20476 | 994 |
\begin{description} |
995 |
||
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
996 |
\item @{ML_type binding} represents the abstract concept of name |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
997 |
bindings. |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
998 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
999 |
\item @{ML Binding.empty} is the empty binding. |
20476 | 1000 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1001 |
\item @{ML Binding.name}~@{text "name"} produces a binding with base |
39832 | 1002 |
name @{text "name"}. Note that this lacks proper source position |
1003 |
information; see also the ML antiquotation @{ML_antiquotation |
|
1004 |
binding}. |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1005 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1006 |
\item @{ML Binding.qualify}~@{text "mandatory name binding"} |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1007 |
prefixes qualifier @{text "name"} to @{text "binding"}. The @{text |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1008 |
"mandatory"} flag tells if this name component always needs to be |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1009 |
given in name space accesses --- this is mostly @{text "false"} in |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1010 |
practice. Note that this part of qualification is typically used in |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1011 |
derived specification mechanisms. |
20437 | 1012 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1013 |
\item @{ML Binding.prefix} is similar to @{ML Binding.qualify}, but |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1014 |
affects the system prefix. This part of extra qualification is |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1015 |
typically used in the infrastructure for modular specifications, |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1016 |
notably ``local theory targets'' (see also \chref{ch:local-theory}). |
20437 | 1017 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1018 |
\item @{ML Binding.conceal}~@{text "binding"} indicates that the |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1019 |
binding shall refer to an entity that serves foundational purposes |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1020 |
only. This flag helps to mark implementation details of |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1021 |
specification mechanism etc. Other tools should not depend on the |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1022 |
particulars of concealed entities (cf.\ @{ML |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1023 |
Name_Space.is_concealed}). |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1024 |
|
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1025 |
\item @{ML Binding.str_of}~@{text "binding"} produces a string |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1026 |
representation for human-readable output, together with some formal |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1027 |
markup that might get used in GUI front-ends, for example. |
20476 | 1028 |
|
33174 | 1029 |
\item @{ML_type Name_Space.naming} represents the abstract concept of |
20476 | 1030 |
a naming policy. |
20437 | 1031 |
|
33174 | 1032 |
\item @{ML Name_Space.default_naming} is the default naming policy. |
20476 | 1033 |
In a theory context, this is usually augmented by a path prefix |
1034 |
consisting of the theory name. |
|
1035 |
||
33174 | 1036 |
\item @{ML Name_Space.add_path}~@{text "path naming"} augments the |
20488 | 1037 |
naming policy by extending its path component. |
20437 | 1038 |
|
33174 | 1039 |
\item @{ML Name_Space.full_name}~@{text "naming binding"} turns a |
30281
9ad15d8ed311
renamed NameSpace.base to NameSpace.base_name (in accordance with "full_name");
wenzelm
parents:
30272
diff
changeset
|
1040 |
name binding (usually a basic name) into the fully qualified |
29008 | 1041 |
internal name, according to the given naming policy. |
20476 | 1042 |
|
33174 | 1043 |
\item @{ML_type Name_Space.T} represents name spaces. |
20476 | 1044 |
|
33174 | 1045 |
\item @{ML Name_Space.empty}~@{text "kind"} and @{ML Name_Space.merge}~@{text |
20488 | 1046 |
"(space\<^isub>1, space\<^isub>2)"} are the canonical operations for |
1047 |
maintaining name spaces according to theory data management |
|
33174 | 1048 |
(\secref{sec:context-data}); @{text "kind"} is a formal comment |
1049 |
to characterize the purpose of a name space. |
|
20437 | 1050 |
|
33174 | 1051 |
\item @{ML Name_Space.declare}~@{text "strict naming bindings |
1052 |
space"} enters a name binding as fully qualified internal name into |
|
1053 |
the name space, with external accesses determined by the naming |
|
1054 |
policy. |
|
20476 | 1055 |
|
33174 | 1056 |
\item @{ML Name_Space.intern}~@{text "space name"} internalizes a |
20476 | 1057 |
(partially qualified) external name. |
20437 | 1058 |
|
20488 | 1059 |
This operation is mostly for parsing! Note that fully qualified |
20476 | 1060 |
names stemming from declarations are produced via @{ML |
33174 | 1061 |
"Name_Space.full_name"} and @{ML "Name_Space.declare"} |
29008 | 1062 |
(or their derivatives for @{ML_type theory} and |
20488 | 1063 |
@{ML_type Proof.context}). |
20437 | 1064 |
|
33174 | 1065 |
\item @{ML Name_Space.extern}~@{text "space name"} externalizes a |
20476 | 1066 |
(fully qualified) internal name. |
1067 |
||
30281
9ad15d8ed311
renamed NameSpace.base to NameSpace.base_name (in accordance with "full_name");
wenzelm
parents:
30272
diff
changeset
|
1068 |
This operation is mostly for printing! User code should not rely on |
9ad15d8ed311
renamed NameSpace.base to NameSpace.base_name (in accordance with "full_name");
wenzelm
parents:
30272
diff
changeset
|
1069 |
the precise result too much. |
20476 | 1070 |
|
34927
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1071 |
\item @{ML Name_Space.is_concealed}~@{text "space name"} indicates |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1072 |
whether @{text "name"} refers to a strictly private entity that |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1073 |
other tools are supposed to ignore! |
c4c02ac736a6
more details on long names, binding/naming, name space;
wenzelm
parents:
34926
diff
changeset
|
1074 |
|
20476 | 1075 |
\end{description} |
1076 |
*} |
|
30272 | 1077 |
|
39832 | 1078 |
text %mlantiq {* |
1079 |
\begin{matharray}{rcl} |
|
1080 |
@{ML_antiquotation_def "binding"} & : & @{text ML_antiquotation} \\ |
|
1081 |
\end{matharray} |
|
1082 |
||
1083 |
\begin{rail} |
|
1084 |
'binding' name |
|
1085 |
; |
|
1086 |
\end{rail} |
|
1087 |
||
1088 |
\begin{description} |
|
1089 |
||
1090 |
\item @{text "@{binding name}"} produces a binding with base name |
|
1091 |
@{text "name"} and the source position taken from the concrete |
|
1092 |
syntax of this antiquotation. In many situations this is more |
|
1093 |
appropriate than the more basic @{ML Binding.name} function. |
|
1094 |
||
1095 |
\end{description} |
|
1096 |
*} |
|
1097 |
||
39833 | 1098 |
text %mlex {* The following example yields the source position of some |
1099 |
concrete binding inlined into the text. |
|
1100 |
*} |
|
1101 |
||
1102 |
ML {* Binding.pos_of @{binding here} *} |
|
1103 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
1104 |
text {* \medskip That position can be also printed in a message as |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
1105 |
follows. *} |
39833 | 1106 |
|
1107 |
ML_command {* |
|
1108 |
writeln |
|
1109 |
("Look here" ^ Position.str_of (Binding.pos_of @{binding here})) |
|
1110 |
*} |
|
1111 |
||
39861
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
1112 |
text {* This illustrates a key virtue of formalized bindings as |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
1113 |
opposed to raw specifications of base names: the system can use this |
b8d89db3e238
use continental paragraph style, which works better with mixture of (in)formal text;
wenzelm
parents:
39857
diff
changeset
|
1114 |
additional information for advanced feedback given to the user. *} |
39833 | 1115 |
|
18537 | 1116 |
end |