18537
|
1 |
|
|
2 |
(* $Id$ *)
|
|
3 |
|
|
4 |
theory prelim imports base begin
|
|
5 |
|
|
6 |
chapter {* Preliminaries *}
|
|
7 |
|
20429
|
8 |
section {* Contexts \label{sec:context} *}
|
18537
|
9 |
|
20429
|
10 |
text {*
|
20451
|
11 |
A logical context represents the background that is required for
|
|
12 |
formulating statements and composing proofs. It acts as a medium to
|
|
13 |
produce formal content, depending on earlier material (declarations,
|
|
14 |
results etc.).
|
18537
|
15 |
|
20451
|
16 |
For example, derivations within the Isabelle/Pure logic can be
|
|
17 |
described as a judgment @{text "\<Gamma> \<turnstile>\<^sub>\<Theta> \<phi>"}, which means that a
|
20429
|
18 |
proposition @{text "\<phi>"} is derivable from hypotheses @{text "\<Gamma>"}
|
|
19 |
within the theory @{text "\<Theta>"}. There are logical reasons for
|
20451
|
20 |
keeping @{text "\<Theta>"} and @{text "\<Gamma>"} separate: theories can be
|
|
21 |
liberal about supporting type constructors and schematic
|
|
22 |
polymorphism of constants and axioms, while the inner calculus of
|
|
23 |
@{text "\<Gamma> \<turnstile> \<phi>"} is strictly limited to Simple Type Theory (with
|
|
24 |
fixed type variables in the assumptions).
|
18537
|
25 |
|
20429
|
26 |
\medskip Contexts and derivations are linked by the following key
|
|
27 |
principles:
|
|
28 |
|
|
29 |
\begin{itemize}
|
|
30 |
|
|
31 |
\item Transfer: monotonicity of derivations admits results to be
|
20451
|
32 |
transferred into a \emph{larger} context, i.e.\ @{text "\<Gamma> \<turnstile>\<^sub>\<Theta>
|
|
33 |
\<phi>"} implies @{text "\<Gamma>' \<turnstile>\<^sub>\<Theta>\<^sub>' \<phi>"} for contexts @{text "\<Theta>'
|
|
34 |
\<supseteq> \<Theta>"} and @{text "\<Gamma>' \<supseteq> \<Gamma>"}.
|
18537
|
35 |
|
20429
|
36 |
\item Export: discharge of hypotheses admits results to be exported
|
20451
|
37 |
into a \emph{smaller} context, i.e.\ @{text "\<Gamma>' \<turnstile>\<^sub>\<Theta> \<phi>"}
|
|
38 |
implies @{text "\<Gamma> \<turnstile>\<^sub>\<Theta> \<Delta> \<Longrightarrow> \<phi>"} where @{text "\<Gamma>' \<supseteq> \<Gamma>"} and
|
|
39 |
@{text "\<Delta> = \<Gamma>' - \<Gamma>"}. Note that @{text "\<Theta>"} remains unchanged here,
|
|
40 |
only the @{text "\<Gamma>"} part is affected.
|
20429
|
41 |
|
|
42 |
\end{itemize}
|
18537
|
43 |
|
20451
|
44 |
\medskip By modeling the main characteristics of the primitive
|
|
45 |
@{text "\<Theta>"} and @{text "\<Gamma>"} above, and abstracting over any
|
|
46 |
particular logical content, we arrive at the fundamental notions of
|
|
47 |
\emph{theory context} and \emph{proof context} in Isabelle/Isar.
|
|
48 |
These implement a certain policy to manage arbitrary \emph{context
|
|
49 |
data}. There is a strongly-typed mechanism to declare new kinds of
|
20429
|
50 |
data at compile time.
|
18537
|
51 |
|
20451
|
52 |
The internal bootstrap process of Isabelle/Pure eventually reaches a
|
|
53 |
stage where certain data slots provide the logical content of @{text
|
|
54 |
"\<Theta>"} and @{text "\<Gamma>"} sketched above, but this does not stop there!
|
|
55 |
Various additional data slots support all kinds of mechanisms that
|
|
56 |
are not necessarily part of the core logic.
|
18537
|
57 |
|
20429
|
58 |
For example, there would be data for canonical introduction and
|
|
59 |
elimination rules for arbitrary operators (depending on the
|
|
60 |
object-logic and application), which enables users to perform
|
20451
|
61 |
standard proof steps implicitly (cf.\ the @{text "rule"} method
|
|
62 |
\cite{isabelle-isar-ref}).
|
18537
|
63 |
|
20451
|
64 |
\medskip Thus Isabelle/Isar is able to bring forth more and more
|
|
65 |
concepts successively. In particular, an object-logic like
|
|
66 |
Isabelle/HOL continues the Isabelle/Pure setup by adding specific
|
|
67 |
components for automated reasoning (classical reasoner, tableau
|
|
68 |
prover, structured induction etc.) and derived specification
|
|
69 |
mechanisms (inductive predicates, recursive functions etc.). All of
|
|
70 |
this is ultimately based on the generic data management by theory
|
|
71 |
and proof contexts introduced here.
|
18537
|
72 |
*}
|
|
73 |
|
|
74 |
|
|
75 |
subsection {* Theory context \label{sec:context-theory} *}
|
|
76 |
|
20429
|
77 |
text {*
|
20447
|
78 |
\glossary{Theory}{FIXME}
|
|
79 |
|
20451
|
80 |
A \emph{theory} is a data container with explicit named and unique
|
|
81 |
identifier. Theories are related by a (nominal) sub-theory
|
|
82 |
relation, which corresponds to the dependency graph of the original
|
|
83 |
construction; each theory is derived from a certain sub-graph of
|
|
84 |
ancestor theories.
|
|
85 |
|
|
86 |
The @{text "merge"} operation produces the least upper bound of two
|
|
87 |
theories, which actually degenerates into absorption of one theory
|
|
88 |
into the other (due to the nominal sub-theory relation).
|
18537
|
89 |
|
20429
|
90 |
The @{text "begin"} operation starts a new theory by importing
|
|
91 |
several parent theories and entering a special @{text "draft"} mode,
|
|
92 |
which is sustained until the final @{text "end"} operation. A draft
|
20451
|
93 |
theory acts like a linear type, where updates invalidate earlier
|
|
94 |
versions. An invalidated draft is called ``stale''.
|
20429
|
95 |
|
20447
|
96 |
The @{text "checkpoint"} operation produces an intermediate stepping
|
20451
|
97 |
stone that will survive the next update: both the original and the
|
|
98 |
changed theory remain valid and are related by the sub-theory
|
|
99 |
relation. Checkpointing essentially recovers purely functional
|
|
100 |
theory values, at the expense of some extra internal bookkeeping.
|
20447
|
101 |
|
|
102 |
The @{text "copy"} operation produces an auxiliary version that has
|
|
103 |
the same data content, but is unrelated to the original: updates of
|
|
104 |
the copy do not affect the original, neither does the sub-theory
|
|
105 |
relation hold.
|
20429
|
106 |
|
20447
|
107 |
\medskip The example in \figref{fig:ex-theory} below shows a theory
|
20451
|
108 |
graph derived from @{text "Pure"}, with theory @{text "Length"}
|
|
109 |
importing @{text "Nat"} and @{text "List"}. The body of @{text
|
|
110 |
"Length"} consists of a sequence of updates, working mostly on
|
|
111 |
drafts. Intermediate checkpoints may occur as well, due to the
|
|
112 |
history mechanism provided by the Isar top-level, cf.\
|
|
113 |
\secref{sec:isar-toplevel}.
|
20447
|
114 |
|
|
115 |
\begin{figure}[htb]
|
|
116 |
\begin{center}
|
20429
|
117 |
\begin{tabular}{rcccl}
|
20447
|
118 |
& & @{text "Pure"} \\
|
|
119 |
& & @{text "\<down>"} \\
|
|
120 |
& & @{text "FOL"} \\
|
18537
|
121 |
& $\swarrow$ & & $\searrow$ & \\
|
20447
|
122 |
$Nat$ & & & & @{text "List"} \\
|
18537
|
123 |
& $\searrow$ & & $\swarrow$ \\
|
20447
|
124 |
& & @{text "Length"} \\
|
18537
|
125 |
& & \multicolumn{3}{l}{~~$\isarkeyword{imports}$} \\
|
|
126 |
& & \multicolumn{3}{l}{~~$\isarkeyword{begin}$} \\
|
|
127 |
& & $\vdots$~~ \\
|
20447
|
128 |
& & @{text "\<bullet>"}~~ \\
|
|
129 |
& & $\vdots$~~ \\
|
|
130 |
& & @{text "\<bullet>"}~~ \\
|
|
131 |
& & $\vdots$~~ \\
|
18537
|
132 |
& & \multicolumn{3}{l}{~~$\isarkeyword{end}$} \\
|
20429
|
133 |
\end{tabular}
|
20451
|
134 |
\caption{A theory definition depending on ancestors}\label{fig:ex-theory}
|
20447
|
135 |
\end{center}
|
|
136 |
\end{figure}
|
20451
|
137 |
|
|
138 |
\medskip There is a separate notion of \emph{theory reference} for
|
|
139 |
maintaining a live link to an evolving theory context: updates on
|
20488
|
140 |
drafts are propagated automatically. Dynamic updating stops after
|
|
141 |
an explicit @{text "end"} only.
|
20451
|
142 |
|
|
143 |
Derived entities may store a theory reference in order to indicate
|
|
144 |
the context they belong to. This implicitly assumes monotonic
|
|
145 |
reasoning, because the referenced context may become larger without
|
|
146 |
further notice.
|
18537
|
147 |
*}
|
|
148 |
|
20430
|
149 |
text %mlref {*
|
20447
|
150 |
\begin{mldecls}
|
|
151 |
@{index_ML_type theory} \\
|
|
152 |
@{index_ML Theory.subthy: "theory * theory -> bool"} \\
|
|
153 |
@{index_ML Theory.merge: "theory * theory -> theory"} \\
|
|
154 |
@{index_ML Theory.checkpoint: "theory -> theory"} \\
|
|
155 |
@{index_ML Theory.copy: "theory -> theory"} \\[1ex]
|
|
156 |
@{index_ML_type theory_ref} \\
|
|
157 |
@{index_ML Theory.self_ref: "theory -> theory_ref"} \\
|
|
158 |
@{index_ML Theory.deref: "theory_ref -> theory"} \\
|
|
159 |
\end{mldecls}
|
|
160 |
|
|
161 |
\begin{description}
|
|
162 |
|
20451
|
163 |
\item @{ML_type theory} represents theory contexts. This is
|
|
164 |
essentially a linear type! Most operations destroy the original
|
|
165 |
version, which then becomes ``stale''.
|
20447
|
166 |
|
|
167 |
\item @{ML "Theory.subthy"}~@{text "(thy\<^sub>1, thy\<^sub>2)"}
|
|
168 |
compares theories according to the inherent graph structure of the
|
|
169 |
construction. This sub-theory relation is a nominal approximation
|
|
170 |
of inclusion (@{text "\<subseteq>"}) of the corresponding content.
|
|
171 |
|
|
172 |
\item @{ML "Theory.merge"}~@{text "(thy\<^sub>1, thy\<^sub>2)"}
|
|
173 |
absorbs one theory into the other. This fails for unrelated
|
|
174 |
theories!
|
|
175 |
|
|
176 |
\item @{ML "Theory.checkpoint"}~@{text "thy"} produces a safe
|
|
177 |
stepping stone in the linear development of @{text "thy"}. The next
|
|
178 |
update will result in two related, valid theories.
|
|
179 |
|
|
180 |
\item @{ML "Theory.copy"}~@{text "thy"} produces a variant of @{text
|
20451
|
181 |
"thy"} that holds a copy of the same data. The result is not
|
|
182 |
related to the original; the original is unchanched.
|
20447
|
183 |
|
20451
|
184 |
\item @{ML_type theory_ref} represents a sliding reference to an
|
|
185 |
always valid theory; updates on the original are propagated
|
20447
|
186 |
automatically.
|
|
187 |
|
20449
|
188 |
\item @{ML "Theory.self_ref"}~@{text "thy"} and @{ML
|
|
189 |
"Theory.deref"}~@{text "thy_ref"} convert between @{ML_type
|
|
190 |
"theory"} and @{ML_type "theory_ref"}. As the referenced theory
|
|
191 |
evolves monotonically over time, later invocations of @{ML
|
20451
|
192 |
"Theory.deref"} may refer to a larger context.
|
20447
|
193 |
|
|
194 |
\end{description}
|
20430
|
195 |
*}
|
|
196 |
|
18537
|
197 |
|
|
198 |
subsection {* Proof context \label{sec:context-proof} *}
|
|
199 |
|
|
200 |
text {*
|
20447
|
201 |
\glossary{Proof context}{The static context of a structured proof,
|
|
202 |
acts like a local ``theory'' of the current portion of Isar proof
|
|
203 |
text, generalizes the idea of local hypotheses @{text "\<Gamma>"} in
|
|
204 |
judgments @{text "\<Gamma> \<turnstile> \<phi>"} of natural deduction calculi. There is a
|
|
205 |
generic notion of introducing and discharging hypotheses.
|
|
206 |
Arbritrary auxiliary context data may be adjoined.}
|
20429
|
207 |
|
20447
|
208 |
A proof context is a container for pure data with a back-reference
|
20449
|
209 |
to the theory it belongs to. The @{text "init"} operation creates a
|
20451
|
210 |
proof context from a given theory. Modifications to draft theories
|
|
211 |
are propagated to the proof context as usual, but there is also an
|
|
212 |
explicit @{text "transfer"} operation to force resynchronization
|
|
213 |
with more substantial updates to the underlying theory. The actual
|
|
214 |
context data does not require any special bookkeeping, thanks to the
|
|
215 |
lack of destructive features.
|
20429
|
216 |
|
20447
|
217 |
Entities derived in a proof context need to record inherent logical
|
|
218 |
requirements explicitly, since there is no separate context
|
|
219 |
identification as for theories. For example, hypotheses used in
|
20451
|
220 |
primitive derivations (cf.\ \secref{sec:thms}) are recorded
|
20447
|
221 |
separately within the sequent @{text "\<Gamma> \<turnstile> \<phi>"}, just to make double
|
|
222 |
sure. Results could still leak into an alien proof context do to
|
|
223 |
programming errors, but Isabelle/Isar includes some extra validity
|
|
224 |
checks in critical positions, notably at the end of sub-proof.
|
20429
|
225 |
|
20451
|
226 |
Proof contexts may be manipulated arbitrarily, although the common
|
|
227 |
discipline is to follow block structure as a mental model: a given
|
|
228 |
context is extended consecutively, and results are exported back
|
|
229 |
into the original context. Note that the Isar proof states model
|
|
230 |
block-structured reasoning explicitly, using a stack of proof
|
|
231 |
contexts internally, cf.\ \secref{sec:isar-proof-state}.
|
18537
|
232 |
*}
|
|
233 |
|
20449
|
234 |
text %mlref {*
|
|
235 |
\begin{mldecls}
|
|
236 |
@{index_ML_type Proof.context} \\
|
|
237 |
@{index_ML ProofContext.init: "theory -> Proof.context"} \\
|
|
238 |
@{index_ML ProofContext.theory_of: "Proof.context -> theory"} \\
|
|
239 |
@{index_ML ProofContext.transfer: "theory -> Proof.context -> Proof.context"} \\
|
|
240 |
\end{mldecls}
|
|
241 |
|
|
242 |
\begin{description}
|
|
243 |
|
|
244 |
\item @{ML_type Proof.context} represents proof contexts. Elements
|
|
245 |
of this type are essentially pure values, with a sliding reference
|
|
246 |
to the background theory.
|
|
247 |
|
|
248 |
\item @{ML ProofContext.init}~@{text "thy"} produces a proof context
|
|
249 |
derived from @{text "thy"}, initializing all data.
|
|
250 |
|
|
251 |
\item @{ML ProofContext.theory_of}~@{text "ctxt"} selects the
|
20451
|
252 |
background theory from @{text "ctxt"}, dereferencing its internal
|
|
253 |
@{ML_type theory_ref}.
|
20449
|
254 |
|
|
255 |
\item @{ML ProofContext.transfer}~@{text "thy ctxt"} promotes the
|
|
256 |
background theory of @{text "ctxt"} to the super theory @{text
|
|
257 |
"thy"}.
|
|
258 |
|
|
259 |
\end{description}
|
|
260 |
*}
|
|
261 |
|
20430
|
262 |
|
20451
|
263 |
subsection {* Generic contexts \label{sec:generic-context} *}
|
20429
|
264 |
|
20449
|
265 |
text {*
|
|
266 |
A generic context is the disjoint sum of either a theory or proof
|
20451
|
267 |
context. Occasionally, this enables uniform treatment of generic
|
20450
|
268 |
context data, typically extra-logical information. Operations on
|
20449
|
269 |
generic contexts include the usual injections, partial selections,
|
|
270 |
and combinators for lifting operations on either component of the
|
|
271 |
disjoint sum.
|
|
272 |
|
|
273 |
Moreover, there are total operations @{text "theory_of"} and @{text
|
|
274 |
"proof_of"} to convert a generic context into either kind: a theory
|
20451
|
275 |
can always be selected from the sum, while a proof context might
|
|
276 |
have to be constructed by an ad-hoc @{text "init"} operation.
|
20449
|
277 |
*}
|
20430
|
278 |
|
20449
|
279 |
text %mlref {*
|
|
280 |
\begin{mldecls}
|
|
281 |
@{index_ML_type Context.generic} \\
|
|
282 |
@{index_ML Context.theory_of: "Context.generic -> theory"} \\
|
|
283 |
@{index_ML Context.proof_of: "Context.generic -> Proof.context"} \\
|
|
284 |
\end{mldecls}
|
|
285 |
|
|
286 |
\begin{description}
|
20430
|
287 |
|
20449
|
288 |
\item @{ML_type Context.generic} is the direct sum of @{ML_type
|
20451
|
289 |
"theory"} and @{ML_type "Proof.context"}, with the datatype
|
|
290 |
constructors @{ML "Context.Theory"} and @{ML "Context.Proof"}.
|
20449
|
291 |
|
|
292 |
\item @{ML Context.theory_of}~@{text "context"} always produces a
|
|
293 |
theory from the generic @{text "context"}, using @{ML
|
|
294 |
"ProofContext.theory_of"} as required.
|
|
295 |
|
|
296 |
\item @{ML Context.proof_of}~@{text "context"} always produces a
|
|
297 |
proof context from the generic @{text "context"}, using @{ML
|
20451
|
298 |
"ProofContext.init"} as required (note that this re-initializes the
|
|
299 |
context data with each invocation).
|
20449
|
300 |
|
|
301 |
\end{description}
|
|
302 |
*}
|
20437
|
303 |
|
20476
|
304 |
|
|
305 |
subsection {* Context data \label{sec:context-data} *}
|
20447
|
306 |
|
|
307 |
text {*
|
20451
|
308 |
The main purpose of theory and proof contexts is to manage arbitrary
|
|
309 |
data. New data types can be declared incrementally at compile time.
|
|
310 |
There are separate declaration mechanisms for any of the three kinds
|
|
311 |
of contexts: theory, proof, generic.
|
20449
|
312 |
|
|
313 |
\paragraph{Theory data} may refer to destructive entities, which are
|
20451
|
314 |
maintained in direct correspondence to the linear evolution of
|
|
315 |
theory values, including explicit copies.\footnote{Most existing
|
|
316 |
instances of destructive theory data are merely historical relics
|
|
317 |
(e.g.\ the destructive theorem storage, and destructive hints for
|
|
318 |
the Simplifier and Classical rules).} A theory data declaration
|
|
319 |
needs to implement the following specification (depending on type
|
|
320 |
@{text "T"}):
|
20449
|
321 |
|
|
322 |
\medskip
|
|
323 |
\begin{tabular}{ll}
|
|
324 |
@{text "name: string"} \\
|
|
325 |
@{text "empty: T"} & initial value \\
|
|
326 |
@{text "copy: T \<rightarrow> T"} & refresh impure data \\
|
|
327 |
@{text "extend: T \<rightarrow> T"} & re-initialize on import \\
|
|
328 |
@{text "merge: T \<times> T \<rightarrow> T"} & join on import \\
|
|
329 |
@{text "print: T \<rightarrow> unit"} & diagnostic output \\
|
|
330 |
\end{tabular}
|
|
331 |
\medskip
|
|
332 |
|
|
333 |
\noindent The @{text "name"} acts as a comment for diagnostic
|
|
334 |
messages; @{text "copy"} is just the identity for pure data; @{text
|
|
335 |
"extend"} is acts like a unitary version of @{text "merge"}, both
|
|
336 |
should also include the functionality of @{text "copy"} for impure
|
|
337 |
data.
|
|
338 |
|
20451
|
339 |
\paragraph{Proof context data} is purely functional. A declaration
|
|
340 |
needs to implement the following specification:
|
20449
|
341 |
|
|
342 |
\medskip
|
|
343 |
\begin{tabular}{ll}
|
|
344 |
@{text "name: string"} \\
|
|
345 |
@{text "init: theory \<rightarrow> T"} & produce initial value \\
|
|
346 |
@{text "print: T \<rightarrow> unit"} & diagnostic output \\
|
|
347 |
\end{tabular}
|
|
348 |
\medskip
|
|
349 |
|
|
350 |
\noindent The @{text "init"} operation is supposed to produce a pure
|
20451
|
351 |
value from the given background theory. The remainder is analogous
|
|
352 |
to theory data.
|
20449
|
353 |
|
20451
|
354 |
\paragraph{Generic data} provides a hybrid interface for both theory
|
|
355 |
and proof data. The declaration is essentially the same as for
|
|
356 |
(pure) theory data, without @{text "copy"}, though. The @{text
|
|
357 |
"init"} operation for proof contexts merely selects the current data
|
|
358 |
value from the background theory.
|
20449
|
359 |
|
|
360 |
\bigskip In any case, a data declaration of type @{text "T"} results
|
|
361 |
in the following interface:
|
|
362 |
|
|
363 |
\medskip
|
|
364 |
\begin{tabular}{ll}
|
|
365 |
@{text "init: theory \<rightarrow> theory"} \\
|
|
366 |
@{text "get: context \<rightarrow> T"} \\
|
|
367 |
@{text "put: T \<rightarrow> context \<rightarrow> context"} \\
|
|
368 |
@{text "map: (T \<rightarrow> T) \<rightarrow> context \<rightarrow> context"} \\
|
|
369 |
@{text "print: context \<rightarrow> unit"}
|
|
370 |
\end{tabular}
|
|
371 |
\medskip
|
|
372 |
|
|
373 |
\noindent Here @{text "init"} needs to be applied to the current
|
|
374 |
theory context once, in order to register the initial setup. The
|
|
375 |
other operations provide access for the particular kind of context
|
|
376 |
(theory, proof, or generic context). Note that this is a safe
|
|
377 |
interface: there is no other way to access the corresponding data
|
20451
|
378 |
slot of a context. By keeping these operations private, a component
|
|
379 |
may maintain abstract values authentically, without other components
|
|
380 |
interfering.
|
20447
|
381 |
*}
|
|
382 |
|
20450
|
383 |
text %mlref {*
|
|
384 |
\begin{mldecls}
|
|
385 |
@{index_ML_functor TheoryDataFun} \\
|
|
386 |
@{index_ML_functor ProofDataFun} \\
|
|
387 |
@{index_ML_functor GenericDataFun} \\
|
|
388 |
\end{mldecls}
|
|
389 |
|
|
390 |
\begin{description}
|
|
391 |
|
|
392 |
\item @{ML_functor TheoryDataFun}@{text "(spec)"} declares data for
|
|
393 |
type @{ML_type theory} according to the specification provided as
|
20451
|
394 |
argument structure. The resulting structure provides data init and
|
|
395 |
access operations as described above.
|
20450
|
396 |
|
20470
|
397 |
\item @{ML_functor ProofDataFun}@{text "(spec)"} is analogous to
|
|
398 |
@{ML_functor TheoryDataFun} for type @{ML_type Proof.context}.
|
20450
|
399 |
|
20470
|
400 |
\item @{ML_functor GenericDataFun}@{text "(spec)"} is analogous to
|
|
401 |
@{ML_functor TheoryDataFun} for type @{ML_type Context.generic}.
|
20450
|
402 |
|
|
403 |
\end{description}
|
|
404 |
*}
|
|
405 |
|
20447
|
406 |
|
20476
|
407 |
section {* Names *}
|
20451
|
408 |
|
20476
|
409 |
text {*
|
|
410 |
In principle, a name is just a string, but there are various
|
20488
|
411 |
convention for encoding additional structure. For example, ``@{text
|
|
412 |
"Foo.bar.baz"}'' is considered as a qualified name consisting of
|
|
413 |
three basic name components. The individual constituents of a name
|
|
414 |
may have further substructure, e.g.\ the string
|
|
415 |
``\verb,\,\verb,<alpha>,'' encodes as a single symbol.
|
20451
|
416 |
*}
|
20437
|
417 |
|
|
418 |
|
|
419 |
subsection {* Strings of symbols *}
|
|
420 |
|
20476
|
421 |
text {*
|
|
422 |
\glossary{Symbol}{The smallest unit of text in Isabelle, subsumes
|
|
423 |
plain ASCII characters as well as an infinite collection of named
|
|
424 |
symbols (for greek, math etc.).}
|
20470
|
425 |
|
20476
|
426 |
A \emph{symbol} constitutes the smallest textual unit in Isabelle
|
20488
|
427 |
--- raw characters are normally not encountered at all. Isabelle
|
|
428 |
strings consist of a sequence of symbols, represented as a packed
|
|
429 |
string or a list of strings. Each symbol is in itself a small
|
|
430 |
string, which has either one of the following forms:
|
20437
|
431 |
|
20451
|
432 |
\begin{enumerate}
|
20437
|
433 |
|
20488
|
434 |
\item a single ASCII character ``@{text "c"}'', for example
|
|
435 |
``\verb,a,'',
|
20437
|
436 |
|
20488
|
437 |
\item a regular symbol ``\verb,\,\verb,<,@{text "ident"}\verb,>,'',
|
20476
|
438 |
for example ``\verb,\,\verb,<alpha>,'',
|
20437
|
439 |
|
20488
|
440 |
\item a control symbol ``\verb,\,\verb,<^,@{text "ident"}\verb,>,'',
|
20476
|
441 |
for example ``\verb,\,\verb,<^bold>,'',
|
20437
|
442 |
|
20488
|
443 |
\item a raw symbol ``\verb,\,\verb,<^raw:,@{text text}\verb,>,''
|
|
444 |
where @{text text} constists of printable characters excluding
|
20476
|
445 |
``\verb,.,'' and ``\verb,>,'', for example
|
|
446 |
``\verb,\,\verb,<^raw:$\sum_{i = 1}^n$>,'',
|
20437
|
447 |
|
20488
|
448 |
\item a numbered raw control symbol ``\verb,\,\verb,<^raw,@{text
|
20476
|
449 |
n}\verb,>, where @{text n} consists of digits, for example
|
20451
|
450 |
``\verb,\,\verb,<^raw42>,''.
|
20437
|
451 |
|
20451
|
452 |
\end{enumerate}
|
20437
|
453 |
|
20476
|
454 |
\noindent The @{text "ident"} syntax for symbol names is @{text
|
|
455 |
"letter (letter | digit)\<^sup>*"}, where @{text "letter =
|
|
456 |
A..Za..z"} and @{text "digit = 0..9"}. There are infinitely many
|
|
457 |
regular symbols and control symbols, but a fixed collection of
|
|
458 |
standard symbols is treated specifically. For example,
|
20488
|
459 |
``\verb,\,\verb,<alpha>,'' is classified as a letter, which means it
|
|
460 |
may occur within regular Isabelle identifiers.
|
20437
|
461 |
|
20488
|
462 |
Since the character set underlying Isabelle symbols is 7-bit ASCII
|
|
463 |
and 8-bit characters are passed through transparently, Isabelle may
|
|
464 |
also process Unicode/UCS data in UTF-8 encoding. Unicode provides
|
|
465 |
its own collection of mathematical symbols, but there is no built-in
|
|
466 |
link to the standard collection of Isabelle.
|
20476
|
467 |
|
|
468 |
\medskip Output of Isabelle symbols depends on the print mode
|
|
469 |
(\secref{FIXME}). For example, the standard {\LaTeX} setup of the
|
|
470 |
Isabelle document preparation system would present
|
20451
|
471 |
``\verb,\,\verb,<alpha>,'' as @{text "\<alpha>"}, and
|
|
472 |
``\verb,\,\verb,<^bold>,\verb,\,\verb,<alpha>,'' as @{text
|
|
473 |
"\<^bold>\<alpha>"}.
|
|
474 |
*}
|
20437
|
475 |
|
|
476 |
text %mlref {*
|
|
477 |
\begin{mldecls}
|
|
478 |
@{index_ML_type "Symbol.symbol"} \\
|
|
479 |
@{index_ML Symbol.explode: "string -> Symbol.symbol list"} \\
|
|
480 |
@{index_ML Symbol.is_letter: "Symbol.symbol -> bool"} \\
|
|
481 |
@{index_ML Symbol.is_digit: "Symbol.symbol -> bool"} \\
|
|
482 |
@{index_ML Symbol.is_quasi: "Symbol.symbol -> bool"} \\
|
20451
|
483 |
@{index_ML Symbol.is_blank: "Symbol.symbol -> bool"} \\[1ex]
|
20437
|
484 |
@{index_ML_type "Symbol.sym"} \\
|
|
485 |
@{index_ML Symbol.decode: "Symbol.symbol -> Symbol.sym"} \\
|
|
486 |
\end{mldecls}
|
|
487 |
|
|
488 |
\begin{description}
|
|
489 |
|
20488
|
490 |
\item @{ML_type "Symbol.symbol"} represents individual Isabelle
|
|
491 |
symbols; this is an alias for @{ML_type "string"}.
|
20437
|
492 |
|
20476
|
493 |
\item @{ML "Symbol.explode"}~@{text "str"} produces a symbol list
|
20488
|
494 |
from the packed form. This function supercedes @{ML
|
20476
|
495 |
"String.explode"} for virtually all purposes of manipulating text in
|
|
496 |
Isabelle!
|
20437
|
497 |
|
|
498 |
\item @{ML "Symbol.is_letter"}, @{ML "Symbol.is_digit"}, @{ML
|
20476
|
499 |
"Symbol.is_quasi"}, @{ML "Symbol.is_blank"} classify standard
|
|
500 |
symbols according to fixed syntactic conventions of Isabelle, cf.\
|
|
501 |
\cite{isabelle-isar-ref}.
|
20437
|
502 |
|
|
503 |
\item @{ML_type "Symbol.sym"} is a concrete datatype that represents
|
20488
|
504 |
the different kinds of symbols explicitly, with constructors @{ML
|
|
505 |
"Symbol.Char"}, @{ML "Symbol.Sym"}, @{ML "Symbol.Ctrl"}, @{ML
|
20451
|
506 |
"Symbol.Raw"}.
|
20437
|
507 |
|
|
508 |
\item @{ML "Symbol.decode"} converts the string representation of a
|
20451
|
509 |
symbol into the datatype version.
|
20437
|
510 |
|
|
511 |
\end{description}
|
|
512 |
*}
|
|
513 |
|
|
514 |
|
20476
|
515 |
subsection {* Basic names \label{sec:basic-names} *}
|
|
516 |
|
|
517 |
text {*
|
|
518 |
A \emph{basic name} essentially consists of a single Isabelle
|
|
519 |
identifier. There are conventions to mark separate classes of basic
|
|
520 |
names, by attaching a suffix of underscores (@{text "_"}): one
|
|
521 |
underscore means \emph{internal name}, two underscores means
|
|
522 |
\emph{Skolem name}, three underscores means \emph{internal Skolem
|
|
523 |
name}.
|
|
524 |
|
|
525 |
For example, the basic name @{text "foo"} has the internal version
|
|
526 |
@{text "foo_"}, with Skolem versions @{text "foo__"} and @{text
|
|
527 |
"foo___"}, respectively.
|
|
528 |
|
20488
|
529 |
These special versions provide copies of the basic name space, apart
|
|
530 |
from anything that normally appears in the user text. For example,
|
|
531 |
system generated variables in Isar proof contexts are usually marked
|
|
532 |
as internal, which prevents mysterious name references like @{text
|
|
533 |
"xaa"} to appear in the text.
|
20476
|
534 |
|
20488
|
535 |
\medskip Manipulating binding scopes often requires on-the-fly
|
|
536 |
renamings. A \emph{name context} contains a collection of already
|
|
537 |
used names. The @{text "declare"} operation adds names to the
|
|
538 |
context.
|
20476
|
539 |
|
20488
|
540 |
The @{text "invents"} operation derives a number of fresh names from
|
|
541 |
a given starting point. For example, the first three names derived
|
|
542 |
from @{text "a"} are @{text "a"}, @{text "b"}, @{text "c"}.
|
20476
|
543 |
|
|
544 |
The @{text "variants"} operation produces fresh names by
|
20488
|
545 |
incrementing tentative names as base-26 numbers (with digits @{text
|
|
546 |
"a..z"}) until all clashes are resolved. For example, name @{text
|
|
547 |
"foo"} results in variants @{text "fooa"}, @{text "foob"}, @{text
|
|
548 |
"fooc"}, \dots, @{text "fooaa"}, @{text "fooab"} etc.; each renaming
|
|
549 |
step picks the next unused variant from this sequence.
|
20476
|
550 |
*}
|
|
551 |
|
|
552 |
text %mlref {*
|
|
553 |
\begin{mldecls}
|
|
554 |
@{index_ML Name.internal: "string -> string"} \\
|
|
555 |
@{index_ML Name.skolem: "string -> string"} \\[1ex]
|
|
556 |
@{index_ML_type Name.context} \\
|
|
557 |
@{index_ML Name.context: Name.context} \\
|
|
558 |
@{index_ML Name.declare: "string -> Name.context -> Name.context"} \\
|
|
559 |
@{index_ML Name.invents: "Name.context -> string -> int -> string list"} \\
|
|
560 |
@{index_ML Name.variants: "string list -> Name.context -> string list * Name.context"} \\
|
|
561 |
\end{mldecls}
|
|
562 |
|
|
563 |
\begin{description}
|
|
564 |
|
|
565 |
\item @{ML Name.internal}~@{text "name"} produces an internal name
|
|
566 |
by adding one underscore.
|
|
567 |
|
|
568 |
\item @{ML Name.skolem}~@{text "name"} produces a Skolem name by
|
|
569 |
adding two underscores.
|
|
570 |
|
|
571 |
\item @{ML_type Name.context} represents the context of already used
|
|
572 |
names; the initial value is @{ML "Name.context"}.
|
|
573 |
|
20488
|
574 |
\item @{ML Name.declare}~@{text "name"} enters a used name into the
|
|
575 |
context.
|
20437
|
576 |
|
20488
|
577 |
\item @{ML Name.invents}~@{text "context name n"} produces @{text
|
|
578 |
"n"} fresh names derived from @{text "name"}.
|
|
579 |
|
|
580 |
\item @{ML Name.variants}~@{text "names context"} produces fresh
|
|
581 |
varians of @{text "names"}; the result is entered into the context.
|
20476
|
582 |
|
|
583 |
\end{description}
|
|
584 |
*}
|
|
585 |
|
|
586 |
|
|
587 |
subsection {* Indexed names *}
|
|
588 |
|
|
589 |
text {*
|
|
590 |
An \emph{indexed name} (or @{text "indexname"}) is a pair of a basic
|
20488
|
591 |
name and a natural number. This representation allows efficient
|
|
592 |
renaming by incrementing the second component only. The canonical
|
|
593 |
way to rename two collections of indexnames apart from each other is
|
|
594 |
this: determine the maximum index @{text "maxidx"} of the first
|
|
595 |
collection, then increment all indexes of the second collection by
|
|
596 |
@{text "maxidx + 1"}; the maximum index of an empty collection is
|
|
597 |
@{text "-1"}.
|
20476
|
598 |
|
20488
|
599 |
Occasionally, basic names and indexed names are injected into the
|
|
600 |
same pair type: the (improper) indexname @{text "(x, -1)"} is used
|
|
601 |
to encode basic names.
|
|
602 |
|
|
603 |
\medskip Isabelle syntax observes the following rules for
|
|
604 |
representing an indexname @{text "(x, i)"} as a packed string:
|
20476
|
605 |
|
|
606 |
\begin{itemize}
|
|
607 |
|
20479
|
608 |
\item @{text "?x"} if @{text "x"} does not end with a digit and @{text "i = 0"},
|
20476
|
609 |
|
|
610 |
\item @{text "?xi"} if @{text "x"} does not end with a digit,
|
|
611 |
|
20488
|
612 |
\item @{text "?x.i"} otherwise.
|
20476
|
613 |
|
|
614 |
\end{itemize}
|
20470
|
615 |
|
20488
|
616 |
Indexnames may acquire large index numbers over time. Results are
|
|
617 |
normalized towards @{text "0"} at certain checkpoints, notably at
|
|
618 |
the end of a proof. This works by producing variants of the
|
|
619 |
corresponding basic name components. For example, the collection
|
|
620 |
@{text "?x1, ?x7, ?x42"} becomes @{text "?x, ?xa, ?xb"}.
|
20476
|
621 |
*}
|
|
622 |
|
|
623 |
text %mlref {*
|
|
624 |
\begin{mldecls}
|
|
625 |
@{index_ML_type indexname} \\
|
|
626 |
\end{mldecls}
|
|
627 |
|
|
628 |
\begin{description}
|
|
629 |
|
|
630 |
\item @{ML_type indexname} represents indexed names. This is an
|
|
631 |
abbreviation for @{ML_type "string * int"}. The second component is
|
|
632 |
usually non-negative, except for situations where @{text "(x, -1)"}
|
20488
|
633 |
is used to embed basic names into this type.
|
20476
|
634 |
|
|
635 |
\end{description}
|
|
636 |
*}
|
|
637 |
|
|
638 |
|
|
639 |
subsection {* Qualified names and name spaces *}
|
|
640 |
|
|
641 |
text {*
|
|
642 |
A \emph{qualified name} consists of a non-empty sequence of basic
|
20488
|
643 |
name components. The packed representation uses a dot as separator,
|
|
644 |
as in ``@{text "A.b.c"}''. The last component is called \emph{base}
|
|
645 |
name, the remaining prefix \emph{qualifier} (which may be empty).
|
|
646 |
The idea of qualified names is to encode nested structures by
|
|
647 |
recording the access paths as qualifiers. For example, an item
|
|
648 |
named ``@{text "A.b.c"}'' may be understood as a local entity @{text
|
|
649 |
"c"}, within a local structure @{text "b"}, within a global
|
|
650 |
structure @{text "A"}. Typically, name space hierarchies consist of
|
|
651 |
1--2 levels of qualification, but this need not be always so.
|
20437
|
652 |
|
20476
|
653 |
The empty name is commonly used as an indication of unnamed
|
20488
|
654 |
entities, whenever this makes any sense. The basic operations on
|
|
655 |
qualified names are smart enough to pass through such improper names
|
20476
|
656 |
unchanged.
|
|
657 |
|
|
658 |
\medskip A @{text "naming"} policy tells how to turn a name
|
|
659 |
specification into a fully qualified internal name (by the @{text
|
20488
|
660 |
"full"} operation), and how fully qualified names may be accessed
|
|
661 |
externally. For example, the default naming policy is to prefix an
|
|
662 |
implicit path: @{text "full x"} produces @{text "path.x"}, and the
|
|
663 |
standard accesses for @{text "path.x"} include both @{text "x"} and
|
|
664 |
@{text "path.x"}. Normally, the naming is implicit in the theory or
|
|
665 |
proof context; there are separate versions of the corresponding.
|
20437
|
666 |
|
20476
|
667 |
\medskip A @{text "name space"} manages a collection of fully
|
|
668 |
internalized names, together with a mapping between external names
|
|
669 |
and internal names (in both directions). The corresponding @{text
|
|
670 |
"intern"} and @{text "extern"} operations are mostly used for
|
|
671 |
parsing and printing only! The @{text "declare"} operation augments
|
20488
|
672 |
a name space according to the accesses determined by the naming
|
|
673 |
policy.
|
20476
|
674 |
|
20488
|
675 |
\medskip As a general principle, there is a separate name space for
|
|
676 |
each kind of formal entity, e.g.\ logical constant, type
|
|
677 |
constructor, type class, theorem. It is usually clear from the
|
|
678 |
occurrence in concrete syntax (or from the scope) which kind of
|
|
679 |
entity a name refers to. For example, the very same name @{text
|
|
680 |
"c"} may be used uniformly for a constant, type constructor, and
|
|
681 |
type class.
|
20476
|
682 |
|
20479
|
683 |
There are common schemes to name theorems systematically, according
|
20488
|
684 |
to the name of the main logical entity involved, e.g.\ @{text
|
|
685 |
"c.intro"} for a canonical theorem related to constant @{text "c"}.
|
|
686 |
This technique of mapping names from one space into another requires
|
|
687 |
some care in order to avoid conflicts. In particular, theorem names
|
|
688 |
derived from a type constructor or type class are better suffixed in
|
|
689 |
addition to the usual qualification, e.g.\ @{text "c_type.intro"}
|
|
690 |
and @{text "c_class.intro"} for theorems related to type @{text "c"}
|
|
691 |
and class @{text "c"}, respectively.
|
20437
|
692 |
*}
|
|
693 |
|
20476
|
694 |
text %mlref {*
|
|
695 |
\begin{mldecls}
|
|
696 |
@{index_ML NameSpace.base: "string -> string"} \\
|
20488
|
697 |
@{index_ML NameSpace.drop_base: "string -> string"} \\ %FIXME qualifier
|
20476
|
698 |
@{index_ML NameSpace.append: "string -> string -> string"} \\
|
|
699 |
@{index_ML NameSpace.pack: "string list -> string"} \\
|
|
700 |
@{index_ML NameSpace.unpack: "string -> string list"} \\[1ex]
|
|
701 |
@{index_ML_type NameSpace.naming} \\
|
|
702 |
@{index_ML NameSpace.default_naming: NameSpace.naming} \\
|
|
703 |
@{index_ML NameSpace.add_path: "string -> NameSpace.naming -> NameSpace.naming"} \\
|
|
704 |
@{index_ML NameSpace.full: "NameSpace.naming -> string -> string"} \\[1ex]
|
|
705 |
@{index_ML_type NameSpace.T} \\
|
|
706 |
@{index_ML NameSpace.empty: NameSpace.T} \\
|
|
707 |
@{index_ML NameSpace.merge: "NameSpace.T * NameSpace.T -> NameSpace.T"} \\
|
|
708 |
@{index_ML NameSpace.declare: "NameSpace.naming -> string -> NameSpace.T -> NameSpace.T"} \\
|
|
709 |
@{index_ML NameSpace.intern: "NameSpace.T -> string -> string"} \\
|
|
710 |
@{index_ML NameSpace.extern: "NameSpace.T -> string -> string"} \\
|
|
711 |
\end{mldecls}
|
20437
|
712 |
|
20476
|
713 |
\begin{description}
|
|
714 |
|
|
715 |
\item @{ML NameSpace.base}~@{text "name"} returns the base name of a
|
|
716 |
qualified name.
|
|
717 |
|
|
718 |
\item @{ML NameSpace.drop_base}~@{text "name"} returns the qualifier
|
|
719 |
of a qualified name.
|
20437
|
720 |
|
20476
|
721 |
\item @{ML NameSpace.append}~@{text "name\<^isub>1 name\<^isub>2"}
|
|
722 |
appends two qualified names.
|
20437
|
723 |
|
20488
|
724 |
\item @{ML NameSpace.pack}~@{text "name"} and @{ML
|
|
725 |
NameSpace.unpack}~@{text "names"} convert between the packed string
|
|
726 |
representation and the explicit list form of qualified names.
|
20476
|
727 |
|
|
728 |
\item @{ML_type NameSpace.naming} represents the abstract concept of
|
|
729 |
a naming policy.
|
20437
|
730 |
|
20476
|
731 |
\item @{ML NameSpace.default_naming} is the default naming policy.
|
|
732 |
In a theory context, this is usually augmented by a path prefix
|
|
733 |
consisting of the theory name.
|
|
734 |
|
|
735 |
\item @{ML NameSpace.add_path}~@{text "path naming"} augments the
|
20488
|
736 |
naming policy by extending its path component.
|
20437
|
737 |
|
20476
|
738 |
\item @{ML NameSpace.full}@{text "naming name"} turns a name
|
|
739 |
specification (usually a basic name) into the fully qualified
|
|
740 |
internal version, according to the given naming policy.
|
|
741 |
|
|
742 |
\item @{ML_type NameSpace.T} represents name spaces.
|
|
743 |
|
|
744 |
\item @{ML NameSpace.empty} and @{ML NameSpace.merge}~@{text
|
20488
|
745 |
"(space\<^isub>1, space\<^isub>2)"} are the canonical operations for
|
|
746 |
maintaining name spaces according to theory data management
|
|
747 |
(\secref{sec:context-data}).
|
20437
|
748 |
|
20476
|
749 |
\item @{ML NameSpace.declare}~@{text "naming name space"} enters a
|
20488
|
750 |
fully qualified name into the name space, with external accesses
|
|
751 |
determined by the naming policy.
|
20476
|
752 |
|
|
753 |
\item @{ML NameSpace.intern}~@{text "space name"} internalizes a
|
|
754 |
(partially qualified) external name.
|
20437
|
755 |
|
20488
|
756 |
This operation is mostly for parsing! Note that fully qualified
|
20476
|
757 |
names stemming from declarations are produced via @{ML
|
20488
|
758 |
"NameSpace.full"} (or its derivatives for @{ML_type theory} and
|
|
759 |
@{ML_type Proof.context}).
|
20437
|
760 |
|
20476
|
761 |
\item @{ML NameSpace.extern}~@{text "space name"} externalizes a
|
|
762 |
(fully qualified) internal name.
|
|
763 |
|
20488
|
764 |
This operation is mostly for printing! Note unqualified names are
|
20476
|
765 |
produced via @{ML NameSpace.base}.
|
|
766 |
|
|
767 |
\end{description}
|
|
768 |
*}
|
20437
|
769 |
|
18537
|
770 |
end
|