doc-src/TutorialI/CodeGen/CodeGen.thy
author nipkow
Wed, 19 Apr 2000 11:56:06 +0200
changeset 8744 22fa8b16c3ae
child 8771 026f37a86ea7
permissions -rw-r--r--
*** empty log message ***
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     1
(*<*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     2
theory CodeGen = Main:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     3
(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     4
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     5
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     6
The task is to develop a compiler from a generic type of expressions (built
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     7
up from variables, constants and binary operations) to a stack machine.  This
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     8
generic type of expressions is a generalization of the boolean expressions in
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     9
\S\ref{sec:boolex}.  This time we do not commit ourselves to a particular
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    10
type of variables or values but make them type parameters.  Neither is there
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    11
a fixed set of binary operations: instead the expression contains the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    12
appropriate function itself.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    13
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    14
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    15
types 'v binop = "'v \\<Rightarrow> 'v \\<Rightarrow> 'v";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    16
datatype ('a,'v)expr = Cex 'v
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    17
                     | Vex 'a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    18
                     | Bex "'v binop"  "('a,'v)expr"  "('a,'v)expr";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    19
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    20
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    21
The three constructors represent constants, variables and the combination of
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    22
two subexpressions with a binary operation.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    23
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    24
The value of an expression w.r.t.\ an environment that maps variables to
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    25
values is easily defined:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    26
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    27
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    28
consts value :: "('a \\<Rightarrow> 'v) \\<Rightarrow> ('a,'v)expr \\<Rightarrow> 'v";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    29
primrec
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    30
"value env (Cex v) = v"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    31
"value env (Vex a) = env a"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    32
"value env (Bex f e1 e2) = f (value env e1) (value env e2)";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    33
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    34
text{*
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    35
The stack machine has three instructions: load a constant value onto the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    36
stack, load the contents of a certain address onto the stack, and apply a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    37
binary operation to the two topmost elements of the stack, replacing them by
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    38
the result. As for \isa{expr}, addresses and values are type parameters:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    39
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    40
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    41
datatype ('a,'v) instr = Const 'v
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    42
                       | Load 'a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    43
                       | Apply "'v binop";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    44
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    45
text{*
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    46
The execution of the stack machine is modelled by a function \isa{exec}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    47
that takes a store (modelled as a function from addresses to values, just
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    48
like the environment for evaluating expressions), a stack (modelled as a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    49
list) of values, and a list of instructions, and returns the stack at the end
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    50
of the execution---the store remains unchanged:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    51
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    52
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    53
consts exec :: "('a\\<Rightarrow>'v) \\<Rightarrow> 'v list \\<Rightarrow> ('a,'v)instr list \\<Rightarrow> 'v list";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    54
primrec
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    55
"exec s vs [] = vs"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    56
"exec s vs (i#is) = (case i of
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    57
    Const v  \\<Rightarrow> exec s (v#vs) is
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    58
  | Load a   \\<Rightarrow> exec s ((s a)#vs) is
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    59
  | Apply f  \\<Rightarrow> exec s ( (f (hd vs) (hd(tl vs)))#(tl(tl vs)) ) is)";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    60
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    61
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    62
Recall that \isa{hd} and \isa{tl}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    63
return the first element and the remainder of a list.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    64
Because all functions are total, \isa{hd} is defined even for the empty
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    65
list, although we do not know what the result is. Thus our model of the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    66
machine always terminates properly, although the above definition does not
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    67
tell us much about the result in situations where \isa{Apply} was executed
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    68
with fewer than two elements on the stack.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    69
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    70
The compiler is a function from expressions to a list of instructions. Its
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    71
definition is pretty much obvious:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    72
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    73
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    74
consts comp :: "('a,'v)expr \\<Rightarrow> ('a,'v)instr list";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    75
primrec
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    76
"comp (Cex v)       = [Const v]"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    77
"comp (Vex a)       = [Load a]"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    78
"comp (Bex f e1 e2) = (comp e2) @ (comp e1) @ [Apply f]";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    79
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    80
text{*
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    81
Now we have to prove the correctness of the compiler, i.e.\ that the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    82
execution of a compiled expression results in the value of the expression:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    83
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    84
theorem "exec s [] (comp e) = [value s e]";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    85
(*<*)oops;(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    86
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    87
This theorem needs to be generalized to
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    88
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    89
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    90
theorem "\\<forall>vs. exec s vs (comp e) = (value s e) # vs";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    91
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    92
txt{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    93
which is proved by induction on \isa{e} followed by simplification, once
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    94
we have the following lemma about executing the concatenation of two
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    95
instruction sequences:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    96
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    97
(*<*)oops;(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    98
lemma exec_app[simp]:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    99
  "\\<forall>vs. exec s vs (xs@ys) = exec s (exec s vs xs) ys"; 
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   100
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   101
txt{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   102
This requires induction on \isa{xs} and ordinary simplification for the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   103
base cases. In the induction step, simplification leaves us with a formula
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   104
that contains two \isa{case}-expressions over instructions. Thus we add
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   105
automatic case splitting as well, which finishes the proof:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   106
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   107
apply(induct_tac xs, simp, simp split: instr.split).;
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   108
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   109
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   110
Note that because \isaindex{auto} performs simplification, it can
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   111
also be modified in the same way \isa{simp} can. Thus the proof can be
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   112
rewritten as
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   113
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   114
(*<*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   115
lemmas [simp del] = exec_app;
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   116
lemma [simp]: "\\<forall>vs. exec s vs (xs@ys) = exec s (exec s vs xs) ys"; 
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   117
(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   118
apply(induct_tac xs, auto split: instr.split).;
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   119
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   120
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   121
Although this is more compact, it is less clear for the reader of the proof.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   122
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   123
We could now go back and prove \isa{exec s [] (comp e) = [value s e]}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   124
merely by simplification with the generalized version we just proved.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   125
However, this is unnecessary because the generalized version fully subsumes
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   126
its instance.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   127
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   128
(*<*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   129
theorem "\\<forall>vs. exec s vs (comp e) = (value s e) # vs";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   130
apply(induct_tac e, auto).;
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   131
end
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   132
(*>*)