doc-src/TutorialI/CodeGen/CodeGen.thy
author nipkow
Tue, 12 Sep 2000 15:43:15 +0200
changeset 9933 9feb1e0c4cb3
parent 9844 8016321c7de1
child 10171 59d6633835fa
permissions -rw-r--r--
*** empty log message ***
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     1
(*<*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     2
theory CodeGen = Main:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     3
(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     4
9844
8016321c7de1 *** empty log message ***
nipkow
parents: 9792
diff changeset
     5
section{*Case study: compiling expressions*}
8016321c7de1 *** empty log message ***
nipkow
parents: 9792
diff changeset
     6
8016321c7de1 *** empty log message ***
nipkow
parents: 9792
diff changeset
     7
text{*\label{sec:ExprCompiler}
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     8
The task is to develop a compiler from a generic type of expressions (built
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
     9
up from variables, constants and binary operations) to a stack machine.  This
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    10
generic type of expressions is a generalization of the boolean expressions in
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    11
\S\ref{sec:boolex}.  This time we do not commit ourselves to a particular
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    12
type of variables or values but make them type parameters.  Neither is there
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    13
a fixed set of binary operations: instead the expression contains the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    14
appropriate function itself.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    15
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    16
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    17
types 'v binop = "'v \\<Rightarrow> 'v \\<Rightarrow> 'v";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    18
datatype ('a,'v)expr = Cex 'v
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    19
                     | Vex 'a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    20
                     | Bex "'v binop"  "('a,'v)expr"  "('a,'v)expr";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    21
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    22
text{*\noindent
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    23
The three constructors represent constants, variables and the application of
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    24
a binary operation to two subexpressions.
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    25
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    26
The value of an expression w.r.t.\ an environment that maps variables to
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    27
values is easily defined:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    28
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    29
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    30
consts value :: "('a,'v)expr \\<Rightarrow> ('a \\<Rightarrow> 'v) \\<Rightarrow> 'v";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    31
primrec
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    32
"value (Cex v) env = v"
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    33
"value (Vex a) env = env a"
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    34
"value (Bex f e1 e2) env = f (value e1 env) (value e2 env)";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    35
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    36
text{*
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    37
The stack machine has three instructions: load a constant value onto the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    38
stack, load the contents of a certain address onto the stack, and apply a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    39
binary operation to the two topmost elements of the stack, replacing them by
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    40
the result. As for @{text"expr"}, addresses and values are type parameters:
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    41
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    42
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    43
datatype ('a,'v) instr = Const 'v
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    44
                       | Load 'a
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    45
                       | Apply "'v binop";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    46
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    47
text{*
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    48
The execution of the stack machine is modelled by a function
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    49
@{text"exec"} that takes a list of instructions, a store (modelled as a
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    50
function from addresses to values, just like the environment for
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    51
evaluating expressions), and a stack (modelled as a list) of values,
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    52
and returns the stack at the end of the execution---the store remains
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    53
unchanged:
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    54
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    55
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    56
consts exec :: "('a,'v)instr list \\<Rightarrow> ('a\\<Rightarrow>'v) \\<Rightarrow> 'v list \\<Rightarrow> 'v list";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    57
primrec
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    58
"exec [] s vs = vs"
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    59
"exec (i#is) s vs = (case i of
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    60
    Const v  \\<Rightarrow> exec is s (v#vs)
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    61
  | Load a   \\<Rightarrow> exec is s ((s a)#vs)
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    62
  | Apply f  \\<Rightarrow> exec is s ((f (hd vs) (hd(tl vs)))#(tl(tl vs))))";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    63
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    64
text{*\noindent
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    65
Recall that @{term"hd"} and @{term"tl"}
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    66
return the first element and the remainder of a list.
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    67
Because all functions are total, @{term"hd"} is defined even for the empty
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    68
list, although we do not know what the result is. Thus our model of the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    69
machine always terminates properly, although the above definition does not
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    70
tell us much about the result in situations where @{term"Apply"} was executed
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    71
with fewer than two elements on the stack.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    72
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    73
The compiler is a function from expressions to a list of instructions. Its
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    74
definition is pretty much obvious:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    75
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    76
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    77
consts comp :: "('a,'v)expr \\<Rightarrow> ('a,'v)instr list";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    78
primrec
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    79
"comp (Cex v)       = [Const v]"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    80
"comp (Vex a)       = [Load a]"
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    81
"comp (Bex f e1 e2) = (comp e2) @ (comp e1) @ [Apply f]";
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    82
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    83
text{*
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    84
Now we have to prove the correctness of the compiler, i.e.\ that the
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    85
execution of a compiled expression results in the value of the expression:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    86
*}
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    87
theorem "exec (comp e) s [] = [value e s]";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    88
(*<*)oops;(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    89
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    90
This theorem needs to be generalized to
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    91
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    92
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
    93
theorem "\\<forall>vs. exec (comp e) s vs = (value e s) # vs";
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    94
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    95
txt{*\noindent
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
    96
which is proved by induction on @{term"e"} followed by simplification, once
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    97
we have the following lemma about executing the concatenation of two
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    98
instruction sequences:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
    99
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   100
(*<*)oops;(*>*)
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   101
lemma exec_app[simp]:
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
   102
  "\\<forall>vs. exec (xs@ys) s vs = exec ys s (exec xs s vs)"; 
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   103
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   104
txt{*\noindent
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
   105
This requires induction on @{term"xs"} and ordinary simplification for the
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   106
base cases. In the induction step, simplification leaves us with a formula
9792
bbefb6ce5cb2 *** empty log message ***
nipkow
parents: 9458
diff changeset
   107
that contains two @{text"case"}-expressions over instructions. Thus we add
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   108
automatic case splitting as well, which finishes the proof:
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   109
*}
9458
c613cd06d5cf apply. -> by
nipkow
parents: 8771
diff changeset
   110
by(induct_tac xs, simp, simp split: instr.split);
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   111
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   112
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   113
Note that because \isaindex{auto} performs simplification, it can
9933
9feb1e0c4cb3 *** empty log message ***
nipkow
parents: 9844
diff changeset
   114
also be modified in the same way @{text simp} can. Thus the proof can be
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   115
rewritten as
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   116
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   117
(*<*)
9933
9feb1e0c4cb3 *** empty log message ***
nipkow
parents: 9844
diff changeset
   118
declare exec_app[simp del];
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
   119
lemma [simp]: "\\<forall>vs. exec (xs@ys) s vs = exec ys s (exec xs s vs)"; 
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   120
(*>*)
9458
c613cd06d5cf apply. -> by
nipkow
parents: 8771
diff changeset
   121
by(induct_tac xs, auto split: instr.split);
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   122
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   123
text{*\noindent
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   124
Although this is more compact, it is less clear for the reader of the proof.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   125
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
   126
We could now go back and prove \isa{exec (comp e) s [] = [value e s]}
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   127
merely by simplification with the generalized version we just proved.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   128
However, this is unnecessary because the generalized version fully subsumes
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   129
its instance.
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   130
*}
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   131
(*<*)
8771
026f37a86ea7 *** empty log message ***
nipkow
parents: 8744
diff changeset
   132
theorem "\\<forall>vs. exec (comp e) s vs = (value e s) # vs";
9458
c613cd06d5cf apply. -> by
nipkow
parents: 8771
diff changeset
   133
by(induct_tac e, auto);
8744
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   134
end
22fa8b16c3ae *** empty log message ***
nipkow
parents:
diff changeset
   135
(*>*)