author | blanchet |
Tue, 02 Feb 2010 11:38:38 +0100 | |
changeset 34982 | 7b8c366e34a2 |
parent 34126 | 8a2c5d7aff51 |
child 34998 | 5e492a862b34 |
permissions | -rw-r--r-- |
33191 | 1 |
\documentclass[a4paper,12pt]{article} |
2 |
\usepackage[T1]{fontenc} |
|
3 |
\usepackage{amsmath} |
|
4 |
\usepackage{amssymb} |
|
33564
75ce0f60617a
fixed minor problems with Nitpick's documentation
blanchet
parents:
33561
diff
changeset
|
5 |
\usepackage[english,french]{babel} |
33191 | 6 |
\usepackage{color} |
7 |
\usepackage{graphicx} |
|
8 |
%\usepackage{mathpazo} |
|
9 |
\usepackage{multicol} |
|
10 |
\usepackage{stmaryrd} |
|
11 |
%\usepackage[scaled=.85]{beramono} |
|
12 |
\usepackage{../iman,../pdfsetup} |
|
13 |
||
14 |
%\oddsidemargin=4.6mm |
|
15 |
%\evensidemargin=4.6mm |
|
16 |
%\textwidth=150mm |
|
17 |
%\topmargin=4.6mm |
|
18 |
%\headheight=0mm |
|
19 |
%\headsep=0mm |
|
20 |
%\textheight=234mm |
|
21 |
||
22 |
\def\Colon{\mathord{:\mkern-1.5mu:}} |
|
23 |
%\def\lbrakk{\mathopen{\lbrack\mkern-3.25mu\lbrack}} |
|
24 |
%\def\rbrakk{\mathclose{\rbrack\mkern-3.255mu\rbrack}} |
|
25 |
\def\lparr{\mathopen{(\mkern-4mu\mid}} |
|
26 |
\def\rparr{\mathclose{\mid\mkern-4mu)}} |
|
27 |
||
28 |
\def\unk{{?}} |
|
34982
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
29 |
\def\undef{(\lambda x.\; \unk)} |
33191 | 30 |
%\def\unr{\textit{others}} |
31 |
\def\unr{\ldots} |
|
32 |
\def\Abs#1{\hbox{\rm{\flqq}}{\,#1\,}\hbox{\rm{\frqq}}} |
|
33 |
\def\Q{{\smash{\lower.2ex\hbox{$\scriptstyle?$}}}} |
|
34 |
||
35 |
\hyphenation{Mini-Sat size-change First-Steps grand-parent nit-pick |
|
36 |
counter-example counter-examples data-type data-types co-data-type |
|
37 |
co-data-types in-duc-tive co-in-duc-tive} |
|
38 |
||
39 |
\urlstyle{tt} |
|
40 |
||
41 |
\begin{document} |
|
42 |
||
33564
75ce0f60617a
fixed minor problems with Nitpick's documentation
blanchet
parents:
33561
diff
changeset
|
43 |
\selectlanguage{english} |
75ce0f60617a
fixed minor problems with Nitpick's documentation
blanchet
parents:
33561
diff
changeset
|
44 |
|
33191 | 45 |
\title{\includegraphics[scale=0.5]{isabelle_nitpick} \\[4ex] |
46 |
Picking Nits \\[\smallskipamount] |
|
33887 | 47 |
\Large A User's Guide to Nitpick for Isabelle/HOL} |
33191 | 48 |
\author{\hbox{} \\ |
49 |
Jasmin Christian Blanchette \\ |
|
33887 | 50 |
{\normalsize Institut f\"ur Informatik, Technische Universit\"at M\"unchen} \\ |
33191 | 51 |
\hbox{}} |
52 |
||
53 |
\maketitle |
|
54 |
||
55 |
\tableofcontents |
|
56 |
||
57 |
\setlength{\parskip}{.7em plus .2em minus .1em} |
|
58 |
\setlength{\parindent}{0pt} |
|
59 |
\setlength{\abovedisplayskip}{\parskip} |
|
60 |
\setlength{\abovedisplayshortskip}{.9\parskip} |
|
61 |
\setlength{\belowdisplayskip}{\parskip} |
|
62 |
\setlength{\belowdisplayshortskip}{.9\parskip} |
|
63 |
||
64 |
% General-purpose enum environment with correct spacing |
|
65 |
\newenvironment{enum}% |
|
66 |
{\begin{list}{}{% |
|
67 |
\setlength{\topsep}{.1\parskip}% |
|
68 |
\setlength{\partopsep}{.1\parskip}% |
|
69 |
\setlength{\itemsep}{\parskip}% |
|
70 |
\advance\itemsep by-\parsep}} |
|
71 |
{\end{list}} |
|
72 |
||
73 |
\def\pre{\begingroup\vskip0pt plus1ex\advance\leftskip by\leftmargin |
|
74 |
\advance\rightskip by\leftmargin} |
|
75 |
\def\post{\vskip0pt plus1ex\endgroup} |
|
76 |
||
77 |
\def\prew{\pre\advance\rightskip by-\leftmargin} |
|
78 |
\def\postw{\post} |
|
79 |
||
80 |
\section{Introduction} |
|
81 |
\label{introduction} |
|
82 |
||
83 |
Nitpick \cite{blanchette-nipkow-2009} is a counterexample generator for |
|
84 |
Isabelle/HOL \cite{isa-tutorial} that is designed to handle formulas |
|
85 |
combining (co)in\-duc\-tive datatypes, (co)in\-duc\-tively defined predicates, and |
|
86 |
quantifiers. It builds on Kodkod \cite{torlak-jackson-2007}, a highly optimized |
|
87 |
first-order relational model finder developed by the Software Design Group at |
|
88 |
MIT. It is conceptually similar to Refute \cite{weber-2008}, from which it |
|
89 |
borrows many ideas and code fragments, but it benefits from Kodkod's |
|
90 |
optimizations and a new encoding scheme. The name Nitpick is shamelessly |
|
91 |
appropriated from a now retired Alloy precursor. |
|
92 |
||
93 |
Nitpick is easy to use---you simply enter \textbf{nitpick} after a putative |
|
94 |
theorem and wait a few seconds. Nonetheless, there are situations where knowing |
|
95 |
how it works under the hood and how it reacts to various options helps |
|
96 |
increase the test coverage. This manual also explains how to install the tool on |
|
97 |
your workstation. Should the motivation fail you, think of the many hours of |
|
98 |
hard work Nitpick will save you. Proving non-theorems is \textsl{hard work}. |
|
99 |
||
100 |
Another common use of Nitpick is to find out whether the axioms of a locale are |
|
101 |
satisfiable, while the locale is being developed. To check this, it suffices to |
|
102 |
write |
|
103 |
||
104 |
\prew |
|
105 |
\textbf{lemma}~``$\textit{False}$'' \\ |
|
106 |
\textbf{nitpick}~[\textit{show\_all}] |
|
107 |
\postw |
|
108 |
||
109 |
after the locale's \textbf{begin} keyword. To falsify \textit{False}, Nitpick |
|
110 |
must find a model for the axioms. If it finds no model, we have an indication |
|
111 |
that the axioms might be unsatisfiable. |
|
112 |
||
33195
0efe26262e73
updated Nitpick manual to reflect the latest Stand der Dinge
blanchet
parents:
33193
diff
changeset
|
113 |
Nitpick requires the Kodkodi package for Isabelle as well as a Java 1.5 virtual |
0efe26262e73
updated Nitpick manual to reflect the latest Stand der Dinge
blanchet
parents:
33193
diff
changeset
|
114 |
machine called \texttt{java}. The examples presented in this manual can be found |
0efe26262e73
updated Nitpick manual to reflect the latest Stand der Dinge
blanchet
parents:
33193
diff
changeset
|
115 |
in Isabelle's \texttt{src/HOL/Nitpick\_Examples/Manual\_Nits.thy} theory. |
0efe26262e73
updated Nitpick manual to reflect the latest Stand der Dinge
blanchet
parents:
33193
diff
changeset
|
116 |
|
33561
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
117 |
Throughout this manual, we will explicitly invoke the \textbf{nitpick} command. |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
118 |
Nitpick also provides an automatic mode that can be enabled using the |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
119 |
``Auto Nitpick'' option from the ``Isabelle'' menu in Proof General. In this |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
120 |
mode, Nitpick is run on every newly entered theorem, much like Auto Quickcheck. |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
121 |
The collective time limit for Auto Nitpick and Auto Quickcheck can be set using |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
122 |
the ``Auto Counterexample Time Limit'' option. |
ab01b72715ef
introduced Auto Nitpick in addition to Auto Quickcheck;
blanchet
parents:
33559
diff
changeset
|
123 |
|
33191 | 124 |
\newbox\boxA |
125 |
\setbox\boxA=\hbox{\texttt{nospam}} |
|
126 |
||
127 |
The known bugs and limitations at the time of writing are listed in |
|
128 |
\S\ref{known-bugs-and-limitations}. Comments and bug reports concerning Nitpick |
|
129 |
or this manual should be directed to |
|
130 |
\texttt{blan{\color{white}nospam}\kern-\wd\boxA{}chette@\allowbreak |
|
131 |
in.\allowbreak tum.\allowbreak de}. |
|
132 |
||
133 |
\vskip2.5\smallskipamount |
|
134 |
||
135 |
\textbf{Acknowledgment.} The author would like to thank Mark Summerfield for |
|
136 |
suggesting several textual improvements. |
|
137 |
% and Perry James for reporting a typo. |
|
138 |
||
139 |
\section{First Steps} |
|
140 |
\label{first-steps} |
|
141 |
||
142 |
This section introduces Nitpick by presenting small examples. If possible, you |
|
143 |
should try out the examples on your workstation. Your theory file should start |
|
144 |
the standard way: |
|
145 |
||
146 |
\prew |
|
147 |
\textbf{theory}~\textit{Scratch} \\ |
|
148 |
\textbf{imports}~\textit{Main} \\ |
|
149 |
\textbf{begin} |
|
150 |
\postw |
|
151 |
||
152 |
The results presented here were obtained using the JNI version of MiniSat and |
|
153 |
with multithreading disabled to reduce nondeterminism. This was done by adding |
|
154 |
the line |
|
155 |
||
156 |
\prew |
|
157 |
\textbf{nitpick\_params} [\textit{sat\_solver}~= \textit{MiniSatJNI}, \,\textit{max\_threads}~= 1] |
|
158 |
\postw |
|
159 |
||
160 |
after the \textbf{begin} keyword. The JNI version of MiniSat is bundled with |
|
161 |
Kodkodi and is precompiled for the major platforms. Other SAT solvers can also |
|
162 |
be installed, as explained in \S\ref{optimizations}. If you have already |
|
163 |
configured SAT solvers in Isabelle (e.g., for Refute), these will also be |
|
164 |
available to Nitpick. |
|
165 |
||
166 |
\subsection{Propositional Logic} |
|
167 |
\label{propositional-logic} |
|
168 |
||
169 |
Let's start with a trivial example from propositional logic: |
|
170 |
||
171 |
\prew |
|
172 |
\textbf{lemma}~``$P \longleftrightarrow Q$'' \\ |
|
173 |
\textbf{nitpick} |
|
174 |
\postw |
|
175 |
||
176 |
You should get the following output: |
|
177 |
||
178 |
\prew |
|
179 |
\slshape |
|
180 |
Nitpick found a counterexample: \\[2\smallskipamount] |
|
181 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
182 |
\hbox{}\qquad\qquad $P = \textit{True}$ \\ |
|
183 |
\hbox{}\qquad\qquad $Q = \textit{False}$ |
|
184 |
\postw |
|
185 |
||
186 |
Nitpick can also be invoked on individual subgoals, as in the example below: |
|
187 |
||
188 |
\prew |
|
189 |
\textbf{apply}~\textit{auto} \\[2\smallskipamount] |
|
190 |
{\slshape goal (2 subgoals): \\ |
|
34982
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
191 |
\phantom{0}1. $P\,\Longrightarrow\, Q$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
192 |
\phantom{0}2. $Q\,\Longrightarrow\, P$} \\[2\smallskipamount] |
33191 | 193 |
\textbf{nitpick}~1 \\[2\smallskipamount] |
194 |
{\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
195 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
196 |
\hbox{}\qquad\qquad $P = \textit{True}$ \\ |
|
197 |
\hbox{}\qquad\qquad $Q = \textit{False}$} \\[2\smallskipamount] |
|
198 |
\textbf{nitpick}~2 \\[2\smallskipamount] |
|
199 |
{\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
200 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
201 |
\hbox{}\qquad\qquad $P = \textit{False}$ \\ |
|
202 |
\hbox{}\qquad\qquad $Q = \textit{True}$} \\[2\smallskipamount] |
|
203 |
\textbf{oops} |
|
204 |
\postw |
|
205 |
||
206 |
\subsection{Type Variables} |
|
207 |
\label{type-variables} |
|
208 |
||
209 |
If you are left unimpressed by the previous example, don't worry. The next |
|
210 |
one is more mind- and computer-boggling: |
|
211 |
||
212 |
\prew |
|
213 |
\textbf{lemma} ``$P~x\,\Longrightarrow\, P~(\textrm{THE}~y.\;P~y)$'' |
|
214 |
\postw |
|
215 |
\pagebreak[2] %% TYPESETTING |
|
216 |
||
217 |
The putative lemma involves the definite description operator, {THE}, presented |
|
218 |
in section 5.10.1 of the Isabelle tutorial \cite{isa-tutorial}. The |
|
219 |
operator is defined by the axiom $(\textrm{THE}~x.\; x = a) = a$. The putative |
|
220 |
lemma is merely asserting the indefinite description operator axiom with {THE} |
|
221 |
substituted for {SOME}. |
|
222 |
||
223 |
The free variable $x$ and the bound variable $y$ have type $'a$. For formulas |
|
224 |
containing type variables, Nitpick enumerates the possible domains for each type |
|
225 |
variable, up to a given cardinality (8 by default), looking for a finite |
|
226 |
countermodel: |
|
227 |
||
228 |
\prew |
|
229 |
\textbf{nitpick} [\textit{verbose}] \\[2\smallskipamount] |
|
230 |
\slshape |
|
231 |
Trying 8 scopes: \nopagebreak \\ |
|
232 |
\hbox{}\qquad \textit{card}~$'a$~= 1; \\ |
|
233 |
\hbox{}\qquad \textit{card}~$'a$~= 2; \\ |
|
234 |
\hbox{}\qquad $\qquad\vdots$ \\[.5\smallskipamount] |
|
235 |
\hbox{}\qquad \textit{card}~$'a$~= 8. \\[2\smallskipamount] |
|
236 |
Nitpick found a counterexample for \textit{card} $'a$~= 3: \\[2\smallskipamount] |
|
237 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
238 |
\hbox{}\qquad\qquad $P = \{a_2,\, a_3\}$ \\ |
|
239 |
\hbox{}\qquad\qquad $x = a_3$ \\[2\smallskipamount] |
|
240 |
Total time: 580 ms. |
|
241 |
\postw |
|
242 |
||
243 |
Nitpick found a counterexample in which $'a$ has cardinality 3. (For |
|
244 |
cardinalities 1 and 2, the formula holds.) In the counterexample, the three |
|
245 |
values of type $'a$ are written $a_1$, $a_2$, and $a_3$. |
|
246 |
||
247 |
The message ``Trying $n$ scopes: {\ldots}''\ is shown only if the option |
|
248 |
\textit{verbose} is enabled. You can specify \textit{verbose} each time you |
|
249 |
invoke \textbf{nitpick}, or you can set it globally using the command |
|
250 |
||
251 |
\prew |
|
252 |
\textbf{nitpick\_params} [\textit{verbose}] |
|
253 |
\postw |
|
254 |
||
255 |
This command also displays the current default values for all of the options |
|
256 |
supported by Nitpick. The options are listed in \S\ref{option-reference}. |
|
257 |
||
258 |
\subsection{Constants} |
|
259 |
\label{constants} |
|
260 |
||
261 |
By just looking at Nitpick's output, it might not be clear why the |
|
262 |
counterexample in \S\ref{type-variables} is genuine. Let's invoke Nitpick again, |
|
263 |
this time telling it to show the values of the constants that occur in the |
|
264 |
formula: |
|
265 |
||
266 |
\prew |
|
267 |
\textbf{lemma}~``$P~x\,\Longrightarrow\, P~(\textrm{THE}~y.\;P~y)$'' \\ |
|
268 |
\textbf{nitpick}~[\textit{show\_consts}] \\[2\smallskipamount] |
|
269 |
\slshape |
|
270 |
Nitpick found a counterexample for \textit{card} $'a$~= 3: \\[2\smallskipamount] |
|
271 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
272 |
\hbox{}\qquad\qquad $P = \{a_2,\, a_3\}$ \\ |
|
273 |
\hbox{}\qquad\qquad $x = a_3$ \\ |
|
274 |
\hbox{}\qquad Constant: \nopagebreak \\ |
|
275 |
\hbox{}\qquad\qquad $\textit{The}~\textsl{fallback} = a_1$ |
|
276 |
\postw |
|
277 |
||
278 |
We can see more clearly now. Since the predicate $P$ isn't true for a unique |
|
279 |
value, $\textrm{THE}~y.\;P~y$ can denote any value of type $'a$, even |
|
280 |
$a_1$. Since $P~a_1$ is false, the entire formula is falsified. |
|
281 |
||
282 |
As an optimization, Nitpick's preprocessor introduced the special constant |
|
283 |
``\textit{The} fallback'' corresponding to $\textrm{THE}~y.\;P~y$ (i.e., |
|
284 |
$\mathit{The}~(\lambda y.\;P~y)$) when there doesn't exist a unique $y$ |
|
285 |
satisfying $P~y$. We disable this optimization by passing the |
|
286 |
\textit{full\_descrs} option: |
|
287 |
||
288 |
\prew |
|
289 |
\textbf{nitpick}~[\textit{full\_descrs},\, \textit{show\_consts}] \\[2\smallskipamount] |
|
290 |
\slshape |
|
291 |
Nitpick found a counterexample for \textit{card} $'a$~= 3: \\[2\smallskipamount] |
|
292 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
293 |
\hbox{}\qquad\qquad $P = \{a_2,\, a_3\}$ \\ |
|
294 |
\hbox{}\qquad\qquad $x = a_3$ \\ |
|
295 |
\hbox{}\qquad Constant: \nopagebreak \\ |
|
296 |
\hbox{}\qquad\qquad $\hbox{\slshape THE}~y.\;P~y = a_1$ |
|
297 |
\postw |
|
298 |
||
299 |
As the result of another optimization, Nitpick directly assigned a value to the |
|
300 |
subterm $\textrm{THE}~y.\;P~y$, rather than to the \textit{The} constant. If we |
|
301 |
disable this second optimization by using the command |
|
302 |
||
303 |
\prew |
|
304 |
\textbf{nitpick}~[\textit{dont\_specialize},\, \textit{full\_descrs},\, |
|
305 |
\textit{show\_consts}] |
|
306 |
\postw |
|
307 |
||
308 |
we finally get \textit{The}: |
|
309 |
||
310 |
\prew |
|
311 |
\slshape Constant: \nopagebreak \\ |
|
312 |
\hbox{}\qquad $\mathit{The} = \undef{} |
|
313 |
(\!\begin{aligned}[t]% |
|
314 |
& \{\} := a_3,\> \{a_3\} := a_3,\> \{a_2\} := a_2, \\[-2pt] %% TYPESETTING |
|
315 |
& \{a_2, a_3\} := a_1,\> \{a_1\} := a_1,\> \{a_1, a_3\} := a_3, \\[-2pt] |
|
316 |
& \{a_1, a_2\} := a_3,\> \{a_1, a_2, a_3\} := a_3)\end{aligned}$ |
|
317 |
\postw |
|
318 |
||
319 |
Notice that $\textit{The}~(\lambda y.\;P~y) = \textit{The}~\{a_2, a_3\} = a_1$, |
|
34982
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
320 |
just like before.\footnote{The Isabelle/HOL notation $f(x := |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
321 |
y)$ denotes the function that maps $x$ to $y$ and that otherwise behaves like |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
322 |
$f$.} |
33191 | 323 |
|
324 |
Our misadventures with THE suggest adding `$\exists!x{.}$' (``there exists a |
|
325 |
unique $x$ such that'') at the front of our putative lemma's assumption: |
|
326 |
||
327 |
\prew |
|
328 |
\textbf{lemma}~``$\exists {!}x.\; P~x\,\Longrightarrow\, P~(\textrm{THE}~y.\;P~y)$'' |
|
329 |
\postw |
|
330 |
||
331 |
The fix appears to work: |
|
332 |
||
333 |
\prew |
|
334 |
\textbf{nitpick} \\[2\smallskipamount] |
|
335 |
\slshape Nitpick found no counterexample. |
|
336 |
\postw |
|
337 |
||
338 |
We can further increase our confidence in the formula by exhausting all |
|
339 |
cardinalities up to 50: |
|
340 |
||
341 |
\prew |
|
342 |
\textbf{nitpick} [\textit{card} $'a$~= 1--50]\footnote{The symbol `--' |
|
343 |
can be entered as \texttt{-} (hyphen) or |
|
344 |
\texttt{\char`\\\char`\<midarrow\char`\>}.} \\[2\smallskipamount] |
|
345 |
\slshape Nitpick found no counterexample. |
|
346 |
\postw |
|
347 |
||
348 |
Let's see if Sledgehammer \cite{sledgehammer-2009} can find a proof: |
|
349 |
||
350 |
\prew |
|
351 |
\textbf{sledgehammer} \\[2\smallskipamount] |
|
352 |
{\slshape Sledgehammer: external prover ``$e$'' for subgoal 1: \\ |
|
353 |
$\exists{!}x.\; P~x\,\Longrightarrow\, P~(\hbox{\slshape THE}~y.\; P~y)$ \\ |
|
354 |
Try this command: \textrm{apply}~(\textit{metis~the\_equality})} \\[2\smallskipamount] |
|
355 |
\textbf{apply}~(\textit{metis~the\_equality\/}) \nopagebreak \\[2\smallskipamount] |
|
356 |
{\slshape No subgoals!}% \\[2\smallskipamount] |
|
357 |
%\textbf{done} |
|
358 |
\postw |
|
359 |
||
360 |
This must be our lucky day. |
|
361 |
||
362 |
\subsection{Skolemization} |
|
363 |
\label{skolemization} |
|
364 |
||
365 |
Are all invertible functions onto? Let's find out: |
|
366 |
||
367 |
\prew |
|
368 |
\textbf{lemma} ``$\exists g.\; \forall x.~g~(f~x) = x |
|
369 |
\,\Longrightarrow\, \forall y.\; \exists x.~y = f~x$'' \\ |
|
370 |
\textbf{nitpick} \\[2\smallskipamount] |
|
371 |
\slshape |
|
372 |
Nitpick found a counterexample for \textit{card} $'a$~= 2 and \textit{card} $'b$~=~1: \\[2\smallskipamount] |
|
373 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
374 |
\hbox{}\qquad\qquad $f = \undef{}(b_1 := a_1)$ \\ |
|
375 |
\hbox{}\qquad Skolem constants: \nopagebreak \\ |
|
376 |
\hbox{}\qquad\qquad $g = \undef{}(a_1 := b_1,\> a_2 := b_1)$ \\ |
|
377 |
\hbox{}\qquad\qquad $y = a_2$ |
|
378 |
\postw |
|
379 |
||
380 |
Although $f$ is the only free variable occurring in the formula, Nitpick also |
|
381 |
displays values for the bound variables $g$ and $y$. These values are available |
|
382 |
to Nitpick because it performs skolemization as a preprocessing step. |
|
383 |
||
384 |
In the previous example, skolemization only affected the outermost quantifiers. |
|
385 |
This is not always the case, as illustrated below: |
|
386 |
||
387 |
\prew |
|
388 |
\textbf{lemma} ``$\exists x.\; \forall f.\; f~x = x$'' \\ |
|
389 |
\textbf{nitpick} \\[2\smallskipamount] |
|
390 |
\slshape |
|
391 |
Nitpick found a counterexample for \textit{card} $'a$~= 2: \\[2\smallskipamount] |
|
392 |
\hbox{}\qquad Skolem constant: \nopagebreak \\ |
|
393 |
\hbox{}\qquad\qquad $\lambda x.\; f = |
|
394 |
\undef{}(\!\begin{aligned}[t] |
|
395 |
& a_1 := \undef{}(a_1 := a_2,\> a_2 := a_1), \\[-2pt] |
|
396 |
& a_2 := \undef{}(a_1 := a_1,\> a_2 := a_1))\end{aligned}$ |
|
397 |
\postw |
|
398 |
||
399 |
The variable $f$ is bound within the scope of $x$; therefore, $f$ depends on |
|
400 |
$x$, as suggested by the notation $\lambda x.\,f$. If $x = a_1$, then $f$ is the |
|
401 |
function that maps $a_1$ to $a_2$ and vice versa; otherwise, $x = a_2$ and $f$ |
|
402 |
maps both $a_1$ and $a_2$ to $a_1$. In both cases, $f~x \not= x$. |
|
403 |
||
404 |
The source of the Skolem constants is sometimes more obscure: |
|
405 |
||
406 |
\prew |
|
407 |
\textbf{lemma} ``$\mathit{refl}~r\,\Longrightarrow\, \mathit{sym}~r$'' \\ |
|
408 |
\textbf{nitpick} \\[2\smallskipamount] |
|
409 |
\slshape |
|
410 |
Nitpick found a counterexample for \textit{card} $'a$~= 2: \\[2\smallskipamount] |
|
411 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
412 |
\hbox{}\qquad\qquad $r = \{(a_1, a_1),\, (a_2, a_1),\, (a_2, a_2)\}$ \\ |
|
413 |
\hbox{}\qquad Skolem constants: \nopagebreak \\ |
|
414 |
\hbox{}\qquad\qquad $\mathit{sym}.x = a_2$ \\ |
|
415 |
\hbox{}\qquad\qquad $\mathit{sym}.y = a_1$ |
|
416 |
\postw |
|
417 |
||
418 |
What happened here is that Nitpick expanded the \textit{sym} constant to its |
|
419 |
definition: |
|
420 |
||
421 |
\prew |
|
422 |
$\mathit{sym}~r \,\equiv\, |
|
423 |
\forall x\> y.\,\> (x, y) \in r \longrightarrow (y, x) \in r.$ |
|
424 |
\postw |
|
425 |
||
426 |
As their names suggest, the Skolem constants $\mathit{sym}.x$ and |
|
427 |
$\mathit{sym}.y$ are simply the bound variables $x$ and $y$ |
|
428 |
from \textit{sym}'s definition. |
|
429 |
||
430 |
Although skolemization is a useful optimization, you can disable it by invoking |
|
431 |
Nitpick with \textit{dont\_skolemize}. See \S\ref{optimizations} for details. |
|
432 |
||
433 |
\subsection{Natural Numbers and Integers} |
|
434 |
\label{natural-numbers-and-integers} |
|
435 |
||
436 |
Because of the axiom of infinity, the type \textit{nat} does not admit any |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
437 |
finite models. To deal with this, Nitpick's approach is to consider finite |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
438 |
subsets $N$ of \textit{nat} and maps all numbers $\notin N$ to the undefined |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
439 |
value (displayed as `$\unk$'). The type \textit{int} is handled similarly. |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
440 |
Internally, undefined values lead to a three-valued logic. |
33191 | 441 |
|
442 |
Here is an example involving \textit{int}: |
|
443 |
||
444 |
\prew |
|
445 |
\textbf{lemma} ``$\lbrakk i \le j;\> n \le (m{\Colon}\mathit{int})\rbrakk \,\Longrightarrow\, i * n + j * m \le i * m + j * n$'' \\ |
|
446 |
\textbf{nitpick} \\[2\smallskipamount] |
|
447 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
448 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
449 |
\hbox{}\qquad\qquad $i = 0$ \\ |
|
450 |
\hbox{}\qquad\qquad $j = 1$ \\ |
|
451 |
\hbox{}\qquad\qquad $m = 1$ \\ |
|
452 |
\hbox{}\qquad\qquad $n = 0$ |
|
453 |
\postw |
|
454 |
||
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
455 |
Internally, Nitpick uses either a unary or a binary representation of numbers. |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
456 |
The unary representation is more efficient but only suitable for numbers very |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
457 |
close to zero. By default, Nitpick attempts to choose the more appropriate |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
458 |
encoding by inspecting the formula at hand. This behavior can be overridden by |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
459 |
passing either \textit{unary\_ints} or \textit{binary\_ints} as option. For |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
460 |
binary notation, the number of bits to use can be specified using |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
461 |
the \textit{bits} option. For example: |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
462 |
|
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
463 |
\prew |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
464 |
\textbf{nitpick} [\textit{binary\_ints}, \textit{bits}${} = 16$] |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
465 |
\postw |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
466 |
|
33191 | 467 |
With infinite types, we don't always have the luxury of a genuine counterexample |
468 |
and must often content ourselves with a potential one. The tedious task of |
|
469 |
finding out whether the potential counterexample is in fact genuine can be |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
470 |
outsourced to \textit{auto} by passing \textit{check\_potential}. For example: |
33191 | 471 |
|
472 |
\prew |
|
473 |
\textbf{lemma} ``$\forall n.\; \textit{Suc}~n \mathbin{\not=} n \,\Longrightarrow\, P$'' \\ |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
474 |
\textbf{nitpick} [\textit{card~nat}~= 100, \textit{check\_potential}] \\[2\smallskipamount] |
33191 | 475 |
\slshape Nitpick found a potential counterexample: \\[2\smallskipamount] |
476 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
477 |
\hbox{}\qquad\qquad $P = \textit{False}$ \\[2\smallskipamount] |
|
478 |
Confirmation by ``\textit{auto}'': The above counterexample is genuine. |
|
479 |
\postw |
|
480 |
||
481 |
You might wonder why the counterexample is first reported as potential. The root |
|
482 |
of the problem is that the bound variable in $\forall n.\; \textit{Suc}~n |
|
483 |
\mathbin{\not=} n$ ranges over an infinite type. If Nitpick finds an $n$ such |
|
484 |
that $\textit{Suc}~n \mathbin{=} n$, it evaluates the assumption to |
|
485 |
\textit{False}; but otherwise, it does not know anything about values of $n \ge |
|
486 |
\textit{card~nat}$ and must therefore evaluate the assumption to $\unk$, not |
|
487 |
\textit{True}. Since the assumption can never be satisfied, the putative lemma |
|
488 |
can never be falsified. |
|
489 |
||
490 |
Incidentally, if you distrust the so-called genuine counterexamples, you can |
|
491 |
enable \textit{check\_\allowbreak genuine} to verify them as well. However, be |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
492 |
aware that \textit{auto} will usually fail to prove that the counterexample is |
33191 | 493 |
genuine or spurious. |
494 |
||
495 |
Some conjectures involving elementary number theory make Nitpick look like a |
|
496 |
giant with feet of clay: |
|
497 |
||
498 |
\prew |
|
499 |
\textbf{lemma} ``$P~\textit{Suc}$'' \\ |
|
500 |
\textbf{nitpick} [\textit{card} = 1--6] \\[2\smallskipamount] |
|
501 |
\slshape |
|
502 |
Nitpick found no counterexample. |
|
503 |
\postw |
|
504 |
||
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
505 |
On any finite set $N$, \textit{Suc} is a partial function; for example, if $N = |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
506 |
\{0, 1, \ldots, k\}$, then \textit{Suc} is $\{0 \mapsto 1,\, 1 \mapsto 2,\, |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
507 |
\ldots,\, k \mapsto \unk\}$, which evaluates to $\unk$ when passed as |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
508 |
argument to $P$. As a result, $P~\textit{Suc}$ is always $\unk$. The next |
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
509 |
example is similar: |
33191 | 510 |
|
511 |
\prew |
|
512 |
\textbf{lemma} ``$P~(\textit{op}~{+}\Colon |
|
513 |
\textit{nat}\mathbin{\Rightarrow}\textit{nat}\mathbin{\Rightarrow}\textit{nat})$'' \\ |
|
514 |
\textbf{nitpick} [\textit{card nat} = 1] \\[2\smallskipamount] |
|
515 |
{\slshape Nitpick found a counterexample:} \\[2\smallskipamount] |
|
516 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
517 |
\hbox{}\qquad\qquad $P = \{\}$ \\[2\smallskipamount] |
|
518 |
\textbf{nitpick} [\textit{card nat} = 2] \\[2\smallskipamount] |
|
519 |
{\slshape Nitpick found no counterexample.} |
|
520 |
\postw |
|
521 |
||
522 |
The problem here is that \textit{op}~+ is total when \textit{nat} is taken to be |
|
523 |
$\{0\}$ but becomes partial as soon as we add $1$, because $1 + 1 \notin \{0, |
|
524 |
1\}$. |
|
525 |
||
526 |
Because numbers are infinite and are approximated using a three-valued logic, |
|
527 |
there is usually no need to systematically enumerate domain sizes. If Nitpick |
|
528 |
cannot find a genuine counterexample for \textit{card~nat}~= $k$, it is very |
|
529 |
unlikely that one could be found for smaller domains. (The $P~(\textit{op}~{+})$ |
|
530 |
example above is an exception to this principle.) Nitpick nonetheless enumerates |
|
531 |
all cardinalities from 1 to 8 for \textit{nat}, mainly because smaller |
|
532 |
cardinalities are fast to handle and give rise to simpler counterexamples. This |
|
533 |
is explained in more detail in \S\ref{scope-monotonicity}. |
|
534 |
||
535 |
\subsection{Inductive Datatypes} |
|
536 |
\label{inductive-datatypes} |
|
537 |
||
538 |
Like natural numbers and integers, inductive datatypes with recursive |
|
539 |
constructors admit no finite models and must be approximated by a subterm-closed |
|
540 |
subset. For example, using a cardinality of 10 for ${'}a~\textit{list}$, |
|
541 |
Nitpick looks for all counterexamples that can be built using at most 10 |
|
542 |
different lists. |
|
543 |
||
544 |
Let's see with an example involving \textit{hd} (which returns the first element |
|
545 |
of a list) and $@$ (which concatenates two lists): |
|
546 |
||
547 |
\prew |
|
548 |
\textbf{lemma} ``$\textit{hd}~(\textit{xs} \mathbin{@} [y, y]) = \textit{hd}~\textit{xs}$'' \\ |
|
549 |
\textbf{nitpick} \\[2\smallskipamount] |
|
550 |
\slshape Nitpick found a counterexample for \textit{card} $'a$~= 3: \\[2\smallskipamount] |
|
551 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
552 |
\hbox{}\qquad\qquad $\textit{xs} = []$ \\ |
|
553 |
\hbox{}\qquad\qquad $\textit{y} = a_3$ |
|
554 |
\postw |
|
555 |
||
556 |
To see why the counterexample is genuine, we enable \textit{show\_consts} |
|
557 |
and \textit{show\_\allowbreak datatypes}: |
|
558 |
||
559 |
\prew |
|
560 |
{\slshape Datatype:} \\ |
|
561 |
\hbox{}\qquad $'a$~\textit{list}~= $\{[],\, [a_3, a_3],\, [a_3],\, \unr\}$ \\ |
|
562 |
{\slshape Constants:} \\ |
|
34982
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
563 |
\hbox{}\qquad $\lambda x_1.\; x_1 \mathbin{@} [y, y] = \undef([] := [a_3, a_3])$ \\ |
33191 | 564 |
\hbox{}\qquad $\textit{hd} = \undef([] := a_2,\> [a_3, a_3] := a_3,\> [a_3] := a_3)$ |
565 |
\postw |
|
566 |
||
567 |
Since $\mathit{hd}~[]$ is undefined in the logic, it may be given any value, |
|
568 |
including $a_2$. |
|
569 |
||
570 |
The second constant, $\lambda x_1.\; x_1 \mathbin{@} [y, y]$, is simply the |
|
571 |
append operator whose second argument is fixed to be $[y, y]$. Appending $[a_3, |
|
572 |
a_3]$ to $[a_3]$ would normally give $[a_3, a_3, a_3]$, but this value is not |
|
573 |
representable in the subset of $'a$~\textit{list} considered by Nitpick, which |
|
574 |
is shown under the ``Datatype'' heading; hence the result is $\unk$. Similarly, |
|
575 |
appending $[a_3, a_3]$ to itself gives $\unk$. |
|
576 |
||
577 |
Given \textit{card}~$'a = 3$ and \textit{card}~$'a~\textit{list} = 3$, Nitpick |
|
578 |
considers the following subsets: |
|
579 |
||
580 |
\kern-.5\smallskipamount %% TYPESETTING |
|
581 |
||
582 |
\prew |
|
583 |
\begin{multicols}{3} |
|
584 |
$\{[],\, [a_1],\, [a_2]\}$; \\ |
|
585 |
$\{[],\, [a_1],\, [a_3]\}$; \\ |
|
586 |
$\{[],\, [a_2],\, [a_3]\}$; \\ |
|
587 |
$\{[],\, [a_1],\, [a_1, a_1]\}$; \\ |
|
588 |
$\{[],\, [a_1],\, [a_2, a_1]\}$; \\ |
|
589 |
$\{[],\, [a_1],\, [a_3, a_1]\}$; \\ |
|
590 |
$\{[],\, [a_2],\, [a_1, a_2]\}$; \\ |
|
591 |
$\{[],\, [a_2],\, [a_2, a_2]\}$; \\ |
|
592 |
$\{[],\, [a_2],\, [a_3, a_2]\}$; \\ |
|
593 |
$\{[],\, [a_3],\, [a_1, a_3]\}$; \\ |
|
594 |
$\{[],\, [a_3],\, [a_2, a_3]\}$; \\ |
|
595 |
$\{[],\, [a_3],\, [a_3, a_3]\}$. |
|
596 |
\end{multicols} |
|
597 |
\postw |
|
598 |
||
599 |
\kern-2\smallskipamount %% TYPESETTING |
|
600 |
||
601 |
All subterm-closed subsets of $'a~\textit{list}$ consisting of three values |
|
602 |
are listed and only those. As an example of a non-subterm-closed subset, |
|
603 |
consider $\mathcal{S} = \{[],\, [a_1],\,\allowbreak [a_1, a_3]\}$, and observe |
|
604 |
that $[a_1, a_3]$ (i.e., $a_1 \mathbin{\#} [a_3]$) has $[a_3] \notin |
|
605 |
\mathcal{S}$ as a subterm. |
|
606 |
||
607 |
Here's another m\"ochtegern-lemma that Nitpick can refute without a blink: |
|
608 |
||
609 |
\prew |
|
610 |
\textbf{lemma} ``$\lbrakk \textit{length}~\textit{xs} = 1;\> \textit{length}~\textit{ys} = 1 |
|
611 |
\rbrakk \,\Longrightarrow\, \textit{xs} = \textit{ys}$'' |
|
612 |
\\ |
|
613 |
\textbf{nitpick} [\textit{show\_datatypes}] \\[2\smallskipamount] |
|
614 |
\slshape Nitpick found a counterexample for \textit{card} $'a$~= 3: \\[2\smallskipamount] |
|
615 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
616 |
\hbox{}\qquad\qquad $\textit{xs} = [a_2]$ \\ |
|
617 |
\hbox{}\qquad\qquad $\textit{ys} = [a_3]$ \\ |
|
618 |
\hbox{}\qquad Datatypes: \\ |
|
619 |
\hbox{}\qquad\qquad $\textit{nat} = \{0,\, 1,\, 2,\, \unr\}$ \\ |
|
620 |
\hbox{}\qquad\qquad $'a$~\textit{list} = $\{[],\, [a_3],\, [a_2],\, \unr\}$ |
|
621 |
\postw |
|
622 |
||
623 |
Because datatypes are approximated using a three-valued logic, there is usually |
|
624 |
no need to systematically enumerate cardinalities: If Nitpick cannot find a |
|
625 |
genuine counterexample for \textit{card}~$'a~\textit{list}$~= 10, it is very |
|
626 |
unlikely that one could be found for smaller cardinalities. |
|
627 |
||
628 |
\subsection{Typedefs, Records, Rationals, and Reals} |
|
629 |
\label{typedefs-records-rationals-and-reals} |
|
630 |
||
631 |
Nitpick generally treats types declared using \textbf{typedef} as datatypes |
|
632 |
whose single constructor is the corresponding \textit{Abs\_\kern.1ex} function. |
|
633 |
For example: |
|
634 |
||
635 |
\prew |
|
636 |
\textbf{typedef}~\textit{three} = ``$\{0\Colon\textit{nat},\, 1,\, 2\}$'' \\ |
|
637 |
\textbf{by}~\textit{blast} \\[2\smallskipamount] |
|
638 |
\textbf{definition}~$A \mathbin{\Colon} \textit{three}$ \textbf{where} ``\kern-.1em$A \,\equiv\, \textit{Abs\_\allowbreak three}~0$'' \\ |
|
639 |
\textbf{definition}~$B \mathbin{\Colon} \textit{three}$ \textbf{where} ``$B \,\equiv\, \textit{Abs\_three}~1$'' \\ |
|
640 |
\textbf{definition}~$C \mathbin{\Colon} \textit{three}$ \textbf{where} ``$C \,\equiv\, \textit{Abs\_three}~2$'' \\[2\smallskipamount] |
|
641 |
\textbf{lemma} ``$\lbrakk P~A;\> P~B\rbrakk \,\Longrightarrow\, P~x$'' \\ |
|
642 |
\textbf{nitpick} [\textit{show\_datatypes}] \\[2\smallskipamount] |
|
643 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
644 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
645 |
\hbox{}\qquad\qquad $P = \{\Abs{1},\, \Abs{0}\}$ \\ |
|
646 |
\hbox{}\qquad\qquad $x = \Abs{2}$ \\ |
|
647 |
\hbox{}\qquad Datatypes: \\ |
|
648 |
\hbox{}\qquad\qquad $\textit{nat} = \{0,\, 1,\, 2,\, \unr\}$ \\ |
|
649 |
\hbox{}\qquad\qquad $\textit{three} = \{\Abs{2},\, \Abs{1},\, \Abs{0},\, \unr\}$ |
|
650 |
\postw |
|
651 |
||
652 |
%% MARK |
|
653 |
In the output above, $\Abs{n}$ abbreviates $\textit{Abs\_three}~n$. |
|
654 |
||
655 |
%% MARK |
|
656 |
Records, which are implemented as \textbf{typedef}s behind the scenes, are |
|
657 |
handled in much the same way: |
|
658 |
||
659 |
\prew |
|
660 |
\textbf{record} \textit{point} = \\ |
|
661 |
\hbox{}\quad $\textit{Xcoord} \mathbin{\Colon} \textit{int}$ \\ |
|
662 |
\hbox{}\quad $\textit{Ycoord} \mathbin{\Colon} \textit{int}$ \\[2\smallskipamount] |
|
663 |
\textbf{lemma} ``$\textit{Xcoord}~(p\Colon\textit{point}) = \textit{Xcoord}~(q\Colon\textit{point})$'' \\ |
|
664 |
\textbf{nitpick} [\textit{show\_datatypes}] \\[2\smallskipamount] |
|
665 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
666 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
667 |
\hbox{}\qquad\qquad $p = \lparr\textit{Xcoord} = 0,\> \textit{Ycoord} = 0\rparr$ \\ |
|
668 |
\hbox{}\qquad\qquad $q = \lparr\textit{Xcoord} = 1,\> \textit{Ycoord} = 1\rparr$ \\ |
|
669 |
\hbox{}\qquad Datatypes: \\ |
|
670 |
\hbox{}\qquad\qquad $\textit{int} = \{0,\, 1,\, \unr\}$ \\ |
|
671 |
\hbox{}\qquad\qquad $\textit{point} = \{\lparr\textit{Xcoord} = 1,\> |
|
672 |
\textit{Ycoord} = 1\rparr,\> \lparr\textit{Xcoord} = 0,\> \textit{Ycoord} = 0\rparr,\, \unr\}$\kern-1pt %% QUIET |
|
673 |
\postw |
|
674 |
||
675 |
Finally, Nitpick provides rudimentary support for rationals and reals using a |
|
676 |
similar approach: |
|
677 |
||
678 |
\prew |
|
679 |
\textbf{lemma} ``$4 * x + 3 * (y\Colon\textit{real}) \not= 1/2$'' \\ |
|
680 |
\textbf{nitpick} [\textit{show\_datatypes}] \\[2\smallskipamount] |
|
681 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
|
682 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
683 |
\hbox{}\qquad\qquad $x = 1/2$ \\ |
|
684 |
\hbox{}\qquad\qquad $y = -1/2$ \\ |
|
685 |
\hbox{}\qquad Datatypes: \\ |
|
686 |
\hbox{}\qquad\qquad $\textit{nat} = \{0,\, 1,\, 2,\, 3,\, 4,\, 5,\, 6,\, 7,\, \unr\}$ \\ |
|
687 |
\hbox{}\qquad\qquad $\textit{int} = \{0,\, 1,\, 2,\, 3,\, 4,\, -3,\, -2,\, -1,\, \unr\}$ \\ |
|
688 |
\hbox{}\qquad\qquad $\textit{real} = \{1,\, 0,\, 4,\, -3/2,\, 3,\, 2,\, 1/2,\, -1/2,\, \unr\}$ |
|
689 |
\postw |
|
690 |
||
691 |
\subsection{Inductive and Coinductive Predicates} |
|
692 |
\label{inductive-and-coinductive-predicates} |
|
693 |
||
694 |
Inductively defined predicates (and sets) are particularly problematic for |
|
695 |
counterexample generators. They can make Quickcheck~\cite{berghofer-nipkow-2004} |
|
696 |
loop forever and Refute~\cite{weber-2008} run out of resources. The crux of |
|
697 |
the problem is that they are defined using a least fixed point construction. |
|
698 |
||
699 |
Nitpick's philosophy is that not all inductive predicates are equal. Consider |
|
700 |
the \textit{even} predicate below: |
|
701 |
||
702 |
\prew |
|
703 |
\textbf{inductive}~\textit{even}~\textbf{where} \\ |
|
704 |
``\textit{even}~0'' $\,\mid$ \\ |
|
705 |
``\textit{even}~$n\,\Longrightarrow\, \textit{even}~(\textit{Suc}~(\textit{Suc}~n))$'' |
|
706 |
\postw |
|
707 |
||
708 |
This predicate enjoys the desirable property of being well-founded, which means |
|
709 |
that the introduction rules don't give rise to infinite chains of the form |
|
710 |
||
711 |
\prew |
|
712 |
$\cdots\,\Longrightarrow\, \textit{even}~k'' |
|
713 |
\,\Longrightarrow\, \textit{even}~k' |
|
714 |
\,\Longrightarrow\, \textit{even}~k.$ |
|
715 |
\postw |
|
716 |
||
717 |
For \textit{even}, this is obvious: Any chain ending at $k$ will be of length |
|
718 |
$k/2 + 1$: |
|
719 |
||
720 |
\prew |
|
721 |
$\textit{even}~0\,\Longrightarrow\, \textit{even}~2\,\Longrightarrow\, \cdots |
|
722 |
\,\Longrightarrow\, \textit{even}~(k - 2) |
|
723 |
\,\Longrightarrow\, \textit{even}~k.$ |
|
724 |
\postw |
|
725 |
||
726 |
Wellfoundedness is desirable because it enables Nitpick to use a very efficient |
|
727 |
fixed point computation.% |
|
728 |
\footnote{If an inductive predicate is |
|
729 |
well-founded, then it has exactly one fixed point, which is simultaneously the |
|
730 |
least and the greatest fixed point. In these circumstances, the computation of |
|
731 |
the least fixed point amounts to the computation of an arbitrary fixed point, |
|
732 |
which can be performed using a straightforward recursive equation.} |
|
733 |
Moreover, Nitpick can prove wellfoundedness of most well-founded predicates, |
|
734 |
just as Isabelle's \textbf{function} package usually discharges termination |
|
735 |
proof obligations automatically. |
|
736 |
||
737 |
Let's try an example: |
|
738 |
||
739 |
\prew |
|
740 |
\textbf{lemma} ``$\exists n.\; \textit{even}~n \mathrel{\land} \textit{even}~(\textit{Suc}~n)$'' \\ |
|
34126 | 741 |
\textbf{nitpick}~[\textit{card nat}~= 100, \textit{unary\_ints}, \textit{verbose}] \\[2\smallskipamount] |
33191 | 742 |
\slshape The inductive predicate ``\textit{even}'' was proved well-founded. |
743 |
Nitpick can compute it efficiently. \\[2\smallskipamount] |
|
744 |
Trying 1 scope: \\ |
|
745 |
\hbox{}\qquad \textit{card nat}~= 100. \\[2\smallskipamount] |
|
746 |
Nitpick found a potential counterexample for \textit{card nat}~= 100: \\[2\smallskipamount] |
|
747 |
\hbox{}\qquad Empty assignment \\[2\smallskipamount] |
|
748 |
Nitpick could not find a better counterexample. \\[2\smallskipamount] |
|
749 |
Total time: 2274 ms. |
|
750 |
\postw |
|
751 |
||
752 |
No genuine counterexample is possible because Nitpick cannot rule out the |
|
753 |
existence of a natural number $n \ge 100$ such that both $\textit{even}~n$ and |
|
754 |
$\textit{even}~(\textit{Suc}~n)$ are true. To help Nitpick, we can bound the |
|
755 |
existential quantifier: |
|
756 |
||
757 |
\prew |
|
758 |
\textbf{lemma} ``$\exists n \mathbin{\le} 99.\; \textit{even}~n \mathrel{\land} \textit{even}~(\textit{Suc}~n)$'' \\ |
|
34126 | 759 |
\textbf{nitpick}~[\textit{card nat}~= 100, \textit{unary\_ints}] \\[2\smallskipamount] |
33191 | 760 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
761 |
\hbox{}\qquad Empty assignment |
|
762 |
\postw |
|
763 |
||
764 |
So far we were blessed by the wellfoundedness of \textit{even}. What happens if |
|
765 |
we use the following definition instead? |
|
766 |
||
767 |
\prew |
|
768 |
\textbf{inductive} $\textit{even}'$ \textbf{where} \\ |
|
769 |
``$\textit{even}'~(0{\Colon}\textit{nat})$'' $\,\mid$ \\ |
|
770 |
``$\textit{even}'~2$'' $\,\mid$ \\ |
|
771 |
``$\lbrakk\textit{even}'~m;\> \textit{even}'~n\rbrakk \,\Longrightarrow\, \textit{even}'~(m + n)$'' |
|
772 |
\postw |
|
773 |
||
774 |
This definition is not well-founded: From $\textit{even}'~0$ and |
|
775 |
$\textit{even}'~0$, we can derive that $\textit{even}'~0$. Nonetheless, the |
|
776 |
predicates $\textit{even}$ and $\textit{even}'$ are equivalent. |
|
777 |
||
778 |
Let's check a property involving $\textit{even}'$. To make up for the |
|
779 |
foreseeable computational hurdles entailed by non-wellfoundedness, we decrease |
|
780 |
\textit{nat}'s cardinality to a mere 10: |
|
781 |
||
782 |
\prew |
|
783 |
\textbf{lemma}~``$\exists n \in \{0, 2, 4, 6, 8\}.\; |
|
784 |
\lnot\;\textit{even}'~n$'' \\ |
|
785 |
\textbf{nitpick}~[\textit{card nat}~= 10,\, \textit{verbose},\, \textit{show\_consts}] \\[2\smallskipamount] |
|
786 |
\slshape |
|
787 |
The inductive predicate ``$\textit{even}'\!$'' could not be proved well-founded. |
|
788 |
Nitpick might need to unroll it. \\[2\smallskipamount] |
|
789 |
Trying 6 scopes: \\ |
|
790 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 0; \\ |
|
791 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 1; \\ |
|
792 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 2; \\ |
|
793 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 4; \\ |
|
794 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 8; \\ |
|
795 |
\hbox{}\qquad \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 9. \\[2\smallskipamount] |
|
796 |
Nitpick found a counterexample for \textit{card nat}~= 10 and \textit{iter} $\textit{even}'$~= 2: \\[2\smallskipamount] |
|
797 |
\hbox{}\qquad Constant: \nopagebreak \\ |
|
798 |
\hbox{}\qquad\qquad $\lambda i.\; \textit{even}'$ = $\undef(\!\begin{aligned}[t] |
|
799 |
& 2 := \{0, 2, 4, 6, 8, 1^\Q, 3^\Q, 5^\Q, 7^\Q, 9^\Q\}, \\[-2pt] |
|
800 |
& 1 := \{0, 2, 4, 1^\Q, 3^\Q, 5^\Q, 6^\Q, 7^\Q, 8^\Q, 9^\Q\}, \\[-2pt] |
|
801 |
& 0 := \{0, 2, 1^\Q, 3^\Q, 4^\Q, 5^\Q, 6^\Q, 7^\Q, 8^\Q, 9^\Q\})\end{aligned}$ \\[2\smallskipamount] |
|
802 |
Total time: 1140 ms. |
|
803 |
\postw |
|
804 |
||
805 |
Nitpick's output is very instructive. First, it tells us that the predicate is |
|
806 |
unrolled, meaning that it is computed iteratively from the empty set. Then it |
|
807 |
lists six scopes specifying different bounds on the numbers of iterations:\ 0, |
|
808 |
1, 2, 4, 8, and~9. |
|
809 |
||
810 |
The output also shows how each iteration contributes to $\textit{even}'$. The |
|
811 |
notation $\lambda i.\; \textit{even}'$ indicates that the value of the |
|
812 |
predicate depends on an iteration counter. Iteration 0 provides the basis |
|
813 |
elements, $0$ and $2$. Iteration 1 contributes $4$ ($= 2 + 2$). Iteration 2 |
|
814 |
throws $6$ ($= 2 + 4 = 4 + 2$) and $8$ ($= 4 + 4$) into the mix. Further |
|
815 |
iterations would not contribute any new elements. |
|
816 |
||
817 |
Some values are marked with superscripted question |
|
818 |
marks~(`\lower.2ex\hbox{$^\Q$}'). These are the elements for which the |
|
819 |
predicate evaluates to $\unk$. Thus, $\textit{even}'$ evaluates to either |
|
820 |
\textit{True} or $\unk$, never \textit{False}. |
|
821 |
||
822 |
When unrolling a predicate, Nitpick tries 0, 1, 2, 4, 8, 12, 16, and 24 |
|
823 |
iterations. However, these numbers are bounded by the cardinality of the |
|
824 |
predicate's domain. With \textit{card~nat}~= 10, no more than 9 iterations are |
|
825 |
ever needed to compute the value of a \textit{nat} predicate. You can specify |
|
826 |
the number of iterations using the \textit{iter} option, as explained in |
|
827 |
\S\ref{scope-of-search}. |
|
828 |
||
829 |
In the next formula, $\textit{even}'$ occurs both positively and negatively: |
|
830 |
||
831 |
\prew |
|
832 |
\textbf{lemma} ``$\textit{even}'~(n - 2) \,\Longrightarrow\, \textit{even}'~n$'' \\ |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
833 |
\textbf{nitpick} [\textit{card nat} = 10, \textit{show\_consts}] \\[2\smallskipamount] |
33191 | 834 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
835 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
836 |
\hbox{}\qquad\qquad $n = 1$ \\ |
|
837 |
\hbox{}\qquad Constants: \nopagebreak \\ |
|
838 |
\hbox{}\qquad\qquad $\lambda i.\; \textit{even}'$ = $\undef(\!\begin{aligned}[t] |
|
839 |
& 0 := \{0, 2, 1^\Q, 3^\Q, 4^\Q, 5^\Q, 6^\Q, 7^\Q, 8^\Q, 9^\Q\})\end{aligned}$ \\ |
|
840 |
\hbox{}\qquad\qquad $\textit{even}' \subseteq \{0, 2, 4, 6, 8, \unr\}$ |
|
841 |
\postw |
|
842 |
||
843 |
Notice the special constraint $\textit{even}' \subseteq \{0,\, 2,\, 4,\, 6,\, |
|
844 |
8,\, \unr\}$ in the output, whose right-hand side represents an arbitrary |
|
845 |
fixed point (not necessarily the least one). It is used to falsify |
|
846 |
$\textit{even}'~n$. In contrast, the unrolled predicate is used to satisfy |
|
847 |
$\textit{even}'~(n - 2)$. |
|
848 |
||
849 |
Coinductive predicates are handled dually. For example: |
|
850 |
||
851 |
\prew |
|
852 |
\textbf{coinductive} \textit{nats} \textbf{where} \\ |
|
853 |
``$\textit{nats}~(x\Colon\textit{nat}) \,\Longrightarrow\, \textit{nats}~x$'' \\[2\smallskipamount] |
|
854 |
\textbf{lemma} ``$\textit{nats} = \{0, 1, 2, 3, 4\}$'' \\ |
|
855 |
\textbf{nitpick}~[\textit{card nat} = 10,\, \textit{show\_consts}] \\[2\smallskipamount] |
|
856 |
\slshape Nitpick found a counterexample: |
|
857 |
\\[2\smallskipamount] |
|
858 |
\hbox{}\qquad Constants: \nopagebreak \\ |
|
859 |
\hbox{}\qquad\qquad $\lambda i.\; \textit{nats} = \undef(0 := \{\!\begin{aligned}[t] |
|
860 |
& 0^\Q, 1^\Q, 2^\Q, 3^\Q, 4^\Q, 5^\Q, 6^\Q, 7^\Q, 8^\Q, 9^\Q, \\[-2pt] |
|
861 |
& \unr\})\end{aligned}$ \\ |
|
862 |
\hbox{}\qquad\qquad $nats \supseteq \{9, 5^\Q, 6^\Q, 7^\Q, 8^\Q, \unr\}$ |
|
863 |
\postw |
|
864 |
||
865 |
As a special case, Nitpick uses Kodkod's transitive closure operator to encode |
|
866 |
negative occurrences of non-well-founded ``linear inductive predicates,'' i.e., |
|
867 |
inductive predicates for which each the predicate occurs in at most one |
|
868 |
assumption of each introduction rule. For example: |
|
869 |
||
870 |
\prew |
|
871 |
\textbf{inductive} \textit{odd} \textbf{where} \\ |
|
872 |
``$\textit{odd}~1$'' $\,\mid$ \\ |
|
873 |
``$\lbrakk \textit{odd}~m;\>\, \textit{even}~n\rbrakk \,\Longrightarrow\, \textit{odd}~(m + n)$'' \\[2\smallskipamount] |
|
874 |
\textbf{lemma}~``$\textit{odd}~n \,\Longrightarrow\, \textit{odd}~(n - 2)$'' \\ |
|
875 |
\textbf{nitpick}~[\textit{card nat} = 10,\, \textit{show\_consts}] \\[2\smallskipamount] |
|
876 |
\slshape Nitpick found a counterexample: |
|
877 |
\\[2\smallskipamount] |
|
878 |
\hbox{}\qquad Free variable: \nopagebreak \\ |
|
879 |
\hbox{}\qquad\qquad $n = 1$ \\ |
|
880 |
\hbox{}\qquad Constants: \nopagebreak \\ |
|
881 |
\hbox{}\qquad\qquad $\textit{even} = \{0, 2, 4, 6, 8, \unr\}$ \\ |
|
882 |
\hbox{}\qquad\qquad $\textit{odd}_{\textsl{base}} = \{1, \unr\}$ \\ |
|
883 |
\hbox{}\qquad\qquad $\textit{odd}_{\textsl{step}} = \! |
|
884 |
\!\begin{aligned}[t] |
|
885 |
& \{(0, 0), (0, 2), (0, 4), (0, 6), (0, 8), (1, 1), (1, 3), (1, 5), \\[-2pt] |
|
886 |
& \phantom{\{} (1, 7), (1, 9), (2, 2), (2, 4), (2, 6), (2, 8), (3, 3), |
|
887 |
(3, 5), \\[-2pt] |
|
888 |
& \phantom{\{} (3, 7), (3, 9), (4, 4), (4, 6), (4, 8), (5, 5), (5, 7), (5, 9), \\[-2pt] |
|
889 |
& \phantom{\{} (6, 6), (6, 8), (7, 7), (7, 9), (8, 8), (9, 9), \unr\}\end{aligned}$ \\ |
|
890 |
\hbox{}\qquad\qquad $\textit{odd} \subseteq \{1, 3, 5, 7, 9, 8^\Q, \unr\}$ |
|
891 |
\postw |
|
892 |
||
893 |
\noindent |
|
894 |
In the output, $\textit{odd}_{\textrm{base}}$ represents the base elements and |
|
895 |
$\textit{odd}_{\textrm{step}}$ is a transition relation that computes new |
|
896 |
elements from known ones. The set $\textit{odd}$ consists of all the values |
|
897 |
reachable through the reflexive transitive closure of |
|
898 |
$\textit{odd}_{\textrm{step}}$ starting with any element from |
|
899 |
$\textit{odd}_{\textrm{base}}$, namely 1, 3, 5, 7, and 9. Using Kodkod's |
|
900 |
transitive closure to encode linear predicates is normally either more thorough |
|
901 |
or more efficient than unrolling (depending on the value of \textit{iter}), but |
|
902 |
for those cases where it isn't you can disable it by passing the |
|
903 |
\textit{dont\_star\_linear\_preds} option. |
|
904 |
||
905 |
\subsection{Coinductive Datatypes} |
|
906 |
\label{coinductive-datatypes} |
|
907 |
||
908 |
While Isabelle regrettably lacks a high-level mechanism for defining coinductive |
|
909 |
datatypes, the \textit{Coinductive\_List} theory provides a coinductive ``lazy |
|
910 |
list'' datatype, $'a~\textit{llist}$, defined the hard way. Nitpick supports |
|
911 |
these lazy lists seamlessly and provides a hook, described in |
|
912 |
\S\ref{registration-of-coinductive-datatypes}, to register custom coinductive |
|
913 |
datatypes. |
|
914 |
||
915 |
(Co)intuitively, a coinductive datatype is similar to an inductive datatype but |
|
916 |
allows infinite objects. Thus, the infinite lists $\textit{ps}$ $=$ $[a, a, a, |
|
917 |
\ldots]$, $\textit{qs}$ $=$ $[a, b, a, b, \ldots]$, and $\textit{rs}$ $=$ $[0, |
|
918 |
1, 2, 3, \ldots]$ can be defined as lazy lists using the |
|
919 |
$\textit{LNil}\mathbin{\Colon}{'}a~\textit{llist}$ and |
|
920 |
$\textit{LCons}\mathbin{\Colon}{'}a \mathbin{\Rightarrow} {'}a~\textit{llist} |
|
921 |
\mathbin{\Rightarrow} {'}a~\textit{llist}$ constructors. |
|
922 |
||
923 |
Although it is otherwise no friend of infinity, Nitpick can find counterexamples |
|
924 |
involving cyclic lists such as \textit{ps} and \textit{qs} above as well as |
|
925 |
finite lists: |
|
926 |
||
927 |
\prew |
|
928 |
\textbf{lemma} ``$\textit{xs} \not= \textit{LCons}~a~\textit{xs}$'' \\ |
|
929 |
\textbf{nitpick} \\[2\smallskipamount] |
|
930 |
\slshape Nitpick found a counterexample for {\itshape card}~$'a$ = 1: \\[2\smallskipamount] |
|
931 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
932 |
\hbox{}\qquad\qquad $\textit{a} = a_1$ \\ |
|
933 |
\hbox{}\qquad\qquad $\textit{xs} = \textsl{THE}~\omega.\; \omega = \textit{LCons}~a_1~\omega$ |
|
934 |
\postw |
|
935 |
||
936 |
The notation $\textrm{THE}~\omega.\; \omega = t(\omega)$ stands |
|
937 |
for the infinite term $t(t(t(\ldots)))$. Hence, \textit{xs} is simply the |
|
938 |
infinite list $[a_1, a_1, a_1, \ldots]$. |
|
939 |
||
940 |
The next example is more interesting: |
|
941 |
||
942 |
\prew |
|
943 |
\textbf{lemma}~``$\lbrakk\textit{xs} = \textit{LCons}~a~\textit{xs};\>\, |
|
944 |
\textit{ys} = \textit{iterates}~(\lambda b.\> a)~b\rbrakk \,\Longrightarrow\, \textit{xs} = \textit{ys}$'' \\ |
|
945 |
\textbf{nitpick} [\textit{verbose}] \\[2\smallskipamount] |
|
946 |
\slshape The type ``\kern1pt$'a$'' passed the monotonicity test. Nitpick might be able to skip |
|
947 |
some scopes. \\[2\smallskipamount] |
|
948 |
Trying 8 scopes: \\ |
|
949 |
\hbox{}\qquad \textit{card} $'a$~= 1, \textit{card} ``\kern1pt$'a~\textit{list}$''~= 1, |
|
950 |
and \textit{bisim\_depth}~= 0. \\ |
|
951 |
\hbox{}\qquad $\qquad\vdots$ \\[.5\smallskipamount] |
|
952 |
\hbox{}\qquad \textit{card} $'a$~= 8, \textit{card} ``\kern1pt$'a~\textit{list}$''~= 8, |
|
953 |
and \textit{bisim\_depth}~= 7. \\[2\smallskipamount] |
|
954 |
Nitpick found a counterexample for {\itshape card}~$'a$ = 2, |
|
955 |
\textit{card}~``\kern1pt$'a~\textit{list}$''~= 2, and \textit{bisim\_\allowbreak |
|
956 |
depth}~= 1: |
|
957 |
\\[2\smallskipamount] |
|
958 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
959 |
\hbox{}\qquad\qquad $\textit{a} = a_2$ \\ |
|
960 |
\hbox{}\qquad\qquad $\textit{b} = a_1$ \\ |
|
961 |
\hbox{}\qquad\qquad $\textit{xs} = \textsl{THE}~\omega.\; \omega = \textit{LCons}~a_2~\omega$ \\ |
|
962 |
\hbox{}\qquad\qquad $\textit{ys} = \textit{LCons}~a_1~(\textsl{THE}~\omega.\; \omega = \textit{LCons}~a_2~\omega)$ \\[2\smallskipamount] |
|
963 |
Total time: 726 ms. |
|
964 |
\postw |
|
965 |
||
966 |
The lazy list $\textit{xs}$ is simply $[a_2, a_2, a_2, \ldots]$, whereas |
|
967 |
$\textit{ys}$ is $[a_1, a_2, a_2, a_2, \ldots]$, i.e., a lasso-shaped list with |
|
968 |
$[a_1]$ as its stem and $[a_2]$ as its cycle. In general, the list segment |
|
969 |
within the scope of the {THE} binder corresponds to the lasso's cycle, whereas |
|
970 |
the segment leading to the binder is the stem. |
|
971 |
||
972 |
A salient property of coinductive datatypes is that two objects are considered |
|
973 |
equal if and only if they lead to the same observations. For example, the lazy |
|
974 |
lists $\textrm{THE}~\omega.\; \omega = |
|
975 |
\textit{LCons}~a~(\textit{LCons}~b~\omega)$ and |
|
976 |
$\textit{LCons}~a~(\textrm{THE}~\omega.\; \omega = |
|
977 |
\textit{LCons}~b~(\textit{LCons}~a~\omega))$ are identical, because both lead |
|
978 |
to the sequence of observations $a$, $b$, $a$, $b$, \hbox{\ldots} (or, |
|
979 |
equivalently, both encode the infinite list $[a, b, a, b, \ldots]$). This |
|
980 |
concept of equality for coinductive datatypes is called bisimulation and is |
|
981 |
defined coinductively. |
|
982 |
||
983 |
Internally, Nitpick encodes the coinductive bisimilarity predicate as part of |
|
984 |
the Kodkod problem to ensure that distinct objects lead to different |
|
985 |
observations. This precaution is somewhat expensive and often unnecessary, so it |
|
986 |
can be disabled by setting the \textit{bisim\_depth} option to $-1$. The |
|
987 |
bisimilarity check is then performed \textsl{after} the counterexample has been |
|
988 |
found to ensure correctness. If this after-the-fact check fails, the |
|
989 |
counterexample is tagged as ``likely genuine'' and Nitpick recommends to try |
|
990 |
again with \textit{bisim\_depth} set to a nonnegative integer. Disabling the |
|
991 |
check for the previous example saves approximately 150~milli\-seconds; the speed |
|
992 |
gains can be more significant for larger scopes. |
|
993 |
||
994 |
The next formula illustrates the need for bisimilarity (either as a Kodkod |
|
995 |
predicate or as an after-the-fact check) to prevent spurious counterexamples: |
|
996 |
||
997 |
\prew |
|
998 |
\textbf{lemma} ``$\lbrakk xs = \textit{LCons}~a~\textit{xs};\>\, \textit{ys} = \textit{LCons}~a~\textit{ys}\rbrakk |
|
999 |
\,\Longrightarrow\, \textit{xs} = \textit{ys}$'' \\ |
|
34124
c4628a1dcf75
added support for binary nat/int representation to Nitpick
blanchet
parents:
34038
diff
changeset
|
1000 |
\textbf{nitpick} [\textit{bisim\_depth} = $-1$, \textit{show\_datatypes}] \\[2\smallskipamount] |
33191 | 1001 |
\slshape Nitpick found a likely genuine counterexample for $\textit{card}~'a$ = 2: \\[2\smallskipamount] |
1002 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
1003 |
\hbox{}\qquad\qquad $a = a_2$ \\ |
|
1004 |
\hbox{}\qquad\qquad $\textit{xs} = \textsl{THE}~\omega.\; \omega = |
|
1005 |
\textit{LCons}~a_2~\omega$ \\ |
|
1006 |
\hbox{}\qquad\qquad $\textit{ys} = \textsl{THE}~\omega.\; \omega = \textit{LCons}~a_2~\omega$ \\ |
|
1007 |
\hbox{}\qquad Codatatype:\strut \nopagebreak \\ |
|
1008 |
\hbox{}\qquad\qquad $'a~\textit{llist} = |
|
1009 |
\{\!\begin{aligned}[t] |
|
1010 |
& \textsl{THE}~\omega.\; \omega = \textit{LCons}~a_2~\omega, \\[-2pt] |
|
1011 |
& \textsl{THE}~\omega.\; \omega = \textit{LCons}~a_2~\omega,\> \unr\}\end{aligned}$ |
|
1012 |
\\[2\smallskipamount] |
|
1013 |
Try again with ``\textit{bisim\_depth}'' set to a nonnegative value to confirm |
|
1014 |
that the counterexample is genuine. \\[2\smallskipamount] |
|
1015 |
{\upshape\textbf{nitpick}} \\[2\smallskipamount] |
|
1016 |
\slshape Nitpick found no counterexample. |
|
1017 |
\postw |
|
1018 |
||
1019 |
In the first \textbf{nitpick} invocation, the after-the-fact check discovered |
|
1020 |
that the two known elements of type $'a~\textit{llist}$ are bisimilar. |
|
1021 |
||
1022 |
A compromise between leaving out the bisimilarity predicate from the Kodkod |
|
1023 |
problem and performing the after-the-fact check is to specify a lower |
|
1024 |
nonnegative \textit{bisim\_depth} value than the default one provided by |
|
1025 |
Nitpick. In general, a value of $K$ means that Nitpick will require all lists to |
|
1026 |
be distinguished from each other by their prefixes of length $K$. Be aware that |
|
1027 |
setting $K$ to a too low value can overconstrain Nitpick, preventing it from |
|
1028 |
finding any counterexamples. |
|
1029 |
||
1030 |
\subsection{Boxing} |
|
1031 |
\label{boxing} |
|
1032 |
||
1033 |
Nitpick normally maps function and product types directly to the corresponding |
|
1034 |
Kodkod concepts. As a consequence, if $'a$ has cardinality 3 and $'b$ has |
|
1035 |
cardinality 4, then $'a \times {'}b$ has cardinality 12 ($= 4 \times 3$) and $'a |
|
1036 |
\Rightarrow {'}b$ has cardinality 64 ($= 4^3$). In some circumstances, it pays |
|
1037 |
off to treat these types in the same way as plain datatypes, by approximating |
|
1038 |
them by a subset of a given cardinality. This technique is called ``boxing'' and |
|
1039 |
is particularly useful for functions passed as arguments to other functions, for |
|
1040 |
high-arity functions, and for large tuples. Under the hood, boxing involves |
|
1041 |
wrapping occurrences of the types $'a \times {'}b$ and $'a \Rightarrow {'}b$ in |
|
1042 |
isomorphic datatypes, as can be seen by enabling the \textit{debug} option. |
|
1043 |
||
1044 |
To illustrate boxing, we consider a formalization of $\lambda$-terms represented |
|
1045 |
using de Bruijn's notation: |
|
1046 |
||
1047 |
\prew |
|
1048 |
\textbf{datatype} \textit{tm} = \textit{Var}~\textit{nat}~$\mid$~\textit{Lam}~\textit{tm} $\mid$ \textit{App~tm~tm} |
|
1049 |
\postw |
|
1050 |
||
1051 |
The $\textit{lift}~t~k$ function increments all variables with indices greater |
|
1052 |
than or equal to $k$ by one: |
|
1053 |
||
1054 |
\prew |
|
1055 |
\textbf{primrec} \textit{lift} \textbf{where} \\ |
|
1056 |
``$\textit{lift}~(\textit{Var}~j)~k = \textit{Var}~(\textrm{if}~j < k~\textrm{then}~j~\textrm{else}~j + 1)$'' $\mid$ \\ |
|
1057 |
``$\textit{lift}~(\textit{Lam}~t)~k = \textit{Lam}~(\textit{lift}~t~(k + 1))$'' $\mid$ \\ |
|
1058 |
``$\textit{lift}~(\textit{App}~t~u)~k = \textit{App}~(\textit{lift}~t~k)~(\textit{lift}~u~k)$'' |
|
1059 |
\postw |
|
1060 |
||
1061 |
The $\textit{loose}~t~k$ predicate returns \textit{True} if and only if |
|
1062 |
term $t$ has a loose variable with index $k$ or more: |
|
1063 |
||
1064 |
\prew |
|
1065 |
\textbf{primrec}~\textit{loose} \textbf{where} \\ |
|
1066 |
``$\textit{loose}~(\textit{Var}~j)~k = (j \ge k)$'' $\mid$ \\ |
|
1067 |
``$\textit{loose}~(\textit{Lam}~t)~k = \textit{loose}~t~(\textit{Suc}~k)$'' $\mid$ \\ |
|
1068 |
``$\textit{loose}~(\textit{App}~t~u)~k = (\textit{loose}~t~k \mathrel{\lor} \textit{loose}~u~k)$'' |
|
1069 |
\postw |
|
1070 |
||
1071 |
Next, the $\textit{subst}~\sigma~t$ function applies the substitution $\sigma$ |
|
1072 |
on $t$: |
|
1073 |
||
1074 |
\prew |
|
1075 |
\textbf{primrec}~\textit{subst} \textbf{where} \\ |
|
1076 |
``$\textit{subst}~\sigma~(\textit{Var}~j) = \sigma~j$'' $\mid$ \\ |
|
1077 |
``$\textit{subst}~\sigma~(\textit{Lam}~t) = {}$\phantom{''} \\ |
|
1078 |
\phantom{``}$\textit{Lam}~(\textit{subst}~(\lambda n.\> \textrm{case}~n~\textrm{of}~0 \Rightarrow \textit{Var}~0 \mid \textit{Suc}~m \Rightarrow \textit{lift}~(\sigma~m)~1)~t)$'' $\mid$ \\ |
|
1079 |
``$\textit{subst}~\sigma~(\textit{App}~t~u) = \textit{App}~(\textit{subst}~\sigma~t)~(\textit{subst}~\sigma~u)$'' |
|
1080 |
\postw |
|
1081 |
||
1082 |
A substitution is a function that maps variable indices to terms. Observe that |
|
1083 |
$\sigma$ is a function passed as argument and that Nitpick can't optimize it |
|
1084 |
away, because the recursive call for the \textit{Lam} case involves an altered |
|
1085 |
version. Also notice the \textit{lift} call, which increments the variable |
|
1086 |
indices when moving under a \textit{Lam}. |
|
1087 |
||
1088 |
A reasonable property to expect of substitution is that it should leave closed |
|
1089 |
terms unchanged. Alas, even this simple property does not hold: |
|
1090 |
||
1091 |
\pre |
|
1092 |
\textbf{lemma}~``$\lnot\,\textit{loose}~t~0 \,\Longrightarrow\, \textit{subst}~\sigma~t = t$'' \\ |
|
1093 |
\textbf{nitpick} [\textit{verbose}] \\[2\smallskipamount] |
|
1094 |
\slshape |
|
1095 |
Trying 8 scopes: \nopagebreak \\ |
|
1096 |
\hbox{}\qquad \textit{card~nat}~= 1, \textit{card tm}~= 1, and \textit{card} ``$\textit{nat} \Rightarrow \textit{tm}$'' = 1; \\ |
|
1097 |
\hbox{}\qquad \textit{card~nat}~= 2, \textit{card tm}~= 2, and \textit{card} ``$\textit{nat} \Rightarrow \textit{tm}$'' = 2; \\ |
|
1098 |
\hbox{}\qquad $\qquad\vdots$ \\[.5\smallskipamount] |
|
1099 |
\hbox{}\qquad \textit{card~nat}~= 8, \textit{card tm}~= 8, and \textit{card} ``$\textit{nat} \Rightarrow \textit{tm}$'' = 8. \\[2\smallskipamount] |
|
1100 |
Nitpick found a counterexample for \textit{card~nat}~= 6, \textit{card~tm}~= 6, |
|
1101 |
and \textit{card}~``$\textit{nat} \Rightarrow \textit{tm}$''~= 6: \\[2\smallskipamount] |
|
1102 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
1103 |
\hbox{}\qquad\qquad $\sigma = \undef(\!\begin{aligned}[t] |
|
1104 |
& 0 := \textit{Var}~0,\> |
|
1105 |
1 := \textit{Var}~0,\> |
|
1106 |
2 := \textit{Var}~0, \\[-2pt] |
|
1107 |
& 3 := \textit{Var}~0,\> |
|
1108 |
4 := \textit{Var}~0,\> |
|
1109 |
5 := \textit{Var}~0)\end{aligned}$ \\ |
|
1110 |
\hbox{}\qquad\qquad $t = \textit{Lam}~(\textit{Lam}~(\textit{Var}~1))$ \\[2\smallskipamount] |
|
1111 |
Total time: $4679$ ms. |
|
1112 |
\postw |
|
1113 |
||
1114 |
Using \textit{eval}, we find out that $\textit{subst}~\sigma~t = |
|
1115 |
\textit{Lam}~(\textit{Lam}~(\textit{Var}~0))$. Using the traditional |
|
1116 |
$\lambda$-term notation, $t$~is |
|
1117 |
$\lambda x\, y.\> x$ whereas $\textit{subst}~\sigma~t$ is $\lambda x\, y.\> y$. |
|
1118 |
The bug is in \textit{subst}: The $\textit{lift}~(\sigma~m)~1$ call should be |
|
1119 |
replaced with $\textit{lift}~(\sigma~m)~0$. |
|
1120 |
||
1121 |
An interesting aspect of Nitpick's verbose output is that it assigned inceasing |
|
1122 |
cardinalities from 1 to 8 to the type $\textit{nat} \Rightarrow \textit{tm}$. |
|
1123 |
For the formula of interest, knowing 6 values of that type was enough to find |
|
1124 |
the counterexample. Without boxing, $46\,656$ ($= 6^6$) values must be |
|
1125 |
considered, a hopeless undertaking: |
|
1126 |
||
1127 |
\prew |
|
1128 |
\textbf{nitpick} [\textit{dont\_box}] \\[2\smallskipamount] |
|
1129 |
{\slshape Nitpick ran out of time after checking 4 of 8 scopes.} |
|
1130 |
\postw |
|
1131 |
||
1132 |
{\looseness=-1 |
|
1133 |
Boxing can be enabled or disabled globally or on a per-type basis using the |
|
1134 |
\textit{box} option. Moreover, setting the cardinality of a function or |
|
1135 |
product type implicitly enables boxing for that type. Nitpick usually performs |
|
1136 |
reasonable choices about which types should be boxed, but option tweaking |
|
1137 |
sometimes helps. |
|
1138 |
||
1139 |
} |
|
1140 |
||
1141 |
\subsection{Scope Monotonicity} |
|
1142 |
\label{scope-monotonicity} |
|
1143 |
||
1144 |
The \textit{card} option (together with \textit{iter}, \textit{bisim\_depth}, |
|
1145 |
and \textit{max}) controls which scopes are actually tested. In general, to |
|
1146 |
exhaust all models below a certain cardinality bound, the number of scopes that |
|
1147 |
Nitpick must consider increases exponentially with the number of type variables |
|
1148 |
(and \textbf{typedecl}'d types) occurring in the formula. Given the default |
|
1149 |
cardinality specification of 1--8, no fewer than $8^4 = 4096$ scopes must be |
|
1150 |
considered for a formula involving $'a$, $'b$, $'c$, and $'d$. |
|
1151 |
||
1152 |
Fortunately, many formulas exhibit a property called \textsl{scope |
|
1153 |
monotonicity}, meaning that if the formula is falsifiable for a given scope, |
|
1154 |
it is also falsifiable for all larger scopes \cite[p.~165]{jackson-2006}. |
|
1155 |
||
1156 |
Consider the formula |
|
1157 |
||
1158 |
\prew |
|
1159 |
\textbf{lemma}~``$\textit{length~xs} = \textit{length~ys} \,\Longrightarrow\, \textit{rev}~(\textit{zip~xs~ys}) = \textit{zip~xs}~(\textit{rev~ys})$'' |
|
1160 |
\postw |
|
1161 |
||
1162 |
where \textit{xs} is of type $'a~\textit{list}$ and \textit{ys} is of type |
|
1163 |
$'b~\textit{list}$. A priori, Nitpick would need to consider 512 scopes to |
|
1164 |
exhaust the specification \textit{card}~= 1--8. However, our intuition tells us |
|
1165 |
that any counterexample found with a small scope would still be a counterexample |
|
1166 |
in a larger scope---by simply ignoring the fresh $'a$ and $'b$ values provided |
|
1167 |
by the larger scope. Nitpick comes to the same conclusion after a careful |
|
1168 |
inspection of the formula and the relevant definitions: |
|
1169 |
||
1170 |
\prew |
|
1171 |
\textbf{nitpick}~[\textit{verbose}] \\[2\smallskipamount] |
|
1172 |
\slshape |
|
1173 |
The types ``\kern1pt$'a$'' and ``\kern1pt$'b$'' passed the monotonicity test. |
|
1174 |
Nitpick might be able to skip some scopes. |
|
1175 |
\\[2\smallskipamount] |
|
1176 |
Trying 8 scopes: \\ |
|
1177 |
\hbox{}\qquad \textit{card} $'a$~= 1, \textit{card} $'b$~= 1, |
|
1178 |
\textit{card} \textit{nat}~= 1, \textit{card} ``$('a \times {'}b)$ |
|
1179 |
\textit{list}''~= 1, \\ |
|
1180 |
\hbox{}\qquad\quad \textit{card} ``\kern1pt$'a$ \textit{list}''~= 1, and |
|
1181 |
\textit{card} ``\kern1pt$'b$ \textit{list}''~= 1. \\ |
|
1182 |
\hbox{}\qquad \textit{card} $'a$~= 2, \textit{card} $'b$~= 2, |
|
1183 |
\textit{card} \textit{nat}~= 2, \textit{card} ``$('a \times {'}b)$ |
|
1184 |
\textit{list}''~= 2, \\ |
|
1185 |
\hbox{}\qquad\quad \textit{card} ``\kern1pt$'a$ \textit{list}''~= 2, and |
|
1186 |
\textit{card} ``\kern1pt$'b$ \textit{list}''~= 2. \\ |
|
1187 |
\hbox{}\qquad $\qquad\vdots$ \\[.5\smallskipamount] |
|
1188 |
\hbox{}\qquad \textit{card} $'a$~= 8, \textit{card} $'b$~= 8, |
|
1189 |
\textit{card} \textit{nat}~= 8, \textit{card} ``$('a \times {'}b)$ |
|
1190 |
\textit{list}''~= 8, \\ |
|
1191 |
\hbox{}\qquad\quad \textit{card} ``\kern1pt$'a$ \textit{list}''~= 8, and |
|
1192 |
\textit{card} ``\kern1pt$'b$ \textit{list}''~= 8. |
|
1193 |
\\[2\smallskipamount] |
|
1194 |
Nitpick found a counterexample for |
|
1195 |
\textit{card} $'a$~= 5, \textit{card} $'b$~= 5, |
|
1196 |
\textit{card} \textit{nat}~= 5, \textit{card} ``$('a \times {'}b)$ |
|
1197 |
\textit{list}''~= 5, \textit{card} ``\kern1pt$'a$ \textit{list}''~= 5, and |
|
1198 |
\textit{card} ``\kern1pt$'b$ \textit{list}''~= 5: |
|
1199 |
\\[2\smallskipamount] |
|
1200 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
|
1201 |
\hbox{}\qquad\qquad $\textit{xs} = [a_4, a_5]$ \\ |
|
1202 |
\hbox{}\qquad\qquad $\textit{ys} = [b_3, b_3]$ \\[2\smallskipamount] |
|
1203 |
Total time: 1636 ms. |
|
1204 |
\postw |
|
1205 |
||
1206 |
In theory, it should be sufficient to test a single scope: |
|
1207 |
||
1208 |
\prew |
|
1209 |
\textbf{nitpick}~[\textit{card}~= 8] |
|
1210 |
\postw |
|
1211 |
||
1212 |
However, this is often less efficient in practice and may lead to overly complex |
|
1213 |
counterexamples. |
|
1214 |
||
1215 |
If the monotonicity check fails but we believe that the formula is monotonic (or |
|
1216 |
we don't mind missing some counterexamples), we can pass the |
|
1217 |
\textit{mono} option. To convince yourself that this option is risky, |
|
1218 |
simply consider this example from \S\ref{skolemization}: |
|
1219 |
||
1220 |
\prew |
|
1221 |
\textbf{lemma} ``$\exists g.\; \forall x\Colon 'b.~g~(f~x) = x |
|
1222 |
\,\Longrightarrow\, \forall y\Colon {'}a.\; \exists x.~y = f~x$'' \\ |
|
1223 |
\textbf{nitpick} [\textit{mono}] \\[2\smallskipamount] |
|
1224 |
{\slshape Nitpick found no counterexample.} \\[2\smallskipamount] |
|
1225 |
\textbf{nitpick} \\[2\smallskipamount] |
|
1226 |
\slshape |
|
1227 |
Nitpick found a counterexample for \textit{card} $'a$~= 2 and \textit{card} $'b$~=~1: \\ |
|
1228 |
\hbox{}\qquad $\vdots$ |
|
1229 |
\postw |
|
1230 |
||
1231 |
(It turns out the formula holds if and only if $\textit{card}~'a \le |
|
1232 |
\textit{card}~'b$.) Although this is rarely advisable, the automatic |
|
1233 |
monotonicity checks can be disabled by passing \textit{non\_mono} |
|
1234 |
(\S\ref{optimizations}). |
|
1235 |
||
1236 |
As insinuated in \S\ref{natural-numbers-and-integers} and |
|
1237 |
\S\ref{inductive-datatypes}, \textit{nat}, \textit{int}, and inductive datatypes |
|
1238 |
are normally monotonic and treated as such. The same is true for record types, |
|
1239 |
\textit{rat}, \textit{real}, and some \textbf{typedef}'d types. Thus, given the |
|
1240 |
cardinality specification 1--8, a formula involving \textit{nat}, \textit{int}, |
|
1241 |
\textit{int~list}, \textit{rat}, and \textit{rat~list} will lead Nitpick to |
|
1242 |
consider only 8~scopes instead of $32\,768$. |
|
1243 |
||
34982
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1244 |
\subsection{Inductive Properties} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1245 |
\label{inductive-properties} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1246 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1247 |
Inductive properties are a particular pain to prove, because the failure to |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1248 |
establish an induction step can mean several things: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1249 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1250 |
\begin{enumerate} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1251 |
\item The property is invalid. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1252 |
\item The property is valid but is too weak to support the induction step. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1253 |
\item The property is valid and strong enough; it's just that we haven't found |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1254 |
the proof yet. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1255 |
\end{enumerate} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1256 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1257 |
Depending on which scenario applies, we would take the appropriate course of |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1258 |
action: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1259 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1260 |
\begin{enumerate} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1261 |
\item Repair the statement of the property so that it becomes valid. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1262 |
\item Generalize the property and/or prove auxiliary properties. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1263 |
\item Work harder on a proof. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1264 |
\end{enumerate} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1265 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1266 |
How can we distinguish between the three scenarios? Nitpick's normal mode of |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1267 |
operation can often detect scenario 1, and Isabelle's automatic tactics help with |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1268 |
scenario 3. Using appropriate techniques, it is also often possible to use |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1269 |
Nitpick to identify scenario 2. Consider the following transition system, |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1270 |
in which natural numbers represent states: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1271 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1272 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1273 |
\textbf{inductive\_set}~\textit{reach}~\textbf{where} \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1274 |
``$(4\Colon\textit{nat}) \in \textit{reach\/}$'' $\mid$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1275 |
``$\lbrakk n < 4;\> n \in \textit{reach\/}\rbrakk \,\Longrightarrow\, 3 * n + 1 \in \textit{reach\/}$'' $\mid$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1276 |
``$n \in \textit{reach} \,\Longrightarrow n + 2 \in \textit{reach\/}$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1277 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1278 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1279 |
We will try to prove that only even numbers are reachable: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1280 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1281 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1282 |
\textbf{lemma}~``$n \in \textit{reach} \,\Longrightarrow\, 2~\textrm{dvd}~n$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1283 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1284 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1285 |
Does this property hold? Nitpick cannot find a counterexample within 30 seconds, |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1286 |
so let's attempt a proof by induction: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1287 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1288 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1289 |
\textbf{apply}~(\textit{induct~set}{:}~\textit{reach\/}) \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1290 |
\textbf{apply}~\textit{auto} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1291 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1292 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1293 |
This leaves us in the following proof state: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1294 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1295 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1296 |
{\slshape goal (2 subgoals): \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1297 |
\phantom{0}1. ${\bigwedge}n.\;\, \lbrakk n \in \textit{reach\/};\, n < 4;\, 2~\textsl{dvd}~n\rbrakk \,\Longrightarrow\, 2~\textsl{dvd}~\textit{Suc}~(3 * n)$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1298 |
\phantom{0}2. ${\bigwedge}n.\;\, \lbrakk n \in \textit{reach\/};\, 2~\textsl{dvd}~n\rbrakk \,\Longrightarrow\, 2~\textsl{dvd}~\textit{Suc}~(\textit{Suc}~n)$ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1299 |
} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1300 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1301 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1302 |
If we run Nitpick on the first subgoal, it still won't find any |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1303 |
counterexample; and yet, \textit{auto} fails to go further, and \textit{arith} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1304 |
is helpless. However, notice the $n \in \textit{reach}$ assumption, which |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1305 |
strengthens the induction hypothesis but is not immediately usable in the proof. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1306 |
If we remove it and invoke Nitpick, this time we get a counterexample: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1307 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1308 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1309 |
\textbf{apply}~(\textit{thin\_tac}~``$n \in \textit{reach\/}$'') \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1310 |
\textbf{nitpick} \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1311 |
\slshape Nitpick found a counterexample: \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1312 |
\hbox{}\qquad Skolem constant: \nopagebreak \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1313 |
\hbox{}\qquad\qquad $n = 0$ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1314 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1315 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1316 |
Indeed, 0 < 4, 2 divides 0, but 2 does not divide 1. We can use this information |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1317 |
to strength the lemma: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1318 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1319 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1320 |
\textbf{lemma}~``$n \in \textit{reach} \,\Longrightarrow\, 2~\textrm{dvd}~n \mathrel{\lor} n \not= 0$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1321 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1322 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1323 |
Unfortunately, the proof by induction still gets stuck, except that Nitpick now |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1324 |
finds the counterexample $n = 2$. We generalize the lemma further to |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1325 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1326 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1327 |
\textbf{lemma}~``$n \in \textit{reach} \,\Longrightarrow\, 2~\textrm{dvd}~n \mathrel{\lor} n \ge 4$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1328 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1329 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1330 |
and this time \textit{arith} can finish off the subgoals. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1331 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1332 |
A similar technique can be employed for structural induction. The |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1333 |
following mini-formalization of full binary trees will serve as illustration: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1334 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1335 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1336 |
\textbf{datatype} $\kern1pt'a$~\textit{bin\_tree} = $\textit{Leaf}~{\kern1pt'a}$ $\mid$ $\textit{Branch}$ ``\kern1pt$'a$ \textit{bin\_tree}'' ``\kern1pt$'a$ \textit{bin\_tree}'' \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1337 |
\textbf{primrec}~\textit{labels}~\textbf{where} \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1338 |
``$\textit{labels}~(\textit{Leaf}~a) = \{a\}$'' $\mid$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1339 |
``$\textit{labels}~(\textit{Branch}~t~u) = \textit{labels}~t \mathrel{\cup} \textit{labels}~u$'' \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1340 |
\textbf{primrec}~\textit{swap}~\textbf{where} \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1341 |
``$\textit{swap}~(\textit{Leaf}~c)~a~b =$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1342 |
\phantom{``}$(\textrm{if}~c = a~\textrm{then}~\textit{Leaf}~b~\textrm{else~if}~c = b~\textrm{then}~\textit{Leaf}~a~\textrm{else}~\textit{Leaf}~c)$'' $\mid$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1343 |
``$\textit{swap}~(\textit{Branch}~t~u)~a~b = \textit{Branch}~(\textit{swap}~t~a~b)~(\textit{swap}~u~a~b)$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1344 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1345 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1346 |
The \textit{labels} function returns the set of labels occurring on leaves of a |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1347 |
tree, and \textit{swap} exchanges two labels. Intuitively, if two distinct |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1348 |
labels $a$ and $b$ occur in a tree $t$, they should also occur in the tree |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1349 |
obtained by swapping $a$ and $b$: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1350 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1351 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1352 |
\textbf{lemma} $``\lbrakk a \in \textit{labels}~t;\, b \in \textit{labels}~t;\, a \not= b\rbrakk {}$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1353 |
\phantom{\textbf{lemma} ``}$\,{\Longrightarrow}{\;\,} \textit{labels}~(\textit{swap}~t~a~b) = \textit{labels}~t$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1354 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1355 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1356 |
Nitpick can't find any counterexample, so we proceed with induction |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1357 |
(this time favoring a more structured style): |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1358 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1359 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1360 |
\textbf{proof}~(\textit{induct}~$t$) \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1361 |
\hbox{}\quad \textbf{case}~\textit{Leaf}~\textbf{thus}~\textit{?case}~\textbf{by}~\textit{simp} \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1362 |
\textbf{next} \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1363 |
\hbox{}\quad \textbf{case}~$(\textit{Branch}~t~u)$~\textbf{thus} \textit{?case} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1364 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1365 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1366 |
Nitpick can't find any counterexample at this point either, but it makes the |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1367 |
following suggestion: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1368 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1369 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1370 |
\slshape |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1371 |
Hint: To check that the induction hypothesis is general enough, try the following command: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1372 |
\textbf{nitpick}~[\textit{non\_std} ``${\kern1pt'a}~\textit{bin\_tree}$'', \textit{show\_consts}]. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1373 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1374 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1375 |
If we follow the hint, we get a ``nonstandard'' counterexample for the step: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1376 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1377 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1378 |
\slshape Nitpick found a nonstandard counterexample for \textit{card} $'a$ = 4: \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1379 |
\hbox{}\qquad Free variables: \nopagebreak \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1380 |
\hbox{}\qquad\qquad $a = a_4$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1381 |
\hbox{}\qquad\qquad $b = a_3$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1382 |
\hbox{}\qquad\qquad $t = \xi_3$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1383 |
\hbox{}\qquad\qquad $u = \xi_4$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1384 |
\hbox{}\qquad {\slshape Constants:} \nopagebreak \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1385 |
\hbox{}\qquad\qquad $\textit{labels} = \undef |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1386 |
(\!\begin{aligned}[t]% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1387 |
& \xi_3 := \{a_4\},\> \xi_4 := \{a_1, a_3\}, \\[-2pt] %% TYPESETTING |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1388 |
& \textit{Branch}~\xi_3~\xi_3 := \{a_4\}, \\[-2pt] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1389 |
& \textit{Branch}~\xi_3~\xi_4 := \{a_1, a_3, a_4\})\end{aligned}$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1390 |
\hbox{}\qquad\qquad $\lambda x_1.\> \textit{swap}~x_1~a~b = \undef |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1391 |
(\!\begin{aligned}[t]% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1392 |
& \xi_3 := \xi_3,\> \xi_4 := \xi_3, \\[-2pt] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1393 |
& \textit{Branch}~\xi_3~\xi_3 := \textit{Branch}~\xi_3~\xi_3, \\[-2pt] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1394 |
& \textit{Branch}~\xi_4~\xi_3 := \textit{Branch}~\xi_3~\xi_3)\end{aligned}$ \\[2\smallskipamount] |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1395 |
The existence of a nonstandard model suggests that the induction hypothesis is not general enough or perhaps |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1396 |
even wrong. See the ``Inductive Properties'' section of the Nitpick manual for details (``\textit{isabelle doc nitpick}''). |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1397 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1398 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1399 |
Reading the Nitpick manual is a most excellent idea. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1400 |
But what's going on? The \textit{non\_std} ``${\kern1pt'a}~\textit{bin\_tree}$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1401 |
option told the tool to look for nonstandard models of binary trees, which |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1402 |
means that new ``nonstandard'' trees $\xi_1, \xi_2, \ldots$, are now allowed in |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1403 |
addition to the standard trees generated by the \textit{Leaf} and |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1404 |
\textit{Branch} constructors.% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1405 |
\footnote{Notice the similarity between allowing nonstandard trees here and |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1406 |
allowing unreachable states in the preceding example (by removing the ``$n \in |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1407 |
\textit{reach\/}$'' assumption). In both cases, we effectively enlarge the |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1408 |
set of objects over which the induction is performed while doing the step |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1409 |
so as to test the induction hypothesis's strength.} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1410 |
The new trees are so nonstandard that we know nothing about them, except what |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1411 |
the induction hypothesis states and what can be proved about all trees without |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1412 |
relying on induction or case distinction. The key observation is, |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1413 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1414 |
\begin{quote} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1415 |
\textsl{If the induction |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1416 |
hypothesis is strong enough, the induction step will hold even for nonstandard |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1417 |
objects, and Nitpick won't find any nonstandard counterexample.} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1418 |
\end{quote} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1419 |
% |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1420 |
But here, Nitpick did find some nonstandard trees $t = \xi_3$ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1421 |
and $u = \xi_4$ such that $a \in \textit{labels}~t$, $b \notin |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1422 |
\textit{labels}~t$, $a \notin \textit{labels}~u$, and $b \in \textit{labels}~u$. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1423 |
Because neither tree contains both $a$ and $b$, the induction hypothesis tells |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1424 |
us nothing about the labels of $\textit{swap}~t~a~b$ and $\textit{swap}~u~a~b$, |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1425 |
and as a result we know nothing about the labels of the tree |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1426 |
$\textit{swap}~(\textit{Branch}~t~u)~a~b$, which by definition equals |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1427 |
$\textit{Branch}$ $(\textit{swap}~t~a~b)$ $(\textit{swap}~u~a~b)$, whose |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1428 |
labels are $\textit{labels}$ $(\textit{swap}~t~a~b) \mathrel{\cup} |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1429 |
\textit{labels}$ $(\textit{swap}~u~a~b)$. |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1430 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1431 |
The solution is to ensure that we always know what the labels of the subtrees |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1432 |
are in the inductive step, by covering the cases where $a$ and/or~$b$ is not in |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1433 |
$t$ in the statement of the lemma: |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1434 |
|
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1435 |
\prew |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1436 |
\textbf{lemma} ``$\textit{labels}~(\textit{swap}~t~a~b) = {}$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1437 |
\phantom{\textbf{lemma} ``}$(\textrm{if}~a \in \textit{labels}~t~\textrm{then}$ \nopagebreak \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1438 |
\phantom{\textbf{lemma} ``(\quad}$\textrm{if}~b \in \textit{labels}~t~\textrm{then}~\textit{labels}~t~\textrm{else}~(\textit{labels}~t - \{a\}) \mathrel{\cup} \{b\}$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1439 |
\phantom{\textbf{lemma} ``(}$\textrm{else}$ \\ |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1440 |
\phantom{\textbf{lemma} ``(\quad}$\textrm{if}~b \in \textit{labels}~t~\textrm{then}~(\textit{labels}~t - \{b\}) \mathrel{\cup} \{a\}~\textrm{else}~\textit{labels}~t)$'' |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and did other fixes to Nitpick
blanchet
parents:
34126
diff
changeset
|
1441 |
\postw |
7b8c366e34a2
added support for nonstandard models to Nitpick (based on an idea by Koen Claessen) and d |