author | bulwahn |
Tue, 08 Nov 2011 10:48:58 +0100 | |
changeset 45409 | 5abb0e738b00 |
parent 45380 | c33a37ccd187 |
child 45516 | b2c8422833da |
permissions | -rw-r--r-- |
36926 | 1 |
\documentclass[a4paper,12pt]{article} |
2 |
\usepackage[T1]{fontenc} |
|
3 |
\usepackage{amsmath} |
|
4 |
\usepackage{amssymb} |
|
5 |
\usepackage[english,french]{babel} |
|
6 |
\usepackage{color} |
|
7 |
\usepackage{footmisc} |
|
8 |
\usepackage{graphicx} |
|
9 |
%\usepackage{mathpazo} |
|
10 |
\usepackage{multicol} |
|
11 |
\usepackage{stmaryrd} |
|
12 |
%\usepackage[scaled=.85]{beramono} |
|
42511 | 13 |
\usepackage{../../lib/texinputs/isabelle,../iman,../pdfsetup} |
36926 | 14 |
|
43216 | 15 |
\def\qty#1{\ensuremath{\left<\mathit{#1\/}\right>}} |
16 |
\def\qtybf#1{$\mathbf{\left<\textbf{\textit{#1\/}}\right>}$} |
|
17 |
||
36926 | 18 |
%\oddsidemargin=4.6mm |
19 |
%\evensidemargin=4.6mm |
|
20 |
%\textwidth=150mm |
|
21 |
%\topmargin=4.6mm |
|
22 |
%\headheight=0mm |
|
23 |
%\headsep=0mm |
|
24 |
%\textheight=234mm |
|
25 |
||
26 |
\def\Colon{\mathord{:\mkern-1.5mu:}} |
|
27 |
%\def\lbrakk{\mathopen{\lbrack\mkern-3.25mu\lbrack}} |
|
28 |
%\def\rbrakk{\mathclose{\rbrack\mkern-3.255mu\rbrack}} |
|
29 |
\def\lparr{\mathopen{(\mkern-4mu\mid}} |
|
30 |
\def\rparr{\mathclose{\mid\mkern-4mu)}} |
|
31 |
||
32 |
\def\unk{{?}} |
|
33 |
\def\undef{(\lambda x.\; \unk)} |
|
34 |
%\def\unr{\textit{others}} |
|
35 |
\def\unr{\ldots} |
|
36 |
\def\Abs#1{\hbox{\rm{\flqq}}{\,#1\,}\hbox{\rm{\frqq}}} |
|
37 |
\def\Q{{\smash{\lower.2ex\hbox{$\scriptstyle?$}}}} |
|
38 |
||
39 |
\urlstyle{tt} |
|
40 |
||
41 |
\begin{document} |
|
42 |
||
43 |
\selectlanguage{english} |
|
44 |
||
45 |
\title{\includegraphics[scale=0.5]{isabelle_sledgehammer} \\[4ex] |
|
46 |
Hammering Away \\[\smallskipamount] |
|
47 |
\Large A User's Guide to Sledgehammer for Isabelle/HOL} |
|
48 |
\author{\hbox{} \\ |
|
49 |
Jasmin Christian Blanchette \\ |
|
43002
e88fde86e4c2
mention contributions from LCP and explain metis and metisFT encodings
blanchet
parents:
42996
diff
changeset
|
50 |
{\normalsize Institut f\"ur Informatik, Technische Universit\"at M\"unchen} \\[4\smallskipamount] |
e88fde86e4c2
mention contributions from LCP and explain metis and metisFT encodings
blanchet
parents:
42996
diff
changeset
|
51 |
{\normalsize with contributions from} \\[4\smallskipamount] |
e88fde86e4c2
mention contributions from LCP and explain metis and metisFT encodings
blanchet
parents:
42996
diff
changeset
|
52 |
Lawrence C. Paulson \\ |
e88fde86e4c2
mention contributions from LCP and explain metis and metisFT encodings
blanchet
parents:
42996
diff
changeset
|
53 |
{\normalsize Computer Laboratory, University of Cambridge} \\ |
36926 | 54 |
\hbox{}} |
55 |
||
56 |
\maketitle |
|
57 |
||
58 |
\tableofcontents |
|
59 |
||
60 |
\setlength{\parskip}{.7em plus .2em minus .1em} |
|
61 |
\setlength{\parindent}{0pt} |
|
62 |
\setlength{\abovedisplayskip}{\parskip} |
|
63 |
\setlength{\abovedisplayshortskip}{.9\parskip} |
|
64 |
\setlength{\belowdisplayskip}{\parskip} |
|
65 |
\setlength{\belowdisplayshortskip}{.9\parskip} |
|
66 |
||
67 |
% General-purpose enum environment with correct spacing |
|
68 |
\newenvironment{enum}% |
|
69 |
{\begin{list}{}{% |
|
70 |
\setlength{\topsep}{.1\parskip}% |
|
71 |
\setlength{\partopsep}{.1\parskip}% |
|
72 |
\setlength{\itemsep}{\parskip}% |
|
73 |
\advance\itemsep by-\parsep}} |
|
74 |
{\end{list}} |
|
75 |
||
76 |
\def\pre{\begingroup\vskip0pt plus1ex\advance\leftskip by\leftmargin |
|
77 |
\advance\rightskip by\leftmargin} |
|
78 |
\def\post{\vskip0pt plus1ex\endgroup} |
|
79 |
||
80 |
\def\prew{\pre\advance\rightskip by-\leftmargin} |
|
81 |
\def\postw{\post} |
|
82 |
||
83 |
\section{Introduction} |
|
84 |
\label{introduction} |
|
85 |
||
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
86 |
Sledgehammer is a tool that applies automatic theorem provers (ATPs) |
40942 | 87 |
and satisfiability-modulo-theories (SMT) solvers on the current goal. The |
44091 | 88 |
supported ATPs are E \cite{schulz-2002}, E-SInE \cite{sine}, E-ToFoF |
89 |
\cite{tofof}, LEO-II \cite{leo2}, Satallax \cite{satallax}, SNARK \cite{snark}, |
|
90 |
SPASS \cite{weidenbach-et-al-2009}, Vampire \cite{riazanov-voronkov-2002}, and |
|
91 |
Waldmeister \cite{waldmeister}. The ATPs are run either locally or remotely via |
|
92 |
the System\-On\-TPTP web service \cite{sutcliffe-2000}. In addition to the ATPs, |
|
93 |
the SMT solvers Z3 \cite{z3} is used by default, and you can tell Sledgehammer |
|
94 |
to try CVC3 \cite{cvc3} and Yices \cite{yices} as well; these are run either |
|
95 |
locally or on a server at the TU M\"unchen. |
|
36926 | 96 |
|
40073 | 97 |
The problem passed to the automatic provers consists of your current goal |
98 |
together with a heuristic selection of hundreds of facts (theorems) from the |
|
99 |
current theory context, filtered by relevance. Because jobs are run in the |
|
100 |
background, you can continue to work on your proof by other means. Provers can |
|
101 |
be run in parallel. Any reply (which may arrive half a minute later) will appear |
|
102 |
in the Proof General response buffer. |
|
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
103 |
|
40073 | 104 |
The result of a successful proof search is some source text that usually (but |
105 |
not always) reconstructs the proof within Isabelle. For ATPs, the reconstructed |
|
45380 | 106 |
proof relies on the general-purpose \textit{metis} proof method, which |
107 |
integrates the Metis ATP in Isabelle/HOL with explicit inferences going through |
|
108 |
the kernel. Thus its results are correct by construction. |
|
36926 | 109 |
|
39320 | 110 |
In this manual, we will explicitly invoke the \textbf{sledgehammer} command. |
44743 | 111 |
Sledgehammer also provides an automatic mode that can be enabled via the ``Auto |
112 |
Sledgehammer'' option in Proof General's ``Isabelle'' menu. In this mode, |
|
113 |
Sledgehammer is run on every newly entered theorem. The time limit for Auto |
|
114 |
Sledgehammer and other automatic tools can be set using the ``Auto Tools Time |
|
115 |
Limit'' option. |
|
39320 | 116 |
|
36926 | 117 |
\newbox\boxA |
118 |
\setbox\boxA=\hbox{\texttt{nospam}} |
|
119 |
||
42763 | 120 |
\newcommand\authoremail{\texttt{blan{\color{white}nospam}\kern-\wd\boxA{}chette@\allowbreak |
121 |
in.\allowbreak tum.\allowbreak de}} |
|
122 |
||
40689 | 123 |
To run Sledgehammer, you must make sure that the theory \textit{Sledgehammer} is |
124 |
imported---this is rarely a problem in practice since it is part of |
|
125 |
\textit{Main}. Examples of Sledgehammer use can be found in Isabelle's |
|
36926 | 126 |
\texttt{src/HOL/Metis\_Examples} directory. |
127 |
Comments and bug reports concerning Sledgehammer or this manual should be |
|
42883 | 128 |
directed to the author at \authoremail. |
36926 | 129 |
|
130 |
\vskip2.5\smallskipamount |
|
131 |
||
132 |
%\textbf{Acknowledgment.} The author would like to thank Mark Summerfield for |
|
133 |
%suggesting several textual improvements. |
|
134 |
||
135 |
\section{Installation} |
|
136 |
\label{installation} |
|
137 |
||
138 |
Sledgehammer is part of Isabelle, so you don't need to install it. However, it |
|
42763 | 139 |
relies on third-party automatic theorem provers (ATPs) and SMT solvers. |
140 |
||
141 |
\subsection{Installing ATPs} |
|
142 |
||
44098 | 143 |
Currently, E, LEO-II, Satallax, SPASS, and Vampire can be run locally; in |
45339 | 144 |
addition, E, E-SInE, E-ToFoF, iProver, iProver-Eq, LEO-II, Satallax, SNARK, |
145 |
Waldmeister, and Vampire are available remotely via System\-On\-TPTP |
|
146 |
\cite{sutcliffe-2000}. If you want better performance, you should at least |
|
147 |
install E and SPASS locally. |
|
36926 | 148 |
|
38043 | 149 |
There are three main ways to install ATPs on your machine: |
36926 | 150 |
|
151 |
\begin{enum} |
|
152 |
\item[$\bullet$] If you installed an official Isabelle package with everything |
|
153 |
inside, it should already include properly setup executables for E and SPASS, |
|
38043 | 154 |
ready to use.% |
155 |
\footnote{Vampire's license prevents us from doing the same for this otherwise |
|
156 |
wonderful tool.} |
|
36926 | 157 |
|
38043 | 158 |
\item[$\bullet$] Alternatively, you can download the Isabelle-aware E and SPASS |
36926 | 159 |
binary packages from Isabelle's download page. Extract the archives, then add a |
41747
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
160 |
line to your \texttt{\$ISABELLE\_HOME\_USER/etc/components}% |
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
161 |
\footnote{The variable \texttt{\$ISABELLE\_HOME\_USER} is set by Isabelle at |
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
162 |
startup. Its value can be retrieved by invoking \texttt{isabelle} |
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
163 |
\texttt{getenv} \texttt{ISABELLE\_HOME\_USER} on the command line.} |
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
164 |
file with the absolute |
40203 | 165 |
path to E or SPASS. For example, if the \texttt{components} does not exist yet |
166 |
and you extracted SPASS to \texttt{/usr/local/spass-3.7}, create the |
|
167 |
\texttt{components} file with the single line |
|
36926 | 168 |
|
169 |
\prew |
|
170 |
\texttt{/usr/local/spass-3.7} |
|
171 |
\postw |
|
172 |
||
38043 | 173 |
in it. |
174 |
||
175 |
\item[$\bullet$] If you prefer to build E or SPASS yourself, or obtained a |
|
176 |
Vampire executable from somewhere (e.g., \url{http://www.vprover.org/}), |
|
177 |
set the environment variable \texttt{E\_HOME}, \texttt{SPASS\_HOME}, or |
|
178 |
\texttt{VAMPIRE\_HOME} to the directory that contains the \texttt{eproof}, |
|
38063 | 179 |
\texttt{SPASS}, or \texttt{vampire} executable. Sledgehammer has been tested |
45048
59ca831deef4
take out remote E-SInE -- it's broken and Geoff says it might take quite a while before he gets to it, plus it's fairly obsolete in the meantime
blanchet
parents:
44816
diff
changeset
|
180 |
with E 1.0 to 1.4, SPASS 3.5 and 3.7, and Vampire 0.6, 1.0, and 1.8% |
38063 | 181 |
\footnote{Following the rewrite of Vampire, the counter for version numbers was |
44419
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
182 |
reset to 0; hence the (new) Vampire versions 0.6, 1.0, and 1.8 are more recent |
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
183 |
than, say, Vampire 9.0 or 11.5.}% |
38063 | 184 |
. Since the ATPs' output formats are neither documented nor stable, other |
42763 | 185 |
versions of the ATPs might or might not work well with Sledgehammer. Ideally, |
186 |
also set \texttt{E\_VERSION}, \texttt{SPASS\_VERSION}, or |
|
44419
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
187 |
\texttt{VAMPIRE\_VERSION} to the ATP's version number (e.g., ``1.4''). |
36926 | 188 |
\end{enum} |
189 |
||
42763 | 190 |
To check whether E and SPASS are successfully installed, follow the example in |
191 |
\S\ref{first-steps}. If the remote versions of E and SPASS are used (identified |
|
192 |
by the prefix ``\emph{remote\_}''), or if the local versions fail to solve the |
|
193 |
easy goal presented there, this is a sign that something is wrong with your |
|
194 |
installation. |
|
36926 | 195 |
|
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
196 |
Remote ATP invocation via the SystemOnTPTP web service requires Perl with the |
39152
f09b378cb252
make remote ATP invocation work for those people who need to go through a proxy;
blanchet
parents:
38997
diff
changeset
|
197 |
World Wide Web Library (\texttt{libwww-perl}) installed. If you must use a proxy |
f09b378cb252
make remote ATP invocation work for those people who need to go through a proxy;
blanchet
parents:
38997
diff
changeset
|
198 |
server to access the Internet, set the \texttt{http\_proxy} environment variable |
39153 | 199 |
to the proxy, either in the environment in which Isabelle is launched or in your |
41747
f58d4d202924
fix path to etc/settings and etc/components in doc
blanchet
parents:
41740
diff
changeset
|
200 |
\texttt{\char`\~/\$ISABELLE\_HOME\_USER/etc/settings} file. Here are a few examples: |
39152
f09b378cb252
make remote ATP invocation work for those people who need to go through a proxy;
blanchet
parents:
38997
diff
changeset
|
201 |
|
f09b378cb252
make remote ATP invocation work for those people who need to go through a proxy;
blanchet
parents:
38997
diff
changeset
|
202 |
\prew |
39153 | 203 |
\texttt{http\_proxy=http://proxy.example.org} \\ |
204 |
\texttt{http\_proxy=http://proxy.example.org:8080} \\ |
|
205 |
\texttt{http\_proxy=http://joeblow:pAsSwRd@proxy.example.org} |
|
39152
f09b378cb252
make remote ATP invocation work for those people who need to go through a proxy;
blanchet
parents:
38997
diff
changeset
|
206 |
\postw |
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
207 |
|
42763 | 208 |
\subsection{Installing SMT Solvers} |
209 |
||
42945 | 210 |
CVC3, Yices, and Z3 can be run locally or (for CVC3 and Z3) remotely on a TU |
211 |
M\"unchen server. If you want better performance and get the ability to replay |
|
45380 | 212 |
proofs that rely on the \emph{smt} proof method, you should at least run Z3 |
42945 | 213 |
locally. |
42763 | 214 |
|
215 |
There are two main ways of installing SMT solvers locally. |
|
216 |
||
217 |
\begin{enum} |
|
218 |
\item[$\bullet$] If you installed an official Isabelle package with everything |
|
219 |
inside, it should already include properly setup executables for CVC3 and Z3, |
|
220 |
ready to use.% |
|
221 |
\footnote{Yices's license prevents us from doing the same for this otherwise |
|
222 |
wonderful tool.} |
|
223 |
For Z3, you additionally need to set the environment variable |
|
224 |
\texttt{Z3\_NON\_COMMERCIAL} to ``yes'' to confirm that you are a noncommercial |
|
225 |
user. |
|
226 |
||
227 |
\item[$\bullet$] Otherwise, follow the instructions documented in the \emph{SMT} |
|
228 |
theory (\texttt{\$ISABELLE\_HOME/src/HOL/SMT.thy}). |
|
229 |
\end{enum} |
|
230 |
||
36926 | 231 |
\section{First Steps} |
232 |
\label{first-steps} |
|
233 |
||
234 |
To illustrate Sledgehammer in context, let us start a theory file and |
|
235 |
attempt to prove a simple lemma: |
|
236 |
||
237 |
\prew |
|
238 |
\textbf{theory}~\textit{Scratch} \\ |
|
239 |
\textbf{imports}~\textit{Main} \\ |
|
240 |
\textbf{begin} \\[2\smallskipamount] |
|
241 |
% |
|
42945 | 242 |
\textbf{lemma} ``$[a] = [b] \,\Longrightarrow\, a = b$'' \\ |
36926 | 243 |
\textbf{sledgehammer} |
244 |
\postw |
|
245 |
||
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
246 |
Instead of issuing the \textbf{sledgehammer} command, you can also find |
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
247 |
Sledgehammer in the ``Commands'' submenu of the ``Isabelle'' menu in Proof |
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
248 |
General or press the Emacs key sequence C-c C-a C-s. |
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
249 |
Either way, Sledgehammer produces the following output after a few seconds: |
36926 | 250 |
|
251 |
\prew |
|
252 |
\slshape |
|
43035 | 253 |
Sledgehammer: ``\textit{e}'' on goal \\ |
42945 | 254 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
43054 | 255 |
Try this: \textbf{by} (\textit{metis last\_ConsL}) (64 ms). \\[3\smallskipamount] |
42945 | 256 |
% |
43035 | 257 |
Sledgehammer: ``\textit{vampire}'' on goal \\ |
42945 | 258 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
43054 | 259 |
Try this: \textbf{by} (\textit{metis hd.simps}) (14 ms). \\[3\smallskipamount] |
36926 | 260 |
% |
43035 | 261 |
Sledgehammer: ``\textit{spass}'' on goal \\ |
42945 | 262 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
43054 | 263 |
Try this: \textbf{by} (\textit{metis list.inject}) (17 ms). \\[3\smallskipamount] |
36926 | 264 |
% |
43035 | 265 |
Sledgehammer: ``\textit{remote\_waldmeister}'' on goal \\ |
43010
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
266 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
43054 | 267 |
Try this: \textbf{by} (\textit{metis hd.simps}) (15 ms). \\[3\smallskipamount] |
40073 | 268 |
% |
45063
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
269 |
Sledgehammer: ``\textit{remote\_e\_sine}'' on goal \\ |
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
270 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
271 |
Try this: \textbf{by} (\textit{metis hd.simps}) (18 ms). \\[3\smallskipamount] |
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
272 |
% |
43035 | 273 |
Sledgehammer: ``\textit{remote\_z3}'' on goal \\ |
42945 | 274 |
$[a] = [b] \,\Longrightarrow\, a = b$ \\ |
43054 | 275 |
Try this: \textbf{by} (\textit{metis list.inject}) (20 ms). |
36926 | 276 |
\postw |
277 |
||
45063
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
278 |
Sledgehammer ran E, E-SInE, SPASS, Vampire, Waldmeister, and Z3 in parallel. |
42945 | 279 |
Depending on which provers are installed and how many processor cores are |
280 |
available, some of the provers might be missing or present with a |
|
43010
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
281 |
\textit{remote\_} prefix. Waldmeister is run only for unit equational problems, |
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
282 |
where the goal's conclusion is a (universally quantified) equation. |
36926 | 283 |
|
45380 | 284 |
For each successful prover, Sledgehammer gives a one-liner proof that uses |
285 |
the \textit{metis} or \textit{smt} proof method. Approximate timings are shown |
|
286 |
in parentheses, indicating how fast the call is. You can click the proof to |
|
287 |
insert it into the theory text. |
|
36926 | 288 |
|
43054 | 289 |
In addition, you can ask Sledgehammer for an Isar text proof by passing the |
42883 | 290 |
\textit{isar\_proof} option (\S\ref{output-format}): |
36926 | 291 |
|
292 |
\prew |
|
293 |
\textbf{sledgehammer} [\textit{isar\_proof}] |
|
294 |
\postw |
|
295 |
||
296 |
When Isar proof construction is successful, it can yield proofs that are more |
|
45380 | 297 |
readable and also faster than the \textit{metis} or \textit{smt} one-liners. |
298 |
This feature is experimental and is only available for ATPs. |
|
36926 | 299 |
|
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
300 |
\section{Hints} |
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
301 |
\label{hints} |
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
302 |
|
42884 | 303 |
This section presents a few hints that should help you get the most out of |
45380 | 304 |
Sledgehammer. Frequently (and infrequently) asked questions are answered in |
305 |
\S\ref{frequently-asked-questions}. |
|
42884 | 306 |
|
42945 | 307 |
\newcommand\point[1]{\medskip\par{\sl\bfseries#1}\par\nopagebreak} |
42763 | 308 |
|
309 |
\point{Presimplify the goal} |
|
310 |
||
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
311 |
For best results, first simplify your problem by calling \textit{auto} or at |
42945 | 312 |
least \textit{safe} followed by \textit{simp\_all}. The SMT solvers provide |
313 |
arithmetic decision procedures, but the ATPs typically do not (or if they do, |
|
314 |
Sledgehammer does not use it yet). Apart from Waldmeister, they are not |
|
315 |
especially good at heavy rewriting, but because they regard equations as |
|
316 |
undirected, they often prove theorems that require the reverse orientation of a |
|
317 |
\textit{simp} rule. Higher-order problems can be tackled, but the success rate |
|
318 |
is better for first-order problems. Hence, you may get better results if you |
|
319 |
first simplify the problem to remove higher-order features. |
|
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
320 |
|
42763 | 321 |
\point{Make sure at least E, SPASS, Vampire, and Z3 are installed} |
322 |
||
323 |
Locally installed provers are faster and more reliable than those running on |
|
324 |
servers. See \S\ref{installation} for details on how to install them. |
|
325 |
||
326 |
\point{Familiarize yourself with the most important options} |
|
327 |
||
328 |
Sledgehammer's options are fully documented in \S\ref{command-syntax}. Many of |
|
329 |
the options are very specialized, but serious users of the tool should at least |
|
330 |
familiarize themselves with the following options: |
|
331 |
||
332 |
\begin{enum} |
|
42884 | 333 |
\item[$\bullet$] \textbf{\textit{provers}} (\S\ref{mode-of-operation}) specifies |
334 |
the automatic provers (ATPs and SMT solvers) that should be run whenever |
|
335 |
Sledgehammer is invoked (e.g., ``\textit{provers}~= \textit{e spass |
|
43014 | 336 |
remote\_vampire}''). For convenience, you can omit ``\textit{provers}~='' |
337 |
and simply write the prover names as a space-separated list (e.g., ``\textit{e |
|
338 |
spass remote\_vampire}''). |
|
42763 | 339 |
|
42884 | 340 |
\item[$\bullet$] \textbf{\textit{max\_relevant}} (\S\ref{relevance-filter}) |
341 |
specifies the maximum number of facts that should be passed to the provers. By |
|
342 |
default, the value is prover-dependent but varies between about 150 and 1000. If |
|
343 |
the provers time out, you can try lowering this value to, say, 100 or 50 and see |
|
344 |
if that helps. |
|
42763 | 345 |
|
42884 | 346 |
\item[$\bullet$] \textbf{\textit{isar\_proof}} (\S\ref{output-format}) specifies |
45380 | 347 |
that Isar proofs should be generated, instead of one-liner \textit{metis} or |
348 |
\textit{smt} proofs. The length of the Isar proofs can be controlled by setting |
|
42884 | 349 |
\textit{isar\_shrink\_factor} (\S\ref{output-format}). |
43038 | 350 |
|
351 |
\item[$\bullet$] \textbf{\textit{timeout}} (\S\ref{timeouts}) controls the |
|
352 |
provers' time limit. It is set to 30 seconds, but since Sledgehammer runs |
|
353 |
asynchronously you should not hesitate to raise this limit to 60 or 120 seconds |
|
354 |
if you are the kind of user who can think clearly while ATPs are active. |
|
42763 | 355 |
\end{enum} |
356 |
||
42884 | 357 |
Options can be set globally using \textbf{sledgehammer\_params} |
43010
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
358 |
(\S\ref{command-syntax}). The command also prints the list of all available |
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
359 |
options with their current value. Fact selection can be influenced by specifying |
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
360 |
``$(\textit{add}{:}~\textit{my\_facts})$'' after the \textbf{sledgehammer} call |
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
361 |
to ensure that certain facts are included, or simply ``$(\textit{my\_facts})$'' |
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
362 |
to force Sledgehammer to run only with $\textit{my\_facts}$. |
42763 | 363 |
|
364 |
\section{Frequently Asked Questions} |
|
365 |
\label{frequently-asked-questions} |
|
366 |
||
42945 | 367 |
This sections answers frequently (and infrequently) asked questions about |
368 |
Sledgehammer. It is a good idea to skim over it now even if you don't have any |
|
369 |
questions at this stage. And if you have any further questions not listed here, |
|
370 |
send them to the author at \authoremail. |
|
371 |
||
42763 | 372 |
\point{Why does Metis fail to reconstruct the proof?} |
373 |
||
42883 | 374 |
There are many reasons. If Metis runs seemingly forever, that is a sign that the |
43036 | 375 |
proof is too difficult for it. Metis's search is complete, so it should |
376 |
eventually find it, but that's little consolation. There are several possible |
|
377 |
solutions: |
|
42763 | 378 |
|
379 |
\begin{enum} |
|
42883 | 380 |
\item[$\bullet$] Try the \textit{isar\_proof} option (\S\ref{output-format}) to |
45380 | 381 |
obtain a step-by-step Isar proof where each step is justified by \textit{metis}. |
382 |
Since the steps are fairly small, \textit{metis} is more likely to be able to |
|
383 |
replay them. |
|
42763 | 384 |
|
45380 | 385 |
\item[$\bullet$] Try the \textit{smt} proof method instead of \textit{metis}. It |
386 |
is usually stronger, but you need to either have Z3 available to replay the |
|
387 |
proofs, trust the SMT solver, or use certificates. See the documentation in the |
|
388 |
\emph{SMT} theory (\texttt{\$ISABELLE\_HOME/src/HOL/SMT.thy}) for details. |
|
42763 | 389 |
|
390 |
\item[$\bullet$] Try the \textit{blast} or \textit{auto} proof methods, passing |
|
43010
a14cf580a5a5
readded Waldmeister as default to the documentation and other minor changes
blanchet
parents:
43008
diff
changeset
|
391 |
the necessary facts via \textbf{unfolding}, \textbf{using}, \textit{intro}{:}, |
42763 | 392 |
\textit{elim}{:}, \textit{dest}{:}, or \textit{simp}{:}, as appropriate. |
393 |
\end{enum} |
|
394 |
||
45380 | 395 |
In some rare cases, \textit{metis} fails fairly quickly, and you get the error |
396 |
message |
|
43036 | 397 |
|
398 |
\prew |
|
399 |
\slshape |
|
400 |
Proof reconstruction failed. |
|
401 |
\postw |
|
402 |
||
43571 | 403 |
This message usually indicates that Sledgehammer found a type-incorrect proof. |
404 |
This was a frequent issue with older versions of Sledgehammer, which did not |
|
405 |
supply enough typing information to the ATPs by default. If you notice many |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
406 |
unsound proofs and are not using \textit{type\_enc} (\S\ref{problem-encoding}), |
43571 | 407 |
contact the author at \authoremail. |
42883 | 408 |
|
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
409 |
\point{How can I tell whether a generated proof is sound?} |
42883 | 410 |
|
45380 | 411 |
First, if \textit{metis} can reconstruct it, the proof is sound (assuming |
412 |
Isabelle's inference kernel is sound). If it fails or runs seemingly forever, |
|
413 |
you can try |
|
42883 | 414 |
|
415 |
\prew |
|
416 |
\textbf{apply}~\textbf{--} \\ |
|
43574 | 417 |
\textbf{sledgehammer} [\textit{sound}] (\textit{metis\_facts}) |
42883 | 418 |
\postw |
419 |
||
420 |
where \textit{metis\_facts} is the list of facts appearing in the suggested |
|
45380 | 421 |
\textit{metis} call. The automatic provers should be able to re-find the proof |
422 |
quickly if it is sound, and the \textit{sound} option (\S\ref{problem-encoding}) |
|
423 |
ensures that no unsound proofs are found. |
|
42883 | 424 |
|
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
425 |
\point{Which facts are passed to the automatic provers?} |
42883 | 426 |
|
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
427 |
The relevance filter assigns a score to every available fact (lemma, theorem, |
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
428 |
definition, or axiom)\ based upon how many constants that fact shares with the |
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
429 |
conjecture. This process iterates to include facts relevant to those just |
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
430 |
accepted, but with a decay factor to ensure termination. The constants are |
42883 | 431 |
weighted to give unusual ones greater significance. The relevance filter copes |
432 |
best when the conjecture contains some unusual constants; if all the constants |
|
433 |
are common, it is unable to discriminate among the hundreds of facts that are |
|
434 |
picked up. The relevance filter is also memoryless: It has no information about |
|
435 |
how many times a particular fact has been used in a proof, and it cannot learn. |
|
42763 | 436 |
|
42883 | 437 |
The number of facts included in a problem varies from prover to prover, since |
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
438 |
some provers get overwhelmed more easily than others. You can show the number of |
42883 | 439 |
facts given using the \textit{verbose} option (\S\ref{output-format}) and the |
440 |
actual facts using \textit{debug} (\S\ref{output-format}). |
|
441 |
||
442 |
Sledgehammer is good at finding short proofs combining a handful of existing |
|
443 |
lemmas. If you are looking for longer proofs, you must typically restrict the |
|
42884 | 444 |
number of facts, by setting the \textit{max\_relevant} option |
43574 | 445 |
(\S\ref{relevance-filter}) to, say, 25 or 50. |
42883 | 446 |
|
42996 | 447 |
You can also influence which facts are actually selected in a number of ways. If |
448 |
you simply want to ensure that a fact is included, you can specify it using the |
|
449 |
``$(\textit{add}{:}~\textit{my\_facts})$'' syntax. For example: |
|
450 |
% |
|
451 |
\prew |
|
452 |
\textbf{sledgehammer} (\textit{add}: \textit{hd.simps} \textit{tl.simps}) |
|
453 |
\postw |
|
454 |
% |
|
455 |
The specified facts then replace the least relevant facts that would otherwise be |
|
456 |
included; the other selected facts remain the same. |
|
457 |
If you want to direct the selection in a particular direction, you can specify |
|
458 |
the facts via \textbf{using}: |
|
459 |
% |
|
460 |
\prew |
|
461 |
\textbf{using} \textit{hd.simps} \textit{tl.simps} \\ |
|
462 |
\textbf{sledgehammer} |
|
463 |
\postw |
|
464 |
% |
|
465 |
The facts are then more likely to be selected than otherwise, and if they are |
|
466 |
selected at iteration $j$ they also influence which facts are selected at |
|
467 |
iterations $j + 1$, $j + 2$, etc. To give them even more weight, try |
|
468 |
% |
|
469 |
\prew |
|
470 |
\textbf{using} \textit{hd.simps} \textit{tl.simps} \\ |
|
471 |
\textbf{apply}~\textbf{--} \\ |
|
472 |
\textbf{sledgehammer} |
|
473 |
\postw |
|
474 |
||
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
475 |
\point{Why are the generated Isar proofs so ugly/detailed/broken?} |
42883 | 476 |
|
477 |
The current implementation is experimental and explodes exponentially in the |
|
478 |
worst case. Work on a new implementation has begun. There is a large body of |
|
479 |
research into transforming resolution proofs into natural deduction proofs (such |
|
480 |
as Isar proofs), which we hope to leverage. In the meantime, a workaround is to |
|
481 |
set the \textit{isar\_shrink\_factor} option (\S\ref{output-format}) to a larger |
|
482 |
value or to try several provers and keep the nicest-looking proof. |
|
483 |
||
43229 | 484 |
\point{What are the \textit{full\_types} and \textit{no\_types} arguments to |
485 |
Metis?} |
|
42883 | 486 |
|
43228
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
487 |
The \textit{metis}~(\textit{full\_types}) proof method is the fully-typed |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
488 |
version of Metis. It is somewhat slower than \textit{metis}, but the proof |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
489 |
search is fully typed, and it also includes more powerful rules such as the |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
490 |
axiom ``$x = \mathit{True} \mathrel{\lor} x = \mathit{False}$'' for reasoning in |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
491 |
higher-order places (e.g., in set comprehensions). The method kicks in |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
492 |
automatically as a fallback when \textit{metis} fails, and it is sometimes |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
493 |
generated by Sledgehammer instead of \textit{metis} if the proof obviously |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
494 |
requires type information or if \textit{metis} failed when Sledgehammer |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
495 |
preplayed the proof. (By default, Sledgehammer tries to run \textit{metis} with |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
496 |
various options for up to 4 seconds to ensure that the generated one-line proofs |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
497 |
actually work and to display timing information. This can be configured using |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
498 |
the \textit{preplay\_timeout} option (\S\ref{timeouts}).) |
42883 | 499 |
|
43229 | 500 |
At the other end of the soundness spectrum, \textit{metis} (\textit{no\_types}) |
501 |
uses no type information at all during the proof search, which is more efficient |
|
502 |
but often fails. Calls to \textit{metis} (\textit{no\_types}) are occasionally |
|
503 |
generated by Sledgehammer. |
|
504 |
||
505 |
Incidentally, if you see the warning |
|
42883 | 506 |
|
507 |
\prew |
|
43007 | 508 |
\slshape |
43228
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
509 |
Metis: Falling back on ``\textit{metis} (\textit{full\_types})''. |
42883 | 510 |
\postw |
511 |
||
45380 | 512 |
for a successful \textit{metis} proof, you can advantageously pass the |
43228
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
513 |
\textit{full\_types} option to \textit{metis} directly. |
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
514 |
|
43054 | 515 |
\point{Are generated proofs minimal?} |
43036 | 516 |
|
43054 | 517 |
Automatic provers frequently use many more facts than are necessary. |
518 |
Sledgehammer inclues a minimization tool that takes a set of facts returned by a |
|
45380 | 519 |
given prover and repeatedly calls the same prover, \textit{metis}, or |
520 |
\textit{smt} with subsets of those axioms in order to find a minimal set. |
|
521 |
Reducing the number of axioms typically improves Metis's speed and success rate, |
|
522 |
while also removing superfluous clutter from the proof scripts. |
|
43036 | 523 |
|
43229 | 524 |
In earlier versions of Sledgehammer, generated proofs were systematically |
525 |
accompanied by a suggestion to invoke the minimization tool. This step is now |
|
526 |
performed implicitly if it can be done in a reasonable amount of time (something |
|
527 |
that can be guessed from the number of facts in the original proof and the time |
|
528 |
it took to find it or replay it). |
|
43036 | 529 |
|
45163 | 530 |
In addition, some provers (e.g., Yices) do not provide proofs or sometimes |
531 |
produce incomplete proofs. The minimizer is then invoked to find out which facts |
|
532 |
are actually needed from the (large) set of facts that was initinally given to |
|
533 |
the prover. Finally, if a prover returns a proof with lots of facts, the |
|
534 |
minimizer is invoked automatically since Metis would be unlikely to re-find the |
|
535 |
proof. |
|
43036 | 536 |
|
43008
bb212c2ad238
renamed "minimize" to "min" to make Sledgehammer output a little bit more concise
blanchet
parents:
43007
diff
changeset
|
537 |
\point{A strange error occurred---what should I do?} |
42763 | 538 |
|
539 |
Sledgehammer tries to give informative error messages. Please report any strange |
|
42883 | 540 |
error to the author at \authoremail. This applies double if you get the message |
42763 | 541 |
|
42883 | 542 |
\prew |
42763 | 543 |
\slshape |
42877 | 544 |
The prover found a type-unsound proof involving ``\textit{foo}'', |
43005
c96f06bffd90
merge timeout messages from several ATPs into one message to avoid clutter
blanchet
parents:
43002
diff
changeset
|
545 |
``\textit{bar}'', and ``\textit{baz}'' even though a supposedly type-sound |
c96f06bffd90
merge timeout messages from several ATPs into one message to avoid clutter
blanchet
parents:
43002
diff
changeset
|
546 |
encoding was used (or, less likely, your axioms are inconsistent). You might |
c96f06bffd90
merge timeout messages from several ATPs into one message to avoid clutter
blanchet
parents:
43002
diff
changeset
|
547 |
want to report this to the Isabelle developers. |
42883 | 548 |
\postw |
42763 | 549 |
|
550 |
\point{Auto can solve it---why not Sledgehammer?} |
|
551 |
||
552 |
Problems can be easy for \textit{auto} and difficult for automatic provers, but |
|
553 |
the reverse is also true, so don't be discouraged if your first attempts fail. |
|
39320 | 554 |
Because the system refers to all theorems known to Isabelle, it is particularly |
555 |
suitable when your goal has a short proof from lemmas that you don't know about. |
|
37517
19ba7ec5f1e3
steal some of http://isabelle.in.tum.de/sledgehammer.html and add it to the docs
blanchet
parents:
37498
diff
changeset
|
556 |
|
42883 | 557 |
\point{Why are there so many options?} |
558 |
||
559 |
Sledgehammer's philosophy should work out of the box, without user guidance. |
|
560 |
Many of the options are meant to be used mostly by the Sledgehammer developers |
|
561 |
for experimentation purposes. Of course, feel free to experiment with them if |
|
562 |
you are so inclined. |
|
563 |
||
36926 | 564 |
\section{Command Syntax} |
565 |
\label{command-syntax} |
|
566 |
||
567 |
Sledgehammer can be invoked at any point when there is an open goal by entering |
|
568 |
the \textbf{sledgehammer} command in the theory file. Its general syntax is as |
|
569 |
follows: |
|
570 |
||
571 |
\prew |
|
43216 | 572 |
\textbf{sledgehammer} \qty{subcommand}$^?$ \qty{options}$^?$ \qty{facts\_override}$^?$ \qty{num}$^?$ |
36926 | 573 |
\postw |
574 |
||
575 |
For convenience, Sledgehammer is also available in the ``Commands'' submenu of |
|
576 |
the ``Isabelle'' menu in Proof General or by pressing the Emacs key sequence C-c |
|
577 |
C-a C-s. This is equivalent to entering the \textbf{sledgehammer} command with |
|
578 |
no arguments in the theory text. |
|
579 |
||
43216 | 580 |
In the general syntax, the \qty{subcommand} may be any of the following: |
36926 | 581 |
|
582 |
\begin{enum} |
|
40203 | 583 |
\item[$\bullet$] \textbf{\textit{run} (the default):} Runs Sledgehammer on |
43216 | 584 |
subgoal number \qty{num} (1 by default), with the given options and facts. |
36926 | 585 |
|
43216 | 586 |
\item[$\bullet$] \textbf{\textit{min}:} Attempts to minimize the facts |
587 |
specified in the \qty{facts\_override} argument to obtain a simpler proof |
|
36926 | 588 |
involving fewer facts. The options and goal number are as for \textit{run}. |
589 |
||
40203 | 590 |
\item[$\bullet$] \textbf{\textit{messages}:} Redisplays recent messages issued |
591 |
by Sledgehammer. This allows you to examine results that might have been lost |
|
43216 | 592 |
due to Sledgehammer's asynchronous nature. The \qty{num} argument specifies a |
36926 | 593 |
limit on the number of messages to display (5 by default). |
594 |
||
41727
ab3f6d76fb23
available_provers ~> supported_provers (for clarity)
blanchet
parents:
41724
diff
changeset
|
595 |
\item[$\bullet$] \textbf{\textit{supported\_provers}:} Prints the list of |
41724 | 596 |
automatic provers supported by Sledgehammer. See \S\ref{installation} and |
597 |
\S\ref{mode-of-operation} for more information on how to install automatic |
|
598 |
provers. |
|
36926 | 599 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
600 |
\item[$\bullet$] \textbf{\textit{running\_provers}:} Prints information about |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
601 |
currently running automatic provers, including elapsed runtime and remaining |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
602 |
time until timeout. |
36926 | 603 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
604 |
\item[$\bullet$] \textbf{\textit{kill\_provers}:} Terminates all running |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
605 |
automatic provers. |
36926 | 606 |
|
607 |
\item[$\bullet$] \textbf{\textit{refresh\_tptp}:} Refreshes the list of remote |
|
608 |
ATPs available at System\-On\-TPTP \cite{sutcliffe-2000}. |
|
609 |
\end{enum} |
|
610 |
||
43216 | 611 |
Sledgehammer's behavior can be influenced by various \qty{options}, which can be |
612 |
specified in brackets after the \textbf{sledgehammer} command. The |
|
613 |
\qty{options} are a list of key--value pairs of the form ``[$k_1 = v_1, |
|
36926 | 614 |
\ldots, k_n = v_n$]''. For Boolean options, ``= \textit{true}'' is optional. For |
615 |
example: |
|
616 |
||
617 |
\prew |
|
43216 | 618 |
\textbf{sledgehammer} [\textit{isar\_proof}, \,\textit{timeout} = 120] |
36926 | 619 |
\postw |
620 |
||
621 |
Default values can be set using \textbf{sledgehammer\_\allowbreak params}: |
|
622 |
||
623 |
\prew |
|
43216 | 624 |
\textbf{sledgehammer\_params} \qty{options} |
36926 | 625 |
\postw |
626 |
||
627 |
The supported options are described in \S\ref{option-reference}. |
|
628 |
||
43216 | 629 |
The \qty{facts\_override} argument lets you alter the set of facts that go |
630 |
through the relevance filter. It may be of the form ``(\qty{facts})'', where |
|
631 |
\qty{facts} is a space-separated list of Isabelle facts (theorems, local |
|
36926 | 632 |
assumptions, etc.), in which case the relevance filter is bypassed and the given |
43216 | 633 |
facts are used. It may also be of the form ``(\textit{add}:\ \qty{facts\/_{\mathrm{1}}})'', |
634 |
``(\textit{del}:\ \qty{facts\/_{\mathrm{2}}})'', or ``(\textit{add}:\ \qty{facts\/_{\mathrm{1}}}\ |
|
635 |
\textit{del}:\ \qty{facts\/_{\mathrm{2}}})'', where the relevance filter is instructed to |
|
636 |
proceed as usual except that it should consider \qty{facts\/_{\mathrm{1}}} |
|
637 |
highly-relevant and \qty{facts\/_{\mathrm{2}}} fully irrelevant. |
|
36926 | 638 |
|
39320 | 639 |
You can instruct Sledgehammer to run automatically on newly entered theorems by |
44743 | 640 |
enabling the ``Auto Sledgehammer'' option in Proof General's ``Isabelle'' menu. |
641 |
For automatic runs, only the first prover set using \textit{provers} |
|
42736
8005fc9b65ec
ensure that Auto Sledgehammer is run with full type information
blanchet
parents:
42724
diff
changeset
|
642 |
(\S\ref{mode-of-operation}) is considered, fewer facts are passed to the prover, |
43574 | 643 |
\textit{slicing} (\S\ref{mode-of-operation}) is disabled, \textit{sound} |
644 |
(\S\ref{problem-encoding}) is enabled, \textit{verbose} (\S\ref{output-format}) |
|
43038 | 645 |
and \textit{debug} (\S\ref{output-format}) are disabled, and \textit{timeout} |
646 |
(\S\ref{timeouts}) is superseded by the ``Auto Tools Time Limit'' in Proof |
|
647 |
General's ``Isabelle'' menu. Sledgehammer's output is also more concise. |
|
39320 | 648 |
|
43216 | 649 |
The \textit{metis} proof method has the syntax |
650 |
||
651 |
\prew |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
652 |
\textbf{\textit{metis}}~(\qty{type\_enc})${}^?$~\qty{facts}${}^?$ |
43216 | 653 |
\postw |
654 |
||
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
655 |
where \qty{type\_enc} is a type encoding specification with the same semantics |
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
656 |
as Sledgehammer's \textit{type\_enc} option (\S\ref{problem-encoding}) and |
43229 | 657 |
\qty{facts} is a list of arbitrary facts. In addition to the values listed in |
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
658 |
\S\ref{problem-encoding}, \qty{type\_enc} may also be \textit{full\_types}, in |
43229 | 659 |
which case an appropriate type-sound encoding is chosen, \textit{partial\_types} |
660 |
(the default type-unsound encoding), or \textit{no\_types}, a synonym for |
|
661 |
\textit{erased}. |
|
43216 | 662 |
|
36926 | 663 |
\section{Option Reference} |
664 |
\label{option-reference} |
|
665 |
||
43014 | 666 |
\def\defl{\{} |
667 |
\def\defr{\}} |
|
668 |
||
36926 | 669 |
\def\flushitem#1{\item[]\noindent\kern-\leftmargin \textbf{#1}} |
43014 | 670 |
\def\optrue#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool}$\bigr]$\enskip \defl\textit{true}\defr\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]} |
671 |
\def\opfalse#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{bool}$\bigr]$\enskip \defl\textit{false}\defr\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]} |
|
672 |
\def\opsmart#1#2{\flushitem{\textit{#1} $\bigl[$= \qtybf{smart\_bool}$\bigr]$\enskip \defl\textit{smart}\defr\hfill (neg.: \textit{#2})}\nopagebreak\\[\parskip]} |
|
36926 | 673 |
\def\opnodefault#1#2{\flushitem{\textit{#1} = \qtybf{#2}} \nopagebreak\\[\parskip]} |
43014 | 674 |
\def\opnodefaultbrk#1#2{\flushitem{$\bigl[$\textit{#1} =$\bigr]$ \qtybf{#2}} \nopagebreak\\[\parskip]} |
675 |
\def\opdefault#1#2#3{\flushitem{\textit{#1} = \qtybf{#2}\enskip \defl\textit{#3}\defr} \nopagebreak\\[\parskip]} |
|
36926 | 676 |
\def\oparg#1#2#3{\flushitem{\textit{#1} \qtybf{#2} = \qtybf{#3}} \nopagebreak\\[\parskip]} |
677 |
\def\opargbool#1#2#3{\flushitem{\textit{#1} \qtybf{#2} $\bigl[$= \qtybf{bool}$\bigr]$\hfill (neg.: \textit{#3})}\nopagebreak\\[\parskip]} |
|
43014 | 678 |
\def\opargboolorsmart#1#2#3{\flushitem{\textit{#1} \qtybf{#2} $\bigl[$= \qtybf{smart\_bool}$\bigr]$\hfill (neg.: \textit{#3})}\nopagebreak\\[\parskip]} |
36926 | 679 |
|
680 |
Sledgehammer's options are categorized as follows:\ mode of operation |
|
38984 | 681 |
(\S\ref{mode-of-operation}), problem encoding (\S\ref{problem-encoding}), |
682 |
relevance filter (\S\ref{relevance-filter}), output format |
|
43038 | 683 |
(\S\ref{output-format}), authentication (\S\ref{authentication}), and timeouts |
684 |
(\S\ref{timeouts}). |
|
36926 | 685 |
|
686 |
The descriptions below refer to the following syntactic quantities: |
|
687 |
||
688 |
\begin{enum} |
|
689 |
\item[$\bullet$] \qtybf{string}: A string. |
|
690 |
\item[$\bullet$] \qtybf{bool\/}: \textit{true} or \textit{false}. |
|
43014 | 691 |
\item[$\bullet$] \qtybf{smart\_bool\/}: \textit{true}, \textit{false}, or |
40203 | 692 |
\textit{smart}. |
36926 | 693 |
\item[$\bullet$] \qtybf{int\/}: An integer. |
42724
4d6bcf846759
added "max_mono_instances" option to Sledgehammer and renamed old "monomorphize_limit" option
blanchet
parents:
42722
diff
changeset
|
694 |
%\item[$\bullet$] \qtybf{float\/}: A floating-point number (e.g., 2.5). |
40343
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
695 |
\item[$\bullet$] \qtybf{float\_pair\/}: A pair of floating-point numbers |
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
696 |
(e.g., 0.6 0.95). |
43014 | 697 |
\item[$\bullet$] \qtybf{smart\_int\/}: An integer or \textit{smart}. |
43036 | 698 |
\item[$\bullet$] \qtybf{float\_or\_none\/}: A floating-point number (e.g., 60 or |
699 |
0.5) expressing a number of seconds, or the keyword \textit{none} ($\infty$ |
|
700 |
seconds). |
|
36926 | 701 |
\end{enum} |
702 |
||
43217 | 703 |
Default values are indicated in curly brackets (\textrm{\{\}}). Boolean options |
704 |
have a negated counterpart (e.g., \textit{blocking} vs.\ |
|
705 |
\textit{non\_blocking}). When setting them, ``= \textit{true}'' may be omitted. |
|
36926 | 706 |
|
707 |
\subsection{Mode of Operation} |
|
708 |
\label{mode-of-operation} |
|
709 |
||
710 |
\begin{enum} |
|
43014 | 711 |
\opnodefaultbrk{provers}{string} |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
712 |
Specifies the automatic provers to use as a space-separated list (e.g., |
43014 | 713 |
``\textit{e}~\textit{spass}~\textit{remote\_vampire}''). The following local |
714 |
provers are supported: |
|
36926 | 715 |
|
716 |
\begin{enum} |
|
42945 | 717 |
\item[$\bullet$] \textbf{\textit{cvc3}:} CVC3 is an SMT solver developed by |
718 |
Clark Barrett, Cesare Tinelli, and their colleagues \cite{cvc3}. To use CVC3, |
|
719 |
set the environment variable \texttt{CVC3\_SOLVER} to the complete path of the |
|
720 |
executable, including the file name. Sledgehammer has been tested with version |
|
721 |
2.2. |
|
722 |
||
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
723 |
\item[$\bullet$] \textbf{\textit{e}:} E is a first-order resolution prover |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
724 |
developed by Stephan Schulz \cite{schulz-2002}. To use E, set the environment |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
725 |
variable \texttt{E\_HOME} to the directory that contains the \texttt{eproof} |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
726 |
executable, or install the prebuilt E package from Isabelle's download page. See |
36926 | 727 |
\S\ref{installation} for details. |
728 |
||
44098 | 729 |
\item[$\bullet$] \textbf{\textit{leo2}:} LEO-II is an automatic |
730 |
higher-order prover developed by Christoph Benzm\"uller et al.\ \cite{leo2}, |
|
45300
d8c8c2fcab2c
specify proof output level 1 (i.e. no detailed, potentially huge E proofs) to LEO-II; requires version 1.2.9
blanchet
parents:
45163
diff
changeset
|
731 |
with support for the TPTP many-typed higher-order syntax (THF0). Sledgehammer |
d8c8c2fcab2c
specify proof output level 1 (i.e. no detailed, potentially huge E proofs) to LEO-II; requires version 1.2.9
blanchet
parents:
45163
diff
changeset
|
732 |
requires version 1.2.9 or above. |
44098 | 733 |
|
734 |
\item[$\bullet$] \textbf{\textit{metis}:} Although it is much less powerful than |
|
735 |
the external provers, Metis itself can be used for proof search. |
|
736 |
||
737 |
\item[$\bullet$] \textbf{\textit{metis\_full\_types}:} Fully typed version of |
|
738 |
Metis, corresponding to \textit{metis} (\textit{full\_types}). |
|
739 |
||
740 |
\item[$\bullet$] \textbf{\textit{metis\_no\_types}:} Untyped version of Metis, |
|
741 |
corresponding to \textit{metis} (\textit{no\_types}). |
|
742 |
||
743 |
\item[$\bullet$] \textbf{\textit{satallax}:} Satallax is an automatic |
|
744 |
higher-order prover developed by Chad Brown et al.\ \cite{satallax}, with |
|
45163 | 745 |
support for the TPTP many-typed higher-order syntax (THF0). Sledgehammer |
746 |
requires version 2.2 or above. |
|
44098 | 747 |
|
45380 | 748 |
\item[$\bullet$] \textbf{\textit{smt}:} The \textit{smt} proof method with the |
749 |
current settings (typically, Z3 with proof reconstruction). |
|
750 |
||
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
751 |
\item[$\bullet$] \textbf{\textit{spass}:} SPASS is a first-order resolution |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
752 |
prover developed by Christoph Weidenbach et al.\ \cite{weidenbach-et-al-2009}. |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
753 |
To use SPASS, set the environment variable \texttt{SPASS\_HOME} to the directory |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
754 |
that contains the \texttt{SPASS} executable, or install the prebuilt SPASS |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
755 |
package from Isabelle's download page. Sledgehammer requires version 3.5 or |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
756 |
above. See \S\ref{installation} for details. |
36926 | 757 |
|
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
758 |
\item[$\bullet$] \textbf{\textit{vampire}:} Vampire is a first-order resolution |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
759 |
prover developed by Andrei Voronkov and his colleagues |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
760 |
\cite{riazanov-voronkov-2002}. To use Vampire, set the environment variable |
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
761 |
\texttt{VAMPIRE\_HOME} to the directory that contains the \texttt{vampire} |
44419
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
762 |
executable and \texttt{VAMPIRE\_VERSION} to the version number (e.g., ``1.8''). |
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
763 |
Sledgehammer has been tested with versions 0.6, 1.0, and 1.8. Vampire 1.8 |
44743 | 764 |
supports the TPTP many-typed first-order format (TFF0). |
40942 | 765 |
|
44098 | 766 |
\item[$\bullet$] \textbf{\textit{yices}:} Yices is an SMT solver developed at |
767 |
SRI \cite{yices}. To use Yices, set the environment variable |
|
768 |
\texttt{YICES\_SOLVER} to the complete path of the executable, including the |
|
769 |
file name. Sledgehammer has been tested with version 1.0. |
|
770 |
||
41740
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
771 |
\item[$\bullet$] \textbf{\textit{z3}:} Z3 is an SMT solver developed at |
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
772 |
Microsoft Research \cite{z3}. To use Z3, set the environment variable |
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
773 |
\texttt{Z3\_SOLVER} to the complete path of the executable, including the file |
44421 | 774 |
name, and set \texttt{Z3\_NON\_COMMERCIAL} to ``yes'' to confirm that you are a |
42945 | 775 |
noncommercial user. Sledgehammer has been tested with versions 2.7 to 2.18. |
41740
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
776 |
|
44423
f74707e12d30
exploit TFF format in Z3 used as ATP, and renamed it "z3_tptp"
blanchet
parents:
44421
diff
changeset
|
777 |
\item[$\bullet$] \textbf{\textit{z3\_tptp}:} This version of Z3 pretends to be |
f74707e12d30
exploit TFF format in Z3 used as ATP, and renamed it "z3_tptp"
blanchet
parents:
44421
diff
changeset
|
778 |
an ATP, exploiting Z3's support for the TPTP untyped and many-typed first-order |
44743 | 779 |
formats (FOF and TFF0). It is included for experimental purposes. It requires |
44421 | 780 |
version 3.0 or above. |
42945 | 781 |
\end{enum} |
782 |
||
783 |
In addition, the following remote provers are supported: |
|
784 |
||
785 |
\begin{enum} |
|
786 |
\item[$\bullet$] \textbf{\textit{remote\_cvc3}:} The remote version of CVC3 runs |
|
787 |
on servers at the TU M\"unchen (or wherever \texttt{REMOTE\_SMT\_URL} is set to |
|
788 |
point). |
|
40073 | 789 |
|
38601 | 790 |
\item[$\bullet$] \textbf{\textit{remote\_e}:} The remote version of E runs |
36926 | 791 |
on Geoff Sutcliffe's Miami servers \cite{sutcliffe-2000}. |
792 |
||
44091 | 793 |
\item[$\bullet$] \textbf{\textit{remote\_e\_sine}:} E-SInE is a metaprover |
794 |
developed by Kry\v stof Hoder \cite{sine} based on E. The remote version of |
|
795 |
SInE runs on Geoff Sutcliffe's Miami servers. |
|
796 |
||
797 |
\item[$\bullet$] \textbf{\textit{remote\_e\_tofof}:} E-ToFoF is a metaprover |
|
798 |
developed by Geoff Sutcliffe \cite{tofof} based on E running on his Miami |
|
44743 | 799 |
servers. This ATP supports the TPTP many-typed first-order format (TFF0). The |
44091 | 800 |
remote version of E-ToFoF runs on Geoff Sutcliffe's Miami servers. |
801 |
||
45339 | 802 |
\item[$\bullet$] \textbf{\textit{remote\_iprover}:} iProver is a pure |
803 |
instantiation-based prover developed by Konstantin Korovin \cite{korovin-2009}. The |
|
804 |
remote version of iProver runs on Geoff Sutcliffe's Miami servers |
|
805 |
\cite{sutcliffe-2000}. |
|
806 |
||
807 |
\item[$\bullet$] \textbf{\textit{remote\_iprover\_eq}:} iProver-Eq is an |
|
808 |
instantiation-based prover with native support for equality developed by |
|
809 |
Konstantin Korovin and Christoph Sticksel \cite{korovin-sticksel-2010}. The |
|
810 |
remote version of iProver-Eq runs on Geoff Sutcliffe's Miami servers |
|
811 |
\cite{sutcliffe-2000}. |
|
812 |
||
813 |
The remote version of LEO-II |
|
814 |
runs on Geoff Sutcliffe's Miami servers \cite{sutcliffe-2000}. |
|
815 |
||
44098 | 816 |
\item[$\bullet$] \textbf{\textit{remote\_leo2}:} The remote version of LEO-II |
817 |
runs on Geoff Sutcliffe's Miami servers \cite{sutcliffe-2000}. |
|
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
818 |
|
44098 | 819 |
\item[$\bullet$] \textbf{\textit{remote\_satallax}:} The remote version of |
820 |
Satallax runs on Geoff Sutcliffe's Miami servers \cite{sutcliffe-2000}. |
|
42964
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
821 |
|
bf45fd2488a2
document primitive support for LEO-II and Satallax
blanchet
parents:
42945
diff
changeset
|
822 |
\item[$\bullet$] \textbf{\textit{remote\_snark}:} SNARK is a first-order |
43625 | 823 |
resolution prover developed by Stickel et al.\ \cite{snark}. It supports the |
44743 | 824 |
TPTP many-typed first-order format (TFF0). The remote version of SNARK runs on |
43625 | 825 |
Geoff Sutcliffe's Miami servers. |
40073 | 826 |
|
42945 | 827 |
\item[$\bullet$] \textbf{\textit{remote\_vampire}:} The remote version of |
44419
a460810d743e
update the Vampire related parts of the documentation
blanchet
parents:
44401
diff
changeset
|
828 |
Vampire runs on Geoff Sutcliffe's Miami servers. Version 1.8 is used. |
42945 | 829 |
|
830 |
\item[$\bullet$] \textbf{\textit{remote\_waldmeister}:} Waldmeister is a unit |
|
831 |
equality prover developed by Hillenbrand et al.\ \cite{waldmeister}. It can be |
|
43625 | 832 |
used to prove universally quantified equations using unconditional equations, |
833 |
corresponding to the TPTP CNF UEQ division. The remote version of Waldmeister |
|
834 |
runs on Geoff Sutcliffe's Miami servers. |
|
41738
eb98c60a6cf0
added experimental "remote_z3_atp", Sutcliffe's TPTP-syntax-aware wrapper for Z3 -- allows to do head-to-head comparison of Sledgehammer's ATP translation and of Sascha's SMT translation
blanchet
parents:
41727
diff
changeset
|
835 |
|
40942 | 836 |
\item[$\bullet$] \textbf{\textit{remote\_z3}:} The remote version of Z3 runs on |
837 |
servers at the TU M\"unchen (or wherever \texttt{REMOTE\_SMT\_URL} is set to |
|
838 |
point). |
|
40073 | 839 |
|
44423
f74707e12d30
exploit TFF format in Z3 used as ATP, and renamed it "z3_tptp"
blanchet
parents:
44421
diff
changeset
|
840 |
\item[$\bullet$] \textbf{\textit{remote\_z3\_tptp}:} The remote version of ``Z3 |
f74707e12d30
exploit TFF format in Z3 used as ATP, and renamed it "z3_tptp"
blanchet
parents:
44421
diff
changeset
|
841 |
with TPTP syntax'' runs on Geoff Sutcliffe's Miami servers. |
36926 | 842 |
\end{enum} |
843 |
||
45063
b3b50d8b535a
reintroduced E-SInE now that it's unexpectedly working again (thanks to Geoff)
blanchet
parents:
45048
diff
changeset
|
844 |
By default, Sledgehammer runs E, E-SInE, SPASS, Vampire, Z3 (or whatever |
44091 | 845 |
the SMT module's \textit{smt\_solver} configuration option is set to), and (if |
846 |
appropriate) Waldmeister in parallel---either locally or remotely, depending on |
|
847 |
the number of processor cores available. For historical reasons, the default |
|
848 |
value of this option can be overridden using the option ``Sledgehammer: |
|
44743 | 849 |
Provers'' in Proof General's ``Isabelle'' menu. |
36926 | 850 |
|
44743 | 851 |
It is generally a good idea to run several provers in parallel. Running E, |
852 |
SPASS, and Vampire for 5~seconds yields a similar success rate to running the |
|
853 |
most effective of these for 120~seconds \cite{boehme-nipkow-2010}. |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
854 |
|
43053 | 855 |
For the \textit{min} subcommand, the default prover is \textit{metis}. If |
856 |
several provers are set, the first one is used. |
|
857 |
||
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
858 |
\opnodefault{prover}{string} |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
859 |
Alias for \textit{provers}. |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
860 |
|
42884 | 861 |
%\opnodefault{atps}{string} |
862 |
%Legacy alias for \textit{provers}. |
|
36926 | 863 |
|
42884 | 864 |
%\opnodefault{atp}{string} |
865 |
%Legacy alias for \textit{provers}. |
|
36926 | 866 |
|
38983 | 867 |
\opfalse{blocking}{non\_blocking} |
868 |
Specifies whether the \textbf{sledgehammer} command should operate |
|
869 |
synchronously. The asynchronous (non-blocking) mode lets the user start proving |
|
870 |
the putative theorem manually while Sledgehammer looks for a proof, but it can |
|
42995
e23f61546cf0
always run Sledgehammer synchronously in the jEdit interface (until the multithreading support for Proof General is ported)
blanchet
parents:
42964
diff
changeset
|
871 |
also be more confusing. Irrespective of the value of this option, Sledgehammer |
e23f61546cf0
always run Sledgehammer synchronously in the jEdit interface (until the multithreading support for Proof General is ported)
blanchet
parents:
42964
diff
changeset
|
872 |
is always run synchronously for the new jEdit-based user interface or if |
e23f61546cf0
always run Sledgehammer synchronously in the jEdit interface (until the multithreading support for Proof General is ported)
blanchet
parents:
42964
diff
changeset
|
873 |
\textit{debug} (\S\ref{output-format}) is enabled. |
38983 | 874 |
|
42443
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
875 |
\optrue{slicing}{no\_slicing} |
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
876 |
Specifies whether the time allocated to a prover should be sliced into several |
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
877 |
segments, each of which has its own set of possibly prover-dependent options. |
42446 | 878 |
For SPASS and Vampire, the first slice tries the fast but incomplete |
42443
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
879 |
set-of-support (SOS) strategy, whereas the second slice runs without it. For E, |
42446 | 880 |
up to three slices are tried, with different weighted search strategies and |
42443
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
881 |
number of facts. For SMT solvers, several slices are tried with the same options |
42446 | 882 |
each time but fewer and fewer facts. According to benchmarks with a timeout of |
883 |
30 seconds, slicing is a valuable optimization, and you should probably leave it |
|
884 |
enabled unless you are conducting experiments. This option is implicitly |
|
42443
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
885 |
disabled for (short) automatic runs. |
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
886 |
|
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
887 |
\nopagebreak |
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
888 |
{\small See also \textit{verbose} (\S\ref{output-format}).} |
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42442
diff
changeset
|
889 |
|
36926 | 890 |
\opfalse{overlord}{no\_overlord} |
891 |
Specifies whether Sledgehammer should put its temporary files in |
|
892 |
\texttt{\$ISA\-BELLE\_\allowbreak HOME\_\allowbreak USER}, which is useful for |
|
893 |
debugging Sledgehammer but also unsafe if several instances of the tool are run |
|
894 |
simultaneously. The files are identified by the prefix \texttt{prob\_}; you may |
|
895 |
safely remove them after Sledgehammer has run. |
|
896 |
||
897 |
\nopagebreak |
|
898 |
{\small See also \textit{debug} (\S\ref{output-format}).} |
|
899 |
\end{enum} |
|
900 |
||
901 |
\subsection{Problem Encoding} |
|
902 |
\label{problem-encoding} |
|
903 |
||
904 |
\begin{enum} |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
905 |
\opdefault{type\_enc}{string}{smart} |
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
906 |
Specifies the type encoding to use in ATP problems. Some of the type encodings |
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
907 |
are unsound, meaning that they can give rise to spurious proofs |
45380 | 908 |
(unreconstructible using \textit{metis}). The supported type encodings are |
909 |
listed below, with an indication of their soundness in parentheses: |
|
42228 | 910 |
|
911 |
\begin{enum} |
|
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
912 |
\item[$\bullet$] \textbf{\textit{erased} (very unsound):} No type information is |
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
913 |
supplied to the ATP. Types are simply erased. |
42582 | 914 |
|
43990 | 915 |
\item[$\bullet$] \textbf{\textit{poly\_guards} (sound):} Types are encoded using |
916 |
a predicate \textit{has\_\allowbreak type\/}$(\tau, t)$ that guards bound |
|
917 |
variables. Constants are annotated with their types, supplied as additional |
|
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
918 |
arguments, to resolve overloading. |
42685 | 919 |
|
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
920 |
\item[$\bullet$] \textbf{\textit{poly\_tags} (sound):} Each term and subterm is |
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
921 |
tagged with its type using a function $\mathit{type\/}(\tau, t)$. |
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
922 |
|
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
923 |
\item[$\bullet$] \textbf{\textit{poly\_args} (unsound):} |
43990 | 924 |
Like for \textit{poly\_guards} constants are annotated with their types to |
43002
e88fde86e4c2
mention contributions from LCP and explain metis and metisFT encodings
blanchet
parents:
42996
diff
changeset
|
925 |
resolve overloading, but otherwise no type information is encoded. This |
43228
2ed2f092e990
obsoleted "metisFT", and added "no_types" version of Metis as fallback to Sledgehammer after noticing how useful it can be
blanchet
parents:
43217
diff
changeset
|
926 |
coincides with the default encoding used by the \textit{metis} command. |
42685 | 927 |
|
42722 | 928 |
\item[$\bullet$] |
929 |
\textbf{% |
|
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
930 |
\textit{raw\_mono\_guards}, \textit{raw\_mono\_tags} (sound); \\ |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
931 |
\textit{raw\_mono\_args} (unsound):} \\ |
43990 | 932 |
Similar to \textit{poly\_guards}, \textit{poly\_tags}, and \textit{poly\_args}, |
42722 | 933 |
respectively, but the problem is additionally monomorphized, meaning that type |
934 |
variables are instantiated with heuristically chosen ground types. |
|
935 |
Monomorphization can simplify reasoning but also leads to larger fact bases, |
|
936 |
which can slow down the ATPs. |
|
42582 | 937 |
|
42722 | 938 |
\item[$\bullet$] |
939 |
\textbf{% |
|
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
940 |
\textit{mono\_guards}, \textit{mono\_tags} (sound); |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
941 |
\textit{mono\_args} (unsound):} \\ |
42722 | 942 |
Similar to |
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
943 |
\textit{raw\_mono\_guards}, \textit{raw\_mono\_tags}, and |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
944 |
\textit{raw\_mono\_args}, respectively but types are mangled in constant names |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
945 |
instead of being supplied as ground term arguments. The binary predicate |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
946 |
$\mathit{has\_type\/}(\tau, t)$ becomes a unary predicate |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
947 |
$\mathit{has\_type\_}\tau(t)$, and the binary function |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
948 |
$\mathit{type\/}(\tau, t)$ becomes a unary function |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
949 |
$\mathit{type\_}\tau(t)$. |
42589
9f7c48463645
restructured type systems some more -- the old naming schemes had "argshg diff |less" and "tagshg diff |less" as equivalent and didn't support a monomorphic version of "tags"
blanchet
parents:
42582
diff
changeset
|
950 |
|
44743 | 951 |
\item[$\bullet$] \textbf{\textit{mono\_simple} (sound):} Exploits simple |
952 |
first-order types if the prover supports the TFF0 or THF0 syntax; otherwise, |
|
44769 | 953 |
falls back on \textit{mono\_guards}. The problem is monomorphized. |
43625 | 954 |
|
44743 | 955 |
\item[$\bullet$] \textbf{\textit{mono\_simple\_higher} (sound):} Exploits simple |
956 |
higher-order types if the prover supports the THF0 syntax; otherwise, falls back |
|
44769 | 957 |
on \textit{mono\_simple} or \textit{mono\_guards}. The problem is monomorphized. |
42681 | 958 |
|
959 |
\item[$\bullet$] |
|
960 |
\textbf{% |
|
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
961 |
\textit{poly\_guards}?, \textit{poly\_tags}?, \textit{raw\_mono\_guards}?, \\ |
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
962 |
\textit{raw\_mono\_tags}?, \textit{mono\_guards}?, \textit{mono\_tags}?, \\ |
44743 | 963 |
\textit{mono\_simple}? (quasi-sound):} \\ |
43990 | 964 |
The type encodings \textit{poly\_guards}, \textit{poly\_tags}, |
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
965 |
\textit{raw\_mono\_guards}, \textit{raw\_mono\_tags}, \textit{mono\_guards}, |
44743 | 966 |
\textit{mono\_tags}, and \textit{mono\_simple} are fully |
43625 | 967 |
typed and sound. For each of these, Sledgehammer also provides a lighter, |
44816 | 968 |
virtually sound variant identified by a question mark (`\hbox{?}')\ that detects |
969 |
and erases monotonic types, notably infinite types. (For \textit{mono\_simple}, |
|
970 |
the types are not actually erased but rather replaced by a shared uniform type |
|
971 |
of individuals.) As argument to the \textit{metis} proof method, the question |
|
972 |
mark is replaced by a \hbox{``\textit{\_query}''} suffix. If the \emph{sound} |
|
973 |
option is enabled, these encodings are fully sound. |
|
42582 | 974 |
|
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
975 |
\item[$\bullet$] |
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
976 |
\textbf{% |
44769 | 977 |
\textit{poly\_guards}??, \textit{poly\_tags}??, \textit{raw\_mono\_guards}??, \\ |
978 |
\textit{raw\_mono\_tags}??, \textit{mono\_guards}??, \textit{mono\_tags}?? \\ |
|
979 |
(quasi-sound):} \\ |
|
44816 | 980 |
Even lighter versions of the `\hbox{?}' encodings. As argument to the |
981 |
\textit{metis} proof method, the `\hbox{??}' suffix is replaced by |
|
982 |
\hbox{``\textit{\_query\_query}''}. |
|
983 |
||
984 |
\item[$\bullet$] |
|
985 |
\textbf{% |
|
986 |
\textit{poly\_guards}@?, \textit{poly\_tags}@?, \textit{raw\_mono\_guards}@?, \\ |
|
987 |
\textit{raw\_mono\_tags}@? (quasi-sound):} \\ |
|
988 |
Alternative versions of the `\hbox{??}' encodings. As argument to the |
|
989 |
\textit{metis} proof method, the `\hbox{@?}' suffix is replaced by |
|
990 |
\hbox{``\textit{\_at\_query}''}. |
|
44769 | 991 |
|
992 |
\item[$\bullet$] |
|
993 |
\textbf{% |
|
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
994 |
\textit{poly\_guards}!, \textit{poly\_tags}!, \textit{raw\_mono\_guards}!, \\ |
44743 | 995 |
\textit{raw\_mono\_tags}!, \textit{mono\_guards}!, \textit{mono\_tags}!, \\ |
996 |
\textit{mono\_simple}!, \textit{mono\_simple\_higher}! (mildly unsound):} \\ |
|
43990 | 997 |
The type encodings \textit{poly\_guards}, \textit{poly\_tags}, |
44494
a77901b3774e
rationalized option names -- mono becomes raw_mono and mangled becomes mono
blanchet
parents:
44423
diff
changeset
|
998 |
\textit{raw\_mono\_guards}, \textit{raw\_mono\_tags}, \textit{mono\_guards}, |
44743 | 999 |
\textit{mono\_tags}, \textit{mono\_simple}, and \textit{mono\_simple\_higher} |
1000 |
also admit a mildly unsound (but very efficient) variant identified by an |
|
44816 | 1001 |
exclamation mark (`\hbox{!}') that detects and erases erases all types except |
1002 |
those that are clearly finite (e.g., \textit{bool}). (For \textit{mono\_simple} |
|
1003 |
and \textit{mono\_simple\_higher}, the types are not actually erased but rather |
|
44743 | 1004 |
replaced by a shared uniform type of individuals.) As argument to the |
1005 |
\textit{metis} proof method, the exclamation mark is replaced by the suffix |
|
1006 |
\hbox{``\textit{\_bang}''}. |
|
42887
771be1dcfef6
document new type system and soundness properties of the different systems
blanchet
parents:
42884
diff
changeset
|
1007 |
|
44769 | 1008 |
\item[$\bullet$] |
1009 |
\textbf{% |
|
1010 |
\textit{poly\_guards}!!, \textit{poly\_tags}!!, \textit{raw\_mono\_guards}!!, \\ |
|
1011 |
\textit{raw\_mono\_tags}!!, \textit{mono\_guards}!!, \textit{mono\_tags}!! \\ |
|
1012 |
(mildly unsound):} \\ |
|
44816 | 1013 |
Even lighter versions of the `\hbox{!}' encodings. As argument to the |
1014 |
\textit{metis} proof method, the `\hbox{!!}' suffix is replaced by |
|
1015 |
\hbox{``\textit{\_bang\_bang}''}. |
|
1016 |
||
1017 |
\item[$\bullet$] |
|
1018 |
\textbf{% |
|
1019 |
\textit{poly\_guards}@!, \textit{poly\_tags}@!, \textit{raw\_mono\_guards}@!, \\ |
|
1020 |
\textit{raw\_mono\_tags}@! (mildly unsound):} \\ |
|
1021 |
Alternative versions of the `\hbox{!!}' encodings. As argument to the |
|
1022 |
\textit{metis} proof method, the `\hbox{@!}' suffix is replaced by |
|
1023 |
\hbox{``\textit{\_at\_bang}''}. |
|
44769 | 1024 |
|
43571 | 1025 |
\item[$\bullet$] \textbf{\textit{smart}:} The actual encoding used depends on |
1026 |
the ATP and should be the most efficient virtually sound encoding for that ATP. |
|
42228 | 1027 |
\end{enum} |
1028 |
||
44743 | 1029 |
For SMT solvers, the type encoding is always \textit{mono\_simple}, irrespective |
1030 |
of the value of this option. |
|
42888 | 1031 |
|
1032 |
\nopagebreak |
|
1033 |
{\small See also \textit{max\_new\_mono\_instances} (\S\ref{relevance-filter}) |
|
1034 |
and \textit{max\_mono\_iters} (\S\ref{relevance-filter}).} |
|
43574 | 1035 |
|
1036 |
\opfalse{sound}{unsound} |
|
1037 |
Specifies whether Sledgehammer should run in its fully sound mode. In that mode, |
|
43822 | 1038 |
quasi-sound type encodings (which are the default) are made fully sound, at the |
1039 |
cost of some clutter in the generated problems. This option is ignored if |
|
1040 |
\textit{type\_enc} is explicitly set to an unsound encoding. |
|
38591 | 1041 |
\end{enum} |
36926 | 1042 |
|
38591 | 1043 |
\subsection{Relevance Filter} |
1044 |
\label{relevance-filter} |
|
1045 |
||
1046 |
\begin{enum} |
|
40343
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
1047 |
\opdefault{relevance\_thresholds}{float\_pair}{\upshape 0.45~0.85} |
38746 | 1048 |
Specifies the thresholds above which facts are considered relevant by the |
1049 |
relevance filter. The first threshold is used for the first iteration of the |
|
1050 |
relevance filter and the second threshold is used for the last iteration (if it |
|
1051 |
is reached). The effective threshold is quadratically interpolated for the other |
|
40343
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
1052 |
iterations. Each threshold ranges from 0 to 1, where 0 means that all theorems |
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
1053 |
are relevant and 1 only theorems that refer to previously seen constants. |
36926 | 1054 |
|
43065 | 1055 |
\opdefault{max\_relevant}{smart\_int}{smart} |
38746 | 1056 |
Specifies the maximum number of facts that may be returned by the relevance |
1057 |
filter. If the option is set to \textit{smart}, it is set to a value that was |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39335
diff
changeset
|
1058 |
empirically found to be appropriate for the prover. A typical value would be |
43065 | 1059 |
250. |
42180
a6c141925a8a
added monomorphization option to Sledgehammer ATPs -- this looks promising but is still off by default
blanchet
parents:
41747
diff
changeset
|
1060 |
|
43352
597f31069e18
fewer monomorphic instances are necessary, thanks to Sascha's new monomorphizer -- backed up by Judgment Day
blanchet
parents:
43260
diff
changeset
|
1061 |
\opdefault{max\_new\_mono\_instances}{int}{\upshape 200} |
42884 | 1062 |
Specifies the maximum number of monomorphic instances to generate beyond |
1063 |
\textit{max\_relevant}. The higher this limit is, the more monomorphic instances |
|
1064 |
are potentially generated. Whether monomorphization takes place depends on the |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
1065 |
type encoding used. |
42884 | 1066 |
|
1067 |
\nopagebreak |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
1068 |
{\small See also \textit{type\_enc} (\S\ref{problem-encoding}).} |
42884 | 1069 |
|
1070 |
\opdefault{max\_mono\_iters}{int}{\upshape 3} |
|
1071 |
Specifies the maximum number of iterations for the monomorphization fixpoint |
|
1072 |
construction. The higher this limit is, the more monomorphic instances are |
|
1073 |
potentially generated. Whether monomorphization takes place depends on the |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
1074 |
type encoding used. |
42884 | 1075 |
|
1076 |
\nopagebreak |
|
43627
ecd4bb7a8bc0
update documentation after "type_enc" renaming + fixed a few other out-of-date factlets
blanchet
parents:
43625
diff
changeset
|
1077 |
{\small See also \textit{type\_enc} (\S\ref{problem-encoding}).} |
36926 | 1078 |
\end{enum} |
1079 |
||
1080 |
\subsection{Output Format} |
|
1081 |
\label{output-format} |
|
1082 |
||
1083 |
\begin{enum} |
|
1084 |
||
1085 |
\opfalse{verbose}{quiet} |
|
1086 |
Specifies whether the \textbf{sledgehammer} command should explain what it does. |
|
41208
1b28c43a7074
make "debug" imply "blocking", since in blocking mode the exceptions flow through and are more instructive
blanchet
parents:
40942
diff
changeset
|
1087 |
This option is implicitly disabled for automatic runs. |
36926 | 1088 |
|
1089 |
\opfalse{debug}{no\_debug} |
|
40203 | 1090 |
Specifies whether Sledgehammer should display additional debugging information |
1091 |
beyond what \textit{verbose} already displays. Enabling \textit{debug} also |
|
41208
1b28c43a7074
make "debug" imply "blocking", since in blocking mode the exceptions flow through and are more instructive
blanchet
parents:
40942
diff
changeset
|
1092 |
enables \textit{verbose} and \textit{blocking} (\S\ref{mode-of-operation}) |
1b28c43a7074
make "debug" imply "blocking", since in blocking mode the exceptions flow through and are more instructive
blanchet
parents:
40942
diff
changeset
|
1093 |
behind the scenes. The \textit{debug} option is implicitly disabled for |
1b28c43a7074
make "debug" imply "blocking", since in blocking mode the exceptions flow through and are more instructive
blanchet
parents:
40942
diff
changeset
|
1094 |
automatic runs. |
36926 | 1095 |
|
1096 |
\nopagebreak |
|
1097 |
{\small See also \textit{overlord} (\S\ref{mode-of-operation}).} |
|
1098 |
||
1099 |
\opfalse{isar\_proof}{no\_isar\_proof} |
|
1100 |
Specifies whether Isar proofs should be output in addition to one-liner |
|
1101 |
\textit{metis} proofs. Isar proof construction is still experimental and often |
|
1102 |
fails; however, they are usually faster and sometimes more robust than |
|
1103 |
\textit{metis} proofs. |
|
1104 |
||
40343
4521d56aef63
use floating-point numbers for Sledgehammer's "thresholds" option rather than percentages;
blanchet
parents:
40341
diff
changeset
|
1105 |
\opdefault{isar\_shrink\_factor}{int}{\upshape 1} |
36926 | 1106 |
Specifies the granularity of the Isar proof. A value of $n$ indicates that each |
1107 |
Isar proof step should correspond to a group of up to $n$ consecutive proof |
|
1108 |
steps in the ATP proof. |
|
1109 |
\end{enum} |
|
1110 |
||
38984 | 1111 |
\subsection{Authentication} |
1112 |
\label{authentication} |
|
1113 |
||
1114 |
\begin{enum} |
|
1115 |
\opnodefault{expect}{string} |
|
1116 |
Specifies the expected outcome, which must be one of the following: |
|
36926 | 1117 |
|
1118 |
\begin{enum} |
|
40203 | 1119 |
\item[$\bullet$] \textbf{\textit{some}:} Sledgehammer found a (potentially |
1120 |
unsound) proof. |
|
38984 | 1121 |
\item[$\bullet$] \textbf{\textit{none}:} Sledgehammer found no proof. |
43014 | 1122 |
\item[$\bullet$] \textbf{\textit{timeout}:} Sledgehammer timed out. |
40203 | 1123 |
\item[$\bullet$] \textbf{\textit{unknown}:} Sledgehammer encountered some |
1124 |
problem. |
|
38984 | 1125 |
\end{enum} |
1126 |
||
1127 |
Sledgehammer emits an error (if \textit{blocking} is enabled) or a warning |
|
1128 |
(otherwise) if the actual outcome differs from the expected outcome. This option |
|
1129 |
is useful for regression testing. |
|
1130 |
||
1131 |
\nopagebreak |
|
43038 | 1132 |
{\small See also \textit{blocking} (\S\ref{mode-of-operation}) and |
1133 |
\textit{timeout} (\S\ref{timeouts}).} |
|
1134 |
\end{enum} |
|
1135 |
||
1136 |
\subsection{Timeouts} |
|
1137 |
\label{timeouts} |
|
1138 |
||
1139 |
\begin{enum} |
|
1140 |
\opdefault{timeout}{float\_or\_none}{\upshape 30} |
|
1141 |
Specifies the maximum number of seconds that the automatic provers should spend |
|
1142 |
searching for a proof. This excludes problem preparation and is a soft limit. |
|
1143 |
For historical reasons, the default value of this option can be overridden using |
|
44743 | 1144 |
the option ``Sledgehammer: Time Limit'' in Proof General's ``Isabelle'' menu. |
43038 | 1145 |
|
1146 |
\opdefault{preplay\_timeout}{float\_or\_none}{\upshape 4} |
|
45380 | 1147 |
Specifies the maximum number of seconds that \textit{metis} or \textit{smt} |
1148 |
should spend trying to ``preplay'' the found proof. If this option is set to 0, |
|
1149 |
no preplaying takes place, and no timing information is displayed next to the |
|
1150 |
suggested \textit{metis} calls. |
|
36926 | 1151 |
\end{enum} |
1152 |
||
1153 |
\let\em=\sl |
|
1154 |
\bibliography{../manual}{} |
|
1155 |
\bibliographystyle{abbrv} |
|
1156 |
||
1157 |
\end{document} |