author | blanchet |
Mon, 19 Jul 2021 10:38:14 +0200 | |
changeset 74046 | 462d652ad910 |
parent 74005 | 14de47e29fe4 |
child 74047 | a2b470e315ee |
permissions | -rw-r--r-- |
38047 | 1 |
(* Title: HOL/Tools/ATP/atp_systems.ML |
28592 | 2 |
Author: Fabian Immler, TU Muenchen |
36371
8c83ea1a7740
move the Sledgehammer menu options to "sledgehammer_isar.ML"
blanchet
parents:
36370
diff
changeset
|
3 |
Author: Jasmin Blanchette, TU Muenchen |
28592 | 4 |
|
36376 | 5 |
Setup for supported ATPs. |
28592 | 6 |
*) |
7 |
||
72400 | 8 |
signature SLEDGEHAMMER_ATP_SYSTEMS = |
28592 | 9 |
sig |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
10 |
type term_order = ATP_Problem.term_order |
45301
866b075aa99b
added sorted DFG output for coming version of SPASS
blanchet
parents:
45300
diff
changeset
|
11 |
type atp_format = ATP_Problem.atp_format |
53586
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
12 |
type atp_formula_role = ATP_Problem.atp_formula_role |
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
13 |
type atp_failure = ATP_Proof.atp_failure |
38023 | 14 |
|
51011 | 15 |
type slice_spec = (int * string) * atp_format * string * string * bool |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
16 |
type atp_config = |
73374 | 17 |
{exec : string list * string list, |
73432 | 18 |
arguments : Proof.context -> bool -> string -> Time.time -> Path.T -> |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
19 |
term_order * (unit -> (string * int) list) * (unit -> (string * real) list) -> string list, |
42578
1eaf4d437d4c
define type system in ATP module so that ATPs can suggest a type system
blanchet
parents:
42577
diff
changeset
|
20 |
proof_delims : (string * string) list, |
53586
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
21 |
known_failures : (atp_failure * string) list, |
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
22 |
prem_role : atp_formula_role, |
48716
1d2a12bb0640
stop distinguishing between complete and incomplete slices, since this is very fragile and has hardly any useful semantics to users
blanchet
parents:
48715
diff
changeset
|
23 |
best_slices : Proof.context -> (real * (slice_spec * string)) list, |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
24 |
best_max_mono_iters : int, |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
25 |
best_max_new_mono_instances : int} |
38023 | 26 |
|
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
27 |
val default_max_mono_iters : int |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
28 |
val default_max_new_mono_instances : int |
44099 | 29 |
val force_sos : bool Config.T |
47032 | 30 |
val term_order : string Config.T |
43566
a818d5a34cca
filter out some tautologies using an ATP, especially for those theories that are known for producing such things
blanchet
parents:
43529
diff
changeset
|
31 |
val e_smartN : string |
a818d5a34cca
filter out some tautologies using an ATP, especially for those theories that are known for producing such things
blanchet
parents:
43529
diff
changeset
|
32 |
val e_autoN : string |
a818d5a34cca
filter out some tautologies using an ATP, especially for those theories that are known for producing such things
blanchet
parents:
43529
diff
changeset
|
33 |
val e_fun_weightN : string |
a818d5a34cca
filter out some tautologies using an ATP, especially for those theories that are known for producing such things
blanchet
parents:
43529
diff
changeset
|
34 |
val e_sym_offset_weightN : string |
47032 | 35 |
val e_selection_heuristic : string Config.T |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
36 |
val e_default_fun_weight : real Config.T |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
37 |
val e_fun_weight_base : real Config.T |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
38 |
val e_fun_weight_span : real Config.T |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
39 |
val e_default_sym_offs_weight : real Config.T |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
40 |
val e_sym_offs_weight_base : real Config.T |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
41 |
val e_sym_offs_weight_span : real Config.T |
50950 | 42 |
val spass_H1SOS : string |
43 |
val spass_H2 : string |
|
44 |
val spass_H2LR0LT0 : string |
|
45 |
val spass_H2NuVS0 : string |
|
46 |
val spass_H2NuVS0Red2 : string |
|
47 |
val spass_H2SOS : string |
|
68563 | 48 |
val is_vampire_noncommercial_license_accepted : unit -> bool option |
73435
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
49 |
val isabelle_scala_function: string list * string list |
57671
dc5e1b1db9ba
avoid 'eproof' and 'eproof_ram' scripts if possible (i.e. if 'eprover' can produce reasonable enough proofs for one-liner reconstruction)
blanchet
parents:
57636
diff
changeset
|
50 |
val remote_atp : string -> string -> string list -> (string * string) list -> |
dc5e1b1db9ba
avoid 'eproof' and 'eproof_ram' scripts if possible (i.e. if 'eprover' can produce reasonable enough proofs for one-liner reconstruction)
blanchet
parents:
57636
diff
changeset
|
51 |
(atp_failure * string) list -> atp_formula_role -> (Proof.context -> slice_spec * string) -> |
dc5e1b1db9ba
avoid 'eproof' and 'eproof_ram' scripts if possible (i.e. if 'eprover' can produce reasonable enough proofs for one-liner reconstruction)
blanchet
parents:
57636
diff
changeset
|
52 |
string * (unit -> atp_config) |
47606
06dde48a1503
true delayed evaluation of "SPASS_VERSION" environment variable
blanchet
parents:
47506
diff
changeset
|
53 |
val add_atp : string * (unit -> atp_config) -> theory -> theory |
06dde48a1503
true delayed evaluation of "SPASS_VERSION" environment variable
blanchet
parents:
47506
diff
changeset
|
54 |
val get_atp : theory -> string -> (unit -> atp_config) |
41727
ab3f6d76fb23
available_provers ~> supported_provers (for clarity)
blanchet
parents:
41725
diff
changeset
|
55 |
val supported_atps : theory -> string list |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
56 |
val is_atp_installed : theory -> string -> bool |
35867 | 57 |
val refresh_systems_on_tptp : unit -> unit |
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
58 |
val effective_term_order : Proof.context -> string -> term_order |
28592 | 59 |
end; |
60 |
||
72400 | 61 |
structure Sledgehammer_ATP_Systems : SLEDGEHAMMER_ATP_SYSTEMS = |
28592 | 62 |
struct |
28596
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
63 |
|
42577
78414ec6fa4e
made the format (TFF or FOF) of the TPTP problem a global argument of the problem again and have the ATPs report which formats they support
blanchet
parents:
42571
diff
changeset
|
64 |
open ATP_Problem |
39491
2416666e6f94
refactoring: move ATP proof and error extraction code to "ATP_Proof" module
blanchet
parents:
39375
diff
changeset
|
65 |
open ATP_Proof |
46320 | 66 |
open ATP_Problem_Generate |
32864
a226f29d4bdc
re-organized signature of AtpWrapper structure: records instead of unnamed parameters and return values,
boehmes
parents:
32740
diff
changeset
|
67 |
|
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
68 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
69 |
(* ATP configuration *) |
32864
a226f29d4bdc
re-organized signature of AtpWrapper structure: records instead of unnamed parameters and return values,
boehmes
parents:
32740
diff
changeset
|
70 |
|
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
71 |
val default_max_mono_iters = 3 (* FUDGE *) |
53515
f5b678b155f6
adjusted number of generated monomorphic instances for new monomorphizer based on new evaluation (E, SPASS, Vampire)
blanchet
parents:
53225
diff
changeset
|
72 |
val default_max_new_mono_instances = 100 (* FUDGE *) |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
73 |
|
51011 | 74 |
type slice_spec = (int * string) * atp_format * string * string * bool |
46409
d4754183ccce
made option available to users (mostly for experiments)
blanchet
parents:
46407
diff
changeset
|
75 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
76 |
type atp_config = |
73374 | 77 |
{exec : string list * string list, |
73432 | 78 |
arguments : Proof.context -> bool -> string -> Time.time -> Path.T -> |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
79 |
term_order * (unit -> (string * int) list) * (unit -> (string * real) list) -> string list, |
42578
1eaf4d437d4c
define type system in ATP module so that ATPs can suggest a type system
blanchet
parents:
42577
diff
changeset
|
80 |
proof_delims : (string * string) list, |
53586
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
81 |
known_failures : (atp_failure * string) list, |
bd5fa6425993
prefixed types and some functions with "atp_" for disambiguation
blanchet
parents:
53515
diff
changeset
|
82 |
prem_role : atp_formula_role, |
48716
1d2a12bb0640
stop distinguishing between complete and incomplete slices, since this is very fragile and has hardly any useful semantics to users
blanchet
parents:
48715
diff
changeset
|
83 |
best_slices : Proof.context -> (real * (slice_spec * string)) list, |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
84 |
best_max_mono_iters : int, |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
85 |
best_max_new_mono_instances : int} |
28596
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
86 |
|
72401
2783779b7dd3
removed obsolete unmaintained experimental prover Pirate
blanchet
parents:
72400
diff
changeset
|
87 |
(* "best_slices" must be found empirically, taking a holistic approach since the |
2783779b7dd3
removed obsolete unmaintained experimental prover Pirate
blanchet
parents:
72400
diff
changeset
|
88 |
ATPs are run in parallel. Each slice has the format |
51011 | 89 |
|
90 |
(time_frac, ((max_facts, fact_filter), format, type_enc, |
|
91 |
lam_trans, uncurried_aliases), extra) |
|
92 |
||
93 |
where |
|
94 |
||
95 |
time_frac = faction of the time available given to the slice (which should |
|
96 |
add up to 1.0) |
|
97 |
||
98 |
extra = extra information to the prover (e.g., SOS or no SOS). |
|
42723 | 99 |
|
100 |
The last slice should be the most "normal" one, because it will get all the |
|
43569
b342cd125533
removed "full_types" option from Sledgehammer, now that virtually sound encodings are used as the default anyway
blanchet
parents:
43567
diff
changeset
|
101 |
time available if the other slices fail early and also because it is used if |
b342cd125533
removed "full_types" option from Sledgehammer, now that virtually sound encodings are used as the default anyway
blanchet
parents:
43567
diff
changeset
|
102 |
slicing is disabled (e.g., by the minimizer). *) |
42710
84fcce345b5d
further improved type system setup based on Judgment Days
blanchet
parents:
42709
diff
changeset
|
103 |
|
51011 | 104 |
val mepoN = "mepo" |
105 |
val mashN = "mash" |
|
106 |
val meshN = "mesh" |
|
107 |
||
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
108 |
val tstp_proof_delims = |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
109 |
[("% SZS output start CNFRefutation", "% SZS output end CNFRefutation"), |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
110 |
("% SZS output start Refutation", "% SZS output end Refutation"), |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
111 |
("% SZS output start Proof", "% SZS output end Proof")] |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
112 |
|
45203 | 113 |
fun known_szs_failures wrap = |
114 |
[(Unprovable, wrap "CounterSatisfiable"), |
|
115 |
(Unprovable, wrap "Satisfiable"), |
|
116 |
(GaveUp, wrap "GaveUp"), |
|
117 |
(GaveUp, wrap "Unknown"), |
|
118 |
(GaveUp, wrap "Incomplete"), |
|
119 |
(ProofMissing, wrap "Theorem"), |
|
120 |
(ProofMissing, wrap "Unsatisfiable"), |
|
121 |
(TimedOut, wrap "Timeout"), |
|
122 |
(Inappropriate, wrap "Inappropriate"), |
|
123 |
(OutOfResources, wrap "ResourceOut"), |
|
124 |
(OutOfResources, wrap "MemoryOut"), |
|
125 |
(Interrupted, wrap "Forced"), |
|
126 |
(Interrupted, wrap "User")] |
|
127 |
||
128 |
val known_szs_status_failures = known_szs_failures (prefix "SZS status ") |
|
129 |
val known_says_failures = known_szs_failures (prefix " says ") |
|
130 |
||
38023 | 131 |
structure Data = Theory_Data |
132 |
( |
|
47606
06dde48a1503
true delayed evaluation of "SPASS_VERSION" environment variable
blanchet
parents:
47506
diff
changeset
|
133 |
type T = ((unit -> atp_config) * stamp) Symtab.table |
38023 | 134 |
val empty = Symtab.empty |
135 |
val extend = I |
|
46407
30e9720cc0b9
optimization: slice caching in case two consecutive slices are nearly identical
blanchet
parents:
46402
diff
changeset
|
136 |
fun merge data : T = |
30e9720cc0b9
optimization: slice caching in case two consecutive slices are nearly identical
blanchet
parents:
46402
diff
changeset
|
137 |
Symtab.merge (eq_snd (op =)) data |
63692 | 138 |
handle Symtab.DUP name => error ("Duplicate ATP: " ^ quote name) |
38023 | 139 |
) |
38017
3ad3e3ca2451
move Sledgehammer-specific code out of "Sledgehammer_TPTP_Format"
blanchet
parents:
38015
diff
changeset
|
140 |
|
43981
404ae49ce29f
give E at least two seconds -- anything else risks causing too early timeouts in the minimizer, because of too conservative time computations in E and eproof scripts
blanchet
parents:
43850
diff
changeset
|
141 |
fun to_secs min time = Int.max (min, (Time.toMilliseconds time + 999) div 1000) |
36142
f5e15e9aae10
make Sledgehammer "minimize" output less confusing + round up (not down) time limits to nearest second
blanchet
parents:
36064
diff
changeset
|
142 |
|
43473
fb2713b803e6
deal with ATP time slices in a more flexible/robust fashion
blanchet
parents:
43467
diff
changeset
|
143 |
val sosN = "sos" |
fb2713b803e6
deal with ATP time slices in a more flexible/robust fashion
blanchet
parents:
43467
diff
changeset
|
144 |
val no_sosN = "no_sos" |
fb2713b803e6
deal with ATP time slices in a more flexible/robust fashion
blanchet
parents:
43467
diff
changeset
|
145 |
|
69593 | 146 |
val force_sos = Attrib.setup_config_bool \<^binding>\<open>atp_force_sos\<close> (K false) |
44099 | 147 |
|
47032 | 148 |
val smartN = "smart" |
47073
c73f7b0c7ebc
generate weights and precedences for predicates as well
blanchet
parents:
47055
diff
changeset
|
149 |
(* val kboN = "kbo" *) |
47032 | 150 |
val lpoN = "lpo" |
47034
77da780ddd6b
implement term order attribute (for experiments)
blanchet
parents:
47033
diff
changeset
|
151 |
val xweightsN = "_weights" |
77da780ddd6b
implement term order attribute (for experiments)
blanchet
parents:
47033
diff
changeset
|
152 |
val xprecN = "_prec" |
77da780ddd6b
implement term order attribute (for experiments)
blanchet
parents:
47033
diff
changeset
|
153 |
val xsimpN = "_simp" (* SPASS-specific *) |
47032 | 154 |
|
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
155 |
(* Possible values for "atp_term_order": |
47049 | 156 |
"smart", "(kbo|lpo)(_weights)?(_prec|_simp)?" *) |
47032 | 157 |
val term_order = |
69593 | 158 |
Attrib.setup_config_string \<^binding>\<open>atp_term_order\<close> (K smartN) |
47032 | 159 |
|
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
160 |
|
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
161 |
(* agsyHOL *) |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
162 |
|
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
163 |
val agsyhol_config : atp_config = |
73374 | 164 |
{exec = (["AGSYHOL_HOME"], ["agsyHOL"]), |
73432 | 165 |
arguments = fn _ => fn _ => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
166 |
["--proof --time-out " ^ string_of_int (to_secs 1 timeout) ^ " " ^ File.bash_path problem], |
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
167 |
proof_delims = tstp_proof_delims, |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
168 |
known_failures = known_szs_status_failures, |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
169 |
prem_role = Hypothesis, |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
170 |
best_slices = |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
171 |
(* FUDGE *) |
72588 | 172 |
K [(1.0, (((60, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), ""))], |
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
173 |
best_max_mono_iters = default_max_mono_iters - 1 (* FUDGE *), |
53515
f5b678b155f6
adjusted number of generated monomorphic instances for new monomorphizer based on new evaluation (E, SPASS, Vampire)
blanchet
parents:
53225
diff
changeset
|
174 |
best_max_new_mono_instances = default_max_new_mono_instances} |
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
175 |
|
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
176 |
val agsyhol = (agsyholN, fn () => agsyhol_config) |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
177 |
|
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
178 |
|
46643
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
179 |
(* Alt-Ergo *) |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
180 |
|
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
181 |
val alt_ergo_config : atp_config = |
73374 | 182 |
{exec = (["WHY3_HOME"], ["why3"]), |
73432 | 183 |
arguments = fn _ => fn _ => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
184 |
["--format tptp --prover 'Alt-Ergo,0.95.2,' --timelimit " ^ string_of_int (to_secs 1 timeout) ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
185 |
" " ^ File.bash_path problem], |
46643
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
186 |
proof_delims = [], |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
187 |
known_failures = |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
188 |
[(ProofMissing, ": Valid"), |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
189 |
(TimedOut, ": Timeout"), |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
190 |
(GaveUp, ": Unknown")], |
47976 | 191 |
prem_role = Hypothesis, |
46643
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
192 |
best_slices = fn _ => |
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
193 |
(* FUDGE *) |
72588 | 194 |
[(1.0, (((100, ""), TFF (Without_FOOL, Polymorphic), "poly_native", liftingN, false), ""))], |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
195 |
best_max_mono_iters = default_max_mono_iters, |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
196 |
best_max_new_mono_instances = default_max_new_mono_instances} |
46643
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
197 |
|
47646 | 198 |
val alt_ergo = (alt_ergoN, fn () => alt_ergo_config) |
46643
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
199 |
|
a88bccd2b567
added support for Alt-Ergo through Why3 (mostly for experimental purposes, e.g. polymorphism vs. monomorphization)
blanchet
parents:
46481
diff
changeset
|
200 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
201 |
(* E *) |
28596
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
202 |
|
43473
fb2713b803e6
deal with ATP time slices in a more flexible/robust fashion
blanchet
parents:
43467
diff
changeset
|
203 |
val e_smartN = "smart" |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
204 |
val e_autoN = "auto" |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
205 |
val e_fun_weightN = "fun_weight" |
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
206 |
val e_sym_offset_weightN = "sym_offset_weight" |
41725
7cca2de89296
added support for bleeding-edge E weighting function "SymOffsetsWeight"
blanchet
parents:
41335
diff
changeset
|
207 |
|
47032 | 208 |
val e_selection_heuristic = |
69593 | 209 |
Attrib.setup_config_string \<^binding>\<open>atp_e_selection_heuristic\<close> (K e_smartN) |
41770 | 210 |
(* FUDGE *) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
211 |
val e_default_fun_weight = |
69593 | 212 |
Attrib.setup_config_real \<^binding>\<open>atp_e_default_fun_weight\<close> (K 20.0) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
213 |
val e_fun_weight_base = |
69593 | 214 |
Attrib.setup_config_real \<^binding>\<open>atp_e_fun_weight_base\<close> (K 0.0) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
215 |
val e_fun_weight_span = |
69593 | 216 |
Attrib.setup_config_real \<^binding>\<open>atp_e_fun_weight_span\<close> (K 40.0) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
217 |
val e_default_sym_offs_weight = |
69593 | 218 |
Attrib.setup_config_real \<^binding>\<open>atp_e_default_sym_offs_weight\<close> (K 1.0) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
219 |
val e_sym_offs_weight_base = |
69593 | 220 |
Attrib.setup_config_real \<^binding>\<open>atp_e_sym_offs_weight_base\<close> (K ~20.0) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
221 |
val e_sym_offs_weight_span = |
69593 | 222 |
Attrib.setup_config_real \<^binding>\<open>atp_e_sym_offs_weight_span\<close> (K 60.0) |
41725
7cca2de89296
added support for bleeding-edge E weighting function "SymOffsetsWeight"
blanchet
parents:
41335
diff
changeset
|
223 |
|
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
224 |
fun e_selection_heuristic_case heuristic fw sow = |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
225 |
if heuristic = e_fun_weightN then fw |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
226 |
else if heuristic = e_sym_offset_weightN then sow |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
227 |
else raise Fail ("unexpected " ^ quote heuristic) |
41725
7cca2de89296
added support for bleeding-edge E weighting function "SymOffsetsWeight"
blanchet
parents:
41335
diff
changeset
|
228 |
|
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
229 |
fun scaled_e_selection_weight ctxt heuristic w = |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
230 |
w * Config.get ctxt (e_selection_heuristic_case heuristic |
47029 | 231 |
e_fun_weight_span e_sym_offs_weight_span) |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
232 |
+ Config.get ctxt (e_selection_heuristic_case heuristic |
47029 | 233 |
e_fun_weight_base e_sym_offs_weight_base) |
41725
7cca2de89296
added support for bleeding-edge E weighting function "SymOffsetsWeight"
blanchet
parents:
41335
diff
changeset
|
234 |
|> Real.ceil |> signed_string_of_int |
41313
a96ac4d180b7
optionally supply constant weights to E -- turned off by default until properly parameterized
blanchet
parents:
41269
diff
changeset
|
235 |
|
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
236 |
fun e_selection_weight_arguments ctxt heuristic sel_weights = |
51631 | 237 |
if heuristic = e_fun_weightN orelse heuristic = e_sym_offset_weightN then |
43622 | 238 |
(* supplied by Stephan Schulz *) |
41314
2dc1dfc1bc69
use the options provided by Stephan Schulz -- much better
blanchet
parents:
41313
diff
changeset
|
239 |
"--split-clauses=4 --split-reuse-defs --simul-paramod --forward-context-sr \ |
2dc1dfc1bc69
use the options provided by Stephan Schulz -- much better
blanchet
parents:
41313
diff
changeset
|
240 |
\--destructive-er-aggressive --destructive-er --presat-simplify \ |
47505
e33d957ae2bf
avoid option introduced in E 1.2 when invoking older versions of E
blanchet
parents:
47499
diff
changeset
|
241 |
\--prefer-initial-clauses -winvfreqrank -c1 -Ginvfreqconjmax -F1 \ |
e33d957ae2bf
avoid option introduced in E 1.2 when invoking older versions of E
blanchet
parents:
47499
diff
changeset
|
242 |
\--delete-bad-limit=150000000 -WSelectMaxLComplexAvoidPosPred -H'(4*" ^ |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
243 |
e_selection_heuristic_case heuristic "FunWeight" "SymOffsetWeight" ^ |
48376
416e4123baf3
use "eproof_ram" script if available (plug-in replacement for "eproof", but faster)
blanchet
parents:
48232
diff
changeset
|
244 |
"(SimulateSOS," ^ |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
245 |
(e_selection_heuristic_case heuristic |
47029 | 246 |
e_default_fun_weight e_default_sym_offs_weight |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
247 |
|> Config.get ctxt |> Real.ceil |> signed_string_of_int) ^ |
41314
2dc1dfc1bc69
use the options provided by Stephan Schulz -- much better
blanchet
parents:
41313
diff
changeset
|
248 |
",20,1.5,1.5,1" ^ |
47030 | 249 |
(sel_weights () |
47029 | 250 |
|> map (fn (s, w) => "," ^ s ^ ":" ^ |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
251 |
scaled_e_selection_weight ctxt heuristic w) |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
252 |
|> implode) ^ |
41314
2dc1dfc1bc69
use the options provided by Stephan Schulz -- much better
blanchet
parents:
41313
diff
changeset
|
253 |
"),3*ConjectureGeneralSymbolWeight(PreferNonGoals,200,100,200,50,50,1,100,\ |
2dc1dfc1bc69
use the options provided by Stephan Schulz -- much better
blanchet
parents:
41313
diff
changeset
|
254 |
\1.5,1.5,1),1*Clauseweight(PreferProcessed,1,1,1),1*\ |
57672 | 255 |
\FIFOWeight(PreferProcessed))' " |
51631 | 256 |
else |
57672 | 257 |
"-xAuto " |
41313
a96ac4d180b7
optionally supply constant weights to E -- turned off by default until properly parameterized
blanchet
parents:
41269
diff
changeset
|
258 |
|
70939
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
259 |
val e_ord_weights = map (fn (s, w) => s ^ ":" ^ string_of_int w) #> space_implode "," |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
260 |
fun e_ord_precedence [_] = "" |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
261 |
| e_ord_precedence info = info |> map fst |> space_implode "<" |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
262 |
|
47039
1b36a05a070d
added "metis_advisory_simp" option to orient as many equations as possible in Metis the right way (cf. "More SPASS with Isabelle")
blanchet
parents:
47038
diff
changeset
|
263 |
fun e_term_order_info_arguments false false _ = "" |
1b36a05a070d
added "metis_advisory_simp" option to orient as many equations as possible in Metis the right way (cf. "More SPASS with Isabelle")
blanchet
parents:
47038
diff
changeset
|
264 |
| e_term_order_info_arguments gen_weights gen_prec ord_info = |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
265 |
let val ord_info = ord_info () in |
57672 | 266 |
(if gen_weights then "--order-weights='" ^ e_ord_weights ord_info ^ "' " else "") ^ |
267 |
(if gen_prec then "--precedence='" ^ e_ord_precedence ord_info ^ "' " else "") |
|
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
268 |
end |
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
269 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
270 |
val e_config : atp_config = |
73973 | 271 |
{exec = (["E_HOME"], ["eprover-ho", "eprover"]), |
73432 | 272 |
arguments = fn ctxt => fn _ => fn heuristic => fn timeout => fn problem => |
57008 | 273 |
fn ({is_lpo, gen_weights, gen_prec, ...}, ord_info, sel_weights) => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
274 |
["--auto-schedule --tstp-in --tstp-out --silent " ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
275 |
e_selection_weight_arguments ctxt heuristic sel_weights ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
276 |
e_term_order_info_arguments gen_weights gen_prec ord_info ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
277 |
"--term-ordering=" ^ (if is_lpo then "LPO4" else "KBO6") ^ " " ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
278 |
"--cpu-limit=" ^ string_of_int (to_secs 2 timeout) ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
279 |
" --proof-object=1 " ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
280 |
File.bash_path problem], |
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
281 |
proof_delims = |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
282 |
[("# SZS output start CNFRefutation", "# SZS output end CNFRefutation")] @ |
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
283 |
tstp_proof_delims, |
36265
41c9e755e552
distinguish between the different ATP errors in the user interface;
blanchet
parents:
36264
diff
changeset
|
284 |
known_failures = |
45203 | 285 |
[(TimedOut, "Failure: Resource limit exceeded (time)"), |
47972 | 286 |
(TimedOut, "time limit exceeded")] @ |
287 |
known_szs_status_failures, |
|
47976 | 288 |
prem_role = Conjecture, |
42646
4781fcd53572
replaced some Unsynchronized.refs with Config.Ts
blanchet
parents:
42643
diff
changeset
|
289 |
best_slices = fn ctxt => |
70939
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
290 |
let |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
291 |
val heuristic = Config.get ctxt e_selection_heuristic |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
292 |
val (format, enc) = |
73974 | 293 |
if string_ord (getenv "E_VERSION", "2.7") <> LESS then |
73970
34c8cf767fa3
adjusted E setup to avoid generating FOOL with 2.5 (where 'ite' is missing)
blanchet
parents:
73859
diff
changeset
|
294 |
(THF (With_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher_fool") |
73974 | 295 |
else if string_ord (getenv "E_VERSION", "2.6") <> LESS then |
296 |
(THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher") |
|
72588 | 297 |
else |
73974 | 298 |
(THF (Without_FOOL, Monomorphic, THF_Lambda_Free), "mono_native_higher") |
70939
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
299 |
in |
43474
423cd1ecf714
optimized E's time slicing, based on latest exhaustive Judgment Day results
blanchet
parents:
43473
diff
changeset
|
300 |
(* FUDGE *) |
47038
2409b484e1cc
continued implementation of term ordering attributes
blanchet
parents:
47034
diff
changeset
|
301 |
if heuristic = e_smartN then |
70939
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
302 |
[(0.15, (((128, meshN), format, enc, combsN, false), e_fun_weightN)), |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
303 |
(0.15, (((128, mashN), format, enc, combsN, false), e_sym_offset_weightN)), |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
304 |
(0.15, (((91, mepoN), format, enc, combsN, false), e_autoN)), |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
305 |
(0.15, (((1000, meshN), format, "poly_guards??", combsN, false), e_sym_offset_weightN)), |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
306 |
(0.15, (((256, mepoN), format, enc, liftingN, false), e_fun_weightN)), |
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
307 |
(0.25, (((64, mashN), format, enc, combsN, false), e_fun_weightN))] |
43473
fb2713b803e6
deal with ATP time slices in a more flexible/robust fashion
blanchet
parents:
43467
diff
changeset
|
308 |
else |
70939
3218999b3715
folded experimental Ehoh into E now that E 2.3 has been released
blanchet
parents:
70938
diff
changeset
|
309 |
[(1.0, (((500, ""), format, enc, combsN, false), heuristic))] |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
310 |
end, |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
311 |
best_max_mono_iters = default_max_mono_iters, |
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
312 |
best_max_new_mono_instances = default_max_new_mono_instances} |
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
313 |
|
47646 | 314 |
val e = (eN, fn () => e_config) |
28596
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
315 |
|
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
316 |
|
48700 | 317 |
(* iProver *) |
318 |
||
319 |
val iprover_config : atp_config = |
|
73374 | 320 |
{exec = (["IPROVER_HOME"], ["iproveropt", "iprover"]), |
73432 | 321 |
arguments = fn _ => fn _ => fn _ => fn timeout => fn problem => fn _ => |
74046
462d652ad910
use Vampire's clausifier with iProver, now that E's is no longer supported
blanchet
parents:
74005
diff
changeset
|
322 |
["--clausifier \"$VAMPIRE_HOME\"/vampire " ^ |
462d652ad910
use Vampire's clausifier with iProver, now that E's is no longer supported
blanchet
parents:
74005
diff
changeset
|
323 |
"--clausifier_options \"--mode clausify\" " ^ |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
324 |
"--time_out_real " ^ string_of_real (Time.toReal timeout) ^ " " ^ File.bash_path problem], |
48700 | 325 |
proof_delims = tstp_proof_delims, |
326 |
known_failures = |
|
327 |
[(ProofIncomplete, "% SZS output start CNFRefutation")] @ |
|
328 |
known_szs_status_failures, |
|
329 |
prem_role = Hypothesis, |
|
330 |
best_slices = |
|
331 |
(* FUDGE *) |
|
51011 | 332 |
K [(1.0, (((150, ""), FOF, "mono_guards??", liftingN, false), ""))], |
48700 | 333 |
best_max_mono_iters = default_max_mono_iters, |
334 |
best_max_new_mono_instances = default_max_new_mono_instances} |
|
335 |
||
336 |
val iprover = (iproverN, fn () => iprover_config) |
|
337 |
||
338 |
||
44099 | 339 |
(* LEO-II *) |
340 |
||
341 |
val leo2_config : atp_config = |
|
73374 | 342 |
{exec = (["LEO2_HOME"], ["leo.opt", "leo"]), |
73432 | 343 |
arguments = fn _ => fn full_proofs => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
344 |
["--foatp e --atp e=\"$E_HOME\"/eprover \ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
345 |
\--atp epclextract=\"$E_HOME\"/epclextract \ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
346 |
\--proofoutput 1 --timeout " ^ string_of_int (to_secs 1 timeout) ^ " " ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
347 |
(if full_proofs then "--notReplLeibnizEQ --notReplAndrewsEQ --notUseExtCnfCmbd " else "") ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
348 |
File.bash_path problem], |
44099 | 349 |
proof_delims = tstp_proof_delims, |
45207 | 350 |
known_failures = |
47974
08d2dcc2dab9
improved LEO-II definition handling -- still hoping for a fix directly in LEO-II
blanchet
parents:
47972
diff
changeset
|
351 |
[(TimedOut, "CPU time limit exceeded, terminating"), |
47972 | 352 |
(GaveUp, "No.of.Axioms")] @ |
353 |
known_szs_status_failures, |
|
47976 | 354 |
prem_role = Hypothesis, |
47914
94f37848b7c9
LEO-II's "--sos" option confusingly disables rather than enables SOS, and SOS seems to be ignored anyway; also, pass a number of facts that's more appropriate for each prover
blanchet
parents:
47912
diff
changeset
|
355 |
best_slices = |
44099 | 356 |
(* FUDGE *) |
72588 | 357 |
K [(1.0, (((40, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), ""))], |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
358 |
best_max_mono_iters = default_max_mono_iters - 1 (* FUDGE *), |
53515
f5b678b155f6
adjusted number of generated monomorphic instances for new monomorphizer based on new evaluation (E, SPASS, Vampire)
blanchet
parents:
53225
diff
changeset
|
359 |
best_max_new_mono_instances = default_max_new_mono_instances} |
39491
2416666e6f94
refactoring: move ATP proof and error extraction code to "ATP_Proof" module
blanchet
parents:
39375
diff
changeset
|
360 |
|
47646 | 361 |
val leo2 = (leo2N, fn () => leo2_config) |
44099 | 362 |
|
363 |
||
67021
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
364 |
(* Leo-III *) |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
365 |
|
69717 | 366 |
(* Include choice? Disabled now since it's disabled for Satallax as well. *) |
67021
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
367 |
val leo3_config : atp_config = |
73374 | 368 |
{exec = (["LEO3_HOME"], ["leo3"]), |
73432 | 369 |
arguments = fn _ => fn full_proofs => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
370 |
[File.bash_path problem ^ " " ^ "--atp cvc=\"$CVC4_SOLVER\" --atp e=\"$E_HOME\"/eprover \ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
371 |
\-p -t " ^ string_of_int (to_secs 1 timeout) ^ " " ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
372 |
(if full_proofs then "--nleq --naeq " else "")], |
67021
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
373 |
proof_delims = tstp_proof_delims, |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
374 |
known_failures = known_szs_status_failures, |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
375 |
prem_role = Hypothesis, |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
376 |
best_slices = |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
377 |
(* FUDGE *) |
72588 | 378 |
K [(1.0, (((150, ""), THF (Without_FOOL, Polymorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), ""))], |
67021
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
379 |
best_max_mono_iters = default_max_mono_iters - 1 (* FUDGE *), |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
380 |
best_max_new_mono_instances = default_max_new_mono_instances} |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
381 |
|
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
382 |
val leo3 = (leo3N, fn () => leo3_config) |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
383 |
|
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
384 |
|
44099 | 385 |
(* Satallax *) |
386 |
||
52097 | 387 |
(* Choice is disabled until there is proper reconstruction for it. *) |
44099 | 388 |
val satallax_config : atp_config = |
73374 | 389 |
{exec = (["SATALLAX_HOME"], ["satallax.opt", "satallax"]), |
73432 | 390 |
arguments = fn _ => fn _ => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
391 |
[(case getenv "E_HOME" of |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
392 |
"" => "" |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
393 |
| home => "-E " ^ home ^ "/eprover ") ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
394 |
"-p tstp -t " ^ string_of_int (to_secs 1 timeout) ^ " " ^ File.bash_path problem], |
45162 | 395 |
proof_delims = |
57707
0242e9578828
imported patch satallax_proof_support_Sledgehammer
fleury
parents:
57672
diff
changeset
|
396 |
[("% SZS output start Proof", "% SZS output end Proof")], |
45203 | 397 |
known_failures = known_szs_status_failures, |
47981 | 398 |
prem_role = Hypothesis, |
44416
cabd06b69c18
added formats to the slice and use TFF for remote Vampire
blanchet
parents:
44391
diff
changeset
|
399 |
best_slices = |
44754 | 400 |
(* FUDGE *) |
72588 | 401 |
K [(1.0, (((150, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), ""))], |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
402 |
best_max_mono_iters = default_max_mono_iters - 1 (* FUDGE *), |
53515
f5b678b155f6
adjusted number of generated monomorphic instances for new monomorphizer based on new evaluation (E, SPASS, Vampire)
blanchet
parents:
53225
diff
changeset
|
403 |
best_max_new_mono_instances = default_max_new_mono_instances} |
44099 | 404 |
|
47646 | 405 |
val satallax = (satallaxN, fn () => satallax_config) |
44099 | 406 |
|
407 |
||
408 |
(* SPASS *) |
|
42725
64dea91bbe0e
added "force_sos" options to control SPASS's and Vampire's use of SOS in experiments + added corresponding Mirabelle options
blanchet
parents:
42723
diff
changeset
|
409 |
|
48005
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
410 |
val spass_H1SOS = "-Heuristic=1 -SOS" |
50333
20c69b00e73c
tweak SPASS default a tiny bit, so that a more interesting heuristic is chosen when "slicing=false" (for experiments)
blanchet
parents:
49991
diff
changeset
|
411 |
val spass_H2 = "-Heuristic=2" |
48005
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
412 |
val spass_H2LR0LT0 = "-Heuristic=2 -LR=0 -LT=0" |
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
413 |
val spass_H2NuVS0 = "-Heuristic=2 -RNuV=1 -Sorts=0" |
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
414 |
val spass_H2NuVS0Red2 = "-Heuristic=2 -RNuV=1 -Sorts=0 -RFRew=2 -RBRew=2 -RTaut=2" |
50333
20c69b00e73c
tweak SPASS default a tiny bit, so that a more interesting heuristic is chosen when "slicing=false" (for experiments)
blanchet
parents:
49991
diff
changeset
|
415 |
val spass_H2SOS = "-Heuristic=2 -SOS" |
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
416 |
|
48005
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
417 |
val spass_config : atp_config = |
73375 | 418 |
let |
419 |
val format = DFG Monomorphic |
|
420 |
in |
|
421 |
{exec = (["SPASS_HOME"], ["SPASS"]), |
|
73432 | 422 |
arguments = fn _ => fn full_proofs => fn extra_options => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
423 |
["-Isabelle=1 " ^ (if full_proofs then "-CNFRenaming=0 -Splits=0 " else "") ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
424 |
"-TimeLimit=" ^ string_of_int (to_secs 1 timeout) ^ " " ^ File.bash_path problem |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
425 |
|> extra_options <> "" ? prefix (extra_options ^ " ")], |
73375 | 426 |
proof_delims = [("Here is a proof", "Formulae used in the proof")], |
427 |
known_failures = |
|
428 |
[(GaveUp, "SPASS beiseite: Completion found"), |
|
429 |
(TimedOut, "SPASS beiseite: Ran out of time"), |
|
430 |
(OutOfResources, "SPASS beiseite: Maximal number of loops exceeded"), |
|
431 |
(MalformedInput, "Undefined symbol"), |
|
432 |
(MalformedInput, "Free Variable"), |
|
433 |
(Unprovable, "No formulae and clauses found in input file"), |
|
73436
e92f2e44e4d8
removed spurious references to perl / libwww-perl;
wenzelm
parents:
73435
diff
changeset
|
434 |
(InternalError, "Please report this error")], |
73375 | 435 |
prem_role = Conjecture, |
436 |
best_slices = fn _ => |
|
437 |
(* FUDGE *) |
|
438 |
[(0.1667, (((150, meshN), format, "mono_native", combsN, true), "")), |
|
439 |
(0.1667, (((500, meshN), format, "mono_native", liftingN, true), spass_H2SOS)), |
|
440 |
(0.1666, (((50, meshN), format, "mono_native", liftingN, true), spass_H2LR0LT0)), |
|
441 |
(0.1000, (((250, meshN), format, "mono_native", combsN, true), spass_H2NuVS0)), |
|
442 |
(0.1000, (((1000, mepoN), format, "mono_native", liftingN, true), spass_H1SOS)), |
|
443 |
(0.1000, (((150, meshN), format, "poly_guards??", liftingN, false), spass_H2NuVS0Red2)), |
|
444 |
(0.1000, (((300, meshN), format, "mono_native", combsN, true), spass_H2SOS)), |
|
445 |
(0.1000, (((100, meshN), format, "mono_native", combs_and_liftingN, true), spass_H2))], |
|
446 |
best_max_mono_iters = default_max_mono_iters, |
|
447 |
best_max_new_mono_instances = default_max_new_mono_instances} |
|
448 |
end |
|
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
449 |
|
48005
eeede26f2721
killed SPASS 3.5/3.7 FLOTTER hack -- requires users to upgrade to SPASS 3.8
blanchet
parents:
48004
diff
changeset
|
450 |
val spass = (spassN, fn () => spass_config) |
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
451 |
|
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
452 |
|
37509
f39464d971c4
factor out TPTP format output into file of its own, to facilitate further changes
blanchet
parents:
37506
diff
changeset
|
453 |
(* Vampire *) |
f39464d971c4
factor out TPTP format output into file of its own, to facilitate further changes
blanchet
parents:
37506
diff
changeset
|
454 |
|
68563 | 455 |
fun is_vampire_noncommercial_license_accepted () = |
456 |
let |
|
69593 | 457 |
val flag = Options.default_string \<^system_option>\<open>vampire_noncommercial\<close> |
68563 | 458 |
|> String.map Char.toLower |
459 |
in |
|
460 |
if flag = "yes" then |
|
461 |
SOME true |
|
462 |
else if flag = "no" then |
|
463 |
SOME false |
|
464 |
else |
|
465 |
NONE |
|
466 |
end |
|
467 |
||
468 |
fun check_vampire_noncommercial () = |
|
469 |
(case is_vampire_noncommercial_license_accepted () of |
|
470 |
SOME true => () |
|
471 |
| SOME false => |
|
472 |
error (Pretty.string_of (Pretty.para |
|
473 |
"The Vampire prover may be used only for noncommercial applications")) |
|
474 |
| NONE => |
|
475 |
error (Pretty.string_of (Pretty.para |
|
476 |
"The Vampire prover is not activated; to activate it, set the Isabelle system option \ |
|
477 |
\\"vampire_noncommercial\" to \"yes\" (e.g. via the Isabelle/jEdit menu Plugin Options \ |
|
478 |
\/ Isabelle / General)"))) |
|
44420 | 479 |
|
68563 | 480 |
val vampire_basic_options = "--proof tptp --output_axiom_names on $VAMPIRE_EXTRA_OPTIONS" |
58084 | 481 |
|
482 |
val vampire_full_proof_options = |
|
71793 | 483 |
" --proof_extra free --forced_options avatar=off:equality_proxy=off:general_splitting=off:inequality_splitting=0:naming=0" |
58084 | 484 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
485 |
val vampire_config : atp_config = |
73375 | 486 |
let |
487 |
val format = TFF (Without_FOOL, Monomorphic) |
|
488 |
in |
|
489 |
{exec = (["VAMPIRE_HOME"], ["vampire"]), |
|
73432 | 490 |
arguments = fn _ => fn full_proofs => fn sos => fn timeout => fn problem => fn _ => |
73375 | 491 |
(check_vampire_noncommercial (); |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
492 |
[vampire_basic_options ^ (if full_proofs then " " ^ vampire_full_proof_options else "") ^ |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
493 |
" -t " ^ string_of_int (to_secs 1 timeout) ^ " --input_file " ^ File.bash_path problem |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
494 |
|> sos = sosN ? prefix "--sos on "]), |
73375 | 495 |
proof_delims = |
496 |
[("=========== Refutation ==========", |
|
497 |
"======= End of refutation =======")] @ |
|
498 |
tstp_proof_delims, |
|
499 |
known_failures = |
|
500 |
[(GaveUp, "UNPROVABLE"), |
|
501 |
(GaveUp, "CANNOT PROVE"), |
|
502 |
(Unprovable, "Satisfiability detected"), |
|
503 |
(Unprovable, "Termination reason: Satisfiable"), |
|
504 |
(Interrupted, "Aborted by signal SIGINT")] @ |
|
505 |
known_szs_status_failures, |
|
506 |
prem_role = Hypothesis, |
|
507 |
best_slices = fn ctxt => |
|
508 |
(* FUDGE *) |
|
509 |
[(0.333, (((500, meshN), format, "mono_native", combs_or_liftingN, false), sosN)), |
|
510 |
(0.333, (((150, meshN), format, "poly_tags??", combs_or_liftingN, false), sosN)), |
|
511 |
(0.334, (((50, meshN), format, "mono_native", combs_or_liftingN, false), no_sosN))] |
|
512 |
|> Config.get ctxt force_sos ? (hd #> apfst (K 1.0) #> single), |
|
513 |
best_max_mono_iters = default_max_mono_iters, |
|
514 |
best_max_new_mono_instances = 2 * default_max_new_mono_instances (* FUDGE *)} |
|
515 |
end |
|
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
516 |
|
47646 | 517 |
val vampire = (vampireN, fn () => vampire_config) |
37509
f39464d971c4
factor out TPTP format output into file of its own, to facilitate further changes
blanchet
parents:
37506
diff
changeset
|
518 |
|
68563 | 519 |
|
48803
ffa31bf5c662
tone down "z3_tptp", now that Z3 (starting with 4.1) no longer supports TPTP TFF0
blanchet
parents:
48801
diff
changeset
|
520 |
(* Z3 with TPTP syntax (half experimental, half legacy) *) |
41740
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
521 |
|
44423
f74707e12d30
exploit TFF format in Z3 used as ATP, and renamed it "z3_tptp"
blanchet
parents:
44422
diff
changeset
|
522 |
val z3_tptp_config : atp_config = |
73375 | 523 |
let |
524 |
val format = TFF (Without_FOOL, Monomorphic) |
|
525 |
in |
|
526 |
{exec = (["Z3_TPTP_HOME"], ["z3_tptp"]), |
|
73432 | 527 |
arguments = fn _ => fn _ => fn _ => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
528 |
["-proof -t:" ^ string_of_int (to_secs 1 timeout) ^ " -file:" ^ File.bash_path problem], |
73375 | 529 |
proof_delims = [("SZS status Theorem", "")], |
530 |
known_failures = known_szs_status_failures, |
|
531 |
prem_role = Hypothesis, |
|
532 |
best_slices = |
|
533 |
(* FUDGE *) |
|
534 |
K [(0.5, (((250, meshN), format, "mono_native", combsN, false), "")), |
|
535 |
(0.25, (((125, mepoN), format, "mono_native", combsN, false), "")), |
|
536 |
(0.125, (((62, mashN), format, "mono_native", combsN, false), "")), |
|
537 |
(0.125, (((31, meshN), format, "mono_native", combsN, false), ""))], |
|
538 |
best_max_mono_iters = default_max_mono_iters, |
|
539 |
best_max_new_mono_instances = 2 * default_max_new_mono_instances (* FUDGE *)} |
|
540 |
end |
|
41740
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
541 |
|
47646 | 542 |
val z3_tptp = (z3_tptpN, fn () => z3_tptp_config) |
41740
4b09f8b9e012
added "Z3 as an ATP" support to Sledgehammer locally
blanchet
parents:
41738
diff
changeset
|
543 |
|
44590 | 544 |
|
69717 | 545 |
(* Zipperposition *) |
546 |
||
72174 | 547 |
val zipperposition_blsimp = "--mode ho-pragmatic --max-inferences 3 --ho-max-app-projections 0 --ho-max-elims 0 --ho-max-rigid-imitations 2 --ho-max-identifications 0 --ho-unif-max-depth 2 --boolean-reasoning no-cases --ext-rules ext-family --ext-rules-max-depth 1 --kbo-weight-fun invdocc --ho-prim-enum tf --ho-prim-enum-early-bird true --tptp-def-as-rewrite --ho-unif-level pragmatic-framework -q '1|const|conjecture-relative-var(1,s,f)' -q '1|prefer-processed|pnrefined(1,1,1,2,2,2,0.5)' -q '1|prefer-sos|staggered(1)' -q '2|prefer-fo|default' -q '1|prefer-neg-unit|orient-lmax(2,1,2,1,1)' -q '2|prefer-easy-ho|conjecture-relative-struct(1.5,3.5,2,3)' --ho-elim-leibniz 2 --ho-fixpoint-decider true --ho-pattern-decider false --ho-solid-decider true --ho-max-solidification 12 --select e-selection11 --solve-formulas true --sup-at-vars false --sup-at-var-headed false --lazy-cnf true --lazy-cnf-kind simp --lazy-cnf-renaming-threshold 4 --sine 50 --sine-tolerance 1.7 --sine-depth-max 3 --sine-depth-min 1 --sine-trim-implications true --ho-selection-restriction none --sup-from-var-headed false --sine-trim-implications true" |
548 |
val zipperposition_s6 = "--tptp-def-as-rewrite --rewrite-before-cnf true --mode ho-competitive --boolean-reasoning no-cases --ext-rules off --ho-prim-enum none --recognize-injectivity true --ho-elim-leibniz off --ho-unif-level full-framework --no-max-vars -q '3|const|conjecture-relative-var(1.02,l,f)' -q '1|prefer-ho-steps|conjecture-relative-var(1,s,f)' -q '1|prefer-processed|fifo' -q '3|by-app-var-num|pnrefined(2,1,1,1,2,2,2)' --select ho-selection5 --prec-gen-fun unary_first --solid-subsumption false --ignore-orphans false --ho-solid-decider true --ho-fixpoint-decider true --ho-pattern-decider true --sup-at-vars false --sup-at-var-headed false --sup-from-var-headed false --ho-neg-ext-simpl true" |
|
549 |
val zipperposition_cdots = "--mode ho-competitive --boolean-reasoning cases-simpl --ext-rules ext-family --ext-rules-max-depth 1 --ho-prim-enum pragmatic --ho-prim-max 1 --bool-subterm-selection A --avatar off --recognize-injectivity true --ho-elim-leibniz 1 --ho-unif-level full-framework --no-max-vars -q '6|prefer-sos|pnrefined(1,1,1,2,2,2,0.5)' -q '6|const|conjecture-relative-var(1.02,l,f)' -q '1|prefer-processed|fifo' -q '1|prefer-non-goals|conjecture-relative-var(1,l,f)' -q '4|prefer-easy-ho|conjecture-relative-var(1.01,s,f)' --select e-selection7 --ho-choice-inst true --sine 50 --sine-tolerance 2 --sine-depth-max 4 --sine-depth-min 1 --scan-clause-ac true --lambdasup 0 --kbo-weight-fun invfreqrank" |
|
550 |
||
57154 | 551 |
val zipperposition_config : atp_config = |
73375 | 552 |
let |
73859
bc263f1f68cd
added support for TFX's and THF's $ite to Sledgehammer
desharna
parents:
73568
diff
changeset
|
553 |
val format = THF (With_FOOL, Polymorphic, THF_Without_Choice) |
73375 | 554 |
in |
555 |
{exec = (["ZIPPERPOSITION_HOME"], ["zipperposition"]), |
|
73432 | 556 |
arguments = fn _ => fn _ => fn extra_options => fn timeout => fn problem => fn _ => |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
557 |
["--input tptp --output tptp --timeout " ^ string_of_int (to_secs 1 timeout) ^ " " ^ File.bash_path problem |
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
558 |
|> extra_options <> "" ? prefix (extra_options ^ " ")], |
73375 | 559 |
proof_delims = tstp_proof_delims, |
560 |
known_failures = known_szs_status_failures, |
|
561 |
prem_role = Hypothesis, |
|
562 |
best_slices = fn _ => |
|
563 |
(* FUDGE *) |
|
73859
bc263f1f68cd
added support for TFX's and THF's $ite to Sledgehammer
desharna
parents:
73568
diff
changeset
|
564 |
[(0.333, (((128, "meshN"), format, "mono_native_higher_fool", keep_lamsN, false), zipperposition_blsimp)), |
bc263f1f68cd
added support for TFX's and THF's $ite to Sledgehammer
desharna
parents:
73568
diff
changeset
|
565 |
(0.333, (((32, "meshN"), format, "poly_native_higher_fool", keep_lamsN, false), zipperposition_s6)), |
bc263f1f68cd
added support for TFX's and THF's $ite to Sledgehammer
desharna
parents:
73568
diff
changeset
|
566 |
(0.334, (((512, "meshN"), format, "mono_native_higher_fool", keep_lamsN, false), zipperposition_cdots))], |
73375 | 567 |
best_max_mono_iters = default_max_mono_iters, |
568 |
best_max_new_mono_instances = default_max_new_mono_instances} |
|
569 |
end |
|
57154 | 570 |
|
571 |
val zipperposition = (zipperpositionN, fn () => zipperposition_config) |
|
572 |
||
573 |
||
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
574 |
(* Remote ATP invocation via SystemOnTPTP *) |
28596
fcd463a6b6de
tuned interfaces -- plain prover function, without thread;
wenzelm
parents:
28592
diff
changeset
|
575 |
|
73426
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
576 |
val no_remote_systems = {url = "", systems = [] : string list} |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
577 |
val remote_systems = Synchronized.var "atp_remote_systems" no_remote_systems |
31835 | 578 |
|
49984 | 579 |
fun get_remote_systems () = |
73426
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
580 |
Timeout.apply (seconds 10.0) SystemOnTPTP.list_systems () |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
581 |
handle ERROR msg => (warning msg; no_remote_systems) |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
582 |
| Timeout.TIMEOUT _ => no_remote_systems |
31835 | 583 |
|
49984 | 584 |
fun find_remote_system name [] systems = |
42537
25ceb855a18b
improve version handling -- prefer versions of ToFoF, SInE, and SNARK that are known to work
blanchet
parents:
42535
diff
changeset
|
585 |
find_first (String.isPrefix (name ^ "---")) systems |
49984 | 586 |
| find_remote_system name (version :: versions) systems = |
38690
38a926e033ad
make remote ATP versions more robust, by starting with "preferred" version numbers and falling back on any version
blanchet
parents:
38685
diff
changeset
|
587 |
case find_first (String.isPrefix (name ^ "---" ^ version)) systems of |
49984 | 588 |
NONE => find_remote_system name versions systems |
38690
38a926e033ad
make remote ATP versions more robust, by starting with "preferred" version numbers and falling back on any version
blanchet
parents:
38685
diff
changeset
|
589 |
| res => res |
38a926e033ad
make remote ATP versions more robust, by starting with "preferred" version numbers and falling back on any version
blanchet
parents:
38685
diff
changeset
|
590 |
|
49984 | 591 |
fun get_remote_system name versions = |
73426
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
592 |
Synchronized.change_result remote_systems (fn remote => |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
593 |
(if #url remote <> SystemOnTPTP.get_url () orelse null (#systems remote) |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
594 |
then get_remote_systems () else remote) |> ` #systems) |
bd8bce50b9d4
use SystemOnTPTP.list_systems from Isabelle/Scala, with dynamic URL option and more elementary error messages;
wenzelm
parents:
73375
diff
changeset
|
595 |
|> `(find_remote_system name versions) |
32864
a226f29d4bdc
re-organized signature of AtpWrapper structure: records instead of unnamed parameters and return values,
boehmes
parents:
32740
diff
changeset
|
596 |
|
49984 | 597 |
fun the_remote_system name versions = |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
598 |
(case get_remote_system name versions of |
42955 | 599 |
(SOME sys, _) => sys |
63692 | 600 |
| (NONE, []) => error "SystemOnTPTP is currently not available" |
42955 | 601 |
| (NONE, syss) => |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
602 |
(case syss |> filter_out (String.isPrefix "%") |> filter_out (curry (op =) "") of |
63692 | 603 |
[] => error "SystemOnTPTP is currently not available" |
604 |
| [msg] => error ("SystemOnTPTP is currently not available: " ^ msg) |
|
46480 | 605 |
| syss => |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
606 |
error ("System " ^ quote name ^ " is not available at SystemOnTPTP.\n(Available systems: " ^ |
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
607 |
commas_quote syss ^ ".)"))) |
31835 | 608 |
|
72174 | 609 |
val max_remote_secs = 1000 (* give Geoff Sutcliffe's servers a break *) |
41148 | 610 |
|
73435
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
611 |
val isabelle_scala_function = (["SCALA_HOME"], ["bin/scala"]) |
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
612 |
|
58084 | 613 |
fun remote_config system_name system_versions proof_delims known_failures prem_role best_slice = |
73435
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
614 |
{exec = isabelle_scala_function, |
73432 | 615 |
arguments = fn _ => fn _ => fn command => fn timeout => fn problem => fn _ => |
73435
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
616 |
[the_remote_system system_name system_versions, |
1cc848548f21
invoke remote ATP via SystemOnTPTP.run_systems from Isabelle/Scala (without perl);
wenzelm
parents:
73432
diff
changeset
|
617 |
Isabelle_System.absolute_path problem, |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
618 |
command, string_of_int (Int.min (max_remote_secs, to_secs 1 timeout) * 1000)], |
42962
3b50fdeb6cfc
started adding support for THF output (but no lambdas)
blanchet
parents:
42955
diff
changeset
|
619 |
proof_delims = union (op =) tstp_proof_delims proof_delims, |
73436
e92f2e44e4d8
removed spurious references to perl / libwww-perl;
wenzelm
parents:
73435
diff
changeset
|
620 |
known_failures = known_failures @ known_says_failures, |
47976 | 621 |
prem_role = prem_role, |
48716
1d2a12bb0640
stop distinguishing between complete and incomplete slices, since this is very fragile and has hardly any useful semantics to users
blanchet
parents:
48715
diff
changeset
|
622 |
best_slices = fn ctxt => [(1.0, best_slice ctxt)], |
47962
137883567114
lower the monomorphization thresholds for less scalable provers
blanchet
parents:
47955
diff
changeset
|
623 |
best_max_mono_iters = default_max_mono_iters, |
58084 | 624 |
best_max_new_mono_instances = default_max_new_mono_instances} : atp_config |
42443
724e612ba248
implemented general slicing for ATPs, especially E 1.2w and above
blanchet
parents:
42332
diff
changeset
|
625 |
|
43500
4c357b7aa710
provide appropriate type system and number of fact defaults for remote ATPs
blanchet
parents:
43497
diff
changeset
|
626 |
fun remotify_config system_name system_versions best_slice |
58084 | 627 |
({proof_delims, known_failures, prem_role, ...} : atp_config) = |
628 |
remote_config system_name system_versions proof_delims known_failures prem_role best_slice |
|
38023 | 629 |
|
58084 | 630 |
fun remote_atp name system_name system_versions proof_delims known_failures prem_role best_slice = |
631 |
(remote_prefix ^ name, fn () => |
|
632 |
remote_config system_name system_versions proof_delims known_failures prem_role best_slice) |
|
43500
4c357b7aa710
provide appropriate type system and number of fact defaults for remote ATPs
blanchet
parents:
43497
diff
changeset
|
633 |
fun remotify_atp (name, config) system_name system_versions best_slice = |
58084 | 634 |
(remote_prefix ^ name, remotify_config system_name system_versions best_slice o config) |
28592 | 635 |
|
57269
1df6f747f164
changed type encoding for new Waldmeister, to trigger filtering of 'dangerous' lemmas
blanchet
parents:
57265
diff
changeset
|
636 |
fun gen_remote_waldmeister name type_enc = |
57265 | 637 |
remote_atp name "Waldmeister" ["710"] tstp_proof_delims |
638 |
([(OutOfResources, "Too many function symbols"), |
|
639 |
(Inappropriate, "**** Unexpected end of file."), |
|
640 |
(Crashed, "Unrecoverable Segmentation Fault")] |
|
641 |
@ known_szs_status_failures) |
|
57264 | 642 |
Hypothesis |
57269
1df6f747f164
changed type encoding for new Waldmeister, to trigger filtering of 'dangerous' lemmas
blanchet
parents:
57265
diff
changeset
|
643 |
(K (((50, ""), CNF_UEQ, type_enc, combsN, false), "") (* FUDGE *)) |
57264 | 644 |
|
52094 | 645 |
val remote_agsyhol = |
646 |
remotify_atp agsyhol "agsyHOL" ["1.0", "1"] |
|
72588 | 647 |
(K (((60, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), "") (* FUDGE *)) |
70937 | 648 |
val remote_alt_ergo = |
649 |
remotify_atp alt_ergo "Alt-Ergo" ["0.95.2"] |
|
72588 | 650 |
(K (((250, ""), TFF (Without_FOOL, Polymorphic), "poly_native", keep_lamsN, false), "") (* FUDGE *)) |
43500
4c357b7aa710
provide appropriate type system and number of fact defaults for remote ATPs
blanchet
parents:
43497
diff
changeset
|
651 |
val remote_e = |
63768 | 652 |
remotify_atp e "E" ["2.0", "1.9.1", "1.8"] |
72588 | 653 |
(K (((750, ""), TFF (Without_FOOL, Monomorphic), "mono_native", combsN, false), "") (* FUDGE *)) |
48700 | 654 |
val remote_iprover = |
52094 | 655 |
remotify_atp iprover "iProver" ["0.99"] |
58084 | 656 |
(K (((150, ""), FOF, "mono_guards??", liftingN, false), "") (* FUDGE *)) |
44099 | 657 |
val remote_leo2 = |
52094 | 658 |
remotify_atp leo2 "LEO-II" ["1.5.0", "1.4", "1.3", "1.2", "1"] |
72588 | 659 |
(K (((40, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", liftingN, false), "") (* FUDGE *)) |
67021
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
660 |
val remote_leo3 = |
41f1f8c4259b
integrated Leo-III in Sledgehammer (thanks to Alexander Steen for the patch)
blanchet
parents:
66544
diff
changeset
|
661 |
remotify_atp leo3 "Leo-III" ["1.1"] |
72588 | 662 |
(K (((150, ""), THF (Without_FOOL, Polymorphic, THF_Without_Choice), "poly_native_higher", keep_lamsN, false), "") (* FUDGE *)) |
57269
1df6f747f164
changed type encoding for new Waldmeister, to trigger filtering of 'dangerous' lemmas
blanchet
parents:
57265
diff
changeset
|
663 |
val remote_waldmeister = gen_remote_waldmeister waldmeisterN "raw_mono_tags??" |
70940
ce3a05ad07b7
added support for Zipperposition on SystemOnTPTP
blanchet
parents:
70939
diff
changeset
|
664 |
val remote_zipperposition = |
72174 | 665 |
remotify_atp zipperposition "Zipperpin" ["2.0"] |
72588 | 666 |
(K (((512, ""), THF (Without_FOOL, Monomorphic, THF_Without_Choice), "mono_native_higher", keep_lamsN, false), "") (* FUDGE *)) |
667 |
||
668 |
||
669 |
(* Dummy prover *) |
|
670 |
||
671 |
fun dummy_config prem_role format type_enc uncurried_aliases : atp_config = |
|
73374 | 672 |
{exec = (["ISABELLE_ATP"], ["scripts/dummy_atp"]), |
73568
bdba138d462d
clarified signature: more structured arguments, notably for remote provers;
wenzelm
parents:
73437
diff
changeset
|
673 |
arguments = K (K (K (K (K (K []))))), |
72588 | 674 |
proof_delims = [], |
675 |
known_failures = known_szs_status_failures, |
|
676 |
prem_role = prem_role, |
|
677 |
best_slices = |
|
678 |
K [(1.0, (((200, ""), format, type_enc, |
|
679 |
if is_format_higher_order format then keep_lamsN |
|
680 |
else combsN, uncurried_aliases), ""))], |
|
681 |
best_max_mono_iters = default_max_mono_iters, |
|
682 |
best_max_new_mono_instances = default_max_new_mono_instances} |
|
683 |
||
684 |
val dummy_tfx_format = TFF (With_FOOL, Polymorphic) |
|
685 |
||
686 |
val dummy_tfx_config = dummy_config Hypothesis dummy_tfx_format "mono_native_fool" false |
|
687 |
val dummy_tfx = (dummy_tfxN, fn () => dummy_tfx_config) |
|
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
688 |
|
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
689 |
|
38454
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
690 |
(* Setup *) |
9043eefe8d71
detect old Vampire and give a nicer error message
blanchet
parents:
38433
diff
changeset
|
691 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
692 |
fun add_atp (name, config) thy = |
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
693 |
Data.map (Symtab.update_new (name, (config, stamp ()))) thy |
63692 | 694 |
handle Symtab.DUP name => error ("Duplicate ATP: " ^ quote name) |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
695 |
|
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
696 |
fun get_atp thy name = |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
697 |
fst (the (Symtab.lookup (Data.get thy) name)) |
63692 | 698 |
handle Option.Option => error ("Unknown ATP: " ^ name) |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
699 |
|
41727
ab3f6d76fb23
available_provers ~> supported_provers (for clarity)
blanchet
parents:
41725
diff
changeset
|
700 |
val supported_atps = Symtab.keys o Data.get |
36371
8c83ea1a7740
move the Sledgehammer menu options to "sledgehammer_isar.ML"
blanchet
parents:
36370
diff
changeset
|
701 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
702 |
fun is_atp_installed thy name = |
48376
416e4123baf3
use "eproof_ram" script if available (plug-in replacement for "eproof", but faster)
blanchet
parents:
48232
diff
changeset
|
703 |
let val {exec, ...} = get_atp thy name () in |
73374 | 704 |
exists (fn var => getenv var <> "") (fst exec) |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
705 |
end |
36371
8c83ea1a7740
move the Sledgehammer menu options to "sledgehammer_isar.ML"
blanchet
parents:
36370
diff
changeset
|
706 |
|
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
707 |
fun refresh_systems_on_tptp () = |
49984 | 708 |
Synchronized.change remote_systems (fn _ => get_remote_systems ()) |
40059
6ad9081665db
use consistent terminology in Sledgehammer: "prover = ATP or SMT solver or ..."
blanchet
parents:
39491
diff
changeset
|
709 |
|
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
710 |
fun effective_term_order ctxt atp = |
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
711 |
let val ord = Config.get ctxt term_order in |
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
712 |
if ord = smartN then |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
713 |
{is_lpo = false, gen_weights = (atp = spassN), gen_prec = (atp = spassN), |
72401
2783779b7dd3
removed obsolete unmaintained experimental prover Pirate
blanchet
parents:
72400
diff
changeset
|
714 |
gen_simp = false} |
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
715 |
else |
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
716 |
let val is_lpo = String.isSubstring lpoN ord in |
54788
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
717 |
{is_lpo = is_lpo, gen_weights = not is_lpo andalso String.isSubstring xweightsN ord, |
a898e15b522a
primitive support for SPASS-Pirate (Daniel Wand's polymorphic SPASS prototype)
blanchet
parents:
54197
diff
changeset
|
718 |
gen_prec = String.isSubstring xprecN ord, gen_simp = String.isSubstring xsimpN ord} |
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
719 |
end |
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
720 |
end |
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
721 |
|
52073
ccb292952774
started adding agsyHOL as an experimental prover
blanchet
parents:
51998
diff
changeset
|
722 |
val atps = |
72403
4a3169d8885c
removed support for obsolete prover SNARK and underperforming prover E-Par
blanchet
parents:
72401
diff
changeset
|
723 |
[agsyhol, alt_ergo, e, iprover, leo2, leo3, satallax, spass, vampire, z3_tptp, zipperposition, |
4a3169d8885c
removed support for obsolete prover SNARK and underperforming prover E-Par
blanchet
parents:
72401
diff
changeset
|
724 |
remote_agsyhol, remote_alt_ergo, remote_e, remote_iprover, remote_leo2, remote_leo3, |
74005
14de47e29fe4
get rid of remote_vampire since it's hard, if possible at all, to follow Vampire's online options
blanchet
parents:
73974
diff
changeset
|
725 |
remote_waldmeister, remote_zipperposition, dummy_tfx] |
47055
16e2633f3b4b
made "spass" a "metaprover" that uses either the new SPASS or the old SPASS, to preserve backward compatibility and prepare for the upcoming release
blanchet
parents:
47053
diff
changeset
|
726 |
|
57262 | 727 |
val _ = Theory.setup (fold add_atp atps) |
35867 | 728 |
|
28592 | 729 |
end; |