src/HOL/Tools/Sledgehammer/MaSh/src/mash.py
author blanchet
Wed, 12 Dec 2012 00:14:58 +0100
changeset 50482 d7be7ccf428b
parent 50441 1e71f9d3cd57
child 50619 b958a94cf811
permissions -rwxr-xr-x
updated version of MaSh learner engine
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
     1
#!/usr/bin/python
50222
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     2
#     Title:      HOL/Tools/Sledgehammer/MaSh/src/mash.py
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     3
#     Author:     Daniel Kuehlwein, ICIS, Radboud University Nijmegen
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     4
#     Copyright   2012
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     5
#
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     6
# Entry point for MaSh (Machine Learning for Sledgehammer).
40e3c3be6bca added file headers
blanchet
parents: 50220
diff changeset
     7
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
     8
'''
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
     9
MaSh - Machine Learning for Sledgehammer
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    10
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    11
MaSh allows to use different machine learning algorithms to predict relevant fact for Sledgehammer.
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    12
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    13
Created on July 12, 2012
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    14
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    15
@author: Daniel Kuehlwein
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    16
'''
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    17
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    18
import logging,datetime,string,os,sys
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    19
from argparse import ArgumentParser,RawDescriptionHelpFormatter
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    20
from time import time
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    21
from stats import Statistics
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    22
from dictionaries import Dictionaries
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
    23
#from fullNaiveBayes import NBClassifier
50482
d7be7ccf428b updated version of MaSh learner engine
blanchet
parents: 50441
diff changeset
    24
from sparseNaiveBayes import sparseNBClassifier
d7be7ccf428b updated version of MaSh learner engine
blanchet
parents: 50441
diff changeset
    25
#from naiveBayes import sparseNBClassifier
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    26
from snow import SNoW
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    27
from predefined import Predefined
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    28
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    29
# Set up command-line parser
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    30
parser = ArgumentParser(description='MaSh - Machine Learning for Sledgehammer.  \n\n\
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    31
MaSh allows to use different machine learning algorithms to predict relevant facts for Sledgehammer.\n\n\
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    32
--------------- Example Usage ---------------\n\
50434
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
    33
First initialize:\n./mash.py -l test.log -o ../tmp/ --init --inputDir ../data/Jinja/ \n\
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
    34
Then create predictions:\n./mash.py -i ../data/Jinja/mash_commands -p ../data/Jinja/mash_suggestions -l test.log -o ../tmp/ --statistics\n\
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    35
\n\n\
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    36
Author: Daniel Kuehlwein, July 2012',formatter_class=RawDescriptionHelpFormatter)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    37
parser.add_argument('-i','--inputFile',help='File containing all problems to be solved.')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    38
parser.add_argument('-o','--outputDir', default='../tmp/',help='Directory where all created files are stored. Default=../tmp/.')
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
    39
parser.add_argument('-p','--predictions',default='../tmp/%s.predictions' % datetime.datetime.now(),
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    40
                    help='File where the predictions stored. Default=../tmp/dateTime.predictions.')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    41
parser.add_argument('--numberOfPredictions',default=200,help="Number of premises to write in the output. Default=200.",type=int)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    42
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    43
parser.add_argument('--init',default=False,action='store_true',help="Initialize Mash. Requires --inputDir to be defined. Default=False.")
50434
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
    44
parser.add_argument('--inputDir',default='../data/Jinja/',\
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    45
                    help='Directory containing all the input data. MaSh expects the following files: mash_features,mash_dependencies,mash_accessibility')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    46
parser.add_argument('--depFile', default='mash_dependencies',
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    47
                    help='Name of the file with the premise dependencies. The file must be in inputDir. Default = mash_dependencies')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    48
parser.add_argument('--saveModel',default=False,action='store_true',help="Stores the learned Model at the end of a prediction run. Default=False.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    49
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    50
parser.add_argument('--nb',default=False,action='store_true',help="Use Naive Bayes for learning. This is the default learning method.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    51
parser.add_argument('--snow',default=False,action='store_true',help="Use SNoW's naive bayes instead of Naive Bayes for learning.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    52
parser.add_argument('--predef',default=False,action='store_true',\
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
    53
                    help="Use predefined predictions. Used only for comparison with the actual learning. Expects mash_mepo_suggestions in inputDir.")
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    54
parser.add_argument('--statistics',default=False,action='store_true',help="Create and show statistics for the top CUTOFF predictions.\
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    55
                    WARNING: This will make the program a lot slower! Default=False.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    56
parser.add_argument('--saveStats',default=None,help="If defined, stores the statistics in the filename provided.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    57
parser.add_argument('--cutOff',default=500,help="Option for statistics. Only consider the first cutOff predictions. Default=500.",type=int)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    58
parser.add_argument('-l','--log', default='../tmp/%s.log' % datetime.datetime.now(), help='Log file name. Default=../tmp/dateTime.log')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    59
parser.add_argument('-q','--quiet',default=False,action='store_true',help="If enabled, only print warnings. Default=False.")
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    60
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
    61
def main(argv = sys.argv[1:]):
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    62
    # Initializing command-line arguments
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    63
    args = parser.parse_args(argv)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    64
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
    65
    # Set up logging
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    66
    logging.basicConfig(level=logging.DEBUG,
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    67
                        format='%(asctime)s %(name)-12s %(levelname)-8s %(message)s',
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    68
                        datefmt='%d-%m %H:%M:%S',
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    69
                        filename=args.log,
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    70
                        filemode='w')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    71
    console = logging.StreamHandler(sys.stdout)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    72
    console.setLevel(logging.INFO)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    73
    formatter = logging.Formatter('# %(message)s')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    74
    console.setFormatter(formatter)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    75
    logging.getLogger('').addHandler(console)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    76
    logger = logging.getLogger('main.py')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    77
    if args.quiet:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    78
        logger.setLevel(logging.WARNING)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    79
        console.setLevel(logging.WARNING)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    80
    if not os.path.exists(args.outputDir):
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    81
        os.makedirs(args.outputDir)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    82
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    83
    logger.info('Using the following settings: %s',args)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    84
    # Pick algorithm
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    85
    if args.nb:
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
    86
        logger.info('Using sparse Naive Bayes for learning.')
50482
d7be7ccf428b updated version of MaSh learner engine
blanchet
parents: 50441
diff changeset
    87
        model = sparseNBClassifier()
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    88
        modelFile = os.path.join(args.outputDir,'NB.pickle')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    89
    elif args.snow:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    90
        logger.info('Using naive bayes (SNoW) for learning.')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    91
        model = SNoW()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    92
        modelFile = os.path.join(args.outputDir,'SNoW.pickle')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    93
    elif args.predef:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    94
        logger.info('Using predefined predictions.')
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
    95
        #predictionFile = os.path.join(args.inputDir,'mash_meng_paulson_suggestions') 
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
    96
        predictionFile = os.path.join(args.inputDir,'mash_mepo_suggestions')
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    97
        model = Predefined(predictionFile)
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
    98
        modelFile = os.path.join(args.outputDir,'mepo.pickle')        
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
    99
    else:
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   100
        logger.info('No algorithm specified. Using sparse Naive Bayes.')
50482
d7be7ccf428b updated version of MaSh learner engine
blanchet
parents: 50441
diff changeset
   101
        model = sparseNBClassifier()
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   102
        modelFile = os.path.join(args.outputDir,'NB.pickle')
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   103
    dictsFile = os.path.join(args.outputDir,'dicts.pickle')
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   104
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   105
    # Initializing model
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   106
    if args.init:
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   107
        logger.info('Initializing Model.')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   108
        startTime = time()
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   109
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   110
        # Load all data
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   111
        dicts = Dictionaries()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   112
        dicts.init_all(args.inputDir,depFileName=args.depFile)
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   113
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   114
        # Create Model
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   115
        trainData = dicts.featureDict.keys()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   116
        if args.predef:
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   117
            model.initializeModel(trainData,dicts)
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   118
        else:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   119
            model.initializeModel(trainData,dicts)
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   120
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   121
        model.save(modelFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   122
        dicts.save(dictsFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   123
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   124
        logger.info('All Done. %s seconds needed.',round(time()-startTime,2))
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   125
        return 0
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   126
    # Create predictions and/or update model
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   127
    else:
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   128
        lineCounter = 1
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   129
        statementCounter = 1
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   130
        computeStats = False
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   131
        dicts = Dictionaries()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   132
        # Load Files
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   133
        if os.path.isfile(dictsFile):
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   134
            dicts.load(dictsFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   135
        if os.path.isfile(modelFile):
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   136
            model.load(modelFile)
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   137
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   138
        # IO Streams
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   139
        OS = open(args.predictions,'w')
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   140
        IS = open(args.inputFile,'r')
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   141
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   142
        # Statistics
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   143
        if args.statistics:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   144
            stats = Statistics(args.cutOff)
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   145
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   146
        predictions = None
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   147
        #Reading Input File
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   148
        for line in IS:
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   149
#           try:
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   150
            if True:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   151
                if line.startswith('!'):
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   152
                    problemId = dicts.parse_fact(line)                    
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   153
                    # Statistics
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   154
                    if args.statistics and computeStats:
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   155
                        computeStats = False
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   156
                        acc = dicts.accessibleDict[problemId]
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   157
                        if args.predef:
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   158
                            predictions = model.predict(problemId)
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   159
                        else:
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   160
                            if args.snow:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   161
                                predictions,_predictionsValues = model.predict(dicts.featureDict[problemId],dicts.expand_accessibles(acc),dicts)
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   162
                            else:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   163
                                predictions,_predictionsValues = model.predict(dicts.featureDict[problemId],dicts.expand_accessibles(acc))                        
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   164
                        stats.update(predictions,dicts.dependenciesDict[problemId],statementCounter)
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   165
                        if not stats.badPreds == []:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   166
                            bp = string.join([str(dicts.idNameDict[x]) for x in stats.badPreds], ',')
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   167
                            logger.debug('Bad predictions: %s',bp)
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   168
                    statementCounter += 1
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   169
                    # Update Dependencies, p proves p
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   170
                    dicts.dependenciesDict[problemId] = [problemId]+dicts.dependenciesDict[problemId]
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   171
                    if args.snow:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   172
                        model.update(problemId,dicts.featureDict[problemId],dicts.dependenciesDict[problemId],dicts)
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   173
                    else:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   174
                        model.update(problemId,dicts.featureDict[problemId],dicts.dependenciesDict[problemId])
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   175
                elif line.startswith('p'):
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   176
                    # Overwrite old proof.
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   177
                    problemId,newDependencies = dicts.parse_overwrite(line)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   178
                    newDependencies = [problemId]+newDependencies
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   179
                    model.overwrite(problemId,newDependencies,dicts)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   180
                    dicts.dependenciesDict[problemId] = newDependencies
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   181
                elif line.startswith('?'):                    
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   182
                    startTime = time()
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   183
                    computeStats = True
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   184
                    if args.predef:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   185
                        continue
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   186
                    name,features,accessibles = dicts.parse_problem(line)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   187
                    # Create predictions
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   188
                    logger.info('Starting computation for problem on line %s',lineCounter)
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   189
                    if args.snow:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   190
                        predictions,predictionValues = model.predict(features,accessibles,dicts)
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   191
                    else:
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   192
                        predictions,predictionValues = model.predict(features,accessibles)
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   193
                    assert len(predictions) == len(predictionValues)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   194
                    logger.info('Done. %s seconds needed.',round(time()-startTime,2))
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   195
                    # Output        
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   196
                    predictionNames = [str(dicts.idNameDict[p]) for p in predictions[:args.numberOfPredictions]]
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   197
                    predictionValues = [str(x) for x in predictionValues[:args.numberOfPredictions]]
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   198
                    predictionsStringList = ['%s=%s' % (predictionNames[i],predictionValues[i]) for i in range(len(predictionNames))]
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   199
                    predictionsString = string.join(predictionsStringList,' ')
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   200
                    outString = '%s: %s' % (name,predictionsString)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   201
                    OS.write('%s\n' % outString)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   202
                else:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   203
                    logger.warning('Unspecified input format: \n%s',line)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   204
                    sys.exit(-1)
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   205
                lineCounter += 1
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   206
            """
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   207
            except:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   208
                logger.warning('An error occurred on line %s .',line)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   209
                lineCounter += 1
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   210
                continue
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   211
            """
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   212
        OS.close()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   213
        IS.close()
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   214
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   215
        # Statistics
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   216
        if args.statistics:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   217
            stats.printAvg()
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   218
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   219
        # Save
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   220
        if args.saveModel:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   221
            model.save(modelFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   222
        dicts.save(dictsFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   223
        if not args.saveStats == None:
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   224
            statsFile = os.path.join(args.outputDir,args.saveStats)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   225
            stats.save(statsFile)
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   226
    return 0
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   227
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   228
if __name__ == '__main__':
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   229
    # Example:
50434
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
   230
    # Jinja
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
   231
    #args = ['-l','testIsabelle.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--predef']
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
   232
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testIsabelle.pred','-l','testIsabelle.log','--predef','-o','../tmp/','--statistics','--saveStats','../tmp/natATPMP.stats']
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
   233
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/']
960a3429615c more MaSh tweaking -- in particular, export the same facts in "MaSh_Export" as are later tried in "MaSh_Eval"
blanchet
parents: 50399
diff changeset
   234
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testNB.pred','-l','../tmp/testNB.log','--nb','-o','../tmp/','--statistics','--saveStats','../tmp/natATPNB.stats','--cutOff','500']
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   235
    # List
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   236
    #args = ['-l','testIsabelle.log','-o','../tmp/','--statistics','--init','--inputDir','../data/List/','--isabelle']
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   237
    #args = ['-i', '../data/List/mash_commands','-p','../tmp/testIsabelle.pred','-l','testIsabelle.log','--isabelle','-o','../tmp/','--statistics']
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   238
    # Huffmann
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   239
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Huffman/','--depFile','mash_atp_dependencies']
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   240
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Huffman/']
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   241
    #args = ['-i', '../data/Huffman/mash_commands','-p','../tmp/testNB.pred','-l','testNB.log','--nb','-o','../tmp/','--statistics']
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   242
    # Jinja
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   243
    # ISAR
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   244
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/']    
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   245
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testNB.pred','-l','../tmp/testNB.log','--nb','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaIsarNB.stats','--cutOff','500']
50441
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   246
    #args = ['-l','testIsabelle.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--predef']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   247
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/JinjaMePo.pred','-l','testIsabelle.log','--predef','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaMePo.stats']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   248
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--depFile','mash_atp_dependencies','--snow']    
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   249
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testNB.pred','-l','../tmp/testNB.log','--nb','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaIsarNB.stats','--cutOff','500','--depFile','mash_atp_dependencies']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   250
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   251
    # ATP
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   252
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--depFile','mash_atp_dependencies']    
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   253
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testNB.pred','-l','../tmp/testNB.log','--nb','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaIsarNB.stats','--cutOff','500','--depFile','mash_atp_dependencies']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   254
    #args = ['-l','testIsabelle.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--predef','--depFile','mash_atp_dependencies']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   255
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/JinjaMePo.pred','-l','testIsabelle.log','--predef','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaMePo.stats','--depFile','mash_atp_dependencies']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   256
    #args = ['-l','testNB.log','-o','../tmp/','--statistics','--init','--inputDir','../data/Jinja/','--depFile','mash_atp_dependencies','--snow']    
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   257
    #args = ['-i', '../data/Jinja/mash_commands','-p','../tmp/testNB.pred','-l','../tmp/testNB.log','--snow','-o','../tmp/','--statistics','--saveStats','../tmp/JinjaIsarNB.stats','--cutOff','500','--depFile','mash_atp_dependencies']
1e71f9d3cd57 more changes to MaSh Python program (by Daniel K.)
blanchet
parents: 50434
diff changeset
   258
50399
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   259
52d9720f7a48 made Python code compile again (by Daniel K.)
blanchet
parents: 50388
diff changeset
   260
    
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   261
    #startTime = time()
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   262
    #sys.exit(main(args))
50388
a5b666e0c3c2 added weights to MaSh (by Daniel Kuehlwein)
blanchet
parents: 50222
diff changeset
   263
    #print 'New ' + str(round(time()-startTime,2))
50220
90280d85cd03 moved MaSh's Python code into Isabelle
blanchet
parents:
diff changeset
   264
    sys.exit(main())