The WEKA datafiles for this tutorial can be found here.
Decision Trees: In order to probe decision tree you will use the Id3 classifier. Type the following command
java weka.classifiers.trees.Id3 -t PATH/weather.nominal.arff(note that the option -t calls the training file according the PATH location of this file in your machine)
Support Vector Machines: In order to probe the SVM classifier, type the following command
java weka.classifiers.functions.SMO -t PATH/weather.nominal.arff
Neural Networks: In order to probe the NNs classifier, type the following command
java weka.classifiers.functions.VotedPerceptron -t PATH/weather.nominal.arff
Naive Bayes: In order to probe the NB classifier, type the following command
java weka.classifiers.bayes.NaiveBayes -t PATH/weather.nominal.arff
java weka.classifiers.TYPE.CLASSIFIER_NAME -t PATH/weather.nominal.arff -d PATH/modelname.model
You should generate a file that contains the model; this can be named for example in the form:
weather_Id3.model
weather_SVM.model
weather_NN.model
weather_NB.model
e.g. by
java weka.classifiers.trees.Id3 -t PATH/weather.nominal.arff -d PATH/weather_Id3.model
In order to use the stored model to classify new examples, use the file "test_weather.arff" (save this file in the same folder than weather.nominal.arff and *.model files). In this file you have two examples without classification. Then classify these examples using the models previously generated in the following way:
java weka.classifiers.~.classifier_name -T PATH/test_weather.arff -l PATH/modelname.model -p 0
In this case you use the options: -T that calls a test file (test_weather.arff); and -l that call the model file to be used. Compare the results obtained using the four models generated.
Classify the examples in the "Breast_Cancer.arff" dataset (benign and malignant cells) using the four classifiers mentioned in the exercise 1, and compare the results.
NOTE: This dataset contains numerical data, so you you can not use Id3 classifier (Id3 only support nominal attributes). In this case try decision trees with J48 classifier with the following command
java weka.classifiers.trees.J48 -t PATH/Breast_Cancer.arff
Classify the examples in this dataset (ALL or AML class) using the four classifiers mentioned in the exercise 1, and compare the results.
Interpretation: Go to PubMed and search the selected genes, do they have any biological meaning? Can you identify the unknown gene function? (Try using other bioinformatics tools)