Sunteți pe pagina 1din 1

Assignment Number 1: Machine Learning Course

(Dr. S. Nadeem Ahsan)

Due Date: 11-Feb-2012

1. Run the MultilayerPerceptron classifier on the iris.arff dataset. Use the default
parameter settings, and use the training set as the test option. Include in your
submission the printed results (weights and statistics) from WEKA. In your report
draw graphically the neural network topology (input nodes, hidden nodes, output
nodes, connections) used by the classifier. Do not show the weights on your drawing.

2. WEKA's default parameter settings (among others) for MultilayerPerceptron are -L


0.3 -M 0.2 -N 500 -H a. Explain in your own words what these mean.

3. Using WEKA's experimenter application, perform the following experiment.


a. Choose a "New" experiment.
b. For the Results Destination section, select ARFF file and provide a file name
in which to store the experimental results.
c. For Experiment Type, choose the default settings: cross-validation with 10
folds and classification.
d. For Iteration Control, choose the default settings: 10 iterations and data sets
first.
e. Select the following four datasets that come with WEKA: contact-lenses, iris,
labor, and weather.
f. Select the following classifiers with default parameter settings:
ConjunctiveRule, J48 and MultilayerPerceptron.
g. Run the experiment.
h. Analyze the results by loading the ARFF results file,

4. Apply the machine learning classification technique Libsvm (MATLAB tool for
support vector machine) on the attached data set (TomacatDataForAssignement1).
You have to perform classification for bugs (last column of the given data set).
Discuss the obtain results in your own words (Hints: First convert the xls/csv data file
into libsvm format, and then train the classifier without using the first three columns of
the dataset. For classification you may replace the bug values with 0 and 1).

5. Apply the machine learning regression technique i.e., neural networks (in MATLAB)
on the attached data set (TomacatDataForAssignement1). Discuss the obtain results in
your own words (Hint: First convert the xls/csv data file into libsvm format, and then
train the neural-network without using the first three columns of the dataset).

6. Draw scatter plot and perform the correlation analysis between metrics value
(columns: 4-to-23) with the number of bugs (last column). Also perform the
correlation among the metrics columns to identify the correlated matrices. Comments,
if the data set have co-linearity.

S-ar putea să vă placă și