Documente Academic
Documente Profesional
Documente Cultură
T – playing chess
P – percentage of wins
E – 1000 recorded whole games
Concept learning
Decision tree learning
Neural networks
Genetic algorithms
Instance based learning
Bayesian learning
Clustering
W(aikato) E(nvironment)
for K(nowlegde) A(nalysis)
Explorer
Experimenter
Knowledge flow
Simple Command Line Interface
Java interface
Train set
Test set
Train set
Test set
The amount of the data is ‘enough’
Test Set
1/3
2/3
Train Set
Train set
Test set
The amount of the data is limited
Cross
Validation
0 24 0 0 1 ski 0 96 0 0 4 ski
0 0 24 0 1 scuba 0 0 96 0 4 scuba
0 0 0 25 0 ml 0 0 0 100 0 ml
Unsupervised learning
Unsupervised learning
Implies metric to calculate the ‘similarity’
between the instances.
Classifier
classifyInstance
or
distributionForInstance
Classifier
buildClassifier
routine which evaluates the
classifyInstance generated model on an unseen test
or dataset
distributionForInstance
Classifier
buildClassifier
classifyInstance
or
a routine which generates a
distributionForInstance
probability distribution for all classes
Used filters
StringToWordVector
NumericToBinary
StringToWordVector with IDFTransform option
Attribute Selection method
ChiSquaredAttributeEval
Used classifiers
J48( C4.5)
Naive Bayes
IBk (kNN)
Results
J48 NB 1NN 3NN