Documente Academic
Documente Profesional
Documente Cultură
Jussi Tohka
Outline
Unsupervised classication problem The K-means algorithm The EM-algorithm
An algorithm to estimate the unknown cluster centres (means) based on the data
Aims to minimize
! !# ! ! "
where
$ &%
according to the
'
The cost function is not be suitable for many problems. e.g. for separating clusters of different size. A generalization: Fuzzy K-means clustering.
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
10
10
12
14
10
10
10
12
14
10
12
14
10
10
10
12
14
10
12
14
Convergence!
10
10
10
12
14
10
12
14
True labels
Clustering result
10
10
10
12
14
10
12
14
True labels
10
12
10
12
200
100
100
200
300 4
10
12
14
True labels
200
200
100
100
100
100
200
200
300 4
10
12
14
300 4
10
12
14
True labels
K-means result
10
12
True labels
10
12
10
12
True labels
K-means result
where is the normal density with the mean and is the prior probability of the class .
% 5 3 % 7
4
Maximization of the above likelihood can be done with an EM-algorithm. This algorithm is also described in the book but it is not called an EM-algorithm there.
1. Initialize
$ %
.
1 @9 % 9 69 39
2. (E-step) Compute the probabilities of belonging to the class based on the parameter values form the previous iteration.
7 % %
1 @9
"
@9
1 @9
"
1 @9
EM versus K-means
14 12
10
10
12
14
True labels
EM versus K-means
14 14 12 12
10
10
10
12
14
10
12
14
True labels
K-means result
EM versus K-means
14 14 12 12
10
10
10
12
14
10
12
14
True labels
EM result
EM versus K-means
400 300
200
100
100
200
300 4
10
12
14
True labels
EM versus K-means
400 400 300 300
200
200
100
100
100
100
200
200
300 4
10
12
14
300 4
10
12
14
True labels
K-means result
EM versus K-means
400 400 300 300
200
200
100
100
100
100
200
200
300 4
10
12
14
300 4
10
12
14
True labels
EM result
EM versus K-means
12 10
10
12
True labels
EM versus K-means
12 12 10 10
10
12
10
12
True labels
K-means result
EM versus K-means
12
11
10
10 9
8
6
7 6
5 4
3
0
2
0 2 4 6 8 10 12
10
12
True labels
EM result
Application: MRI-segmentation
Application: MRI-segmentation
Application: MRI-segmentation
Ground truth
EM,T1+T2
K-means,T1+T2
Application: MRI-segmentation
Ground truth
EM,T1+T2
EM, only T1
Application: MRI-segmentation
5500 5000 4500 4000 3500 3000 2500 2000 1500 1000 500
100
200
300
400
500
600
700
800
900
1200
1800 1600
1000
1400
800
1200 1000
600
800
400
600 400
200
200
100
200
300
400
500
600
700
800
900
0 500
1000
1500
2000
2500
3000
3500
4000
4500
5000
5500