Documente Academic
Documente Profesional
Documente Cultură
(Multivariate Techniques)
Reference Materials
Available articles (websites)
Lecture-notes/power-points
and the book:
Multivariate Analysis
By
Joseph Hair, William Black, Barry Babin,
Rolph Anderson, Ronald Dixon
Multivariate Techniques
Dependence Techniques
Multiple
Regression
Discriminant
Analysis
Conjoint
Analysis
Interdependence Techniques
Factor
Analysis
Cluster
Analysis
Multidimension
al Scaling
Person
a
b
c
d
e
X1
2
8
9
1
8.5
X2
4
2
3
5
1
The data of this table are plotted in the figure in next slide
Measures of Distance
Definition (Measures of distance):
The Euclidean distance between
the two points i and j is the
hypotenuse of the triangle ABC:
Observation i is closer (more similar)
to j than observation k if D(i; j) < D(i;k).
An alternative measure is the squared Euclidean distance:
D2(I , j) = A2 + B2 = (X1i - X1j)2 + (X2i - X2j)2
Figure 1
Nearest neighbor method, Step 1:
For example, the distance between a and b is
Figure 2
Figure 3
Figure 4
The groupings and the distance at which these took place are also
shown in the tree diagram (dendrogram) of Figure 5.
Figure 5 (dendrogram)
The four clusters remaining at Step 1 and the distances between these
clusters are shown in Figure 6(a).
Figure 6
In step 2, the nearest clusters (a) and (d) are grouped into cluster (ad).
The remaining steps are similarly executed.
-----------------------------
Figure 7
Thus, centroid of cluster 1 is C1 (3:67, 3:67) and that of cluster 2 is
C2 (8.75, 2) in Figure 7(b).
Obs.
a
b
c
d
b
0
c
1
0
d
1
0.5
0
0
1
1
0