Documente Academic
Documente Profesional
Documente Cultură
N
COMPONENTE PRINCIPALE
IIN
NT
TR
RO
OD
DU
UC
CE
ER
RE
E
Studiem cu ajutorul acestei metode un tabel indivizi x variabile, n cazul
n care toate variabilele sunt numerice.
3UH]HQW P PDL vQWkL R DERUGDUH H[SORUDWRDUH FDUH SHUPLWH GHVFULHUHD
lor.
(VWHGH DVHPHQL SRVLELO RE LQHUHD XQHL UHSUH]HQW UL VLPXOWDQH SH R KDUW
DLQGLYL]LORULDYDULDELOHORU
0DLPXOWHVWHQHFHVDUV FRPSOHW PUHSUH]HQWDUHDJUDILF DGDWHORUFXR
tipologie a indivizilor.
3UH]HQW P GH DVHPHQL PHWRGD GH FODVLILFDUH DVFHQGHQW LHUDUKLF FDUH
IRORVHWHFULWHULXOOXL:DUGIRDUWHELQHDGDSWDWODWUDWDUHDGDWHORUQXPHULFH
UHSUH]LQW
PRGHOH
GH
PDLQL
GLQ
DQXO
LDU
FRORDQHOH
LPH
Tabelul 1
Nr. Crt.
Model
Cilindree
Putere
9LWH]
Greutate
Lungime
LPH
Putere
Cilindree
Greutate
Lungime
LPH
GH
3HXJHRW 5DOO\H 6HDW ,EL]D 6;L L &LWURsQ$; 6SRUW DX YLWH]H
PDULLSXWHUHPDUHvQU
1LVVDQ9DQHWWHL9:&DUDYHOOHVHFDUDFWHUL]HD] SULQYLWH]HPLFL
Figura 2
Graficele n stea pentru indivizi
5HQDXOWDUHRSXWHUHPLF vQUDSRUWFXFLOLQGUHHDVD$FHVWDHVWH
un diesel.
QJHQHUDOPXO LPHDFDUDFWHULVWLFLORUHYROXHD] vQDFHODLVHQV
*UDILFHOHvQVWHDFUHVFUHJXODWGHOD PDLQLOHPLFLSUHFXP)RUG)LHVWDL
)LDW8QRODFHOHPDLPDULSUHFXP%09,5RYHULL5HQDXOW
Media
Dispersia
Abaterea
medie
Minim
Maxim
Greutate
Lungime
S WUDWLF
&RUHOD LL
Variabile
Cilindree
Putere
9LWH]
-1),
GHRDUHFH HVWH YRUED GH R DQDOL] JHRPHWULF D GDWHORU L QX H[LVW LQIHUHQ
VWDWLVWLF
6WDWLVWLFD LQIHUHQ LDO VWXGLD] XQ HDQWLRQ L WUDJH FRQFOX]LL SHQWU
vQWUHDJDPXO LPH
3HQWUX D DYHD R YL]LXQH FRPSOHW D GDWHORU L D LQWHU UHOD LLORU vQWUH
Figura 3
*UDILFXOFRUHOD LLORULQWHU
-variabile
F
H[LVW XQ IDFWRU P ULPH WDOLH L F OD R SULP DQDOL] PDLQLOH SRW IL
RUGRQDWH GH OD FHOH PDL PLFL OD FHOH PDL PDUL $FHDVWD VH YHGH GH DOWIHO L
LHUDUKLF
DVFHQGHQW
D
YDULDELOHORU
OXkQG
GUHSW
LQGLFH
-o
GH
pozitive).
'DF FRUHOD LLOH YDULDELOHORU DX YDORUL QHJDWLYH OX P vQ FRQVLGHUD LH
0.894.
QDSDWUDHWDS /
&RU/
LPHDvQWkOQHWHJUXSXO&LOLQGUHH*UHXWDWH/XQJLPH
LPH/XQJLPH
L vQ VIkULW vQ D FLQFHD HWDS FHOH GRX JUXSXUL 3XWHUH 9LWH] L
&LOLQGUHH*UHXWDWH/XQJLPH/
LPHIX]LRQHD]
YDULDELOHOHXQXLJUXSLFHOHDOHFHOXLODOWJUXSvQPRPHQWXOUHJUXS ULL
Figura 4
&ODVLILFDUHDLHUDUKLF
DVFHQGHQW
DYDULDELOHORU
0HWRGDFRUHOD LLORUPD[LPH
WXWXURU
YDULDELOHORU
LQFOXVLY
HD
vQV L
XWLOL]kQG
S WUDWHOH
1
4.29
(1 + 0.8612 + 0.6932 + 0.905 2 + 0.864 2 + 0.709 2 ) =
= 0.715
6
6
ia
7DEHOXO FRQ LQH VLPLODULWDWHD ILHF UHL YDULDELOH FX vQWUHDJD PXO LPH D
variabilelor:
Tabelul 3
lelor.
GHFHOHODOWH
izi x
xij
LPHDFDUDFWHULVWLFLORU
X j pentru individul i,
pentru individul i, x j , s 2j =
s j PHGLDGLVSHUVLDLDEDWHUHDPHGLHS
WUDWLF DYDULDELOHL
1 n
( xij x j ) 2
n i =1
Xj.
3ULQFLSDOH
IUDQFH]H
L
e n acest cadru
geometric.
-
UH]XOWDWHOHSURJUDPHORUIUDQFH]HGHDQDOL] vQFRPSRQHQWHSULQFLSDOH
FRUHVSXQGDFHVWHLDERUG UL
&ULWHULXO LQHU LHL HVWH vQ DFHODL WLPS PXOW PDL FRPSOH[ GHFkW FHOHODOWH
GRX FULWHULL SURSXVH GH +RWHOOLQJ FULWHULXO FRUHOD LHL L FULWHULXO
dispersiei.
3UH]HQWDUHD$&3FRQIRUPDERUG
ULLJHRPHWULFHDOXL3HDUVRQ
1RUXOGHSXQFWHDVRFLDWGDWHORULFDUDFWHULVWLFLO
e sale
xi de
caracteristici ( xi1 ,.....xip ) ale individului i este considerat drept un punct ntr-un
VSD LXFXSGLPHQVLXQL
&HQWUXOGHJUHXWDWHDOQRUXOXL1HVWHSXQFWXOJDOHF UXLFRRUGRQDWHVXQW
1 n
xi = ( x1 ,...., x j ,.., x p ) = x .
n i =1
I (N , g) =
1 n p
( xij x j ) 2 .
n i =1 j =1
,QHU LD WRWDO SRDWH IL FDOFXODW GLUHFW ILLQG HJDO FX VXPD GLVSHUVLLO
or
YDULDELOHORUGLQSUREOHP
I (N , g) =
1 n 2
1 n p
d
(
x
,
g
)
=
i
( xij x j ) 2 =
n i =1
n i =1 j =1
p
1 n
2
(
x
x
)
=
s 2j
ij
j
n
j =1
i =1
j =1
=
2E LQHPSHQWUXH[HPSOX
I(N,g)=267072+1441+609+50824+1638+56=321640.
6HREVHUY F LQHU LDQRUXOXLVHGDWRUHD] vQSULQFLSDOFLOLQGUHHL
$FHDVWD GLQ FDX]D DOHJHULL XQLW
urat
FLOLQGUHHD vQ OLWUL LPSRUWDQ D H[DJHUDW D FLOLQGUHHL vQ FDOFXOXO LQHU LHL DU IL
GLVS UXW
Q SUDFWLF DGHVHRUL HVWH SUHIHUDELO V RE LQHP R GHVFULHUH D GDWHORU
LQGHSHQGHQW GHDOHJHUHDXQLW
LORU
X *j =
X j xj
sj
X j L VH DVRFLD]
GHPHGLHLGLVSHUVLH
1RXOWDEHOVWXGLDWHVWHIRUPDWGLQFDQWLW
xij* =
LOH
xij x j
sj
QXP UXOSDOYDULDELOHORU
3ULPDD[
SULQFLSDO
LSULPDFRPSRQHQW
SULQFLSDO
norului de puncte N * .
3ULPDD[
SULQFLSDO
1 V
LQHU LHL
I ( N * , ) norului N * UDSRUWDW
1 n
I ( N * , ) = d 2 ( xi* , yi ) unde yi
n i =1
*
punctului xi pe dreapta .
Dreapta 1 FDXW
a norului N * .
V PLQLPL]H]H
HVWH
SURLHF LD
RUWRJRQDO
I ( N * , ) LVHQXPHWHSULPDD[
P ( xi* )
SULQFLSDO
YHFWRUSURSULXQRUPDWDOPDWULFHL5DFRUHOD LLORUvQWUHYDULDELOHOH
u1 ,
X j , asociat
1 HVWHYL]XDOL]DW
vQILJXUD
Tabelul 4
9DORULLYHFWRULSURSULLDLPDWULFHLGHFRUHOD LL
Figura 5
&
XWDUHDSULPHLD[HSULQFLSDOH
3HQWUXH[HPSOXOFXPDLQLOHDPRE LQXW
1 = 4.6745
u1 = (0.4434;0.4182;0.3497;0.4252;0.4246;0.3811).
3ULPDFRPSRQHQW
SULQFLSDO
Prima FRPSRQHQW
SULQFLSDO
Y1 HVWH R QRX
xi* pe axa
FXSURGXVXOVFDODUvQWUHYHFWRULL
x ij x j
j =1
sj
Y1 (i ) = Oy i = u1 j (
u1 L xi* :
FX
Y1 (Rover)=0.44*1.49+0.41*1.67+0.34*1.58+0.43*1.13+0.43*1.17+0.38*0.83=3.19
Global, Y1 se scrie deci:
Y1 = 0.44Cilindree* + 0.41Putere * + 0.34Viteza * +
+ 0.43Greutate* + 0.43Lungime* + 0.38Latime* .
WUDWHOHGLVWDQ HORUSkQ
LS
Tabelul 5
ODRULJLne, componentele principale
WUDWHOHFRVLQXVXULORU
Y1 HVWH FHQWUDW
variabile centrate.
6HSRDWHDU WDF GLVSHUVLDVDHVWHHJDO FX
1 :
1
1
Y12 (i ) = d 2 ( y i ,0) = I ({ y1 ,...., y n },0) = 1 .
n i =1
n i =1
Dispersia primei componente principale Y1 HVWH HJDO FX LQHU
Dispersie (Y1 ) =
LD QRUXOXL
Xj
L FRUHVSRQGHQ D SULQFLSDO
Y1 pot fi
Y1 ID
HJDO FX
1 p
cor 2 ( X j , Y1 ) = 1
p j =1
p
4.656
= 0.776 comparabil cu 0.715 al
6
X j L Y1 DSDUvQSULPDFRORDQ
DWDEHOXOXL
Tabelul 6.
&RUHOD LLYDULDELOH
-componente principale
PULPD FRPSRQHQW
SULQFLSDO
YDULDELOHOH HD SRDWH IL LQWHUSUHWDW FD XQ IDFWRU GH P ULPH FODVkQG PDLQLOH
de la cele mai mici ( Y1 (Fiat Uno)= -3.76; Y1 (Ford Fiesta)= - 3.50) la cele mai
mari ( Y1 (Renault 25)=3.44; Y1 (BMV530i)=3.95).
&DOLWDWHDJOREDO DSULPHLFRPSRQHQWHSULQFLSDOH
3HQWUX D P VXUD FDOLWDWHD JOREDO D SULPHL FRPSRQHQWH SULQFLSDOH
FRQVLGHUDW FD UH]XPDW DO GDWHORU VH IRORVHWH IRUPXOD GH GHVFRPSXQHUH D
LQHU LHLWRWDOH
D YHFWRUXOXL
xi* pe dreapta 1 ,
1 n 2 *
1 n 2
1 n 2 *
d
(
x
,
0
)
=
d
(
y
,
0
)
+
d ( xi , yi )
i
i n
n i =1
n i =1
i =1
,QHU LDWRWDO
I ( N * ,0) =
1 n 2 *
d ( xi ,0) = p
n i =1
VHGHVFRPSXQHGHFLvQGRX S U L
1 n 2
d ( yi ,0) = I ({ y1 ,...., yn },0) UHSUH]LQW LQHU LD WRWDO D
n i =1
norului { y1 ,...., yn } D SURLHF LLORU SXQFWHORU xi* pe axa 1 $FHDVW
primul termen
1 LHVWHHJDO
1
d 2 ( xi* , yi ) = I ( N * , 1 ) UHSUH]LQW
n i =1
a norului n jurul axei 1
al doilea termen
PentruH[HPSOXOFXPDLQLOHRE LQHP
LQHU LDWRWDO S
LQHU LDH[SOLFDW GH 1 = 1 =4.656
LQHU LDUH]LGXDO S- 1 =1.344
FX
1
p
GHPXO LPHDGHYDULDELOH
QH[HPSOXSDUWHDGHLQHU LHH[SOLFDW GH
1 HVWHHJDO
FX
4.656
= 0.776. Se
6
ULLLQGLYL]LORUSHSULPDD[
SULQFLSDO
Y1
1 VH P VRDU
*
WUDWXOXLFRVLQXVXOXLXQJKLXOXLIRUPDWGHYHFWRUXO xi cu axa 1 :
cos 2 ( xi* , 1 ) =
FX D
jutorul
d 2 ( yi ,0)
Y1 (i ) 2
=
.
d 2 ( xi* ,0) d 2 ( xi* ,0)
Y1 ( Rover ) = 3.19
d 2 ( Rover,0) = 1.49 2 + 1.67 2 + 1.582 + 1.132 + 1.17 2 +
+ 0.832 = 10.8
cos 2 ( Rover , 1 ) =
10.18
= 0.94
10.80
5RYHUHVWHELQHUHSUH]HQWDWSHD[DSULQFLSDO
1 .
3 WUDWHOH GLVWDQ HORU ILHF UXL LQGLYLG OD RULJLQH L S WUDWHOH FRVLQXVXULORU
SULQFLSDO
LDGRXDFRPSRQHQW
SULQFLSDO
LOH FHOHL GH
-a doua componente
principale.
$GRXDD[
SULQFLSDO
6HFDXW RD[
2 RUWRJRQDO
FX
1 LFDUHV
PLQLPL]H]HLQHU LD
I ( N * , ) .
GH
vectorul u2 YHFWRU SURSULX QRUPDW GLQ PDWULFHD GH FRUHOD LL 5 DVRFLDW OD D
doua cea mai mare valoare proprie 2 .
Valoarea proprie 2 L YHFWRUXO SURSULX u2 SHQWUX H[HPSOXO FX PDLQLOH
se DIO vQ 7DEHOXO & XWDUHD FHOHL GH-a doua axe principale 2 este
YL]XDOL]DW vQ)LJXUD
Figura 6
&
6 QRW P FX
XWDUHDFHOHLGH
zi L ai SURLHF
LLOH SXQFWXOXL
deducem:
(1)
unde
1 n 2 *
d ( xi , ai )
n i =1
*
LDQRUXOXL N
n raport cu planul (1 , 2 ) . 6HSRDWHGHPRQVWUDF
I ( N * , (1 , 2 )) =
HVWHLQHU
I ( N , (1 , 2 )) HVWH PLQLP
*
posibile.
Planul (1 , 2 ) VH QXPHWH SULPXO SODQ SULQFLSDO (VWH SODQXO FDUH WUHFH
cel mai bine posibil prin mijlocul norului N * vQVHQVXOFULWHULXOXLLQHU LHL
$GRXDFRPSRQHQW
SULQFLSDO
Y2 HVWH R YDULDELO
OXQJLPHDDOJHEULF DVHJPHQWX
lui [0, zi ]
xij x j
j =1
sj
Y2 (i) = u2 j (
) .
Y2 HVWHFHQWUDW
LGHGLVSHUVLHHJDO FX
2 .
Putem scrie:
1 n
1 n 2
2
Y
(
i
)
=
2
d ( zi ,0) =
n i =1
n i =1
= I ({z1 ,...., z n },0) = 2
Disp (Y2 ) =
Y1 L Y2 HVWH HJDO
(2)
FXDMXWRUXOIRUPXOHL
cor ( X J , Y2 ) = 2 u 2 j
&RUHOD LLOH GLQWUH YDULDELOHOH L FRPSRQHQWD SULQFLSDO
Y2 din exemplul
nostru sunt datH vQ 7DEHOXO 3XWHP REVHUYD F Y2 HVWH FRUHODW
YDULDELOHOH
PRWRU&LOLQGUHH
3XWHUH
9LWH]
YDULDELOHOHFRQIRUW*UHXWDWH/XQJLPH/
$ GRXD FRPSRQHQW SULQFLSDO
Y2
L
FRUHODW
SR]LWLY FX
QHJDWLY
FX
LPH
DFHOHLGH DGRXDFRPSRQHQW
SULQFLSDO
LDSULPHORU
FRPSRQHQWHSULQFLSDOH
'LQ HFXD LLOH L VH GHGXFH F SDUWHD GH LQHU LH H[SOLFDW GH D GRX
(1 , 2 ) este
(1 + 2 )
.
p
n exemplu, 2 H[SOLF
H[SOLF
&DOLWDWHDUHSUH]HQW
(1 , 2 )
GLQLQHU LDWRWDO
ULLLQGLYL]LORUSHDGRXDD[
SULQFLSDO
LSHSULPXO
plan principal
&DOLWDWHDUHSUH]HQW ULLILHF UXLSXQFW
SDUWH
Pe 2 :
cos 2 ( x i* , 2 ) =
d 2 ( z i ,0)
Y2 (i ) 2
=
d 2 ( xi* ,0) d 2 ( xi* ,0)
Pe (1 , 2 ) :
cos 2 ( xi* , (1 , 2 )) =
Rezultate generale
Extinznd
PXO LPH
GH
S
D[H
SULQFLSDOH
1 ,......., p
Figura 7
Axele principale. Componentele principale
(OHUHSUH]LQW FRRUGRQDWHOH
h LQHFRUHODWHvQWUHHOH
*
i
xi* = Yh (i )u k
h =1
Formulele carH XUPHD] VXQW IRDUWH LPSRUWDQWH L VH GHGXF GLUHFW GLQ
procesul de construire al componentelor principale:
Formula de reconstituire a datelor:
p
xij* = Yh (i )u hj
(3)
h =1
)RUPXODGHUHFRQVWLWXLUHDPDWULFHLFRUHOD LLORUGLQWUHYDULDELOH
(4)
h =1
WUDWXOXLGLVWDQ HLXQXLSXQFWODRULJLQH
)RUPXODGHGHVFRPSXQHUHDS
de unde se deduce:
p
(i)
cos ( x ,
2
h =1
*
i
(ii)
h =1
&DOFXOXOFRUHOD LLORUvQWUHYDULDELOHOH
) =1
=p
X j LFRPSRQHQWHOHSULQFLSDOH Yh
cor ( X j , Yh ) = h u hj
(5)
X 1 ,...., X p
H[SOLFDW GHD[DSULQFLSDO
1
p
cor
j =1
( X j ,Yh ) =
p
p
Yh cu variabilele
h .
'LVWDQ DOXL0DKDODQRELV
3HQWUXDP VXUDGLVWDQ DGLQWUHXQLQGLYLGLFHQWUXOGHJUHXWDWHDOQRUXOXL
VHXWLOL]HD] DGHVHDGLVWDQ DOXL0DKDODQRELV
(D VH GHILQHWH vQ IHOXO XUP WRU VH FRQVWUXLHVF PDL vQWkL FRPSRQHQWHOH
principale Z h preferabil pentru datele de origine dect pentru datele centrateUHGXVH 3HQWUX DFHDVWD VH XWLOL]HD] YHFWRULL SURSULL vh din matricea
de
covariaQ D YDULDELOHORU X j L VH FDOFXOHD] YDULDELOHOH Z h cu ajutorul
formulei:
Z h (i ) =
'LVWDQ DOXL0DKDODQRELV
v
j =1
hj
( x ij x j )
x DOQRUXOXLIRUPDWGLQGDWHOHGHRULJLQHVHGHILQHWHFXDMXWRUXOIRUPXOHL
p
d ( xi , x ) = Z h* (i ) 2
2
M
h =1
5HSUH]HQW ULJUDILFH
(VWHYRUEDGHUHSUH]HQW ULJUDILFHDOHLQGLYL]LORULYDULDELOHORU
Harta indivizilor
3URLHF LLOH SXQFWHORU
Ai = (Y1 (i ), Y2 (i )) QH G
-a
OXQJXO SULPHL D[H vQ IXQF LH GH PRGHOXO ORU GH OD FHOH PDL PLFL )LDW 8QR
)RUG)LHVWDODFHOHPDLPDUL5HQDXOW%09LLGH
D GRXD D[H vQ IXQF LH GH FDUDFWHULVWLFD ORU GH OD PDLQLOH IDPLOLDOH 1LVV
Figura 8
3ULPXOSODQSULQFLSDOLFHUFXOFRUHOD LLORU
Harta variabilelor
Variabilele sunt reprezentate ntr-un plan cu ajutorul punctelor:
B j = ( cor ( X j , Y1 ), cor ( X j ,Y2 )) Se RE LQH UHSUH]HQWDUHD JUDILF GLQ )LJXUD
QXPLW FHUFXOGHFRUHOD LL
(VWH YL]XDOL]DW ELQH IDSWXO F SULPD FRPSRQHQW SULQFLSDO FRUHODW
SR]LWLYFXWRDWHYDULDELOHOHSUREOHPHLHVWHXQIDFWRUGHWDOLHP ULPHLF
DGRXDFRPSRQHQW SULQFLSDO RSXQkQG9LWH]D3XWHUHOD/
LPH/XQJLPH
RE LQH
Variabile
Cilindree
Putere
9LWH]
Greutate
Lungime
/
LPH
Rj
0.96
0.98
0.97
0.92
0.97
0.93
7RDWHYDULDELOHOHVXQWELQHUHSUH]HQWDWHSHFHUFXOGHFRUHOD LL
3DUWHD GH LQHU LH H[SOLFDW GH SULPXO SODQ SULQFLSDO ILLQG IRDUWH PDUH
FRUHOD LLOHvQWU
cor ( X j , X l ) = h u hj u hl
FDUHGHYLQHGDF LQHPFRQWGH
h =1
X j L X l
SULQ
, FRUHOD
LDvQWUH
X j L X l se scrie aproximativ:
FHUFXO GH FRUHOD LL vQ IXQF LH GH OXQJLPHD YHFWRULORU
YDULDELOH L D
FRVLQXVXULORUXQJKLXULORUGLQWUHDFHWLYHFWRUL
6H SRDWH YHULILFD GH H[HPSOX F GHQGRJUDPD GLQ )LJXUD H[SULP ELQH
SR]L LDYHFWRU
ilor-YDULDELOHGLQFHUFXOGHFRUHOD LLXQLLvQUDSRUWFXFHLODO L
Biplotul
Lundu-QH FkWHYD SUHFDX LXQL vQ FHHD FH SULYHWH VFDUD GH UHSUH]HQWDUH
HVWH SRVLELO V VXSUDSXQHP FHOH GRX JUDILFH SULPXO SODQ SULQFLSDO L FHUFXO
GHFRUHOD LLRE LQkQGDVWIHORUHSUH]HQWDUHvPERJ
LW
xij* = Yh (i )uhj
h =1
de reconstituLUHDGDWHORUSHUPLWHRE LQHUHDXQHLEXQHDSUR[LP
ULDSXQFWHORU
*
ij XWLOL]kQGGRDUSULPHOHGRX GLPHQVLXQL
*
xij* = Yh (i )u hj Notnd Yh =
h =1
utiliznd faptulF
Yh
h
FRPSRQHQWD SULQFLSDO
IRUPXO GHYLQH
Yh
UHGXV L
2675 1906.1
= 1.49 bine reconstituit prin
516.79
Y1* ( Rover )cor (Cilindree, Y1 ) + Y2* ( Rover ) cor (Cilindree, Y2 ) =
1
1
3.19 0.96 +
0.77 0.03 = 1.44.
4.656
0.9152
produsul scalar dintre vectorii Ai* = (Y1* (i ),Y2* (i )) L B j = ( cor ( X j , Y1 ), cor ( X j ,Y2 ))
*
1RW P Pij SURLHF LDYHFWRUXOXL Ai pe axa ( B j ) JHQHUDW GHYHFWRUXO B j .
$FHVWHQRWD LLVXQWYizualizate n Figura 9.
/XQJLPHDDOJHEULF
OPij =
OPij
HVWHHJDO FX
Y (i)cor(X ,Y )
*
h
Figura 9
3XQFWHLQGLYL]LLD[HYDULDELOH
1XPLWRUXOILLQGHJDOFXFRUHOD LDPXOWLSO
R j ntre X j LSULPHOHGRX
D[H
Q)LJXUDDPFRQVWUXLWELSORWXOUHSUH]HQWDUHDVLPXOWDQ DLQGLYL]LORUL
DYDULDELOHORUvQIHOXOXUP WRU
Figura 10
%LSORWUHSUH]HQWDUHDVLPXOWDQ
DLQGLYL]LORULDYDULDELOHORU
$VWIHO VH SRDWH YHULILFD IDSWXO F SURLHF LD PDLQLORU SH D[D 9LWH]
UHVWLWXLHELQHUHSDUWL LDGDWHORUGHSOHFDUHSURLHF LLOHPDLQLORUFHOHPDLUDSLGH
(BMW 530i, Renault 25, Audi 90 Quatro) se opun bine la cele mai lente (Ford
Fiesta, Nissan Vanette, Fiat Uno, VW Caravelle).
'H DVHPHQHD SURLHF LLOH PDLQLORU SH D[D /
componentelor principaOHGDUVHSLHUGHvQDFHVWFD]GLPHQVLXQHDJHRPHWULF
a problemei.
9RPSUH]HQWDFULWHULXOFRUHOD LHLDSRLDOGLVSHUVLHL
&ULWHULXOFRUHOD LHL
6H FDXW P YDULDELOH
maximizeze criteriul :
F1,....., Fm centrate-UHGXVH
L QHFRUHODWH FDUH V
[ p cor
h =1
j =1
( X j , Fh )]
(7)
X 1 ,....., X p
printr-XQ QXP U PDL PLF GH YDULDELOH F1,....., Fm QHFRUHODWH vQWUH HOH L FDUH V
reprezinte principalele dimensiuni ale fenomenului studiat.
Fh = Yh* =
Yh
PD[LPXOXLHVWHHJDO FX
(1 + .... + m ) / p .
Criteriul dispersiei
6H FDXW P YDULDELOH
Z1,....., Z m de forma Z h = v hj X j
cu vectorii
j =1
LFDUHV PD[LPL]H]HFULWHULXO
Dispersie( Z
h =1
) (8)
v1,....., vm DLPDWULFHLGHFRYDULDQ
vQWUHYDULDELOHOH
x j asociate
SVHRE LQH
v1 + ..... + v p = Dispersie( X j )
j =1
Dispersie(Z
h =1
) = 1 + ....... + m .
LHUDUKLF
$FHDVW PHWRG FRQGXFH OD XQ DOW SURFHGHX GH D UH]XPD GDWHOH
FRQVWUXLUHDXQXLWLSRORJLLVDXSDUWL LLDLQGLYL]LORUvQFODVHDVWIHOFDLQGLYL]LL
FDUH DSDU LQ DFHOHLDL FODVH V ILH DVHP Q WR
n1 ,....., nk indivizi.
G1 ,....., Gk WLSRORJLDFRUHVSXQ]
WRDUHQRUXOXLGHSXQFWHDVRFLDW
k
k
n
n
I ( N , g ) = ( i )d 2 ( gi , g ) + i I (Gi , gi ).
i =1 n
i =1 n
3ULPXO WHUPHQ GLQ GUHDSWD VH QXPHWH LQHU LD LQWHU FODVH L P VRDU IHOXO
G1 ,....., Gk L UHSUH]LQW
tipologie.
Al doilea termHQ GLQ GUHDSWD VH QXPHWH LQHU LD LQWUD-FODVH L P
omogenitatea claselor.
VRDU
&DOLWDWHD WLSRORJLHL VH P VRDU FX DMXWRUXO UDSRUWXOXL GLQWUH LQHU LD LQWHU
FODVHLLQHU LDWRWDO
poate
fi
D (Gi , G j ) =
ni n j
n(ni + n j )
d 2 ( gi , g j )
$FHVW FULWHULX XWLOL]DW SHQWUX P VXUDUHD GLVWDQ HL vQWUH GRX FODVH
Gi L
G j VHQXPHWHFULWHULXOGHDJUHJDUHDOOXL:DUG
Exemplu:
6 OX P
*
G1 = {xCitroenBX
}
G2 = {x*Peugeot 405 }
. Avem
*
*
d 2 ( x CitroenBX
, x Peugeot
405 ) =
*
D 2 ( xCitroenBX
, x*Peugeot 405 ) =
&ODVLILFDUHDLHUDUKLF
1 1
0.189 = 0.00393
24 (1 + 1)
DVFHQGHQW
G1 ,....., Gk LVH
-clase.
/D HWDSD ILQDO QX PDL H[LVW GHFkW R VLQJXU FODV L LQHU LD LQWHU
HVWHGHFLQXO
-clase
6XPDSLHUGHULORULQHU LHLLQWHU-clase a diferitelor etape este deci
HJDO FX LQHU LD WRWDO /D ILHFDUH HWDS VH FDOFXOHD] XQ LQGLFH RE LQXW SULQ
vPS U LUHDSLHUGHULLGHLQHU LHLQWHU FODVHODLQHU LDWRWDO
a indicelui
$SOLFD LH
$P UHDOL]DW R FODVLILFDUH LHUDUKLF DVFHQGHQW D GDWHORU FHQWUDWH
-reduse
GHQGRJUDP OD QLYHOXO OXL &LWURHQ %; DGLF D HOHPHQWXOXL FDUH SUHFHGH SH
FHO ODW&ODVDHVWHQXPHURWDW
/DDGRXDHWDS VHRE LQHFODVDUHJUXSkQG)RUG6LHUUDL3HXJHRW
%UHDNDOF UHLLQGLFHGHDJUHJDUH
$OJRULWPXO XUPHD] DFHODL SURFHGHX SkQ OD XOWLPD HWDS FkQG VH
UHJUXSHD] FODVDD PLFLORUPDLQL&LYLF6HDW)LHVWDLFODVDIRUPDW
GLQUHVWXOHDQWLRQXOXL
&ULWHULXOOXL:DUGFXPXODWGHODXOWLPDLWHUD LHSHUPLWHFDOFXODUHDLQHU LHL
H[SOLFDWHSULQGLIHULWHOHWLSRORJLLFRQVWUXLWHQDGHY UODXOWLPDLWHUD LHDYHP
I(43,46)=D(43,46)=3.07202
ntruct iQHU LD H[SOLFDW , SULQ FODVD IRUPDW
REVHUYD LL HVWH QXO Q FRQVHFLQ
GRX FODVH L HVWH HJDO FX L SDUWHD GH LQHU LH H[SOLFDW HVWH
HJDO FX
Clasa
%0:L
$XGL %0:
7LSR59:&DUDYHOOH
(VSDFH2PHJD9:&DUDYHOOH`L
7LSR55`
,
-I(43,44,45) deducem:
I(43,44,38,42)=D(43,46)+D(44,45)+D(38,42)=
=3.07202+1.42919+0.29270=4.79391.
&UHWHUHDLQHU LHLH[SOLFDWHILLQGPLF DWXQFLFkQGVHWUHFHGHODWLSRORJLD
IRUPDW GLQ FODVH OD WLSRORJLD IRUPDW GLQ FODVH
DGRSW PWLSRORJLDGDWHORUGLQFODVH
Tabelul 7
&ODVLILFDUHDLHUDUKLF
DVFHQGHQW
Elementul
care
precede
Elementul
FDUHXUPHD]
Nr.
elemente
FRQ LQXWH
Criteriul
lui
Ward
Indice(%).
Tabelul 8
0HGLLOHYDULDELOHORUSHFODVHLWHVWXO)LVKHU
Figura 11
Dendograma
Figura 12
Vizualizarea tipologiei din 3 clase
3HQWUX
D
LQWHUSUHWD
FX
PDL
PXOW
SUHFL]LH
DFHDVW
WLSRORJLH
DP
reprezentat-RSHSODQXOSULQFLSDOGLQ)LJXUDLDPFRQVWUXLW7DEHOXOXQGH
YDULDELOHOH VXQW DUDQMDWH vQ RUGLQHD GHVFUHVF WRare a testului Fisher ntre
YDULDELOHLWLSRORJLH
&ODVDPDLQLORUPLFLFRUHVSXQGHFODVHL
Honda Civic, Seat Ibiza Sxi, Citroen AX Sport, Peugeot 205 Rallye,
Peugeot 205, Fiat Uno, Ford Fiesta.
&ODVDPDLQLORUPHGLLFRUHVSXQGHFODVHL
Fiat Tipo, Renault 19,Citroen BX, Peugeot 405, Renault 21, Espace, Opel
Omega, Ford Sierra, Peugeot 405 Break, Nissan Vanette, VW Caravelle.
&ODVDPDLQLORUPDULFRUHVSXQGHFODVHL
Concluzie
V-DP
d (1977), Saporta
special:
(YHULWW L SURFHGXULOH $&(&/86&/867(5)$67&/86 GLQ
programul SAS.
B
BIIB
BLLIIO
OG
GR
RA
AF
FIIE
E
Michel Tenenhaus
Gilbert Saporta,
9LRULFDWHI QHVFX