Documente Academic
Documente Profesional
Documente Cultură
de cercetare sociala
Tema 5-7: Analiza factoriala (II)
Bibliografie: Capitolul 4
George H. Dunteman. 1989. Principal Components Analysis. Newbury Park,
Ca.: Sage Publications.
Jae-On Kim, Charles W. Mueller. 1978a. Introduction to Factor Analysis. What
It Is and How to Do It. Newbury Park, Ca.: Sage Publications.
Jae-On Kim, Charles W. Mueller. 1978b. Factor Analysis. Statistical Methods
and Practical Issues. Newbury Park, Ca.: Sage Publications.
J. Scott Long. 1983. Confirmatory Factor Analysis. Newbury Park, Ca.: Sage
Publications.
F1
F2
...
Fn
X1
U1
X2
U2
X3
...
U3
...
Xm
Um
X1
X2
...
Xm
F1
b11
b21
F2
b12
b22
...
...
...
Fn
b1n
b2n
bm1
bm2
...
bmn
bij = r(Xi,Fj)
1 0 ... 0
0 1 ... 0
...
0 0 ... 1
r(X , X )
r(X , X ) a (X , X )
i
KMO =
j i
j i
ji
unde a(Xi, Xj) este coeficientul de corelaie parial ntre Xi i Xj cnd toate
celelalte variabile sunt controlate.
(a) metoda celor mai mici ptrate the least squares method,
(b) metoda probabilitii maxime - the maximum likelihood method,
(c) metoda de extragere factorial Alpha Alpha factoring,
(d) analiza imaginii image factoring,
(e) metoda factorilor principali principal axis factoring,
(f) metoda componentelor principale principal component analysis.
Primul factor extras va corespunde valorii proprii celei mai mari, cu alte
cuvinte primul factor extras este cel care explic cel mai mult din
variana variabilelor observate. Urmtorul factor extras va explica
ct mai mult din restul de varian rmas neexplicat, i aa mai
departe.
La ci factori ne oprim?
De ci factori avem nevoie pentru a reprezenta datele?
Figura 10. Obinerea unei structuri simple prin examinarea configuraiei grafice a
variabilelor.
X1
X2
X3
X4
X5
Factor1
0.83
0.76
0.90
0.20
0.25
Factor2
-0.15
-0.24
-0.35
0.80
0.85
1.0
X4
.8
X5
.6
.4
.2
F A C TO R 2
0.0
X1
X2
-.2
X3
-.4
0.0
FACTOR1
.2
.4
.6
.8
1.0
Variante de rspuns:
1. Foarte important
2. Important
3. Destul de important
4. Puin important
5. Deloc important
8. NS (recodat ca missing value)
9. NR (recodat ca missing value)
Corelaiile observate:
Correlations
S se nasc
ntr-o familie
bogat?
S stie s
se
descurce?
S aib
relatii?
S aib
noroc/
sans?
S cread n
Dumnezeu?
S fie
desteapt/in
teligent?
S arate
bine?
S fac
scoal?
S
munceasc
mult?
S fure?
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
S se nasc
ntr-o familie
S stie s se
bogat?
descurce?
1
,168**
,
,000
2027
1999
,168**
1
S aib
relatii?
,566**
,000
2011
,227**
S aib
S fie
noroc/
S cread n
desteapt/i
S arate
sans?
Dumnezeu?
nteligent?
bine?
,347**
,097**
,147**
,286**
,000
,000
,000
,000
2016
2015
2017
1999
,216**
,115**
,243**
,145**
S
S fac
munceasc
scoal?
mult?
S fure?
,067**
,081**
,214**
,003
,000
,000
2012
2015
1911
,209**
,213**
,115**
,000
,000
,000
,000
,000
,000
,000
,000
,000
1999
2025
2003
2013
2013
2014
1999
2013
2011
1927
Pearson Correlation
Sig. (2-tailed)
N
,566**
,000
,227**
,000
,389**
,000
,162**
,000
,190**
,000
,258**
,000
,069**
,002
,061**
,006
,198**
,000
2011
2003
2028
2023
2014
2021
2002
2016
2015
1917
,347**
,000
2016
,097**
,000
2015
,147**
,000
2017
,286**
,000
1999
,067**
,003
2012
,081**
,000
2015
,214**
,000
1911
,216**
,000
2013
,115**
,000
2013
,243**
,000
2014
,145**
,000
1999
,209**
,000
2013
,213**
,000
2011
,115**
,000
1927
,389**
,000
2023
,162**
,000
2014
,190**
,000
2021
,258**
,000
2002
,069**
,002
2016
,061**
,006
2015
,198**
,000
1917
1
,
2044
,287**
,000
2030
,268**
,000
2034
,214**
,000
2013
,124**
,000
2031
,147**
,000
2033
,107**
,000
1926
,287**
,000
2030
1
,
2047
,248**
,000
2029
,136**
,000
2011
,241**
,000
2029
,222**
,000
2031
-,037
,103
1926
,268**
,000
2034
,248**
,000
2029
1
,
2041
,240**
,000
2016
,474**
,000
2029
,402**
,000
2029
-,059**
,010
1926
,214**
,000
2013
,136**
,000
2011
,240**
,000
2016
1
,
2021
,240**
,000
2014
,157**
,000
2013
,164**
,000
1912
,124**
,000
2031
,241**
,000
2029
,474**
,000
2029
,240**
,000
2014
1
,
2041
,523**
,000
2031
-,131**
,000
1925
,147**
,000
2033
,222**
,000
2031
,402**
,000
2029
,157**
,000
2013
,523**
,000
2031
1
,
2042
-,104**
,000
1926
,107**
,000
1926
-,037
,103
1926
-,059**
,010
1926
,164**
,000
1912
-,131**
,000
1925
-,104**
,000
1926
1
,
1937
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
Pearson Correlation
Sig. (2-tailed)
N
1
,
U1
F1
U2
.
F2
.
rv10
U10
Efect indirect,
mediat de F1
b11 este saturaia lui F1 (factor loading F1) iar b12 este saturaia lui F2 (factor loading F2).
Aceste saturaii sunt prezentate n matricea saturaiilor factoriale (Factor Matrix sau
Factor Loadings Matrix) din SPSS output. Dac noi alegem un model n care factorii
sunt independeni, atunci corelaiile dintre factori i variabile se reduc la efectele directe,
deci sunt identice cu saturaiile (factor loadings).
Extracia factorilor:
Construim un model al relaiilor dintre factori i variabile astfel nct
diferena dintre corelaiile observate i cele re-construite (reproduced
correlations) s fie ct mai mic.
Reproduced Correlations
S se nasc
ntr-o familie
bogat?
Reproduced Correlation
Residual a
S stie s se
descurce?
b
S aib
relatii?
S aib
noroc/
sans?
S cread n
Dumnezeu?
S fie
desteapt/i
nteligent?
S arate
bine?
S fac
scoal?
S fure?
,515
,197
,546
,354
,135
,137
,261
4,165E-02
4,790E-02
,215
,197
,546
,354
,135
,143b
,215
,182
,139
,215
,580b
,380
,151
,182
,380
,276b
,153
,139
,151
,153
,148b
,214
,160
,206
,245
,159
,283
,221
,145
,214
6,187E-02
,165
,267
,187
6,590E-02
,149
,231
3,333E-02
,224
,114
-7,39E-03
,137
,214
,160
,206
,245
,424
,211
,485
,417
-6,04E-02
,261
4,165E-02
4,790E-02
,215
,159
,214
,187
3,333E-02
,283
6,187E-02
6,590E-02
,224
,221
,165
,149
,114
,145
,267
,231
-7,395E-03
,211
,485
,417
-6,037E-02
,184b
,194
,172
6,622E-02
-3,280E-02
1,566E-02
-2,06E-02
-4,674E-02
-2,378E-02
2,373E-02
1,408E-02
2,268E-02
-2,29E-03
5,831E-03
2,858E-02
-2,85E-03
-2,616E-02
1,624E-04
,135
1,697E-02
2,552E-04
4,050E-02
9,117E-03
-2,14E-02
-2,98E-02
-1,46E-02
-5,90E-03
-1,02E-02
6,989E-03
-5,10E-02
-1,35E-02
2,308E-02
-1,056E-02
-8,712E-03
-1,445E-02
7,841E-02
-2,82E-02
-5,73E-03
-3,31E-02
5,076E-03
-3,40E-03
-1,279E-02
-1,94E-03
2,904E-02
-2,041E-02
1,121E-02
9,804E-02
-1,65E-02
-2,62E-03
-3,280E-02
1,566E-02
-2,063E-02
-4,674E-02
5,831E-03
2,858E-02
-2,616E-02
-2,85E-03
1,624E-04
,135
-2,378E-02
1,697E-02
2,552E-04
4,050E-02
9,117E-03
2,373E-02
1,408E-02
2,268E-02
-2,293E-03
-2,139E-02
-1,020E-02
2,308E-02
7,841E-02
-2,98E-02
6,989E-03
-1,06E-02
-2,82E-02
-1,46E-02
-5,10E-02
-8,71E-03
-5,73E-03
-5,895E-03
-1,345E-02
-1,445E-02
-3,315E-02
5,076E-03
-3,396E-03
-1,279E-02
-1,942E-03
2,904E-02
-2,04E-02
9,804E-02
,194
,582b
,498
-,126
1,121E-02
-1,65E-02
S
munceasc
mult?
,172 6,622E-02
,498
-,126
,426b
-,103
-,103
,126b
-2,624E-03
Extracia factorilor:
Diferena dintre corelaiile observate i cele re-construite (reproduse) este
msurat ca suma ptratic a diferenelor i interpretat ca o msur de tip
CHI-ptrat. Se testeaz semnificaia statistic a diferenelor pe baza
distribuiei lui CHI-ptrat.
Goodness-of-fit Test
Chi-Square
182,705
df
26
Sig.
,000
Cum au fost
extrai factorii?
Approx. Chi-Square
df
Sig.
,756
3351,885
45
,000
Comunaliti (communalities)
Iniiale
s aib relaii?
s arate bine?
s cread n Dumnezeu?
s aib noroc/ans?
s fac coal?
s fie inteligent?
s fure?
s munceasc mult?
s tie s se descurce?
s se nasc ntr-o familie bogat?
Extraction Method: Maximum Likelihood.
,378
,165
,150
,248
,382
,316
,113
,307
,128
,360
Extrase
,580
,184
,148
,276
,582
,424
,126
,426
,143
,515
Factor
1
2
3
4
5
6
7
8
9
10
Total
2,787
1,781
,966
,875
,828
,680
,633
,576
,452
,423
Initial Eigenvalues
% of Variance Cumulative %
27,866
27,866
17,813
45,679
9,656
55,335
8,752
64,087
8,277
72,364
6,804
79,168
6,329
85,497
5,755
91,252
4,516
95,768
4,232
100,000
2
,572
,503
,565
-,324
,562
-,516
,524
,490
,492
,483
,421
,378
,367
-,428
,206
-,118
,343
Dac presupunem c factorii sunt independeni, atunci sistemul de axe este ortogonal iar
saturaiile factorilor sunt egali cu coeficienii de corelaie Pearson dintre variabile i factori.
Corelaia dintre factori: rF1F2=F1*F2* cos 90 = F1 *F2 * 0 = 0.
Putem roti soluia factorial pstrnd independena (ortogonalitatea) factorilor.
Dac presupunem c factorii sunt corelai, atunci sistemul de axe NU este ortogonal, ci
oblic. Saturaiile factorilor (efectele directe ale fiecrui factor) vor diferi de coeficienii de
corelaie dintre factori i variabile, pentru c o parte din corelaie se datoreaz corelaiei
dintre factori (efecte indirecte ale factorilor prin ceilali factori). Corelaia dintre factori:
rF1F2=F1*F2* cos
Putem roti soluia factorial presupunnd c factorii coreleaz (rotaie oblic).
rX1 X2 = b11 * b21 + b12 * b22 + b11 * b22 * rF1F2 + b21 * b12 * rF1F2
Saturaiile factorilor (b11, b21 pentru F1, b12, b22 pentru F2 etc.) vor fi egale cu
coeficienii de corelaie pariali, obinui prin controlarea efectelor celorlali factori.
Saturaiile pot fi interpretate ca i coefieni de regresie multinear standardizai (beta).
rX1 X2 = b11 * b21 + b12 * b22 + b11 * b22 * rF1F2 + b21 * b12 * rF1F2
efectul
efectul
efectele indirecte
direct al lui F1
Variana =
comunalitatea
Interpretare?
Scoruri factoriale
II. Construirea unor scale aditive ce in seama de intensitatea relaiilor dintre
variabile i factori i acord ponderi diferite variabilelor.
Avnd n vedere c variabilele observate indic mai puternic sau mai modest
dimensiunea latent (factorul) cercetat, acestea capt ponderi (weights) diferite n
indicele final. Ponderea este dat de un scor (un numr) cu care multiplicm
valoarea variabilei respective pentru fiecare caz (individ statistic).
Acest scor ne este furnizat n urma analizei factoriale i apare ca o nou variabil
n baza de date (cu valori diferite pentru fiecare obiect din eantion).
Scorul poate fi determinat prin mai multe metode:
1.
2.
3.
,785
,670
2
-,137
..
,643
,348
,269
,101
,208
,755
,717
,187
-,194
,236
,447
,348
,306