Documente Academic
Documente Profesional
Documente Cultură
Proiect Analiza Datelor PDF
Proiect Analiza Datelor PDF
DATELOR
INTRODUCERE
$FHVW SURLHFW DUH FD VFRS V DUDWH LQWHUHVXO L HILFDFLWDWHD SH FDUH OH
SUH]LQW PHWRGHOH VWDWLVWLFH GHVFULSWLYH vQ DQDOL]D WDEHOHORU GH GDWH GH
dimensiuni mari.
([HPSOXO DOHV VH UHIHU OD SRSXOD LD D GH
DQDOL]DWHvQIXQF LHGHYDULDELOHOH
FRQWLQHQW
HFRQRPLFH 3URGXVXO 1D LRQDO %UXW SH ORFXLWRU SHQWUX vQ GRODUL
H[WUDV GLQ SXEOLFD LL DOH % QFLL 0RQGLDOH 3URGXVXO ,QWHULRU %UXW SH
ORFXLWRU SHQWUX vQ GRODUL OD SUH XO L SDU
itatea puterii de
FXPS UDUH GLQ IXUQL]DW GH &HQWUXO GH 6WXGLL 3URVSHFWLYH L GH
)LLQG GDW QDWXUD FDQWLWDWLY D YDULDELOHORU VH YD HIHFWXD vQ SULPD SDUWH
D[HORUSULQFLSDOHDVWIHOFUHDWHSUHFXPLDFDOLW
LQGLYL]LORUvQDFHVWVSD LXFXGRX GLPHQVLX
LLUHSUH]HQW ULLYDULDELOHORUL
ni.
vQDFHVWVSD LX
Q D GRXD SDUWH FX DMXWRUXO XQHL FODVLILF UL YRP RE LQH SDUWL LD FHD PDL
FRHUHQW DDFHVWHLPXO LPLGH
UL
GHVFULSWLY D GDWHORU
progQR]DSRSXOD LHL
'HIDSWYRPvQFHUFDV H[SOLF PDFHDVW YDULDELO vQIXQF LHGHYDULDELOHOH
active.
QH[HPSOXOQRVWUXDPUH LQXW
11 variabile active
683(5),&FRQWLQX
3238/$7,FRQWLQX
1$7$/,7(FRQWLQX
0257$/,7FRQWLQX
8,1)$17,/FRQWLQX
)(&21',7FRQWLQX
,1FRQWLQX
68FRQWLQX
(63(5$1&FRQWLQX
31%FRQWLQX
3,%FRQWLQX
3 variabile ilustrative
1.
2.
7.
5(/,*,21PRGDOLW
&217,1(1PRGDOLW
352-(&7,FRQWLQX
L
L
1.1.2. Indivizii
,QGLYL]LLVWDWLVWLFLVXQW
ULOHGH
ULDXIRVWUH LQXWHSHQWUXDQDOL]DFXR
SRQGHUH XQLIRUP HJDO FX )UDQ D QX SDUWLFLS OD DFHDVW DQDOL] L YD IL
WUDWDW FDXQLQGLYLGVXSOLPHQWDU
1.1.3.
6WDWLVWLFDXQLGLPHQVLRQDO
GH YLD
ULGLFDW
IRUP vQ WULXQJKL SUHFXP &DQDGD *HUPDQLD L ,WDOLD LDU SH GH DOW SDUWH
ULOH FDUH SUH]LQW R QDWDOLWDWH R PRUWDOLWDWH LQIDQWLO L XQ SURFHQW GH PDL
SX LQGH DQL SUHD ULGLFDW ID
a,
$UDELD6DXGLW L&RDVWDGH)LOGH
8Q DOW JUDILF WLS FXWLD FX PXVW
L FODVHD]
e.
SULPDFXDQWLO
Q2 = 0,0490 (mediana)
4
DWUHLDFXDQWLO
E = Q3-Q1*Q3+1,5Q1=0,2676
$FHVW JUDILF DUDW
norma:
Coreea de Sud,
ULOHGH-RV
Japonia,
Belgia.
,QHU LDLF
XWDUHDD[HORUSULQFLSDOH
,QHU LD LQL LDO D QRUXOXL GH SXQFWH HVWH VXPD SRQGHUDW D S WUDWHORU
GLVWDQ HORU LQGLYL]LORU OD FHQWUXO GH JUHXWDWH 6H DUDW F DWXQFL FkQG GDWHOH
LQHU LH HVWH HJDO FX QXP UXO GH YDULDELOH DGLF
$&3 FRQVW vQ GHWHUPLQDUHD D[HORU QXPLWH D[H SULQFLSDOH FDUH YRU
SHUPLWHPD[LPL]DUHDLQHU LHLQRUXOXLGHSXQFWHSURLHFWDW$FHDVW PD[LPL]DUH
QHFHVLW F XWDUHD YDORULORU SURSULL DOH PDWULFLL 90 XQGH 9 HVWH PDWULFHD
doua 13,80 %.
3ULPXO SODQ SULQFLSDO H[SOLF GHFL GLQ LQHU LD WRWDO D QRUXOXL GH
puncte.
+LVWRJUDPD YDORULORU SURSULL IDFH V DSDU R UXSWXU GXS D GRXD YDORDUH
SURSULH$FHVWFULWHULXSHUPLWHGHWHUPLQDUHDQXP UXOXLGHD[HGHLQWHUSUHWDW
variabile.
Aceasta permite determinarea cerculuLGHFRUHOD LL
&DOLWDWHDUHSUH]HQW
ULLYDULDELOHORU
L
31%YDULDELOHFDUHVXQWPDLSX LQELQHUHSUH]HQWDWH
6H SRDWH GD GHFL R LQWHUSUHWDUH SHQWUX D[D DFHDVW D[ RSXQH
WLQHUHOD
ULOH
ULOHE WUkQH
3H D[D YDULDELOHOH FHOH PDL ELQH UHSUH]HQWDWH VXQW 3RSXOD LD L
6XSUDID D
(VWH YRUED GHFL GH R D[ GH WDOLH FDUH RSXQH PDULOH
celelalte.
Global variabilele cele mai bine reprezentate n planul 1-2 sunt Natalitatea
L 3URFHQWXO GH PDL SX LQ GH DQL JUDILF HOH VXQW FHOH PDL DSURSLDWH GH
FHUFXOGHFRUHOD LL
9DULDELOD LOXVWUDWLY 3URMHFWLRQ 3URJQR]D SRSXOD LHL HVWH IRDUWH VWUkQV
FRUHODW FX D[D L IRDUWH VODE FX D[D (D HVWH VODE FRUHODW FX YDULDELOHOH
demografice.
1.2.3. Reprezentarea indivizilor
&RRUGRQDWHOH QHFHVDUH UHSUH]HQW ULL LQGLYL]LORU vQ SULPXO SODQ SULQFLSDO
VXQWIXUQL]DWHvQDQH[DFkWLHOHPHQWHOHQHFHVDUHLQWHUSUHW ULLFRQWULEX LLOH
LQGLYL]LORUODSULPXOSODQSULQFLSDOLFRVLQXVXULOHS WUDWH
&RQWULEX LLOH D[ FX D[ SHUPLW GHWHUPLQDUHD LPSRUWDQ HL LQGLYL]LORU vQ
FRQVWUXLUHD D[HORU 1X HVWH GH GRULW FD XQ LQGLYLG V DLE R FRQWULEX LH
H[FHVLY $FHDVWDDUFRQVWLWXLXQIDFWRUGHLQVWDELOLWDWH
Q DGHY U GDF UHQXQ
GHWHUPLQ P GLQ QRX D[H FX R VHPQLILFD LH GLIHULW *UDILF LQGLYL]LL FX R
FRQWULEX LH SXWHUQLF VXQW SH IURQWLHUHOH UHSUH]HQW ULL vQWUXFkW FRQWULEX LD
pe axa 1 sunt:
&RDVWDGH)LOGH0DOL6HQHJDO7RJR(WLRSLD6RPDOLD7FKDGSHQWUX
ULOH WLQHUH $FHVWH
L31%
6LWXD LDHVWHGLQFRQWU LQYHUVDW SHQWUX-DSRQLD
ULOHFXRFRQWULEX LHPDUHVXQW
&KLQD 8566 ,QGLD $FHVWH
VXSUDID
$FHVWH
DFHVWH
SURFHQWXOGHPDLPXOWGH
PDUH
ULU PkQLGHQWLFH
(PLUDWHOH $UDEH 8QLWH DX R VLWXD LH LQYHUV VXSUDID
L SRSXOD LH
PLF
XQVSD LXFXGLPHQVLXQL8QLQGLYLGYDILFXDWkWPDLELQHUHSUH]HQWDWFXFkW
SLHUGHUHDGLVWDQ HL HVWH PDL PLF Q HOHJHP SULQ SLHUGHUHDGLVWDQ HLGLIHUHQ D
vQWUH GLVWDQ D LQGLYLGXOXL L OD RULJLQH vQ VSD LXO FX GLPHQVLXQL L GLVWDQ D
DFHOXLDL LQGLYLG OD RULJLQH vQ SULPXO SODQ SULQFLSDO &D R FRQVHFLQ
XQ
LQGLYLG YD IL FX DWkW PDL ELQH UHSUH]HQWDW FX FkW XQJKLXO vQWUH LQGLYLG L
SURLHF LD VD HVWH PDL PLF VDX FX DOWH FXYLQWH FX FkW S WUDWXO FRVLQXVXOXL
Siria.
IndiDQDGHY
UHVWHVLQJXUD DU vQFDUHKLQGXLVPXOHVWHUHOLJLDPDMRULWDU
Q FRQFOX]LH UHSUH]HQW ULOH JUDILFH IDF V DSDU FODU FHOH WUHL JUXSH
GLVWLQFWHGH
UL
DQLPLVW VDXPXVXOPDQ
FHOH E WUkQH
$PHULFD6HSWHQWULRQDO GHUHOLJLHFUHWLQ
LRDWUHLDJUXS LQWHUPHGLDU
Americii de Sud.
ULOHvQFXUVGHGH]YROWDUHDOH$VLHLL
UL
CLASIFICAREA
Clasificarea are ca scop regruparea indivizilor n clase omogene.
([LVW GRX PDUL WLSXUL GH PHWRGH FODVLILFDUHD QRQ LHUDUKLF FDUH
SURGXFH R SDUWL LH vQWU
-un nuP
SURGXFHXQLUGHSDUWL LLvQFXLEDWH
2.1.
&ODVLILFDUHDLHUDUKLF
SULQPHWRGDOXL:DUG
,QGLFLL QLYHOXULORU H[SULP SLHUGHUHD GH LQHU LH LQWHUFODVH OD ILHFDUH
UHJUXSDUH6XPDLQGLFLORUGHQLYHODUWUHEXLV ILHHJDO FXDGLF HJDO FX
LQHU LDWRWDO DQRUXOXLGHSXQFWH
&HL GRL LQGLYL]L FHLPDL DSURSLD L VXQW GHFL L DGLF 3RUWXJDOLD L
1RXD=HHODQG
Q HWDSD XUP WRDUH QX PDL U PkQ GHFkW LQGLYL]L L R FODV GLQ FHOH
GRX
6H UHJUXSHD] GLQ QRX FHL GRL LQGLYL]L I FkQG V VH SLDUG FkW PDL SX LQ
LQHU LHLQWHUFODVHDGLF 5HJDWHOH 8QLWHL'DQHPDUFD
6H UHvQFHSH DSRL DFHDVW LWHUD LH SkQ FkQG WR L LQGLYL]LL YRU IL UHJUXSD L
de
FODVHFHWUHEXLHS VWUDW$FHVWFULWHULXDEVROXWDFRQVHUYDWFODVH
&XORULOH SHUPLW XRU V GLVWLQJHP FHOH FODVH DVWIHO FUHDWH FXORULOH
2.2.
Consolidarea claselor
0HWRGD FHQWUHORU PRELOH FRQYHUJH IRDUWH UDSLG 3ULPD SDUWL LH HIHFWXDW
HUD GHFL VDWLVI F WRDUH 8Q VLQJXU LQGLYLG D VFKLPEDW FODVD HVWH YRUED GH
$UJHQWLQDFDUHWUHFHGLQFODVDOD
ULvQFXUVGHGH]YROWDUH
&RQVWUXLUHDILQDO DFODVHORUHVWHGHFL
Descrierea claselor
Mai multe elemente permit caracterizarea claselor diferite create.
3RW IL SULYLWH FRRUGRQDWHOH L YDORULOH WHVW DOH FHQWUHORU GH JUHXWDWH DOH
FODVHORULQGLYL]LLWLSLFLPRGDOLW
LOHLYDULDELOHOHFHOHPDLFDUDFWHULVWLFH
&ODVHOH L DX FRRUGRQDWH GH YDORUL ULGLFDWH GDU RSXVH SH D[D L FX
TaEHOHOH XUP
FDUDFWHULVWLFLLGLVWDQ DORUODFHQWUXOGHJUHXWDWH
&DUDFWHUL]DUHDSULQYDULDELOHLOXVWUDWLYHLDFWLYH
8UP WRDUHOH GRX WDEHOH GDX SHQWUX ILHFDUH FODV YDULDELOHOH FHOH PDL
PXOWLFHOHPDLSX LQUHSUH]HQWDWLYH
'HILQL LH
([HPSOXGLQWUH
LL
ULOHDVLDWLFHIDFSDUWHGLQFODVD
(10/12=0,8333).
02'&/$6 UDSRUWXO GLQWUH QXP UXO LQGLYL]LORU DSDU LQkQG FODVHL L
PRGDOLW
LLLQXP UXOXLGHLQGLYL]LDLFODVHL
ULOH DVLDWLFH v
n timp ce ea nu
3 au variabile caracterisWLFH GDU vQ RSR]L LH $FHOHD FDUH VXQW VXSHULRDUH
PHGLHLSHQWUXRFODV VXQWLQIHULRDUHSHQWUXFHDODOW LLQYHUV&ODVDHVWH
vQDGHY URFODV LQWHUPHGLDU I U FDUDFWHULVWLFLELQHPDUFDWH
&ODVLILFDUHD
HVWH
GHFL
FRPSOHPHQWDU
$QDOL]HL
vQ
&RPSRQH
nte
Concluzie
0HWRGHOH VWDWLVWLFH GHVFULSWLYH GDWRULW DVSHFWHORU YL]XDOH UHSUH]HQW ULL
JUDILFH L DUERUL GH FODVLILFDUH L LQWXLWLYH LPSRUWDQWH SHUPLW GHVFULHUHD
UHODWLYVLPSOXDPXO LPLLGHGDWHFRPSOH[H
$YDQWDMXO DFHVWRU PHWRGH vQ DIDUD DVSHFWXOXL GHVFULSWLY FRQVW GHFL vQ
IDSWXOF HOHVXQWUHFHSWLELOHGHXQSXEOLFODUJQHVSHFLDOL]DW
Anexe