Sunteți pe pagina 1din 30
(Clusterizarea Prin clusterizare partiionim mulfimes judefelor in submuljimi pe baza similaritiii datelor. Blementele unui cluster trebuie si fie un grad inalt de similaritate , iar inter elementele din clustere trebuie sh fie { lun grad redus de similaritate. Procesul de custerizare se bazeazi pe trei elemente: faetiactes' ~criteriu de formare al clusterelor 3 a ~algoritm de contructie al clusterelor care si duca la indeplinirea erteriului dat. (Clusterizarea poate fi non-ierarhic&(are drept rezultat clusterele—porneste cu o partie initialA a obiectelor- algoritmul presupune imbunstifirea partifionarii uténd obiectele dintr-un cluster in altul) sau ierarhic&(e 0 Nbrary(e1071) é > library(scales) > indicatori<- read. csv‘ > indicatori Judet RM PAPO s ‘indicatori .csv") 1 Alba 208.0 172.1 159.6 12.508 2 Arad 270-6 218.2 212:1 6.071 3 Arges 378.5 259.1 243.8 15.334 f 4 Bacau 371.0 219-0 204.2 14.809 5 Bihor 358.8 274.8 264.8 10.030 6 sistrita-Nasaud 176.7 134.8 128.8 6.042 7 Botosani 237.6 152.7 145.1 7.614 8 Braila 190.1 129.2 119.7 9.519 9 10 1 2 Brasov 357.5 251.4 240.5 10.850 Buzau 262.2 190.2 171.8 18.348 Calarasi 181-8 104.4 95.8 8.599 :277 Caras-Severin 176.4 117-4 112.1 LB Cluj 466.8. 353.6 9.938 4 Constanta 448.8 303.6 1978 fas Covasna 130.8 88.7 83.0 5.746 16 Dambovita 326.3 204.6 188.4 16.228 t Vv Dolj 410.8 284.1 257:3 26.755 J 18 Galati 335.5 20217 183!4 191253 | 39 Giurgiu 170-9 93:1 86.6 6.510 20 Gorj 214.4 142-0 131.4 10.581 21 Harghita 195.6 138.8 130.5 8.256 2 Hunedoara 256.4 189.4 176.9 12.467 23 Talomita 162.3 104.3 95.8 8.481 24 Tasi 515.9 294.9 280.3 14.612 25 Tifov 289-9 173.3 170.7 2.634 26 Maramures 299.1 206.5 199.4 7.140 27 Mehedinti 160.0 114.7 103.2 11.464 28 Mures 343.8 244.2 230.2 14.039 Neamt 272.5 197 Olt 259.2 175 Satu Mare 217.7 156 Teleorman 209.4 169. 129.6 8 vas! ns CS POP NREM 79.0 mic 227.371 mediu 2 119-3 mic 291995 mediu 3) 131.3 mediu 415.905 mediu 97.3 mic 398.556 mare $150.3 mic 392.596 mediu 57.6 mic 189.511 mediu 48.9 mic 258.043 mediu 61.2 mic 207-747 mediu 457-2 mediu 382.542 mare 077.9 mic 287.512 mic 2 40.7 mic 195.477 mic. 32 50.9 mic 195.780 mediu 33 196.2 mic 492.723 mare 14 169:1 mic 477.652 mare 4 15 45.4 mic 140.227 mic 16 72.9 mic 349.625 mediu 27 116.3 mic 439.795 mediu 18 103.0 mic 360.907 mare 19 31.4 mic 181.197 mic 20 68.9 mediu 232-113 mic 21 59.7 mic 209.311 mic 22 106.9 mic 279.161 mediu 23 40-6 mic 175.379 mic 24 138.0 mediu 530.875 mare 25 108-3 mare 294.888 ‘mic 26 91.6 mic 325.105 mediu 27 40.9 mic 1731296 mic 28:119.0 mic 365.223 mediu 29 73.9 mic 297-130 mediu 30 62.4 mic 281.432 mediu $1 162.3 mediu 509.413 mediu 32° 43:7 mic 143.952 mic 3375.3 mic 235.942 mediu 34 119.7 medi 275.688 ‘mare 3591.7 mic 412/483 mediu 36 52.1 mic 229097 mediu 37 209.2 mediu 501.983 ‘mare 38 42.0 mic 141/256 "mic 240.262 mic mic 248.800 mediu mic 217.090 mediu > set. seed(s) > kme-kmeans (indi catori 2:5] Cluster means: Prahova 470.2 301. Salas 133-2 106.5 sibtu 255.2 191. Suceava 361.8 246. san. valcea 216.2 170. 230.3 152. Vrancea 200.0 146. +3)// Gruparea in clustere Kemeans clustering with 3 clusters of sizes 9, 13, 19 5 4 0 6 3 7 6 135. 6 137. M PA PO s 3 433.8889 295.5444 281.5222 14.034556 2 299.9385 204.9000 192.8000 12-095385 3 186.3684 130.5842 121.4737 9.111842 Clustering vector: )321213332233113 23]. 31223222133 Within cluster sum of squares by cluster (4) 44970.66 34583.13 46227.06 (betweenss / total_ss = 85.0 %) ‘Available components: 2123332 333 (2) “cluster” [4] “withinss" (7] "size" "iter “ifault" > plotCindicatori[,2], indicatori[,3], col=kmScluster) 3 * 3 eee 28 oo i caw ha ie ue : at eles a eee natn 2 Figure 22 Reprezentare pe 3 clusere > table(kmScluster, indicatoriscs) mare mediu mic BECO ats Bae eliee 2e19 Sone Tae > kacluster 17321213332233113 (23) 31223222133213133353332 > gc-order hagcluster) > data. frameCindi catori $yud indicator’. tudet.o. km-cluscar cre Wuster fol) Arges i Bihor Cluj Constanta dolj Iasi Prahova Suceava Timis Arad Bacau Brasov Buzau Dambovita Galati Hunedoara Iifov Maramures ures Neamt olt sibiu Alba Bistrita-nasaud Botosani Braila Calarasi Caras-Severin Covasna Giurgiu Gorj Harghita Talomita Mehedinti Salaj Satu Mare Teleorman Tulcea valcea Vaslui Vrancea we We wee ee Va He al fia bd ae} pis 8 Ped Figure 23 Reprezentare graficaclustere RIS [—_ 52 1 2) 2] &) Bl A Ne? [ENE 10 25 ne uy 40 25 1 8 10 30 Clusterul 1 cuprinde judetele sihor, cluj,constanta, Dolj, Iasi, Prahova, Suceava, Timis Clusterul 2 cuprinde judetele Arad, ‘Bacau, Brasov, Guzau, Danbovita, Galati ,wunedoara Clusterul 3 cuprinde judetele alba, Bistrita, Botosani, Braila, calaras, Covasrna, Guirgi ", Gorj, Harghita,Talomita, Salaj. Mehedinti. Satu mare. Tmuritorilor astora, Tulcea, vra vicea ,merge. : > plotCindicatori$RM, indicatori$Pa, xlab="Resursa de munca", ylab="Populatia activa", co T=km$cluster) > points (km$centers[,c(1,2)],col=1:3,pch=8,cex=2) > text (xeindicatori$RM, y=indicatori$Pa, labels=indicatori$judet, col=kmScluster) Figure 24 Centrotz lustre > indicatori<-read.csv > indicatori2<-indicatori[,2:6] > indicatori2 RM PA po Ss _NMS 1 208.0 172.1 159.6 12.508 79.0 2 270.6 218.2 212.1 6.071 119.3 3° 378.5 259.1 243.8 15.334 131.3 4 371.0 219.0 204.2 14.809 97.3 5 358.8 274.8 264.8 10.030 150.3 6 176.7 134.8 128.8 6.042 57.6 7 237.6 152.7 145.1 7.614 48.9 8 190.1 129.2 119-7 91519 61.2 9° 357.5 251.4 240.5 10.850 157.2 0 262.2 190.2 171.8 18.348 77.9 11 181.8 104.4 95.8 8.599 40.7 22 176.4 117.4 112.1 5.277 50.9 13 466.8 353.6 343.7 9.938 196.2 14 448.8 303.6 291.6 11.978 169.1 15 130.8 88.7 83.0 5.746 45.4 16 326.3 204.6 188.4 16.228 72.9 17 410.8 284.1 257.3 26.755 116.3 18 335.5 202.7 183.4 19.253 103.0 19 170.9 93.1 86.6 6.510 31.4 20 214.4 142.0 131.4 10.581 68.9 21 195.6 138.8 130.5 8.256 59.7 2.467 106.9 23 162.3 104.3 95.8 8.481 40.6 24 515.9 294.9 280.3 14.612 138.0 25 289.9 173.3 170.7 2.634 108.3 26 299.1 206.5 199.4 7.140 91:6 27 160.0 114.7 103.2 111464 40:9 5 22 256.4 189.4 176.9 1; 8 3 DETECTAREA OUTLIERILOR PRIN METODA C-MEANS ‘indicatori .csv") 28 343.8 244.2 230.2 14.039 119.0 28 272-5 19775 195-4 12/114 73:9 30 259-2 1475/2 14.435 62.4 31 470.2 301°8 286.0 15.791 162.3 32 133.2 106:5 100.3 6/235 43:7 33 227.7 156.2 149.4 6.816 73.3 34 255.2 191.5 182'6 35 381.8 246, 36 209.4 169. 37 473-4 341.6 336.2 5.433 209.2 38 329.6 87-3 82.5 4.775 “42:0 39 216.2 170.7 161:1 9/608 71.9 40 230.3 152 217 41 200-0 146. : > pmeans- result<-kmeans Cindi catori2, centers~3) > kmeans.resultScenters Ns. PA, Po s 2 486-3684 130.5842 121.4737 9.111845 53.130 3 432-2889 296.1000 282689 13'4a3eae asc a3eee 3 301.8077 204.5154 > kmeans.resultécluster” 888 52:1 > Cantera Kmeans. resu] tScentersfkmeans.resultScluster] > centers f2} 286.3684 301.8077 431.1889 301.8077 431.1889 £6] 186.3684 186.3684 186.3684 431.1909 Sar aes F124] 186.3684 186.3684 431.1889 431.1889 soa. sone £18] 301.8077 4311889 301.8077 186.3688 14 seek [22] 186.3684 3018077 18613684 431.1889 Joc aes [26] 301.8077 186.3684 301/807 301.8077 sot, core (311 431.1889 186.3684 186.3684 301.8077 301.0077 [3] 186.3684 431.1889 186.3684 186.3684 186..ce4 (41) 186.3684 > distantec- ‘Sart Crowsums( Cindi catori2-centers)A2)) > distante [16] 396.0619 561.2575 379.9204 274.3288 224.6684 [21] 230.7340 389.8799 261.5521 554.9902 400.9930 [26] 388.0651 253.9710 355.5828 401.4401 421.4584 [31] 533.1039 263.4691 218.7092 384.1052 374.4397 136] 219.6024 499.3776 278.6472 215.1356 229.8818 (41] 232.8332 3 > #distanta euclidiana de la data la centroizi > outliers<-order(distante,decreasing=T) [1:5] > #ordoneaza descrescator distantele si le retine pe primele s > outliers 11] 3 917 524 > printCindicatori2[outliers,]) RM PAPO SAMs. 3 378.5 259.1 243.8 15.334 131.3 9 357.5 251.4 240.5 10.850 157.2 17 410.8 284.1 257.3 26.755 116.3 5 358.8 274.8 264.8 10.030 150.3 indi i "PAN h="0", col=kmeans.resultScluster, cexe ot Cindicatori2[,c("RM", "PA")], pch="0", a ‘ * points¢kneans,resultscencers( eC ‘RM, "PA")) ,col=1:3, pch: > pointsCindicatori2[outliers,c("RM", "PA")] , pel > #afiseaza outlierre cu + 3) +Cex=1.5)#centroizi 5) Figure 25 Afiseaca cu + outirt F - 5 F F F F F ALGORITMUL CELOR K MEDOIZI Este o varianta a algoritmului celor k medii si rezulta prin modificarea ce vizeaza ca in locul cebtroizil prototipuri ale clusterelor vor fi alesi mmedoizii. Un medoid aa unui cluster este elementul cel mai apropiat de library (fpc) J L > indican 7 tort<-read.csv(“C: /indi cator$ .csy" deities ead. csv(“C:/indi catort csv") subsetCindicatori, select = -¢(1,6,7,8,9)) ‘pamk (datele) 2 18 19 20 zr 2711 2 2 22 2: 1 33-34 35 36 37 38 3 212 221 7 Objective function: build ‘swap 76.56172 6.28335 ria 40 41 oe “id.med' “clustering” Solation” “clusinfo" "diss" "cali" (71 “sitinfo (10) “data” $nc G2 Scrit £1] 0.000000 0. 5785071 0.4973523 0.001940 {51 0.5092449 0.479549 0. 4574886 O.asséscy {9} 0.4331718 0.4218813 FPamk.rezultatSnc#afiseaza numarul de clustere a] 2 > #cs > tableCindicatorics, pamk. rezul tatSpanobject Sclustering) 12 mare 1 0 mediu 2 5 mic 22:11 7, Plot(pamk. rezultatSpamobject) Hit to see next plot: layout (matrix(c(1,2),1,2)) Hit to see next plot: #putem face 4 clustere > pam. result<-pamk(datele[, -5], 4) > tableCindicatori$cs, pam. resultSpamobject Sclustering) Or ae Nae mare 10 00 mediu 1 2 1 3 mic 14 512 2 > plot¢pam.result Spanobject) Hit to see next plot: #NREM ; : Wit, sReturr> £0 see next plot: table( indicator sREMpank.rezultatSpambjectSclustering) > amk . rezul tat $pamobject; eatin to see next plot: layout(matrix(c(1,2),1,2)) MIE cReturm> co see next plot: #puten face 4 clustere te-pamk(datelel,-5], z OTe Soa ecat pam. resul tSpamobject $clustering) plotCpam.result $pamobject) ‘vam 6 oe —_ 2 r108 Ses a oo oe oe 10 - Shamma, ‘meng shoe nan 0 DETECTAREA OUTLIERILOR PRIN METODA C- > library(cluster) > indicatori<-read.csv(" > datelel<-subset indicator’ > pam. rezultat<-pam(datelel, 3) > pam.rezultat Medoids: ID RM PAPO s 22 22 256.4 189.4 176.9 12.467 47 17 410-8 284°1 257.3 26.755 12 12 176.4 117-4 11211 5.277 Clustering vector: 123 45 67 8 9101112 13 1415 16 208 2062952 1:3 P32 OI 3:3.122° 34 47 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 8 Bo oy Leese 8 2 Pa 2 og 33 34 35 36 37 38 39 40 41 Te 22 To 2 32. 11/3 objective function: build swap 49.60403 48.94952 Available components: [1] “medoids' id-med" “clustering” [4] "objective" “isolation” “clusinfo" [7] “silinfo" “diss” "call" (10) “data” > par(mfrow=c(2,1)) > plot(pam.rezultat) ‘mowed pt of panies saat, Kk ed ‘aueplotpam(x = scat, kk, les = los) Componerts These wo components eon 8 82% oe pst ase) Figure 26 Distana dindreclustere CS a Component 1 ‘These two components explain 98,82 % of the point variability Silhouette plot of pam(x = datele1, k = 3) 3 clusters C/— J ANP ¥5 2: 12) 047 3: 13 | 0.58 Fone nc Slane al aa eal 0.0 02 04 06 08 10 Silhouette width s, Average silhouette width : 0.5 ALGORITMUL FUZZY C-MEANS Acest tip de clusterizare se opreste asupra acelor situaii in care datele nu pot fi impéirtite in regiuni cu granidele perfect delimitate . De aici reiase faptul ca cluserele nu mai pot fi multimi crisp ale miultimilor observatiilor ci submultimi fuzzy ale acestora. Apartenenja obiectelor la clustere este data de aceasta data de grade de apartenenta. Pentru aceasta este necesara o partitie fuzzy. Functia membership a afsat gradele de apartenenta la cele 3 clisterepentru toate observatiile. Spre exemplu gradul de apartenenta al primului obiect la primul cluster este 0,5, la cel de-al doilea 0,36 , ia la ultimul 0,12. Se poate spune astfel ca primul obiect(judet) are cel mai mare grad de apartenenta la clusterul 1 dsi ce mai mic grad de apartenenta la clusterul 3. > #alg fuzzy c-means > library(e1071) : 2 indieatori3<- subset Cindicatori,select = -c(1,7,8,9)) > result<-cmeans(indicatori3[,-7],3,100,m=2,method="cmeans") > result Ea Fuzzy c-means clustering with 3 clusters Cluster centers: J 448.1132 303.8393 290.3743 13.469312 menberships: ; A 1 2 0.033700529 0.6358441295 0.330455341 2 0.036288792 0.0572912613 0.906419947 3 0.542624894 0.0688233449 0. 388551761 4 0.179472430 0.0825524187 0.737975151 5 0.599473842 0.0682452546 0.332280903 & 0.001920994 0.9871425631 0.010936443 7 0.030079960 0.6789485533 0. 290971487 8 0.001137842 0.9921579862 0.006704171 9 0.435592757 0.0810621484 0. 483345094 10 0.026100808 0.1268560192 0.847043173 12 0.006421045 0.9637416417 0.029837314 12 0.001070906 0.9935041534 0.005424941 13 0.896840444 0.0302761883 0.072883368 14 0.998236670 0.0004401705 0.001323160 15 0.023448713 0.8895817908 0.086969496 16 0.032418085 0.0431380588 0.924443856 17 0.795864228 0.0400780668 0.164057705 18 0.040562409 0.0423072204 0.917130370 19 0.013627002 0.9295128153 0.056860183 20 0.013363694 0.8830718427 0.103564463 22 0.004183402 0.9683838072 0.027432791 22 0.027261833 0.1121856353 0.860552532 23 0.008279851 0.9557003510 0.036019797 24 0.900081837 0.0276352713 0.072282891 25 0.023613816 0.0709584460 0.905427738 26 0.003190661 0.0050684745 0.991740864 27 0.005673274 0.9687025807 0.025624145, 28 0.213884127 0.0744292167 0.711686656 29 0.017782143 0.0593792177 0.922838639 30 0036875341 0. 2837222198 0.679402439 31 0.987094090 0.0033157905 0.009590120 32 0.015749600 0.9212845043 0.062965896 33 0.026032090 0.7233493224 0.250618588 34 0.032742899 0.1129332997 0.854323801 35 0,364339724 0.0863880211 0.549272255 36 0.025929481 0. 7520125963 0.222057923 37 0,901332223 0.0288976206 0.069770157 38 0.024492722 0.8855740240 0.089933254 39 0.033711545 0.6072323304 0.359056125 40 0.023989898 0.7671600949 0. 208850007 41 0.008532047 0.9320852694 0.059382684 Closest hard clustering: 33 34 35 36 37 38 39 40 41 B38 2522022: 2 s ns Ss os 43 161. 33900 2 182.6546 126.4278 117.6578 8.771036 51.79791 3 296.4308 202:9703 190.9762 1.993525 97.09761 23°45 67 8 910111213 1415 1 Pasi aN 22h 2.,3, 90209 a4 9 18 Bah segege Saeed 3) 302, gia's toe "cluster" ‘withinerror" alimnanmwnescenaZzngugsaat td & Oo. O71 ean" 3 Blot Cindtcatori3t,1),indicatori3[,2], com » points (rezultatScenters(,c(1,2)}. ep 300 400 500 indicatoni3i, 1] CLUSTERIZARE IERARHICA Clusterizarea ierarhicd are rolul de a crea o scventa de Partitii imbricate care poate fi vizualizata sub formide arbore sau sub forma de partitiimbricate . Clusterele din ierarhie variazain nivelul inferior al icrarhiei(frunza), fiecare punct formeaza un cluster, in timp ce la nivelul inferior este reprezentat un cluster care contine toate punctele. entra aces tip de clusterizare exsti doua abordari algoritmice:aglomerativd si diviziva. Strategia aslomerativi este de ti de jos in sus-se pornest de la celen punte separate in cate x, cluster, apoi se unesc pana cand toate clementele devin elementele aceluisi cluster. Strateile dvizive fae referne 1a faptul ca membrii aceluiasi cluster impart custerele succesiv » Pana cand toate punctele sunt in clustere diferite. > fit <- hclust(d, method="ward.0") #calculeaza distanta dintre clustere conform metodei Ward. > plot(fit) > groups<-cutree(fit,k=3)#imparte dendograma in 3 clustere > groups#afiseaza ceke 3 clustere eee ee ee Fe 9 10 11 12 13 14 15 16 2G Sk 8 2 A ee Oe 8 af 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 ide tet e ILS. 2a taeeg ahah gir: 33 34 35 36 37 38 39 40 41 OC peers ee tee : > dendograma=hclust (di st(indicatori4)) >odendograma cluster method Distance Number of objects: 41 > plot (dendograma) > Vibrary(cluster) > esantion 1 3 2 4.9 3 4 4 4.6 = 5.0 6 5.4 7 4.6 8 5.0 9 4.4 10 4.9 n 5.4 2 4.8 B 4.8 14 4.3 15 5.8 16 5.7 v7 5.4 18 5.1 19 5.7 i 20 5.1 3 21 5.4 3 5.1 3. 46 a 5.1 a 4.8 ay Petal .widt ERB evoueenr Sepal.Length sepal.wi Did cali: helust(d = distCindicatori4)) : complete euclidean > plot(as.dendrogram(dendograma) , horiz=FALSE) > rect-hciust(dendograma, k=3, border="red") > cutree(dendograma, k=3) 12345 6 7 8 9101112 13 14 15 16 Peer ge ge aan a ea a3 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 HOR MLO LNA PPD QV 22 3 1 33 34 35 36 37 38 39 40 41 A253 REED Tr > esantion<-as.matrix(iris[,-5][1:25,]) ar 2i jdth Petal.Lengt YU RR WwUWUNWwwuwWww. MOROOO aN Oba OOUNS 8 4 7 6 3 4 25 1 Pepe bowae DPPP PPP BBE E Beppe ONOUNUN RUN LR OG aDAY ‘33 ie eeesososs. Nanuuua ae, eo. > plotchclust: da single Tink 7/n single") > plotche) > plotChc,hang=-1) > hed=as.dendrogram(hc) > Op=par(mfrow=c(2,1)) > plotChcd) 22 dist Cesantion),method="single")) #calculeaza dist dintre 2 clustere cu meto > plot(as.dendrogram(hclust(dist(esantion) ,method="single")) ,horiz=T,main="dist(esantion) > heshelust (di stCindicatori4)) > plotChed, type="triangle") =~ | > parCop) > Op=par(mfrow=c(2,1))#Functia par (combina mai multe grafice pe acceasi foaie. Comanda mfrow=c(nrlinii, nrcoloane) creeaza o matrice de grafice care sunt asezate linie cu linie > plot(cut(hcd,h=75)Supper, main="Arborele superior a inaltime 75") #Vizualizam partea su amura 2")#Vizualizam a doua ramura a dendogramei perioara a dendogramei deasupra inaltimii 75. > plot(cutChcd, h=75) Slower [[2]], mai sub inaltimea 75 > par(op)#revin la setarile initiale > de-distCindicatori4,method = “euclidean") od a 2 2 _94.030075 3 209. 133633 120.024636 4 175.394027 101.091853 5 210.620774 117.370199 6 57.977833 150. 703885 7 —38.558024 99. 351049 8 _61.333222 151.495540 2 187.580113 97381974 10 58.721339 51.277429 11 96.723577 185. 385978 12 _79.367420 170.396333 13 365.812937 272.325382 14 304, 468670 213.081071 15 137.217654 230.176466 16 126.073068 62.867358 17 251.833491 162.695169 18 133.433673 73.822186 19 113.940274 203.323739 20 41.784247 124.489799 21 46.125259 136. 355030 22 54.232294 48.073161 23 103.788182 195.534135 24 352.782039 266.043547 25 83.245155 64.142599 26 105.334825 33.340257 3 4 56.856799 33.222168 83.412702 263.600211 224.953663 202.422969 160.423901 260.390465 218.991059 23.049648 50.649518 153.186110 117.171518 290.813239 246. 412102 279945233 238.254043 163.512283 216.284018 96.012722 144.418747 341.086528 299.074753 93.621842 49.569180 44.508193 93.724954 93.238825 44.478412 308.939255 264.176936 230.862667 189. 130315 246.599854 206.571614 155.725174 121.491131 304.395210 261.729600 146.605495 180. 410833 143.932276 99.680392 105.402190 73.537158 33 22. ga 56.094263 42. 101738818 116. 33 771936358 100.868010 37 361.116360 267.874256 38 139.074833 231.977692 39° 8.937741 88.482157 40. 38.685001 109.401576 41 34.683381 125056329 2 3 4 5 6 266.968751 7 209.599086 65.554414 B 265.919764 17.487697 9 —33.769993 242. 454546 10 158.766839 111.264764 11 298.208329 45.229506 12 285.279514 24.131416 13 155.237426 422.171338 14 98.241919 359.264424 15 345.957805 79.559711 16 108.902090 175.807038 17 55.914807 306.650580 18 111.589980 181.613712 19 316.368346 59.611987 20 237.240518 38.736191 21 251. 336183 19.519267 22 159.722475 108.111704 23 310.235232 47.249960 24 159.203218 404.616293 25 154.786856 126.742710 26 111.868191 158.455721 27 302.109362 _36.981862 28 48.729786 224.135410 29 140.469509 127.864613 30 175.144866 _97.636328 31 116.711564 372.609894 32 326.028990 59.206649 33 217.491747 50.635157 34 156.301240 110.812798 35 50.869324 254. 696149 36 216.214140 53.456896 37 150.713279 416.907977 38 347.778371 81. 363108 39 204.757030 62.490302 40 219.761486 57.957406 AL 240.381143 27.766246 9 10 493017 121.21 139.074675 39.188010 213.319340 177.645736 456.358722 207.43295: 342922196 300-882737 202553630 167.870879 212 ,376061 170.214581 236.213164 197.300119 7 58.798716 182289143 239.895835 3.287135 108.220937 B8.758043 35.439898 78.012765 20.067002 363.786766 420.825885 298.089826 356.225640 139.147905 80.733299 111.849859 170.294746 245.396450 303.205112 116.990454 175.202314 106.885541 52.692638 29.141433 29.872192 46.591653 15.512742 52.298390 106. 302684 102.195360 44.329871 340.587349 399.274879 61.966204 120.636865 98.109147 155.608610 96.105112 37.313443 164.115754 221.538643 69.781015 125.610008 35.578449 92.758757 310.247068 368.702859 122.648855 64.343179 20.673336 48.786845 56.771319 109.891560 191.987599 250.397125 34.828624 54.483650 357.503590 415.044088 140.953822 82.596644 32.278787 64.167182 15.699364 49.653525 38.804195 26.997031 11 2 40 132.6796 31 270:966136 140. 345442 12 259366205 128/048510 21.791872 33 181.775691 313.338568 452528279 440.207775, 44 116-931657 2491142001 386.417776 375.682517 62916 188.712640 34.949701 61.237896 57094 _67.795460 198.879993 189.777408 73751 195.653003 333.384133 322.918867 75262599 202.666839 194. 596810 . 19 289.106374 158.629374 18.316766 35.672262 { ™| 20 210.591672 79:376107 611219346 49.495277 21 225.825372 94.260588 501774084 34 264186 iy 22 134.583746 9.740439 139.220442 125.836148 7 23 284.051320 152.421054 191500613 _25.423918 24 169:059169 295.147400 426.602234 418. 503145 25 124.935634 36.069929 148.588059 139. 456106 26 84436628 50.146578 186.865697 174.982458 27 276.667322 144562926 _25.383030 _19.842857 28 18.862389 114.033712 252.746896 241.092063, =—y 29 114/751112 19575565 157.906983 145.158033 30 147-764245 19.243481 1231541325 1121482172 31 131666676 262234129 397.956134 388.166915. 32 301.635274 170.0160 48.910209 46.099976 33 192152448 61.408444 82.753786 67.858592 34 131:945830 16.046932 143207102 129.163774 35 27.513417 144.407342 279.705419 269.916992 36 192.250200 _60.897140 891382736 73.354068 37 1751375836 307768675 446.202402 434.390923 38 322/285783 190604004 56645944 63.029057 39 181/063780 51.838186 9.217690 82.732418 = 40 192.540560 61.415702 79.402537 69.455572 41 215.268707 83.762659 62.258642 45.637617 B 14 15 16 B©eNoueun 5 12 B 14 74.448449 15 501.048973 436.873904 16 257.097071 188.349549 250.743080 17 125.355580 56.737111 383.931020 135.345956 18 256.502162 186.488245 255.265918 11.063482 19 470.668908 404.467006 40.508193 216.884968 20 391.857147 326.686259 110.434538 140.432399 21 406. 381039 342.397931 94.718531 157.569327 22 314.735501 251.427383 186.489227 72.549605 23 465.109958 400.190716 37.509069 213.520177 24 99.488222 68.649311 479.402071 229.238896 25 306.241814 238.603060 200.426906 52.940881 1196 4g 348275 2 tga. 9a25i6 234.035660 2235 125.483408 982066 102 121.455926 -fi093e 148. 089898 19 20 18 133.467374 19 351-541406 220.540119 20 273.638042 145.357847 79.417083 117067 68.035568 19.232671 21 289.557564 16 22 198.685070 80.758899 157.396048 78.003635 23 347.126028 217.877479 16.968348 _73.535502 24 108.807640 224.624462 444.222436 369.415916 25 187.015595 58.148097 166.375699 91.036173 26 149.116760 41.738768 204.985455 126.373179 27 339.967094 212.219129 29.756884 67.087031 28 83.529137 63.313236 270.930371 192.256594 29 178.914557 63.647508 176.110093 96.820504 30 210.489435 84.332159 141.794731 63.154519 31 69.1765 196. 223458 415.911862 338.966052 32 365.613991 239.279770 42.292028 94.020145 33 256.292789 131.718522 100.577252 23.467109 34 196.689419 81.745647 161.277819 82.093358 35 5.803040 78.948420 297.727837 220.631838 36 255.663254 134. 706805 106.904279 34.244653 37 117.918818 248.693491 464.284202 385.863579 38 385.800260 257.103855 41.942225 112285572 39 245.514392 125.884404 116.763320 41.351838 40 254.693948 126.110426 97.716364 20. 632182 41 279. 184619 153.932549 791598578 16.507804 21 22 23 24 beeen SSUERRERE Reeve x 8 21 22 91.802465 23 59.188518 24 386.574364 25 108.306911 26 141.576818 27 _51.026966 28 207.475138 29 111.302265 30 _79.537350 31 355.262137 32 76.506042 33 33.918337 34 95.101079 35 237.097929 36 39.995993 37 400.775672 38 96.562764 39 48.786751 40 38.725531 41 11.568588 25 150.631730 298.607872 442.085013 38.943393 163.361432 279.317369 51.481326 199.734419 247.825424 142498232 _13.308204 436.475853 116.125627 265.722621 186.277316 19.929742 169.916564 278.823546 21.738515 136.663346 307.413827 265.062368 412.356807 46.583045 167.203612 29.613249 463.059417 58.207506 92.943543 354.057224 7.169953 154.170774 297.042956 147.680583 294.016404 151.223317 58.177566 97.269192 356.038459 309.321527 459.234091 84.830443 188.343242 _39.356250 481.193367 47.154711 107.608183 345.657996 61.630914 92.672793 350.539956 80.961459 70.531621 376.995032 26 27 28 34 61.308444 35 204.092703 146.102388 36 19.466720 61. 325862 205.188993 37 366.949319 306.179522 169.862280 366. 471018 38 130. 340307 191.493164 332.881319 133.461106 39 18.898755 49.157456 194.801927 16.010080 40 22.016489 66.727201 201.838037 30.497334 41 23.326179 84.083828 227.089993 29.023305 37 38 39 40 > de-distCindicatori4, method="manhattar Eee 2 3 4 $ 2 167.637 hE WIS as 4 256-801 117: i 5 361.178 2011459 61.704 133.379 6 405-866 260.629 450.392 362.667 462.088 7 68.394 167.043 353.720 265.995 365.416 8 103.689 265.348 448.215 360.490 459.911 9. 311-358 153.279 36.484 86.159 49.820 10 90.340 88.977 260.214 173.539 282.518 11 161.609 321.428 506.135 418.410 517,831 32 141-031 295.794 485.557 397.832 497.253 13 626.970 467.067 288.096 374.771 265.752 14 504.830 349.007 165.956 252.631 147/548 15 243.962 398.725 588.488 500.763 600.184 16 183.320 103.157 162.994 76.319 185.298 17 426.747 271.984 82.221 169.946 85.525 18 188.645 122.282 163.719 77.044 186.023 19 195.098 350.739 539.624 451.899 551.320 20 66.627 217.610 398.353 310.628 411.151 21 79.052 238.185 423.578 335.853 435.274 22 83.041 84.596 261.567 173.842 278.137 23 181.327 340.910 525.853 438.128 537.549 24 553.504 398.741 210.422 297.097 197.282 25 104.074 109.037 260.200 172.475 271.896 26 170.668 53.969 184.594 96.869 196.290 27 162.844 328.393 507.370 419.645 521.934 28 280.031 125.268 64.495 79.170 84.209 29 116.094 55.343 229.220 141.495 245.084 30 57.427 114.064 287.099 199.374 307.605 31 521.583 366.820 177.057 264.782 165.361 32 205.973 361.064 550.499 462.774 562.195 33 41.492 178.345 366.618 278.893 378.314 34 193.256 74.381 258.582 170.857 270.278 35 322.432 167.669 30.906 65.631 92.610 36 20.380 185.217 356.454 269.779 378.758 37 618.575 450.938 279.701 366.376 257.397 38 248.033 402.796 592/559 504.834 604.235 39 14.004 156.433 339/130 251.405 330,606 42. 72-061 194.098 365.335 278 660 387 655 41 58.967 219.270 403/493 315.768 415. ta9 96.672 31.577 413-908 196.206 71.057 22 35.165 23 7277696 24 609.636 25 138.096 16 289/186 17 532.613 18 294/511, 19 90.168 20 52/039 21 26.814 22 188-825 23 80.339 24 659.370 25 197.008 26 265.798 27 67.822 28 385.897 29 221/172 30 163.293 31 627.449 32 100.493 33 83.774 34 191.810 35 428.298 36 101-046 37 711.509 38 142-167 39 111.262 40 89.127 41 46.899 11 98.305 37.236 99.534 154.385 131.837 631.024 512964 234.768 192-514 435.941 197-839 185.904 50.567 71.142 92.153, 173.867 562.698 103.480 170.074 161.350 289.225 124.500 66.621 530.777 196.779 28.498 95.138 331.626 60.774 617.981, 238.839, 57.390 27.055 52.227 12 HE oNooawn 2 4 5 6 7 8 9 10 11 12 38.022 13 783.439 762.861 14 665.379 644.801 15 82.353 103.869 16 344.929 324.351 17 588.356 567.778 18 350.254 329.676 19 33.489 56.533 20 107.782 87.204 411.731 194.029 232.698 57.920 469.651 251.949 37.342 449.073 231.371 725.519 315.612 548.310 607.459 195.728 426.170 140.273 5521004 334.302 287.009 135.478 97.220 530.436 118.705 336.407 292.334 136.203 98.305 91.409 503.140 285.438 49.862 361.869 144.167 27.163 387.094 169.392 186.648 228.317 17.581 77.638 489.369 271.667 657.193 245.462 470.636 201.785 223.716 61.414 268.379 148.110 92.008 63.045 472.114 253.184 383.720 34.389 198.309 218.995 195.264 37.434 161.116 257/785 32.913 625.272 213.541 436.357 102.284 514/015 296.313 87.003 330.134 112.432 190.967 222.098 28.596 426.121 45.390 235.908 98.869 328.938 96.240 716.286 307.217 539.915 144.344 556.075 338.373 109.085 302.646 84.944 86.950 337.819 107.079 46.078 367.009 149.307 13 14 15 122.140 865.792 747.732 451.090 328.950 427.282 228.717 106.577 670.709 451.815 329.675 432.607 816.928 698.868 48.864 676.943 557.597 190.135 22 162.011 £3 102.725 264.286 24 632.556 470.545 25 174.622 65.633 26 241.216 87.627 xy 27 90.208 245/803 28 359.083 197,072 29 194.358 33.053 30 136.479 35/068 31 600.635 438:624 32 126.921 288/932 33 59.840 105.051 al 34 164.996 17.615 " 35 401.484 239.473 [ 36 74.232 100.621 37 689.123 535.534 38 168.981 330.992 39 84.448 77.563 40 62.313 109.502 i 41 20.085 141.926 26 27 41 242. 287 485.714 247.612 136.131 27.140 21 22 23 24 25 734.831 277.347 469.178 343.941 393.572 75.606 23.083 716.348 264.830 461.358 273.473 195.705 296.633 438.198 65.780 238.754 496.077 54.301 702.910 59.479 437.257 38.046 759.477 297/501 162.565 575.596 114.782 267.271 467.560 71.018 503.759 234.728 238.106 176.507 566.876 121.654 791.848 154.279 520.099 66.706 801.537 336.641 186.723 548.108 92.870 164.588 575.757 130.535 122.360 612.471 155.707 28 29 30 26 27 331.424 28 120.099 442.875 29 54.574 278.150 164.725 30 117-095 220.271 223.396 62.521 31 361.651 684.427 241.552 406.277 464.156 . 32 365.905 43.129 486.004 321.279 263.400 - 33 182.024 150.048 302.123 137.398 79.519 34° 77.412 254.012 194.087 29.362 47.683 35 162.500 485.276 42.801 207.126 265.005 36 188.248 158.024 294.549 133.674 71.153 37 447.907 779.331 341.606 $02,481 565.002 38 407.965 85.189 528.064 363.339 305.460 39 159.464 171.960 274.635 109.910 52.631 40 197.129 146.105 303.430 142.555 80.034 41 222.301 109.123 338.998 174.273 116.394 31 32033 34 35 32 727.556 i ; 33 543.675 183.881 : 34 435.639 291.917 10 35 200.449 52, 4328 40.$41.478 189.234 40.953 119.717 341.029 AL 580.550 147.006 40.925 144.911 381.399 36 37 38 3 40 37 636.155 38 243.213 852.458 39 28.784 607.371 253.429 40 53.719 645.036 231.294 65.865 41__54.147 670.208 189.066 64.363 47.428 fit < hclust(d, method="ward.p") #calculeaza distanta dintre clustere conform metodei ward. > plot(fit) Cluster Dendrogram d halust (*, “ward.D") > Groups<-cutree(fit,k=3)#imparte dendograma in 3 clustere > Groups#afiseaza ceke 3 clustere 22345 6 7 8 91011 1213 1415 16 Mee Se Ske Lele eI 2 yogis age 47 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 eee Le TAB 2 18 22a 3\ 22 gg) 33 34 35 36 37 38 39 40 41 BW 3 w9 ae Dad > plot(hed) #un alt mod de a vizualiza dendograma q hous ¢, "ward D") ey g,“ect-hclust (Fit, k=3, border="red")#Printr-o Tinie rosie sunt delimitate cele 3 cluster > hed = as.dendrogram(fit) > hedfeste caracterizata dendograma ‘dendrogram’ with 2 branches and 41 members total + at height 4911.018 > plot(hcd, type="triangle") #Dendograma in forma triunghiulara | g g 2000 3000 +1000 ° : > op <- pan(mfrow = ¢(2, 1). acceas? foate. noes arte Te: By ate Tied ‘cu inte. f= 75)Supper, main = “upper tree of cut at he1000") #vizualizam partes > #runctia par Qcombina mai multe grafice pe jlgane) creeaza 0 matrice de grafice care sunt asezi Siplorceutched, (hes adograne! geasupra inaltinis 10 super iear f serm 1000) Stoner ((2]]. main = FoR iciizam a doua ramura a dendograne! sub inaltimea 1000 Upper tree of cut at h=1000 e 8 a e Jee eg tla 5 qywr eozee gs es & es Ha § eo 'R Ss -§ BBs SBE E & Ss 2 moo 6 aa Second branch of lower tree with cut at h=1000 150 300 0 a 26 I 7 29 to 2 34 25 30 4 16 18 J

S-ar putea să vă placă și