Present Data Part II KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 1 Review Central Tendency Dispersion
KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 2 Descriptive Statistics KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 3
Central Tendency Mean Median Modus Dispersion Range Mean Deviation Variance Standard Deviation Form Distribution Skewness Skewness is a measure of the asymmetry of the probability distribution of a real-valued random variable about its mean. The skewness value can be positive or negative, or even undefined KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 4 Shape and Distribution One advantage of knowing all those measurement is explaining how data is distributed.
KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 5 Mean = Median =Mode
Mean < Median < Mode Mode < Median < Mean Right-Skewed Left-Skewed Symmetric Shape and Distribution Box-and-whisker plot Graphical display of data using 5-number summary
KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 6 Median( ) 4 6 8 10 12 X largest X smallest 1 Q 3 Q 2 Q Shape and Distribution KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 7 Right-Skewed Left-Skewed Symmetric 1 Q 1 Q 1 Q 2 Q 2 Q 2 Q 3 Q 3 Q 3 Q Shape and Distribution Berikut adalah data pembayaran (dalam ribuan) yang dilakukan oleh 40 pasien di sebuah puskesmas.
KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 8 Shape and Distribution Mean: 66.28 Median: 65.5 Mode: 55 KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 9 Mode < Median < Mean Right-Skewed Mean < Median < Mode Left-Skewed Mean = Median =Mode
Symmetric Shape and Distribution Minimum: 43 Q1: 59.5 Median (Q2): 65.5 Q3: 72.5 Maximum: 92 KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 10 92 43 65.5 59.5 72.5 43 92 67.5 55.25 79.75 Skewness Beside using visualization recognition, there is a formula to describe skewness of data (Fishers Measure of Skewness).
Skewness 0 (Symmetric) Skewness > 1 (Asymmetric - Right skew) Skewness < 1 (Asymmetric - Left skew) KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 11 Skewness Skewness dari data pembayaran puskesmas adalah 0.221, maka ia termasuk right skew. Sesuai dengan hasil pengenalan shape and distribution. KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 12 2002 Prentice-Hall, Inc.
Chap 2-13 Tabulating and Graphing Categorical Data: Univariate Data Categorical Data Tabulating Data The Summary Table Graphing Data Pie Charts Pareto Diagram Bar Charts 2002 Prentice-Hall, Inc.
Chap 2-14 Summary Table (for an Political Parties) Investment Category Total Percentage (in thousands) (in %)
Republics 46.5 42.27
Labors 32 29.09
Democrats 15.5 14.09
Teachers 16 14.55
Total 110 100
Variables are Categorical 2002 Prentice-Hall, Inc.
Chap 2-15 Graphing Categorical Data: Univariate Data Categorical Data Tabulating Data The Summary Table 0 10 20 30 40 50 S t oc k s B onds S avi ngs CD Graphing Data Pie Charts Pareto Diagram Bar Charts 0 5 10 15 20 25 30 35 40 45 S t oc k s B onds S avi ngs CD 0 20 40 60 80 100 120 2002 Prentice-Hall, Inc.
Chap 2-16 Bar Chart (for Political Parties) 0 10 20 30 40 50 Republics Labors Democrats Teachers Amount in thousands Political Parties 2002 Prentice-Hall, Inc.
Chap 2-17 Pie Chart (for Political Parties) Percentages are rounded to the nearest percent. Amount in Thousands Teachers 15% Democrats 14% Labors 29% Republics 42% 2002 Prentice-Hall, Inc.
Chap 2-18 Pareto Diagram Axis for line graph shows cumulative % chosen Axis for bar chart shows % chosen in each category 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 0% 5% 10% 15% 20% 25% 30% 35% 40% 45% Stocks Bonds Savings CD Republics Labors Teachers Democrats 2002 Prentice-Hall, Inc.
Chap 2-19 Tabulating and Graphing Bivariate Categorical Data Contingency tables: choosers in thousands Political Parties Perth Sydney Canberra Total Category
Republics 46.5 55 27.5 129
Labors 32 44 19 95
Democrats 15.5 20 13.5 49
Teachers 16 28 7 51
Total 110 147 67 324
2002 Prentice-Hall, Inc.
Chap 2-20 Tabulating and Graphing Bivariate Categorical Data Side by side charts 0 10 20 30 40 50 60 Republics Labors Teachers Democrats Comparing Investors Canberra Sydney Perth 2002 Prentice-Hall, Inc.
Chap 2-21 Principles of Graphical Excellence Presents data in a way that provides substance, statistics and design Communicates complex ideas with clarity, precision and efficiency Gives the largest number of ideas in the most efficient manner Almost always involves several dimensions Tells the truth about the data 2002 Prentice-Hall, Inc.
Chap 2-22 Using chart junk Failing to provide a relative basis in comparing data between groups Compressing the vertical axis Providing no zero point on the vertical axis Errors in Presenting Data 2002 Prentice-Hall, Inc.
Chap 2-23 Chart Junk Good Presentation 1960: $1.00 1970: $1.60 1980: $3.10 1990: $3.80 Minimum Wage Minimum Wage 0 2 4 1960 1970 1980 1990 $ Bad Presentation
2002 Prentice-Hall, Inc.
Chap 2-24 Compressing Vertical Axis Good Presentation Quarterly Sales Quarterly Sales Bad Presentation 0 25 50 Q1 Q2 Q3 Q4 $ 0 100 200 Q1 Q2 Q3 Q4 $
2002 Prentice-Hall, Inc.
Chap 2-25 No Zero Point on Vertical Axis Good Presentation Monthly Sales Monthly Sales Bad Presentation 0 39 42 45 J F M A M J $ 36 39 42 45 J F M A M J $ Graphing the first six months of sales. 36
Chap 2-27 Frequency Distributions, Relative Frequency Distributions and Percentage Distributions
Class Frequency 10 but under 20 3 .15 15 20 but under 30 6 .30 30 30 but under 40 5 .25 25 40 but under 50 4 .20 20 50 but under 60 2 .10 10 Total 20 1 100 Relative Frequency Percentage Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 2002 Prentice-Hall, Inc.
Chap 2-28 Graphing Numerical Data: The Histogram Histogram 0 3 6 5 4 2 0 0 1 2 3 4 5 6 7 5 15 25 36 45 55 More F r e q u e n c y Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 No Gaps Between Bars Class Midpoints Class Boundaries 2002 Prentice-Hall, Inc.
Chap 2-29 Graphing Numerical Data: The Frequency Polygon F r e q u e n c y 0 1 2 3 4 5 6 7 5 1 5 2 5 3 6 4 5 5 5 M o r e Class Midpoints Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 2002 Prentice-Hall, Inc.
Chap 2-30 Tabulating Numerical Data: Cumulative Frequency Cumulative Cumulative Class Frequency % Frequency 10 but under 20 3 15 20 but under 30 9 45 30 but under 40 14 70 40 but under 50 18 90 50 but under 60 20 100 Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 2002 Prentice-Hall, Inc.
Chap 2-31 Graphing Numerical Data: The Ogive (Cumulative % Polygon) Ogive 0 20 40 60 80 100 10 20 30 40 50 60 Class Boundaries (Not Midpoints) Data in ordered array: 12, 13, 17, 21, 24, 24, 26, 27, 27, 30, 32, 35, 37, 38, 41, 43, 44, 46, 53, 58 2002 Prentice-Hall, Inc.
Chap 2-32 Organizing Numerical Data Numerical Data Ordered Array Stem and Leaf Display Frequency Distributions Cumulative Distributions Histograms Polygons Ogive Tables 2 144677 3 028 4 1 41, 24, 32, 26, 27, 27, 30, 24, 38, 21 21, 24, 24, 26, 27, 27, 30, 32, 38, 41 2002 Prentice-Hall, Inc.
Chap 2-33 Data in raw form (as collected): 126, 224, 321, 27, 127, 230, 341, 32, 138 Data in ordered array from smallest to largest:
Stem-and-leaf display:
Organizing Numerical Data (continued) 2 144677 3 028 4 1 2002 Prentice-Hall, Inc.
Chap 2-34 Graphing Bivariate Numerical Data (Scatter Plot) Mutual Funds Scatter Plot 0 10 20 30 40 0 10 20 30 40 Net Asset Values T o t a l
Y e a r
t o
D a t e
R e t u r n
( % ) Summary KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 35 Univariate Bivariate Numerical Line Stem and Leaf Scatter Categorical Bar Pie Chart Contingency Table Grouped Data Frequency Table Polygon Cumulative Freq. Table Ogive Soal Latihan 1 Seribu warga Minnesota dimintai pendapat tentang musim yang paling mereka sukai. Hasilnya adalah 100 orang menyukai musim dingin, 300 orang penyuka musim semi, 400 orang menyukai musim panas dan 200 orang menyukai musim gugur. Dalam membuat tabel frekuensi, berapa kelas yang harus dibuat untuk data tersebut? Bagaimana frekuensi relatif untuk tiap kelas? Bagaimana frekuensi kumulatif untuk tiap kelas? Soal Latihan 2 Wellstone Inc., ingin membuat dan memasarkan rangka ponsel dalam 5 warna: putih, hitam, hijau, oranye dan merah. Sebelum melakukan produksi masal, perusahaan tersebut membuka gerai mall dan mendapatkan hasil dari survey singkat sbb: Putih 130 Hitam 104 Kuning 325 Hijau 455 Merah 286 Disebut apa tabel disamping? Buatlah diagram batangnya Buatlah diagram pienya. Jika Wellstone Inc. berencana memproduksi 1 juta ponsel, berapa rangka dari tiap warna harus diproduksi? Bingkisan Dikerjakan personal (tiap anak satu kolom saja) Membuat histogram dan polygon frekuensi kumulatif (ogive) dari tabel A2 untuk kolom X4 hingga X12 (9 variasi). Dikerjakan secara komputerisasi. Dicetak berwarna di kertas A4 1 lembar (tabel generetan dan Histogramnya). Nama NRP diletakkan di setiap halaman (apabila lebih dari satu halaman) Dikumpulkan Selasa di Ketua Kelas. Letakkan di Loker Pembagian NRP KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 39 Available on scribd.com/rvinarti X4 X5 X6 X7 X8 X9 X10 X11 X12 Kisi-kisi UTS KS141209 Statistics Institut Teknologi Sepuluh Nopember (ITS) 40 Populasi Sample Skala Data Descriptive Statistics Present Data Applied Statistics -> Survey / Questionnaire SPSS Output