Sunteți pe pagina 1din 5

Example: Maine

Data Analysis Using SPSS Descriptive Statistics


Mercury levels in sh Limits: Alexander Ploner
Institute for Mathematics and Statistics

0.43 ppm (state) 1.0 ppm (federal) 125 of 1073 accessible lakes sampled (total: 2314 lakes) Mixed samples from 3 to 5 predatory sh Variables: dependant: Hg-content independant: lake properties

16. April 2002

Descriptive Statistics

Descriptive Statistics

Statistical Measures

Measures of Location

Location Spread/variation Shape: symmetry atness Comparison with standard error

Mode Mean Median Quantiles (quartiles) Robust estimators trimmed mean M-estimators back

Descriptive Statistics

Descriptive Statistics

Measures of Variation

Measures of Shape

Variance Standard deviation (Range) Interquartile range back

Skewness/Symmetry: < 0: long tail to the left > 0: long tail to the right Kurtosis: < 0: atter than standard normal > 0: peakier than standard normal back

Descriptive Statistics

Descriptive Statistics

Standard Errors (s.e.)

Graphical Representations of Variables

Measure of variation for an estimated parameter p Rule of thumb for approximately normal behaviour: p se: 70% condence interval for p p 2se: 95% condence interval p 3se: 99.9% condence interval back

Interval scale: histogram stem-and-leaf display boxplot Nominal/ordinal scale: barchart (piechart)

Descriptive Statistics

Descriptive Statistics

Graphical Comparisons of Variables

Interactive Graphics

Nominal*Interval: boxplots Nominal*Nominal: stacked or clustered barchart Analysieren/Deskriptive Statistik/Kreuztabellen Graken/Balken/Gruppiert or /Gestapelt Interval*Interval: scatterplot and scatterplot matrix Graken/Streudiagramm/Einfach or /Matrix or /3D Nominal*Nominal*Interval: clustered boxplots Graken/Boxplot/Gruppiert Graken/Galerie for overview!
Descriptive Statistics 8

Highly congurable: barchart boxplots scatterplot etc.

Specically: 3D Plot barchart with error bars

Descriptive Statistics

Assumptions & Diagnostics

Normal or approximate normal distribution quantile plot

Checking Common Assumptions About Data


Equal variances between groups of data boxplots spread vs. level plot

Descriptive Statistics

10

Descriptive Statistics

11

Testing Assumptions

Statistical Tests

Normality: Kolmogorov-Smirnov test Shapiro-Wilks test (small sample) H0: normal distribution Equal variances: Levene-test H0: equal variances Signicant result indicates violation of assumption!

Model assumptions: e.g. normal distribution Null- and alternative hypothesis Calculation of test statistic Probability of the calculated value assuming H0 is valid = p-value, empirical signicance Accept or reject H0 depending on xed signicance level , e.g. 0.05

Descriptive Statistics

12

Descriptive Statistics

13

Error Types for Tests

Decision H0 H1 Type I Error: propability Type II Error: propability

Reality H0 H1 ok Type II Error Type I Error ok

Transforming Data

Power: 1 = probability of rejecting incorrect H0 Statistical tests control only !

Descriptive Statistics

14

Descriptive Statistics

15

Transformations

Power Transformations

(Nonlinear) Change of scale Purpose: normality symmetry variance homogenity Problem: Interpretation Useful: power transformations

Function of data x for all p R: y= xp p=0 ln(x) p = 0

Ladder of powers: p = 1: identity p < 1: moves bump to the right p > 1: moves bump to the left For p {0, 1, 0.25, 0.5, 2 etc. } often interpretation possible

Descriptive Statistics

16

Descriptive Statistics

17

Example: Cars

For 60 types of cars (1990): type (small, compact etc.) price mileage (miles/gallon) reliability (1-5)

Relationships? Buying Decision? Assumptions?

Descriptive Statistics

18

S-ar putea să vă placă și