Documente Academic
Documente Profesional
Documente Cultură
Types of Data
Quantitative
Qualitative
Quantitative Data Analysis
3
Key components of a Data Analysis Plan
Research objectives
Research questions
Tabular
Graphical / Pictures
4
Statistical Applications
Two main groups of statistical applications are:
1) Descriptive statistics
Summarize data
2) Inferential statistics
Generalize from a sample to a population.
Population includes all cases in which the research is interested
Samples include carefully chosen subsets of the population
Tests hypothesis.
5
Descriptive Statistics
Univariate descriptive statistics include:
Percentages, averages, and various charts and graphs.
Example: On the average, participants are of 30.3 years of age.
Bivariate descriptive statistics describe the strength and direction of
the relationship between two variables.
Example: Older students have higher GPAs.
Multivariate descriptive statistics describe the relationships
between three or more variables.
Example: Grades increase with age for females but not for males.
6
Levels of data analysis
Univariate analysis
Bi-variate analysis
Multivariate analysis
7
Univariate analysis
Involves examination across cases of one variable at a time.
Some techniques used are: percentage, central tendency,
dispersion, etc.
8
Bivariate Analysis
It involves the examination across cases of two variables at a
time to see the relation. Some techniques used are:
percentage, central tendency, dispersion, co-relation,
regression, inferential statistics, etc.
11
About SPSS
SPSS stands for
12
Data Entry into SPSS
Click the SPSS icon and open the SPSS
Data View
Variable View
13
SPSS Data Editor
Data View
Rows are cases
Columns are variables
14
SPSS Data Editor
Variable view
15
Data Entry into SPSS
Use the ‘variable view’ window for data entry
16
Data Entry into SPSS – Variable Name
First column is ‘Name’ - Name of variable
It must be unique
It must start with a letter, Example ‘m1age’
Certain characters can not be used. For example,
we can use ‘_’ but not ‘-’; space not accepted
Name will be used to identify the variable later on
analysis and other manipulations
Variable name should be short but easily
recognizable and visible (nowadays no limit of
number of character)
17
Variable Name- Example
18
Data Entry into SPSS – Variable Type
2nd column is ‘Type’ - Type of variable
19
Type of variable - Example
20
Variable Type- Example
21
Data Entry into SPSS - Width
3rd column is ‘Width’ – What is the number of
categories or values
22
Data entry into SPSS – Example of Width
23
Data Entry into SPSS - Decimals
4th column is ‘Decimals’ – How many decimal points
are there in the values?
25
Data Entry into SPSS – Example of Label
26
Data Entry into SPSS - Values
6th column is ‘Values’ – Numerical value of each
category (if category is there)
Not needed for interval scale variables where
quantitative values are entered directly, but if in
category then needed
Put the code against ‘value’ for specific category
and then write the name of that category against
‘label’; then click OK.
For every category we need to complete same
process
We can enter new category anytime
27
Data Entry into SPSS - Example of ‘value’
and ‘value labels’
Example: Define Value and Labels
How is the law and order situation of the country?
Excellent
Good
Bad
Very bad
We can code this answers as follows:
Code Value
1 Excellent
2 Good
3 Bad
4 Very bad
28
Data Entry into SPSS – Example of values
29
Data Entry into SPSS – Example of value labels
33
Data Entry into SPSS - Align
34
Data Entry into SPSS - Measure
10th column is ‘Measure’ – Level of
measurement in which the variable is measured.
It can be changed anytime even during analysis
35
Using SPSS for Results
36
Output in SPSS
37
SPSS Menu: ‘File’
38
SPSS Menu: ‘Edit’
Works same like edit menu of other softwares
Use ‘Insert Variable’ for insertion of new
variable
Use ‘Insert Cases’ for insertion of new cases
Both can also be done by clicking on screen
Use ‘Options’ for choosing different items
(Display, Output Labels, Tables, etc.) as required
39
SPSS Menu: ‘View’
40
SPSS Menu: ‘Data’
Use ‘Sort Cases’ to arrange cases by ascending or descending
order according to the values of a particular variable
Use ‘Sort Variables’ to arrange variables by ascending or
descending order according to a particular property of a
variable
Use ‘Transpose’ to convert variables into cases or cases into
variables
Use ‘Merge Files (add cases or add variables)’ to add cases or
variables of other file(s) into one file
Use ‘Split file’ to have outputs separately by the values of a
particular values
Use ‘Weight Cases’ to weight particular cases
42
SPSS Menu: ‘Analyze’
Use ‘Descriptive Statistics’ Frequency for frequency
distribution
Number of times a particular value occurs in a data series
Frequency table is arranging data by values and their
corresponding frequencies
We can easily have frequency table using data in SPSS
SPSS frequency output has four columns
1) Value
2) Percent
3) Valid percent
4) Cumulative percent
43
SPSS Menu: ‘Analyze’ - Continue
Use ‘Descriptive Statistics’ Descriptive
for some specific statistics
Use ‘Descriptive Statistics’ Crosstabs for
bi- & multi variate tables
Select desired statistics and statistical tests
45
SPSS Menu: ‘Analyze’ - Continue
48
Frequency Table for Multiple Variables
Determine the frequency of a combination of
variables.
Select: Analyze > Descriptive Statistics > Crosstabs
Rows: Preferably independent variable
Columns: Preferably dependent variable
Layer: Preferably control variable
Send desired variable from variable list to rows or columns or
layer
Select desired statistics by clicking ‘Statics’
Select desired count and percentage by clicking ‘Cells’
Select: Analyze > Descriptive Statistics > Custom Tables
49
Percentage and Proportion
50
Use of Percentage and Proportion
Calculated to standardize the frequencies so that
comparison among values / categories and between
data sets becomes valid
Report relative size in the total data set
Compare the number of cases in a specific category to
the number of cases in all categories.
Compare a part (specific category) to a whole (all
categories).
The part is the numerator (f ).
The whole is the denominator (N).
51
Ratios
Compare the relative sizes of categories.
Compare parts to parts.
Ratio = f1 / f2
f1 = number of cases in first category
f2 = number of cases in second category
Example: In case of 23 females and 19 males:
the ratio of males to females is:
19/23 = 0.83, that is, for every female there are 0.83 males.
the ratio of females to males is:
23/19 = 1.21, that is, for every male, there are 1.21 females.
52
Rate
Expresses the number of actual occurrences of an event
(births, deaths, homicides) vs. the number of possible
occurrences per some unit of time.
Birth rate is the number of births divided by the
population size times 1000 per year.
If a village of 2300 had 17 births last year, the birth rate is:
(17/2300) * 1000 = (.00739) * 1000 = 7.39
That is, the village had 7.39 births for every 1000 residents.
53
Measures of Central Tendency
Central tendency is the trend of individual data of
data set towards the central point of the data set.
It summarizes the data allowing:
Description of data briefly and conveniently
Comparison among groups
Base for advanced statistical analysis
54
Calculate Measures of Central Tendency in SPSS
Select: Analyze Descriptive Statistics
Frequencies Statistics Desired type in Central
tendency
or
Select: Analyze Descriptive Statistics
Descriptives Options Mean
or
Select: Analyze Compare Means Options
Transfer Mean from left box to right box
55
Calculating Mean(s) Using SPSS
56
Calculate Measures of Dispersion in SPSS
Select: Analyze Descriptive Statistics
Frequencies Statistics Desired type in
Dispersion
or
Select: Analyze Descriptive Statistics
Descriptives Options Desired type in Dispersion
or
Select: Analyze Compare Means Options
Transfer Desired type of Dispersion from left box to
right box
57
Producing a charts
Select: Analyze Descriptive Statistics
Frequencies Reset Desired variable
Charts Bar/Pie/Histogram
Frequencies/Percentage OK
58
Chart using Excel
Charts can be constructed easily with more options
using tables produced by through SPSS
1. Open excel
2. Calculate desired table in SPSS
3. Copy the table by clicking right button of mouse in the SPSS
output window
4. Paste the table in Excel
5. Edit the table to make it fit for Chart
6. Select the table
7. Click chart
8. Edit chart according your choice
61
Steps of hypothesis testing
Hypothesis testing – 7 steps
1) Construct null hypothesis
2) Construct alternative hypothesis
3) Select a significance level and a critical region
(region of rejection of the null hypothesis).
Consider:
a) Whether both ends (tails) of the distribution should be
included.
b) How the critical region of a certain size will contribute
to Type I or Type II errors.
4) Select a statistical test to use
5) Calculate the test statistic
6) Find out tabulated value
7) Make a decision (accept of reject null hypothesis)
1) Null hypothesis
Null (nullus – latin): “not any” no difference
between groups
A neutral position
Predicts that two groups will not differ
Denotes by H0
H0: µ1 = µ2
The purpose of statistical test is to evaluate the
null hypothesis (H0) at a specified level of
significance
2) Alternative hypothesis
3 (a) Levels of significance
Level of significance determines the probability that
the observed result of a study is due to the influence
of the independent variable rather than by chance.
A result is “statistically significant” at a certain
level. For example, a result might be significant at
p<.05.
“P” represents the probability that the result was due to chance,
and
.05 represents a 5% probability that the result was due to chance.
Therefore, p<.05 means that the observed results have over a
95% probability of being due to the influence of the
independent variable. Or The hypothesis is correct for 95%
samples
3 (b) Critical region
The set of values of the
test statistic for which the null
hypothesis is rejected.
Values of the test statistic for which we
reject the null in favor of the
alternative hypothesis
May be located in one side or in both
side
66
Critical region for one-tailed test
Critical regions for a two-tailed test
4) Test selection
When a researcher is ready to test a
specific hypothesis generated from a
theory or to answer a research question
posed, he or she is faced with the task
of choosing an appropriate statistical
procedure.
Factors in choosing a test
74
75
Degrees of Freedom
One sample t-test or paired t-test = N-1
Independent t-test = N-2
Chi-square test =
(# rows - 1) x (# columns – 1)
ANOVA
df between groups = (# levels or groups – 1)
df within groups = (# subjects - # of levels)
Correlations = N-2
7) Making the decision
If calculated value is lower than
tabulated value the null hypothesis is
accepted. That is, there is no difference
between/among groups
If calculated value is higher than
tabulated value the null hypothesis is
rejected. That is, there is significant
difference between/among groups.
Use of some common
Statistical tests
78
Chi-square test
Application
When data are presented in ‘contingency
tables’; i.e. in categories.
Use
χ2 test is used in testing hypothesis.
Use
It is used to determine whether there is a significant
difference between the means of two groups
Whether the sample mean represents the population mean
82
Thank you
83