Documente Academic
Documente Profesional
Documente Cultură
for
Chapter 9
Marketing Research
Text and Cases
by
Rajendra Nargundkar
Slide 1
Slide 2
Methods
1. A oneindependent variable experiment is called oneway ANOVA. ANOVA stands for Analysis of Variance, the
generic name given to a set of techniques for studying
cause-and-effect of one or more factors on a single
dependent variable.
2. If we hypothesise that there is also a Blocking Variable
(to be explained later in the Randomised Block Design) in
addition to one independent variable, we can use a
randomized block design.
Slide 3
Variables
The Analysis of Variance technique is used when the
independent variables are of nominal scale (categorical) and
the dependent variable is metric (continuous).
Design
The design of the experiment is the most critical in
performing any experiment to be analysed through the
technique of ANOVA.
There are four major types of designs, of which three
frequently used types will be illustrated with a worked out
example each.
These four major types are
Completely Randomised Design in a One-Way ANOVA
(Single Factor)
Randomised Block Design (Single Blocking Factor)
Latin Square Design (Two Blocking Factors)
Factorial Design with 2 or more Factors.
We will discuss in detail the first two, and the fourth.
Slide 4
One-Way ANOVA
This particular design is used when there is only one
categorical independent variable, and one dependent (metric)
variable.
Each category of an independent variable is called a level.
The independent variable may be different levels of prices, or
different pack sizes, or different product colours, and the
effect (dependent variable) could be sales, preferences or
attitudes towards the brand.
In the example that follows, we will look at advertising copy
alternatives as the independent variable, and preference rating
for the advertising copy as the dependent variable.
Worked Example Problem:
In this example, we assume that three different versions of
advertising copy have been created by an advertising agency
for a campaign. Let us call these versions of copy ADCOPY
1, 2 and 3. Now, the ad agency wants to test which of these
three versions of the advertising copy is preferred by its target
population, before they launch the campaign.
A sample of 18 respondents is selected from the target
population in the nearby areas of the city. At random, these
18 respondents are assigned to the 3 versions of ad copy.
Each version of ad copy is thus shown to six of the
respondents.
The respondents are asked to rate their liking for the ad copy
shown to them on a scale of 1 to 10. (1 = Not liked at all, 10
= Liked a lot, and other values in between these two). The
ratings given by the 18 respondents are tabulated.
Slide 5
Input Data
Fig 1. shows the input data for the 18 respondents.
Fig. 1.
Sr.
No.
1
2
3
4
5
6
7
8
9
10
Ad
copy
1
1
1
1
1
1
2
2
2
2
rating
6.00
7.00
5.00
8.00
8.00
8.00
4.00
4.00
5.00
7.00
Slide 5 contd...
Fig. 1. Contd
Sr.
No.
11
12
13
14
15
16
17
18
Ad
copy
2
2
3
3
3
3
3
3
rating
7.00
6.00
5.00
5.00
4.00
7.00
8.00
7.00
Slide 6
The input data in fig 1 is input into a statistical
package for performing a One-Way ANOVA,
because we have only 1 categorical factor (Ad copy)
at 3 levels 1, 2, 3 and 1 dependent variable
Rating.
Output
The output of the computerised One-Way ANOVA
is shown in fig. 2.
Fig. 2
Source of
Variation
Sum of
Squares
DF
Mean
Square
Sig.
of F
Main
Effects
ADCOPY
Explained
Residual
Total
7.000
3.500
1.780
.203
7.000
7.000
29.500
36.500
2
2
15
17
3.500
3.500
1.967
2.147
1.780
1.780
.203
.203
Slide 6 contd.
Slide 7
The ANOVA has thus told us what we may not have been
able to gauge if we had simply looked at the mean ratings for
each ad copy by computing these.
For example, the ratings for the ad copy version 1 are
6,7,5,8,8,8 and the mean rating is (6+7+5+8+8+8) / 6, or 42/6
= 7. Similarly, the mean rating of ad copy version 2 is
(4+4+5+7+7+6) / 6, or 33/6 = 5.5. The mean rating for ad
copy version 3 is (5+5+4+7+8+7) / 6, or 36/6 = 6.
At a glance, the three mean ratings appear to be different 7,
5.5 and 6. But the ANOVA tells us that this difference is not
statistically significant at the 95 percent confidence level.
It does this by performing an F-test. The null hypothesis for
this F-test is that there is no significant difference in the mean
ratings for the three ad copy versions. (H0: M1 = M2 = M3
where M1, M2 and M3 are the mean ratings for the three
versions of ad copy). Thus, in this case, we have accepted the
null hypothesis (or failed to reject the null hypothesis), at the
95 percent confidence level.
If the significance of F in the last column of fig. 2 had been
less than 0.05, we would have rejected the null hypothesis. In
that case, we would have concluded that significant
differences exist between mean ratings given to the three ad
copy versions.
Slide 8
1. Randomised Block Design:
Let us continue with the same input data as in fig. 1,
with one more column added to it. This
dataset is
shown in fig. 3.
Fig. 3
sr. adcopy
no.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
rating
1
1
1
1
1
1
2
2
2
2
2
2
3
3
3
3
3
3
6.00
7.00
5.00
8.00
8.00
8.00
4.00
4.00
5.00
7.00
7.00
6.00
5.00
5.00
4.00
7.00
8.00
7.00
magazine
1
2
3
4
5
6
1
2
3
4
5
6
1
2
3
4
5
6
Slide 8 contd..
We have made a slightly different assumption in this
case. We assume that the three versions of the adcopy
were each used in 6 different magazines. These six
magazines are coded 1, 2, 3, 4, 5, 6 and appear in the
column titled magazine. Out of the people who saw
these ads, 18 randomly chosen respondents are
picked, one from each magazine who saw a particular
version of ad. Thus, we finally have one respondent
who has seen a given version of the ad in a given
magazine. In other words, we have one respondent
for every combination of magazine and adcopy.
Slide 9
Hypothesis
1. The assignment of our sample of 18 in the above manner
assumes that the magazine in which the version of adcopy
appears may have an impact on the ratings. We can test this
hypothesis - in fact, two hypotheses - by doing an ANOVA
with a randomised block design.
2. For this purpose, we use the variable Rating as the
dependent variable, and Adcopy as the factor, and
Magazine as the block.
3. A block is defined as some variable which could affect the
relationship between the independent factor and the
dependent variable under study in an ANOVA. In our
example, the magazine in which the advertisement appears
could influence the Rating given to Adcopy by the
respondents. We are trying to remove the effect of the
magazine used, by "blocking" its effect, or treating the block
separately.
4. If we do not block on a variable, its effect gets included
with the error (residual) term. This may lead to wrong
conclusions about the relationship between the independent
and dependent variables. In that sense, a randomised block
design is more "powerful" than a simple one-way ANOVA, if
the block effect is significantly influencing the relationship.
Slide 10
Output
The computer output for this problem using a randomised
block design is shown in fig. 4.
Fig. 4
Tests of significance for RATING using UNIQUE sums of
squares.
Source of
Variation
Residual
Adcopy
Magazine
(Model)
(Total)
SS
DF MS
Sig
of F
3.67 10 .37
7.00 2 3.50 9.55 .005
25.83 5 5.17 14.09 .000
32.83 7 4.69 12.79 .000
36.50 17 2.15
Slide 11
1. To test if the null hypotheses are rejected or not, we turn to
the last column of fig. 4, which gives the result of an F-test
for any assumed confidence level. We will assume we wanted
to test these hypotheses at the 95 percent confidence level.
2. We know that the significance level of F in the last column
should be less than 0.05 for the null hypothesis to be rejected.
We see that for both the rows labelled ADCOPY and
MAGAZINE, the significance of F is less than .05. It is .005
for ADCOPY and .000 for MAGAZINE. This means that
both the null hypotheses are rejected.
Slide 12
Slide 13
Worked Example
In this example, we assume that we are testing for a toilet
soap brand, the effect of two Factors (independent variables)
Pack Design and Price - on Sales (dependent variable).
We would like to know (1) if each of the Factors
independently affects Sales (called the Main Effects), and (2)
if there is a combined effect of Pack Design and Price (called
the 2 way Interaction Effect) on Sales.
Incidentally, if there are 3 factors in a study, then we could
test for all 2-way interaction effects and the 3-way
interaction effect, in addition to the Main Effects of the
individual factors.
To continue with our example, the experiment is conducted
in a simulated environment on 18 randomly selected
respondents. There are 3 levels of price Rs. 8, Rs. 11 and
Rs. 14, and 3 levels of Pack Design designated by the main
colours used Blue, Red and Green.
The coding of these variables is 1, 2, 3 respectively for Rs.
8, 11 and 14 and 1, 2, 3 for Blue, Red and Green in the case
of Pack Design.
Slide 14
Input Data
Slide 15
Also note from fig.5 that each combination of Price and Pack
Design appears twice in the dataset. For example, Packdesign =
1 and Price = 1 appears in Row 1 and also Row 10. This is
known as a replication in design of experiments. This is similar
to having a higher sample size in a survey.
Depending on the number of Factors and the number of levels
of each Factor, the minimum sample size required for ANOVA
may go up. In such cases, multiple observations or replications
become necessary. In general, replications reduce chances of
random error affecting the results of ANOVA experiments,
similar to the effects of increasing sample size in surveys.
Output:
The output data for our factorial experiment are presented in
fig. 6.
Fig 6
Source of
Variation
Main
Effects
Packdesn
Price
2-Way
Interactions
Packdesn
Price
Explained
Residual
Total
Sum of
Squares
DF
209305.556
12536.111
196769.444
9838.889
2
4
9838.889
Mean Square
Sig of
F
1.635 .248
.641 .646
219144.444 8 27393.056
34512.500 9
3834.722
253656.944 17 14920.997
7.143 .004
Slide 16
Slide 17
We find that the significance of F values are
Pack Design - .248 (Main Effect 1)
Price - .000 (Main Effect 2)
Pack Design by Price - .646 (Interaction Effect)
Therefore, only the Price effect, one of the two main
effects, is significant statistically, at 95 percent
confidence level. This means that hypothesis no. 2 is
rejected.
Hypothesis 1 and 3 cannot be rejected, as the
significance of F values are greater than .05 in both
cases - .248 and .646 respectively).
Thus, we conclude that Price alone has an impact on
Sales. Neither Pack Design alone nor the combination of
Pack Design with Price have any significant impact on
Sales of the toilet soap.
Slide 18
Additional Comments