Documente Academic
Documente Profesional
Documente Cultură
a r t i c l e i n f o a b s t r a c t
Article history: The optimized descriptive profile (ODP) is a new descriptive methodology which proposes an optimized sample
Received 9 April 2014 evaluation by no trained assessors. To permit that assessors with a low degree of training quantitatively evaluate
Accepted 28 June 2014 samples consistently, this method proposes the presentation of reference materials ("weak" and "strong")
Available online 6 July 2014
together with all set of samples at the time of evaluation. The attribute-by-attribute protocol is recommended
on the ODP technique to allow for comparison among the samples and avoid fatigue, since only one attribute
Keywords:
No trained assessors
is evaluated at a time. Simultaneous presentation of the samples, together with the reference materials, is the
New techniques "key point" of the ODP, and therefore the number of samples may be a limiting factor in this technique. The
Quantitative data purpose of the present study is to determine if the balanced incomplete block design (BIBD), used in descriptive
Attribute-by-attribute protocol techniques traditionally, may be an alternative for data collection on this specific methodology, the optimized
Design for sample presentation descriptive profile (ODP). The BIB design does not allow for comparison among all samples together, but only
among a fraction of them. Therefore an assessment of the data quality obtained through this design is necessary
to increase the applicability of the ODP. In this experiment a specific set of grape juice formulations was used,
which presented variations in pulp and sugar contents. Sensory evaluation was conducted with two independent
panels, where each panel evaluated the samples using the ODP protocol in a specific design. Thus, one panel
evaluated the samples using the balanced complete blocks design (BCBD) and the other panel the balanced
incomplete blocks design (BIBD). The data obtained by the two designs were compared in relation to analysis
of variation, F-test power and regression models. Data obtained by the ODP conducted in the BIBD was very
similar to that of the ODP in the BCBD. In the ANOVA, the formulations differed (p b 0.05) in relation to all attri-
butes, showing a test power (1 - β) greater than 0.95. The regression models obtained in the different designs
indicated overlapping of the confidence intervals (CI 95%). Therefore, the BIBD presents itself as an alternative
for data collection in the ODP without loss in quality of the results.
© 2014 Elsevier Ltd. All rights reserved.
1. Introduction Foley, 2007; Ares, Giménez, Barreiro, & Gámbaro, 2010), pivot profile
(Thuillier, 2007), ultra-flash (Perrin et al., 2008), ranking descriptive
The optimized descriptive profile (ODP) is a descriptive methodolo- analysis (Richter, Almeida, Prudencio, & Benassi, 2010) and their
gy that was recently proposed (Silva et al., 2012) in order to reduce the respective variations.
time of sensory testing, and at the same time provide quantitative In ODP technique a panel of no trained assessors evaluated the
information on the sensory attributes in foods. The obtaining of quanti- intensity of sensory stimuli on an unstructured scale, anchored at the
tative data highlights the ODP among the fast methodologies published extremities by the terms “weak” and “strong” which are represented
recently, such as: flash profile (Dairou & Sieffermann, 2002; Delaure & by quantitative reference materials. Product evaluation by the panel
Sieffermann, 2004), napping (Pagès, 2005), free sorting task (Cartier was performed via the attribute-by-attribute protocol, which was
et al., 2006), check-all-that-apply (Adams, Williams, Lancaster, & recommended by Ishii, Chang, and O’Mahony (2007) for untrained
assessors.
Attribute-by-attribute protocol recommends that the assessors
receive all samples at once. In this protocol, they are asked to the asses-
⁎ Corresponding author.
E-mail addresses: rita.navarro@ufv.br (R.C.S.N. Silva), vprm@ufv.br (V.P.R. Minim),
sors to compare the samples with each other, in relation to a single
alexandre.navarro@ufv.br (A.N. Silva), andrea.simiqueli@ufv.br (A.A. Simiqueli), attribute, and indicate the stimulus intensity on an unstructured scale.
smdlucia@yahoo.com.br (S.M. Della Lucia), lminim@ufv.br (L.A. Minim). In technique, re-tasting of the products and re-evaluation of the score
http://dx.doi.org/10.1016/j.foodres.2014.06.042
0963-9969/© 2014 Elsevier Ltd. All rights reserved.
290 R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297
may be performed until the judge is sure that the scores represent the the number of sessions required corresponds to the number of
correct “space” between intensities (Ishii, Stampanoni, & O’Mahony, attributes multiplied by the number of repetitions of the evaluations.
2008).
In the OPD, the attribute-by-attribute protocol is combined with the 2.2. Balanced incomplete block design (BIBD)
presence of reference materials at the time of product evaluation. The
quantitative references are presented together with the samples to Balanced incomplete block designs are recommended for sample
allow assessors to compare the samples with the references, which presentation in conventional descriptive methods when the total
facilitates allocation of the attribute intensities on the unstructured number of samples under study is greater than the number that can
scale (Silva et al., 2012). This structuring of the test (all samples submit- be evaluated in the session before sensory fatigue (Best, Rayner, &
ted together along with the presence of reference materials) is the “key Allingham, 2011; Gacula, Singh, Bi, & Altan, 2008). In the BIBD, in a
point” of the ODP, because it allows for assessors with a low degree of single session the assessors evaluate only a fraction (k) of the total
training to consistently assess the intensity of the products, as verified number of samples (t), where k b t, which are presented simultaneously
in the studies of Silva et al. (2012, 2013a). (Stone & Sidel, 2012). In this design all samples are evaluated the same
The evaluation of a single attribute per session is recommended in number of times (denoted by r), and the number of times that a pair of
the ODP in order to avoid the effect of sensory fatigue, since the judge samples is evaluated together is the same for all pairs and is denoted by
must test all samples and reference materials (“weak” and “strong”) λ. The fact that r and λ are constant for all samples in the BIBD ensures
before the allocation on the stimulus intensity scale. Re-tasting may be that each treatment average is estimated with the same accuracy and
necessary in some cases, possibly enhancing the effect of sensory fatigue that all paired comparisons between two treatment averages are equal-
in the case of a large number of samples. Silva et al. (2012) reported that ly sensitive (Montgomery, 2001). The number of blocks required to
the number of samples is a possible limitation of the ODP methodology, complete one repetition of the basic design is detonated by b, and the
since the simultaneous presentation of a high number of samples and number of repetitions of the basic design is detonated by p. A list of
the references may cause sensory fatigue and not represent the actual BIB designs is presented by Cochran and Cox (1981) and is useful for
intensity of sensory stimulus. selecting a specific design for the study.
Therefore, the objective of the present study is to assess whether According to Meilgaard et al. (2006), two of the BIB designs may
the balanced incomplete block design may be an alternative design to be adopted according to the number of blocks, which are: (i) a high
evaluate a larger number of samples in this specific methodology, the number of blocks: each judge evaluates only one block (b) of samples,
optimized descriptive profile. To this end, this research evaluated the and therefore pb assessor are needed to complete the basic design, (ii)
same number of samples using two designs (complete and incomplete) small number of blocks: each judge returns several times until complet-
to verify agreement between the data. ing all repetitions of the basic design, thus each judge evaluates all
sample blocks (b), and the number of repetitions (p) is equal to the
2. Designs used for presentation of samples in conventional number of assessors.
descriptive sensory analysis The method of using the analysis of variance (ANOVA) to analyze
data from the BIBD depends on how the design is administered. Thus,
2.1. Balanced complete block design (BCBD) if each assessor evaluates only one block of the basic design, as in case
(i), then the assessors' effect will be confounded (“mixed”) with the
The balanced complete block design is the most widely used design block effect. In case (ii), in which the assessors evaluate all blocks of
in conventional descriptive tests and is recommended when the the basic design, the assessors' effect can be partitioned from the total
number of samples is small and sensory fatigue is not a concern variability, where the ANOVA may be adopted as described in Eq. (2)
(Meilgaard, Civille, & Carr, 2006). In this design, the assessors are the (Meilgaard et al., 2006; Stone & Sidel, 2012).
“blocks”, which assess all samples in r repetitions. Thus, the number of
assessors corresponds to the number of blocks (b) and the judging effect
is partitioned from the total variance in the analysis of variance Model ¼ Samplesðadjusted for blocksÞ þ Assessors
(ANOVA). Samples are presented to the assessors at random, and þ Blocksðwithin assessorsÞ þ Interactionðsamples x assessorsÞ
evaluation repetitions by the assessors (intra-block repetitions) allow þ Error: ð2Þ
for estimation of the interaction effect between the assessors and
samples, adopting the model described in Eq. (1) (Stone & Sidel, 2012).
The ODP methodology has not been evaluated according to the BIBD,
and therefore presents a question as to whether the technique provides
Model : Samples þ Assessors þ Interaction ðsamples x assessorsÞ
þ Error ð1Þ accurate and precise results when data are collected via this design. The
purpose of the ODP is to make comparisons between the samples and
the references before allocation of intensity of the stimuli on the
The methods of presenting the samples to the assessors in the BCBD unstructured scale. In the BIB design it is not possible to make com-
may follow two different protocols, which are: (i) simultaneous parisons between all samples, but only among a fraction of them, and
protocol: all samples are presented simultaneously to the assessors in therefore to assess the feasibility of this design for data collection in
the same session, making it possible to compare them in relation to the ODP it is necessary to increase the applicability of this technique.
the complete list of attributes; (ii) monadic protocol: samples are
presented monadically, i.e., individually. In the monadic protocol, com- 3. Materials and methods
parison is not allowed, and the judge evaluates a sample in relation to all
attributes listed before another sample is presented. In both protocols, 3.1. Samples
the order of sample presentation, including the repetitions are random-
ized and balanced. Five grape juice formulations were used for descriptive analyses.
In the studies conducted by Silva et al. (2012, 2013a) with the ODP This specific number (small) of samples was used for permitting the
methodology (no trained assessors), evaluations of the samples were comparison of the BCB and BIB designs in the ODP protocol. Therefore,
performed using the balanced complete block design and followed the this small number of samples was used purposely to allow comparison
attribute-by-attribute protocol, which is similar to the simultaneous of the two designs using the same assessment protocol (ODP), without
protocol, however only one attribute is evaluated per session. Thus, causing fatigue.
R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297 291
After this procedure, the assessors were divided into two homoge- A B C D E
neous panels for product evaluation. Thus, one group of seventeen 1 x x x
panelists evaluated the juice via the ODP methodology according to 2 x x x
the BIB design, another seventeen assessors using the ODP according 3 x x x
to the BCB design, and according to the recommendation of the ideal 4 x x x
5 x x x
number of assessors for ODP technique (Silva, Minim, Silva, Peternelli, 6 x x x
& Minim, 2014). The ODP/BCBD panel was comprised of eight women 7 x x x
and nine men ranging in age from twenty-one to thirty-two years old. 8 x x x
The other panel (ODP/BIBD) was comprised of ten women and seven 9 x x x
10 x x x
men, with ages ranging between twenty and thirty years old.
292 R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297
coded with three random digits. The juices were presented in a random the PCA were confirmed by the cluster analysis, using the k-means
order within each block. clustering procedure (Johnson & Wichern, 2007).
In the evaluation procedure, the assessor was asked to compare the
samples among themselves and with the reference materials, and thus 4. Results
allocate the stimulus intensity on the intensity scale. The evaluation
form was organized by attributes and contained the 9 cm unstructured 4.1. Comparison of the designs (BIB and BCB)
scale (interval) associated with each formulation.
The mean scores of the sensory attributes obtained using the both
3.3. Statistical analysis designs showed significant difference (p b 0.05) based on the F-test
(Table 3) for only four attributes. There was a significant difference
Descriptive data were analyzed by means of the analysis of variance between the mean scores obtained in the sensory characterization of
(ANOVA) and multiple regression analysis, which were performed juice-A for the attributes grape aroma, sourness and astringent. Differ-
using the R software (R Development Core Team, 2005). ence between the two designs for the grape flavor attribute in juice-E
In the ANOVA, the models described in Eqs. (1) and (2) were consid- was also observed. For other attributes and formulations both designs
ered, using the appropriate design. In the case of significant interaction, provided similar mean scores (p N 0.05).
the F-test for samples was calculated using MSinteraction as the denomi- The ODP conducted in the balanced incomplete block design
nator, as recommended by Stone and Sidel (2012). The significance presented similar mean scores compared to using the same methodolo-
level (p-value) of the variation source of interest (sample), obtained gy conducted in the balanced complete block design. There were only
by ANOVA (probability of FSAMPLE), was observed as a criterion for four exceptions, and the average score obtained in the incomplete
evaluating the discrimination power of the samples in the different design (BIBD) were higher than the score assigned by the panel when
designs. The power analysis of the F-test was also performed to better they evaluated the same juice through complete design (BCBD).
estimate the discrimination power of samples for each design. The
power of the test was calculated by the probability function [1– 4.1.1. ANOVA: BIB and BCB designs
P(type II error)], fixing the significance level (type I error) at 5%, as A summary of the ANOVA, considering the different designs, is
described in O’Brien and Lohr (1984). described in Table 4. In the BCB design, only four sources of variation
The multiple regression analysis was used to analyze the influence of were partitioned from the total variation, where the “block effect” is
quantitative factors (pulp and sugar concentrations), independent represented by the variation among assessors. In the BIBD there is an
variables, on the dependent variables (sensory attributes of the juice). extra source of variation, which estimates the variability in evaluation
Fitting of the regression models was performed as recommended by
Montgomery (2001), assessing the lack of model fit and the significance
Table 3
of the parameters, at 5% probability. The coefficient of determination ANOVA for comparison of designs BIB and BCB.
was also obtained.
Attributes BIB design BCB design
Table 4
Summary of the ANOVA obtained for each design.
BIB Sample (adjusted for blocks) 4 568.5455 327.7806 371.8679 292.9823 416.4389 314.5753
Assessors 16 9.7541 12.6985 10.5825 13.3764 11.7441 17.1937
Blocks (within assessors) 153 3.1057 3.2091 2.6397 3.4986 2.9241 3.5706
Interaction 64 3.2133 6.0704 4.0061 6.5063 5.8122 5.0995
Error 272 0.4811 2.0811 1.3557 2.0248 1.0475 2.2097
BCB Sample 4 299.1208 134.2522 168.5611 151.1814 191.4267 124.3403
Assessors (blocks) 16 5.3929 4.3889 5.3590 14.0418 5.6429 15.9219
Interaction 64 1.9738 4.2534 4.0730 3.9950 4.0688 3.6597
Error 170 0.7912 2.9840 1.6660 2.6401 2.0709 2.8607
of each judge due to repetitions in the different sample blocks. The technique, showing no alteration on the discriminatory power of the
number of degrees of freedom for the sources of variation was similar technique when the design was different.
in both designs. Only the residual variation observed a higher value
for the incomplete design (DFerror = 272) compared to the complete
4.1.3. Influence of the pulp and sugar concentrations on the sensory profile:
design (DFerror = 170). This difference between the residual degrees
regression models
of freedom occurs because on the incomplete design the panel was
The pulp and sugar contents influenced the sensory characteristics
required six repetitions to complete the assessment of the entire block
of the juices, and their effects were modeled using the linear model
of samples, while three replications of evaluations were performed on
(Table 6). Significance and estimates of the regression parameters
the complete design, as recommended in the literature for descriptive
were similar in both designs (ODP/BIBD and ODP/BCBD), as shown in
methods (Stone & Sidel, 2012).
Fig. 1.
Hatley test was used to compare the magnitude of common vari-
The pulp concentration was significant (p b 0.05) in all sensory
ances between the designs (sample, assessors, interaction and error),
attributes; whereas the sugar content only influenced the attributes
presented in Table 5. Variance of samples effect was similar between
“sweetness”, “grape aroma” and “grape flavor”. All adjusted regression
designs for all sensory attributes, indicating that the formulations
models showed high determination coefficients (greater than 0.85).
were similarly discriminated in both designs. On the other hand, there
The highest values of determination coefficients were observed for the
was significant difference (p b 0.05) to assessors' effect in the evaluation
data obtained from the ODP using the BCBD, showing that the explana-
of grape aroma, where in the incomplete design the estimated variance
tion of data can be better using the complete design due the exposition
was higher (Table 4). Regarding the interaction effect, the variability
of all samples at the same time.
of this effect was significantly higher in the incomplete design to
violet color and sourness. The residual variance was significantly
lower (p b 0.05) in the incomplete design for all attributes, except for 4.2. BIBD: comparison among the sub-group samples
sweetness, for which variances were similar (p N 0.05).
In the PCA, Fig. 2, it was noted that evaluations of juice samples were
very similar when they were assessed in the different blocks. The group-
4.1.2. Discrimination power ings suggested by the PCA were confirmed by the cluster analysis using
Discrimination power of samples on the ODP technique in both the k-means clustering procedure with 1,000,000 iterations, where the
designs was evaluated using the significance level (p-value) at which repetitions of each sample were grouped. This indicates that the average
the samples were discriminated by ANOVA (probability of Fsample). score assigned to the samples in a first evaluation was not altered when
The power analysis of F-test [1–P (type II error)] was also evaluated as this same sample was presented for the second, third or more times. Re-
a criterion for discrimination of samples. gardless of the number of times that the sample was presented, the in-
In both designs, significant effect was found for the interaction tensity assigned to the sample was similar in all sessions.
between assessors and samples (p b 0.05) in all attributes. In this case, Although the spatial distribution of the samples in the PCA was
Fsample was calculated using the MSinteraction as the denominator (Stone similar in the repetitions of evaluations, when the average scores
& Sidel, 2012). Samples were discriminated at the same level of the repetitions were compared by the Tukey test (Table 7), some
of significance (p b 0.0001) in both designs, considering all sensory significant differences (p b 0.05) were detected between the repetitions
attributes. The power of the F test presented greater than 0.95, with due to contrasts among sensory stimuli.
probability of type II error lower than 0.05. Of the thirty groups formed (blocking), in only seven groups
There was no difference in discrimination of samples when com- there was a significant difference (p N 0.05) between the repetitions
paring two different designs of sample presentation using the ODP performed in the different groupings. Sensory perception for juice-A
Table 5
Comparison between estimates of variances by Hatley test (α = 0.05).
Sample 1.90 0.2746 2.44 0.2042 2.21 0.2312 1.94 0.2687 2.18 0.2351 2.53 0.1953
Assessors 1.81 0.1233 2.89 0.0204 1.97 0.0922 1.05 0.4619 2.08 0.0767 1.08 0.4399
Interaction 1.63 0.0267 1.43 0.0787 1.02 0.4737 1.63 0.0266 1.43 0.0782 1.39 0.0936
Error 1.64 0.0001 1.43 0.0041 1.23 0.0655 1.30 0.0259 1.98 0.0001 1.29 0.0292
Table 6
Multiple regression models adjusted for each attribute on the two designs.
showed a difference (p b 0.05) in mean scores attributed in the different the formulations with low sucrose content (Juice-A and Juice-D), the
sessions only in relation to “violet color”. The score assigned to the sam- sweetness of this juice was perceived more intensely. However, when
ple in block ABC differed from that assigned when it was in comparison the same juice was evaluated in conjunction with the juices presenting
with samples B and D. Substitution of formulation C for sample D in the higher sucrose concentrations, the sweetness was perceived more
block altered the perception of visual stimuli of juice-A. The contrast be- lightly.
tween A and D, formulations with opposing sensory profiles (Table 3),
caused a suppression in the perception of the stimulus, which was 5. Discussion
then less intense (Lim, 2011; Silva et al., 2013b).
Intensity of juice-B was altered for the attribute sourness. When the The optimized descriptive profile technique conducted according
sample B was tested in comparison with juices-C and D, the average to the BIB design presented a sensory profile of the grape juices very
perception of acidity was more intense than when the same juice was similar to the profile obtained when the same method was conducted
evaluated with formulations A and D. The change from sample C, with in the complete design (BCBD). The average scores obtained to the
lower intensity of sourness, to juice-A with the highest stimulus inten- five formulations of grape juice were very similar when compared by
sity, caused a decrease in sensory perception. ANOVA. There was a significant difference (p b 0.05) only for four
Juices-C and D were the formulations most affected by the blocking, attributes (three attributes for juice-A and one attribute for juice-E). In
presenting difference in two attributes, which were: violet color and BIB design, the mean score was higher compared to the complete design
sourness for juice-C, and sourness and astringent feel for juice-D. Similar showing that the assessors have tended to assign values closer to the
behavior was observed for juices-C and D when tested in comparison anchor when a smaller number of samples were presented to them.
with samples E and B. When these were compared with juice-E (more On the other hand, the magnitude of the difference between the scores
dilute), the perception of acid taste was more intense than when tested obtained by the two different designs was small. The smallest significant
in combination with the more concentrated sample (juice-B). difference was 0.3 cm for grape flavor in the juice-E, and the biggest
Juice-E, to which no sugar was added, had its perception altered in significant difference was equal to 0.9 cm for the attribute sourness in
relation to the sweetness. When this juice was tested together with the juice-A.
Fig. 1. Representation of the confidence intervals (CI95%) of the linear model parameters for the two designs.
R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297 295
Table 7
Tukey's test (α = 0.05) for the average scores obtained in the different blocks of the BIB
The BIBD is an alternative for data collection in the ODP technique design.
without causing alteration in the mean scores assigned to the formula- Juice-A Juice-B Juice-C Juice-D Juice-E
tions. This shows that the presence of the reference materials (“weak”
Block Mean Block Mean Block Mean Block Mean Block Mean
and “strong”) at the time of evaluating the samples provides essential
Violet color
support for the allocation of stimulus intensity on the evaluation scale.
ABC 7.3a BAC 7.0a CAB 1.3b DAE 1.4a EAB 3.1a
Silva et al. (2013a) also verified the importance of reference materials ABE 6.3ab BAE 6.3a CBD 2.7a DBC 1.9a EAD 4.0a
to represent the anchors of the unstructured scale. The authors ob- ADE 7.1ab BCD 6.1a CDE 2.0ab DCE 2.3a ECD 4.6a
served that in the CP (trained panel and lack of references at the time ABD 6.0b BAD 6.3a CAD 2.0ab DAB 1.5a EAC 4.3a
of assessment) the rating scale was not homogeneously used, and the ACD 6.7ab BCE 6.5a CAE 2.0ab DAC 1.8a EBC 4.1a
ACE 6.5ab BDE 6.3a CBE 1.3b DBE 1.4a EBD 4.1a
assessors tended to “anchor” the samples, more often using the ex-
tremes of the intensity scale. In contrast, when the samples were evalu- Grape aroma
ated by the ODP technique (presence of anchors samples), the ABC 6.3a BAC 6.1a CAB 2.7a DAE 2.6a EAB 4.4a
ABE 5.5a BAE 6.3a CBD 3.9a DBC 1.9a EAD 4.7a
unstructured scale was used more homogeneously.
ADE 6.1a BCD 6.0a CDE 2.3a DCE 2.5a ECD 5.4a
The ODP technique when administered in the incomplete design ABD 5.1a BAD 6.5a CAD 3.3a DAB 1.7a EAC 4.7a
presented smallest variability of experimental error when compared ACD 6.4a BCE 6.4a CAE 3.0a DAC 2.5a EBC 3.6a
with the same method conducted in the complete design for samples ACE 5.4a BDE 6.8a CBE 2.5a DBE 1.6a EBD 3.6a
presentation. On the other hand, in the incomplete design there was Sweetness
the largest interaction effect between sample and assessors. These ABC 2.9a BAC 6.6a CAB 3.9a DAE 1.4a EAB 3.5ab
results suggest that when the technique was conducted following the ABE 2.4a BAE 6.9a CBD 4.2a DBC 1.7a EAD 4.6a
ADE 3.8a BCD 7.0a CDE 4.6a DCE 1.1a ECD 3.9ab
BIB design the evaluation procedure has become more difficult for the
ABD 3.0a BAD 7.2a CAD 4.6a DAB 1.3a EAC 3.2ab
assessors, which were more influenced by the samples at the time of ACD 3.1a BCE 6.2a CAE 4.2a DAC 1.5a EBC 2.8b
allocation of intensities for the violet color and sourness attributes. At ACE 2.8a BDE 5.9a CBE 2.8a DBE 1.2a EBD 3.2ab
the same time, random variation was also reduced when the evalua-
Sourness
tions were conducted in the BIBD, showing that the variability between ABC 6.4a BAC 4.8ab CAB 2.7ab DAE 2.0ab EAB 2.7a
repetitions of the evaluations were lower than when the whole set of ABE 6.0a BAE 6.0ab CBD 2.7ab DBC 2.5ab EAD 4.0a
samples was presented simultaneously (BCBD). The presentation of ADE 6.3a BCD 6.3a CDE 3.3a DCE 3.5a ECD 4.0a
only a fraction of the samples may complicate the analysis for the no ABD 5.5a BAD 4.1b CAD 2.2ab DAB 2.4ab EAC 3.3a
ACD 6.2a BCE 4.8ab CAE 2.2ab DAC 2.6ab EBC 2.7a
trained assessors, but the presence of references materials (anchors)
ACE 6.5a BDE 5.0ab CBE 1.4b DBE 1.0b EBD 3.3a
can help in score attribution and orient the assessors, enabling a more
consistent assessment. The results suggest that even the assessors did Grape flavor
ABC 5.8a BAC 6.8a CAB 3.3a DAE 1.6a EAB 4.1a
not have access to all the samples to perform an effective comparison
ABE 5.5a BAE 6.4a CBD 4.0a DBC 1.8a EAD 4.2a
between them, the smaller number of samples in evaluation on the ADE 6.6a BCD 6.7a CDE 3.1a DCE 2.0a ECD 5.1a
BIB design can enable greater attention given to each unit, generating ABD 5.2a BAD 6.9a CAD 3.4a DAB 1.5a EAC 4.6a
more accurate answers (less variability between repetitions of evalua- ACD 6.3a BCE 6.8a CAE 3.0a DAC 1.9a EBC 3.8a
tions—MSerror). ACE 5.8a BDE 6.5a CBE 2.5a DBE 1.1a EBD 3.9a
In previous studies conducted with the ODP method (Silva et al., Astringent
2012, 2013a) the complete design was used (BCBD). The authors ABC 6.3a BAC 5.2a CAB 3.3a DAE 2.0ab EAB 3.3a
found that the technique permitted to obtain results very similar to ABE 6.0a BAE 6.0a CBD 3.5a DBC 2.6ab EAD 5.0a
ADE 6.6a BCD 6.3a CDE 2.6a DCE 3.8a ECD 4.2a
the profile constructed by a trained panel and also enabled a reduction ABD 6.1a BAD 5.3a CAD 2.3a DAB 2.6ab EAC 3.3a
in the effect of interaction between assessors and samples. In the ACD 6.1a BCE 5.9a CAE 2.3a DAC 2.8ab EBC 3.7a
present study, it was verified that when the samples are presented at ACE 6.9a BDE 5.7a CBE 1.6a DBE 1.3b EBD 3.2a
an incomplete design, the interaction effect can appear more acutely. Same letters in the column, to each attribute, are not significantly different at 5% of
In contrast, in the BIB design the variance of experimental error was probability.
296 R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297
Table 8
Comparison between the designs in relation to the number of steps, time required and quality of data.
Criteria Design
BIBD BCBD
mathematical model was obtained (p N 0.05) for the two designs 6. Conclusion
utilized.
It is important to note that in the BIBD sixty sessions were neces- The presence of all set of samples simultaneously on the evaluation
sary to complete the evaluation of the samples and a greater number session is a requisite to optimized descriptive profile technique. This is
of repetitions of the evaluations to obtain all paired comparisons. necessary so that the no trained assessors can evaluate the samples
The time required to complete the evaluations may have contributed consistently and present quantitative responses. However, the presen-
for “calibration” of the sensory memory of the panel, constituting tation of all set of samples at the same time and the presence of
“panel training”. In contrast, when comparing the mean scores reference materials ("weak" and "strong") brought a question regarding
assigned to samples in the different evaluation repetitions, i.e., the evaluation of a larger number of samples. The number of samples
along the execution time of the experiment there was similarity be- could be a limiting factor in the application of ODP? This answer is no!
tween the repetitions (Fig. 2). The difference observed between rep- This study found that the evaluation of the same number of samples
etitions in the Tukey's means test was due to the contrast effect conducted under the conventional design (BCBD) and the incomplete
between stimuli (Lim, 2011; Silva et al., 2013b), which occurred design (BIBD) for presentation of samples has provided similar results,
when a given sample was tested in conjunction with showed a high discrimination power of the samples and allowed for
other(s) formulation(s) with higher stimulus intensity, where sen- obtaining the functional relationship between the independent and
sory perception is suppressed and a lower sensory score is attribut- dependent variables on both designs. The balanced incomplete block
ed. However, when a same formulation was tested against design was tested in this study to evaluate juices using the ODP and
formulations with more similar intensity to the sample under evalu- presented very similar results when the same technique was adminis-
ation, the stimulus was more noticeable and a higher score assigned. tered in the complete design. The use of this experimental condition
In the BCBD this effect is minimized since all stimuli are presented (ODP/BIBD) in other food matrices such as more complex products
simultaneously. (greater number of attributes descriptors) can be necessary.
Table 8 shows a comparison between the designs with respect to the The BIBD is presented as an alternative for the evaluation of a larger
number of sessions and time required, besides the quality of responses number of samples, showing that the number of samples is not a limit-
obtained by the method when conducted in both designs. It is noted ing factor for the ODP. For this number of samples there was no loss of
that the ODP, when conducted under the BCBD, permitted a 37% reduc- information in the BIB compared to the BCB design. Thus, a larger
tion in the number of sessions in relation to the BIBD, representing a 42% number of samples may be extrapolated to the BIBD on ODP protocol.
reduction in the time required. The ODP conducted in the BIBD showed
longer time required to complete the evaluations, being less economical
than the same method conducted in the complete design for sample Acknowledgment
presentation. On the other hand, when the ODP was conducted under
the BIB design, it enabled the evaluation of the same number of samples The authors would like to acknowledge the CNPq (Centro Nacional
without altering the sensory profile of the product, discrimination of the de Desenvolvimento Científico e Tecnológico) and Fapemig (Fundação
samples, and in functional relationship between the independent and de Amparo à Pesquisa de Minas Gerais) for their financial support.
dependent variables. Thus, the number of samples is not a limiting
factor for the ODP, once that this evaluation can be done by an incom- References
plete design.
By another side, when the incomplete block design is used, the Adams, J., Williams, A., Lancaster, B., & Foley, M. (2007). Advantages and uses of check-all-that-
apply response compared to tradicional scaling of attibutes for salty snacks. 7th Pangborn
reduction in time of the technique is smaller. Thus, the greater the
Sensory Science Symposium. Minneapolis, USA, 12-16, August, 2007.
number of samples to be evaluated, the greater the number of blocks Ares, G., Giménez, A., Barreiro, C., & Gámbaro, A. (2010). Use of a open-ended question to
required to make comparisons between the treatments and, therefore, identify drivers of liking of milk desserts: Comparison with preference mapping
techniques. Food Quality and Preference, 21, 286–294.
more time required to complete the evaluation. On the other hand,
Best, D. J., Rayner, J. C. W., & Allingham, D. (2011). A statistical test for ranking data from
training of the panel for a large number of samples should also be partially balanced incomplete block designs. Journal of Sensory Studies, 26(1), 81–84.
more intense and the number of sessions for evaluation of all samples Cartier, R., Ritz, A., Lecomte, A., Poblete, F., Krystlik, J., Belin, E., et al. (2006). Sorting
and repetitions should also increase, since it is not recommended procedure as an alternative to quantitative descriptive analysis to obtain a product
sensory map. Food Quality and Preference, 17, 562–571.
to evaluate more than six samples per session in the conventional Cochran, W., & Cox, G. M. (1981). Diseños experimentales (7th ed.). México: Editorial
technique (Stone & Sidel, 2012). Trillas.
R.C.S.N. Silva et al. / Food Research International 64 (2014) 289–297 297
Dairou, V., & Sieffermann, J. M. (2002). A comparison of 14 jams characterized by convention- Perrin, L., Symoneaux, R., Maître, I., Asselin, C., Jourjon, F., & Pagès, J. (2008). Comparison
al profile and a quick original method, the flash profile. Journal of Food Science, 67, of three sensory methods for use with the napping procedure: Case of tem wines
826–834. from Loire valley. Food Quality and Preference, 19, 1–11.
Damásio, M. H., & Costell, E. (1991). Análisis sensorial descriptivo: Generación de R Development Core Team (2005). R: A language and environment for statistical computing.
descriptores y selección de catadores. Revista Agroquimica de Technologia de Vienna, Austria: R Foundation for Statistical Computing3-900051-07-0 (http://www.
Alimentos, 31, 165–178. R-project.org).
Delaure, J., & Sieffermann, J. M. (2004). Sensory mapping using flash profile. Comparison Richter, V. B., Almeida, T. C. A., Prudencio, S. H., & Benassi, M. T. (2010). Proposing a
with a conventional descriptive method for the evaluation of the flavour of fruit dairy ranking descriptive sensory method. Food Quality and Preference, 21, 611–620.
products. Food Quality and Preference, 15, 383–392. Schenker, N., & Gentleman, J. F. (2001). On judging the significance of differences by
Gacula, M. C., Jr., Singh, J., Bi, J., & Altan, S. (2008). Statistical methods in food and consumer examining overlap between confidence intervals. The American Statistician, 55,
research (2nd ed.). Academic Press. 182–186.
Ishii, R., Chang, H., & O’Mahony, M. (2007). A comparison of serial monadic and attribute- Silva, A. N., Silva, R. C. S. N., Ferreira, M.A.M., Minim, V. P. R., Costa, T. M. T., & Perez, R.
by-attribute protocols for simple descriptive analysis with untrained judges. Food (2013a). Performance of hedonic scales in sensory acceptability of strawberry yogurt.
Quality and Preference, 18, 440–449. Food Quality and Preference, 30, 9–21.
Ishii, R., Stampanoni, C., & O’Mahony, M. (2008). A comparison of serial monadic and Silva, R. C. S. N., Minim, V. P. R., Carneiro, J.D. S., Nascimento, M., Della Lucia, S. M., &
attribute-by-attribute descriptive analysis protocols for trained judges. Food Quality Minim, L. A. (2013b). Quantitative sensory description using the optimized descrip-
and Preference, 19, 277–285. tive profile: Comparison with conventional and alternative methods for evaluation
Johnson, R. A., & Wichern, D. W. (2007). Applied multivariate statistical analysis (6th ed.). of chocolate. Food Quality and Preference, 30, 169–179.
London: Pearson Education. Inc, 773. Silva, R. C. S. N., Minim, V. P. R., Silva, A. N., Peternelli, L. A., & Minim, L. A. (2014).
Lim, J. (2011). Hedonic scaling: A review of methods and theory. Food Quality and Optimized descriptive profile: How many judges are necessary? Food Quality and
Preference, 22, 733–747. Preference, 36, 3–11.
Meilgaard, M. C., Civille, G. V., & Carr, B. T. (2006). Sensory evaluation techniques (4th ed.). Silva, R. C. S. N., Minim, V. P. R., Simiqueli, A. A., Moraes, L. E. S., Gomide, A. I., & Minim, L. A.
Boca Raton: CRC Press. (2012). Optimized descriptive profile: A rapid methodology for sensory description.
Montgomery, D. C. (2001). Design and analysis of experiments (5th ed.). New York: John Food Quality and Preference, 24, 190–200.
Wiley and Sons, 699. Stone, H., & Sidel, J. L. (2012). Sensory evaluation practices (4th ed.). New York: Academic.
O’Brien, R., & Lohr, V. (1984). Power analysis for linear models: The time has come. Thuillier, B. (2007). Rôle du CO2 dans l’Appréciation Organoleptique des Champagnes –
Proceedings of the ninth annual SAS User’s Group international conference Expérimentation et Apports Méthodologiques. Reims, France: Thèse de l’URCA.
(pp. 840–846). Van Belle, G. (2002). Statistical rules of thumb. New York, NY: John Wiley and Sons.
Pagès, J. (2005). Collection and analysis of perceived product inter-distances using Villanueva, N. D.M., Petenate, A. J., & Da Silva, M.A. A. P. (2005). Performance of the hybrid
multiple factor analysis: Application to the study of 10 white wines from the Loire hedonic scale as compared to the traditional hedonic, self-adjusting and ranking
Valley. Food Quality and Preference, 16, 642–649. scales. Food Quality and Preference, 16, 691–703.