Documente Academic
Documente Profesional
Documente Cultură
Hui Li1 , Jianfei Cai1 , Thi Nhat Anh Nguyen2 , Jianmin Zheng1
1
ABSTRACT
Though quite a few image segmentation benchmark datasets
have been constructed, there is no suitable benchmark for
semantic image segmentation. In this paper, we construct a
benchmark for such a purpose, where the ground-truths are
generated by leveraging the existing fine granular groundtruths in Berkeley Segmentation Dataset (BSD) as well as
using an interactive segmentation tool for new images. We
also propose a percept-tree-based region merging strategy
for dynamically adapting the ground-truth for evaluating
test segmentation. Moreover, we propose a new evaluation
metric that is easy to understand and compute, and does not
require boundary matching. Experimental results show that,
compared with BSD, the generated ground-truth dataset is
more suitable for evaluating semantic image segmentation,
and the conducted user study demonstrates that the proposed
evaluation metric matches user ranking very well.
Index Terms Benchmark, Evaluation, Semantic Image
Segmentation, Dataset
1. INTRODUCTION
Semantic image segmentation refers to the task of segmenting
an image into a set of non-overlapped meaningful regions
corresponding to objects or parts of the objects which can
deliver semantics or high-level structure information. A
good semantic image segmentation can benefit many other
computer vision tasks and multimedia applications such as
object recognition, content-based image indexing, summary
and retrieval, and image editing.
Although there is no universal answer on what a good
semantic image segmentation should be since the concept
of semantic is subjective and content-dependent, there are
some general criteria. First, a good semantic segmentation
should be able to achieve high similarity within segments
and low association across the segments. Second, the segmentation boundary should match human perception. Third,
semantic segmentation should reflect significant features and
small-scale fluctuation should be ignored according to the
part salience theory [1]. In other words, semantic image
segmentation should decompose an image into a small set
of meaningful regions (different from over-segmentation or
clustering pixels), each of which is of considerable size. Fig. 1
shows a few examples of semantic image segmentation.
(a) Images
(b) BSD
(c) BSD
(d) Ours
with:
(
pi pj =
(
Wij =
jNk (i)
1, R(S1 , pi ) 6= R(S1 , pj )
0, otherwise
P R
+ P
(1 )R
(2)
Wij+
Wij
1,
0 d(i, j) < a
)
,
b d(i, j) < c
0.5(
cb
0,
c d(i, j)
0,
da 2
0.5( ba ) ,
2
= 0.5( cd
cb ) 1,
1,
0,
0 d(i, j) < a
a d(i, j) < b
b d(i, j) < c
c d(i, j) < c + a
c + a d(i, j)
(3)
(4)
(a) Image
(b) Ground-truth
(c) BSD
(d) Test
(e) Test
(f) Test
(g) Test
(h) Test
(a) with c = 10
(c)-(h):
Precision
0.43
0.80
0.59
0.79
0.50
0.38
Recall
0.77
0.78
0.54
0.78
0.48
0.35
F-measure
0.55
0.79
0.57
0.79
0.49
0.36
(b) with a = 2
6. CONCLUSION
7. REFERENCES
[1] D. D. Hoffman and M. Singh, Salience of visual parts,
Cognition, vol. 63, no. 1, pp. 29 78, 1997.
[2] D. Martin, C. Fowlkes, D. Tal, and J. Malik, A database
of human segmented natural images and its application to
evaluating segmentation algorithms and measuring ecological
statistics, in ICCV, July 2001, vol. 2, pp. 416 423.
[3] D. Batra, A. Kowdle, D. Parikh, J. Luo, and T. Chen.,
icoseg: Interactive co-segmentation with intelligent scribble
guidance, in CVPR, 2010, pp. 3169 3176.
[4] S. Alpert, M. Galun, R. Basri, and A. Brandt, Image
segmentation by probabilistic bottom-up aggregation and cue
integration, in CVPR, June 2007, pp. 1 8.
[5] M. Everingham, L. V. Gool, C. Williams, J. Winn, and
A. Zisserman,
The PASCAL Visual Object Classes
Challenge 2009 (VOC2009) Results, http://www.pascalnetwork.org/challenges/VOC/voc2009/workshop/index.html.
[6] E. Borenstein and S. Ullman, Class-specific, top-down
segmentation, in ECCV, 2002, pp. 109 122.
[7] D. Martin, C. Fowlkes, and J. Malik, Learning to detect
natural image boundaries using local brightness, color,and
texture cues, IEEE Trans. PAMI, vol. 26, no. 5, pp. 530
549, May 2004.
[8] X. Jin and C. H. Davis, A genetic image segmentation
algorithm with a fuzzy-based evaluation function, May 2003,
vol. 2, pp. 938 943, ICFS.
[9] A. Blake, C. Rother, M. Brown, P. Perez, and P. Torr, Interactive image segmentation using an adaptive gmmrf model, in
ECCV, 2004, pp. 428 441.
[10] T. Nguyen, J. Cai, J. Zhang, and J. Zheng, Robust interactive
image segmentation using convex active contours, in IEEE
Trans. on Image Processing, 2012, vol. 21, pp. 3734 3743.
[11] X. Bai and G. Sapiro, A geodesic framework for fast
interactive image and video segmentation and matting, in
ICCV, 2007, pp. 1 8.