Sunteți pe pagina 1din 4

RECURSIVE DISPARITY ESTIMATION ALGORITHM FOR REAL TIME

STEREOSCOPIC VIDEO APPLICATIONS


E.A. Hendriks Gy. Marosi
Delft University of Technology Budapest University of Technology
Dept. of EE, Information Theory Group Dept. of Telecommunication and Telematics
Mekelweg 4 H-1111 Sztoczek utca 2.
2628 CD Delft Budapest
The Netherlands Hungary
e-mail hendriks@it.et.tudelft.nl e-mail marosi@bme-tel.ttt.bme.hu

ABSTRACT of view the most promising algorithms are those based


Several implementations of a recursive disparity esti- on block-wise processing. Block-based algorithms are
mator have been investigated in which bi-directional highly structured and can easily be pipelined or paral-
spatial predictions as well as temporal predictions are lelized.
used. The recursive disparity estimator showed to be The problem of disparity estimation is strongly re-
very attractive for real time multi view interpolation lated to motion estimation. Nowadays, block-based
applications because of the use of the predictions and motion estimators are being developed [6, 7, 8] or
small search ranges or update sets. The estimated dis- even commercially available [9]. All implemented al-
parity elds are smooth, causing less visible artifacts gorithms are based on full search and/or hierarchical
in the interpolated intermediate viewpoints. Moreover, block matching. The main application for this class of
the recursive estimator is attractive for hardware im- motion estimators is MPEG coding. For coding pur-
plementation. Only a small number of updates have to poses the aim is to nd good matching values, where
be checked, thereby considerably reducing the number the quality of the estimator is usually judged upon the
of operations to be executed. compression ratio that can be achieved. For interpo-
lation purposes it is not enough to nd local optima,
1. INTRODUCTION but true vectors are necessary to prevent annoying ar-
tifacts.
One of the goals in the development of enhanced 3D- Finding the true vector elds is not obvious. As-
telepresence systems is to provide the viewer an un- suming ideal cameras and illumination, there are still
limited number of viewpoints with a limited number of non-occluded areas where simple estimators fail, for ex-
cameras. This so-called multi viewpoint capability can ample with no texture or with repetitive pattern. Per-
be achieved by interpolating intermediate viewpoints ceptually the most irritating result of not having true
using at least two recorded image sequences (left and vector values is the lack of consistency in the output:
right) and their disparity information. Assuming par- spatial inconsistencies in the interpolated still images
allel camera alignment the di erence between the left and additional temporal inconsistencies in interpolated
and the right images is due to the objects' horizon- image sequences.
tal translation depending on their depth in the scene.
This displacement is called disparity and its estimation 2. BLOCK-BASED RECURSIVE
is required. DISPARITY ESTIMATION
In recent years several disparity estimation algo-
rithms have been developed. One can distinguish be- There have been many simulations of various algo-
tween object-based [1], pixel-based [2] and block-based rithms for disparity estimation - all of them su er from
disparity estimators [3, 4, 5]. From a hardware point some errors in the result eld. Amongst the block-
This work was done in the framework of the European ACTS based algorithms, the simplest full search results in
AC092 'PANORAMA' Project and supported by a mobility annoying discontinuities that is even more irritating
grant of the TEMPUS program for moving sequences. The smoothness constraint of
BBBBB CCCCC
BBBBB
CCCCC
BBBBB CCCCC
BBBBB BBBBB
CCCCC
BBBBB
BBBBB CCCCC
BBBBB
CCCCC
BBBBB BBBBB
CCCCC
BBBBB
BBBBB
BBBBB
BBBBB
Block in question
CCCCC
BBBBB
CCCCC
T (a) T (b) T (c) Spatial Predictor

CCCCC CCCCC CCCCC CCCCC CCCCC CCCCC &&&&&


CCCCC
&&&&&
Temporal Predictor
CCCCC CCCCC
BBBBB !!!!!
!!!!! !!!!!
BBBBB
CCCCC CCCCC
!!!!!
BBBBB !!!!!
!!!!! !!!!!
CCCCC CCCCC
!!!!!
BBBBB *****
&&&&&
CCCCC
CCCCC CCCCC
BBBBB
CCCCC CCCCC
&&&&&
!!!!! CCCCC
!!!!!
BBBBB
CCCCC CCCCC CCCCC
&&&&&
!!!!! CCCCC
!!!!!
BBBBB
CCCCC CCCCC *****
Temporal accelerator
*****
BBBBB &&&&&
!!!!! !!!!!
BBBBB
&&&&& &&&&&
!!!!!
!!!!! !!!!!
BBBBB
!!!!!
&&&&&
BBBBB &&&&& *****
!!!!! *****
!!!!!
&&&&&
*****
!!!!!
***** *****
!!!!!
*****
T (d)
T *****
T *****
T1 (e) T1 (f)

Figure 1: Di erent search strategies: a) full search, b)+c) one-directional spatial prediction, d) bi-directional spatial
prediction, e) spatial-temporal prediction, f) spatial-temporal prediction with temporal accelerators.

disparity paths suggests the usage of the spatial re- date values f? 2 , 0, + 2 g resulted in a further im-
L L

dundancies in real-life images, and also the temporal provement in the smoothness of the disparity eld caus-
redundancy in image sequences, hence a recursive al- ing a consistent interpolated output and a much faster
gorithm seems to be advantageous [10]. Let us consider estimation process. The optimal values of both 1 and
L

the steps that lead from the full search approach to the L 2 for the AQUA and the MANEGE test images with a
spatial-temporal recursive algorithm with convergence block-size of 8-by-8 turned out to be between 2 and 4.
accelerators. To improve the consistency of the interpolated se-
The disparity values for the images used over the quences temporal recursion was also utilized; a third
tests are known to be in the range of -16 and +40, hence search is applied, initialized by the disparity value of
the block-based full search algorithm (Figure 1/a) was the same block in the previous frame (Figure 1/e). Al-
applied in this interval only1; minimum absolute di er- though this step increases the number of candidates,
ence in the luminance domain was the search criterion the annoying temporal jumps of blocks are highly re-
of the block matching. As a 0 order approach towards
th
duced. The spatially bi-directional search still has the
spatial recursion, the estimated disparity value of a pre- weakness of slow convergence at the upper edges of
viously evaluated neighboring block was selected as the objects. Using the estimated results of the previous
relative zero value of the search range (Figure 1/b,c). frame four-directional convergence can be introduced,
To utilize bi-directional convergence, the set of dispar- by adding the results of two more neighboring blocks
ity candidates was extended by a second search ini- (referred to as the convergence accelerators) to the ini-
tialized by the estimated disparity value of another tializing set (Figure 1/f). To avoid the degradation of
evaluated neighboring block (Figure 1/d). Obviously, spatial consistency, the update set of these convergence
overlapping ranges of the search area have to be calcu- accelerators is reduced to f0g, i.e., no search is done
lated only once, and due to the usually high spatial re- around their disparity values, just the values them-
dundancy of blocks, this bi-directional search requires selves are taken into account as possible candidates.
only a little extra computation. A smoother dispar-
ity eld can be achieved by strongly limiting the num- 3. RESULTS
ber of possible candidate disparity values. Keeping the
bi-directional convergence structure the search range All algorithms described above have been implemented
was radically reduced to the range of [? 1 + 1 ] L ::: L and tested on several real-life stereoscopic sequences;
around the disparity values of the initializing blocks for the evaluation with the AQUA sequence see Ta-
(Figure 1/d); decreasing the candidate set to the up- ble 1, for other sequences having the same camera set-
1 Except for the MAN sequence, where due to the larger base- up these algorithms show very similar characteristics
line there is an o set in the disparity values. as in case of AQUA.
See Spatial/Temporal Update Number of Notes
Fig. Recursion Length Cost Function
1/  1 2 Operations [106]
L ;

a) No/No - 23.2 Full Search [-16,+40], annoying temporal


inconsistency; slow estimation process.
b) Range / No 2 3.0 Cannot follow edges, shadows on the in-
terpolated image.
c) Range / No 4 4.9 Good still pictures.
d) Range / No 8 8.5 Dizzy background.
e) Range / Range 2 3.4 Temporal consistency is introduced.
f) Range / Range 2 3.6 Faster convergence at the upper edges of
objects due to temporal accelerators.
f) Set / Set 2 2.5 Fast convergence due to temporal acceler-
ators and faster estimation process; best
results.

Table 1: Results of the recursive disparity estimator for the DISTIMA test sequence AQUA. Since there have been no
reliable and standard quality measurement and validation methods, Notes re ect the subjectively judged perceptual
quality achieved.

The usefulness of the predictors can clearly be seen mance and perceptual quality are demonstrated. These
both on the decreased computational requirement and kinds of disparity estimation algorithms with a sim-
on the more consistent disparity elds they provide. ple majority voting post-processing step provide a dis-
The \convergence accelerator" predictors not only im- parity eld in the order of 1{2 seconds per frame on
prove the estimation at the edges of objects, but they a Sun-4m workstation, and the result of the spatial-
also reduce the convergence delay introduced by the temporal recursive algorithm with convergence accel-
recursions; for the test sequences a delay of 5{6 frames erators is comparable, for example, to the one yielded
(that is 0.2 sec supposing 25 frames/sec) was found. by a hierarchical estimation method [12], while the lat-
The MAN sequence, due to the larger baseline, has ter is reported to take 6 hours of computing per frame
disparity values in the range of [+40 + 100], which
::: (i.e., 4 magnitudes more time) on the same platform.
fact was compensated by shifting both views 25 pixels, It also has to be mentioned that these recursive algo-
thus transforming the disparity range to [?10 +50].
::: rithms are hardware ecient, since they work on the
Because of the large noisy background area of the MAN basis of very simple and regular rules.
sequence the initializers of the recursive disparity esti- From our evaluation it can be concluded that the
mators are not coherent, so they yield noisy output, recursive disparity estimator requires operations about
which results several times in multiple initial values one order of magnitude less than a comparable full
and non-overlapping search ranges for the recursions. search estimator, while having less visible artifacts, es-
(Note that the noisy values do not cause more harm in pecially in image sequences (see Figure 2). Therefore
this sequence since the background has very few visible it is a suitable candidate for hardware implementation
patterns.) Hence, 30% more calculations were required in a real time multi viewpoint stereoscopic video chain.
for MAN than for AQUA and MANEGE. There are still some problems that have to be
It has to be mentioned, that the stereo image se- solved, e.g. recursive disparity estimation without dis-
quences that were used during the tests su er from parity o set compensation failed on the sequences hav-
luminance di erences between the left and the right ing large baselines, which could be avoided by applying
images. In order to avoid it a stereo balancing prepro- a preprocessor to nd and realize the o set-shift. Some
cessing step was applied [11]. further improvement might be achieved by taking into
account some sort of reliability measure during the es-
4. CONCLUSIONS timation by weighting the possibilities for the recursion
initialization blocks, based, for example, on the error
Various block-based recursive disparity estimation al- eld or on the left-to-right versus right-to-left disparity
gorithms are discussed in this article and their perfor- path consistency; a cross checking technique would also
Figure 2: The 13 interpolated frame of the DISTIMA test sequence MAN: (a) with full search, and (b) with
th

spatial-temporal recursive disparity estimation algorithm.

improve results for one-side occluded parts of objects. Transactions on Circuits and Systems for Video
Simple frame correlators might be applied to skip tem- Technology, October 1995.
poral recursion if successive frames di er more than a
certain threshold. [7] H. Yeo and Y.H. Hu. A novel modular systolic
array architecture for full-search block matching
motion estimation. IEEE Transactions on Circuits
5. REFERENCES and Systems for Video Technology, October 1995.
[1] L. Falkenhagen and M. Strintzis. 3d object- [8] A. Pirson et al. A programmable motion estima-
based depth estimation from stereoscopic image tion processor for full search block matching. Pro-
sequences. Deliverable of the DISTIMA project, ceedings ICASSP, Detroit, May 1995.
March 1995.
[9] M. Querol. Sti3220, motion estimation processor
[2] J.N. Driessen, L. Boroczki, and J. Biemond. Pel- codec. Application Note, SGS Thompson, 1995.
recursive motion eld estimation from image se-
quences. Journal on Visual Comm. and Image [10] G. de Haan and H. Huijgen. Motion estimator for
Representation, September 1991. tv-picture enhancement. Proceedings Signal Pro-
cessing of HDTV, III, Turin, September 1991.
[3] S.S. Intille and A.F. Bobick. Disparity space im- [11] R.E.H. Franich. Disparity estimation in stereo-
ages and large occlusion stereo. MIT Media Lab scopic digital images. ph.D Thesis, Dept. of EE,
Vision and Modeling Group, Technical Report No
220, 1993. Delft University of Technology, September 1996.
[4] B. Chupeau. A multiscale approach to the joint [12] P. Doelam. Hierarchical disparity estimation using
computation of motion and disparity: applica- disparity space images. Msc. Thesis, Dept. of EE,
tion to the synthesis of intermediate views. Proc. Delft University of Technology, June 1995.
4th European Workshop on 3DTV, Rome, October
1993.
[5] R.E.H. Franich, R.L. Lagendijk, and J. Biemond.
A disparity space image path. Proceedings of the
International Workshop on Stereoscopic and Three
Dimensional Imaging, Santorini, September 1995.
[6] L. de Vos and M. Schobinger. VLSI architecture
for a exible block matching processor. IEEE

S-ar putea să vă placă și