Sunteți pe pagina 1din 48

P.

Nagabhushan
University of Mysore
pnagabhushan@hotmail.com
11
Entropy
Computations
Histogramming
Regression
Line Scan Signals
Equivalence
Test
12
Generation of Line Scan Signals
Transforming a text document images line scan signals into ENTROPY
Row Transitions
Column
Transitions
The structure of the text components are the foreground overlay on the back
ground of the document image resulting in a 0/1 transition line scan signals.
0 - 1
Text to Background transition (+ve)
1 0
Background to Text transition (-ve)
Entropy Measure of change in energy caused due to transitions.
13
Entropy is defined as :
E = p Log(1/p) + (1-p) Log(1/(1-p))
p
Probable occurrence of transition.
(1-p)
Non probable occurrence of transition.
Veronique Eglin and Stephane Bres, Document Page Similarity Based on
Layout Visual Saliency: Application to Query By Example And Document
Classification, International Conference on Document Analysis and
Recognition, 2003.
For recognizing the unique structure of the components and to comprehend
the content of the components at three hierarchical levels, in this research work,
two new models namely,

Conventional Entropy Quantifier (CEQ)

Spatial Entropy Quantifier (SEQ)

have been developed originating with the entropy formulation
14
Conventional Entropy Quantifier (CEQ)
This is based on the number of times the transition occurs in each row and column of
the image matrix.
)) 1 /( 1 log( ) 1 ( ) / 1 log( ) ( p p p p t E + =
t is the transition from 0-1 and 1-0.
p is calculated based on the number of +ve and -ve transitions in each row
or column divided by the total number of columns or rows respectively.
1-p is the probable non-occurrences of transition.
E(t) is the total entropy.
15
Development of the Computational Model


Depending upon the transition (1/0 or 0/1) as described above E(t) could be E
-
(t) or
E
+
(t).

Let R represent the set of sequence of m rows in a component and C represent the
set of sequence of n columns in a component.

R= {r
o
/ every r
o
is a horizontal row of consecutively placed n pixels one after the
other} and 1s o s m.

C={r
|
/ every r
|
is a vertical column of consecutively placed m pixels one below the
other} and 1s | s n.
Total entropy along an o
th
row is defined as

E(o) = E
+
(o) + E
-
(o)

where E
+
(o) = E
|
E
+
(t) at all packets of transitions scanning over |;
E
-
(o) = E
|
E
-
(t) 1s |s n.

Hence the net horizontal entropy for a component is defined as
E
H
= E
R
E(o)


16
Similarly total entropy along a |
th
column is

E(|) = E
+
(|) + E
-
(|)

Where E
+
(|) = E
o
E
+
(t) at all packets of transitions scanning over o;
and E
-
(|) = E
o
E
-
(t) 1s os m.


Hence the net vertical entropy for a component is
E
V
= E
C
E(|)

Finally the overall entropy of the component is

E
O
=E
H
+E
V
entropy feature is represented as a 4-dimentional feature set F1, F2, F3 and F4. where,
F1 stands for the total row entropy for transition 1-0 (E
-
( o))
F2 stands for the total row entropy for transition 0-1(E
+
( o))
F3 stands for the total column entropy for transition 1-0 (E
-
(|))
F4 stands for the total column entropy for transition 0-1(E
+
(|))
1-dimensional feature F5 where, F5=F1+F2+F3+F4 is the overall entropy E
O

17
Component F1 F2 F3 F4 F5 difference
Hydrobia 5.23 5.23 22.58 22.58 55.63 0.52
Hydrobic 5.22 5.22 22.33 22.33 55.11

Impunity 5.21 5.21 14.58 14.58 39.59 1.11
Impurity 5.22 5.22 14.02 14.02 38.48

Metc 4.77 4.77 46.37 46.37 102.29 2.27
Meta 4.88 4.88 47.97 47.97 105.69
Component F1 F2 F3 F4 F5 difference
Hydrobia 5.08 5.12 26.59 25.60 62.40 1.46
Hydrobic 5.07 5.10 26.00 25.02 61.20

Impunity 5.05 5.08 16.45 16.18 42.76 1.62
Impurity 5.06 5.09 15.78 15.52 41.45

Metc 4.63 4.63 53.38 52.98 115.61 4.52
Meta 4.74 4.73 55.17 54.73 119.38
Distance between the features extracted with tight segmentation
Distance between the features extracted without tight segmentation
18
Experimental Analysis with Highly Resembling Finer Components.
Set Character F1 F2 F3 F4 F5
(0,o)
0 5.33 5.33 2.17 2.17 14.99
O 4.28 4.28 3.72 3.72 16.00
(u,v)
u 5.08 5.08 4.88 4.88 18.32
v 4.88 5.08 4.88 5.08 18.32
(b,p,d,q)
b 5.02 5.32 3.09 2.64 16.07
p 5.23 4.85 3.09 2.98 16.15
d 5.00 4.28 3.39 3.39 16.05
q 4.67 5.42 3.02 3.33 16.43
(6,9)
6 5.14 5.06 2.73 2.90 15.83
9 5.51 5.14 2.79 2.84 16.28
Set of highly resembling finest components
19
Experimental Analysis with Variation in Font Size and Font Style
Font
size
F1 F2 F3 F4 F5
8 3.6902 3.8274 7.3345 7.3216 22.1738
10 4.0664 4.337 8.4822 8.4495 25.3352
12 4.7016 4.8436 8.821 8.7091 27.0754
14 5.0523 5.1278 9.1954 9.1356 28.5111
16 5.3422 5.5107 10.1921 10.1921 31.237
18 5.6999 5.7931 10.9236 10.8178 33.2343
Feature vector for a component (word) in different font sizes.

Font style F1 F2 F3 F4 F5
TNR 5.1054 4.9964 6.9746 6.4661 23.5424
Arial 4.9007 5.7858 6.596 5.7773 23.0598
courier 4.0503 4.2283 9.5296 9.3543 27.1625
courierN 3.8731 4.0407 10.5304 9.8292 28.2734
Feature vector for a component (word) in different font styles.
20
Experimental Analysis with Medium (Line) Components
Line is a group of words. Experiments are conducted on lines with different
combinations of fine level components.
Line 1: Radha went to school in the afternoon.
Line 2: in the afternoon Radha went to school.
Line 3: went to school Radha in the afternoon.
Line 4: went to school in the afternoon Radha.
Line 5: in the afternoon went to school Radha.
sample F1 F2 F3 F4 F5
Line1 5.28 5.28 77.16 77.16 164.86
Line2 5.28 5.28 77.16 77.16 164.86
Line3 5.2753 5.2753 77.1597 77.1597 164.86
Line4 5.2753 5.2753 77.1597 77.1597 164.86
Line5 5.2753 5.2753 77.1597 77.1597 164.86
Feature vector of medium components with tight segmentation
21
Experimental Analysis with Coarse (Paragraph) Components.



paragraph 1 Paragraph 2
Paragraph 3
sample F1 F2 F3 F4 F5
Para1 20.8874 20.9908 38.4377 37.719 118.0349
Para2 36.1343 36.4856 23.7066 23.2941 119.6205
Para3 23.9655 24.0126 33.4495 32.828 114.2556
Feature vector for coarse components with tight segmentation
22
Spatial Entropy Quantifier (SEQ)
The formulation of SEQ essentially emanates from CEQ, which is morphed to take an
improved form.
position of transition is represented by pos.
Where pos is the row number in column transition and column number in row transition.
23
Development of the Computational Model
Consider a component of size m x n, obtained from any hierarchical level.

Let R represent the set of sequence of m rows and C represents the set of sequence of
n columns.

R= {r
o
/ every r
o
is a horizontal row of consecutively placed n pixels one after the other}
and 1s o s m.

C= {c
|
/ every c
|
is a vertical column of consecutively placed m pixels one below the
other} and 1s | s n.

If the transition occurs in a row r

then it contributes to row Entropy at position pos and


it is formulated as:




If the transition occurs in a column c

then it contributes to column entropy at position


pos and it is given by


)))) ) (( log( )) ( (( )) log( ) )((( ( ) ( pos n m n n pos m pos n n pos m r r E + =
o o
)))) ) (( log( )) ( (( )) log( ) )((( ( ) ( pos n m m m pos n pos m m pos n c c E + =
| |
24
Total entropy for each row E(ro) is the summation of entropy at +ve (E+( ro)) and ve
(E-(ro)) transitions in a particular row which is formulated as,

+
+ =
n n
r E r E r E ) ( ) ( ) (
o o o
Similarly for each column,

+
+ =
m m
c E c E c E ) ( ) ( ) (
| | |
Then the total horizontal entropy E
h
(R) is given by,

=
n
h
r E R E ) ( ) (
o
Similarly for vertical entropy E
v
(C) is given by,

=
m
v
c E C E ) ( ) (
|
Total entropy of the component is given
by
) ( ) ( ) ( C E R E t E
v h
+ =
25
Spatial-entropy quantifier ,
Position of occurrence of transition.
Based on the position of the component the entropy also changes.
A Hypothetical
Experiment


Component (EMPTY) Position -1


Position -2 Position 3
26
Position-1

Position-3 Position-2
Features Features Features
F1 F2 F3 F4 F1 F2 F3 F4 F1 F2 F3 F4
-34.34 -34.37 -23.17 -23.28 -137.85 -137.97 -11.31 -11.35 -258.00 -258.23 -16.58 -16.63
-34.13 -34.16 -23.04 -23.15 -136.94 -137.04 -11.26 -11.30 -256.38 -256.57 -16.52 -16.57
-38.61 -38.67 -26.10 -26.16 -142.03 -142.28 -14.15 -14.18 -262.08 -262.53 -19.36 -19.39
-38.40 -38.46 -25.98 -26.01 -141.20 -141.44 -14.10 -14.11 -260.65 -261.04 -19.30 -19.31
-42.86 -42.96 -28.96 -29.03 -146.20 -146.60 -16.97 -17.00 -266.16 -266.84 -22.10 -22.14
-42.73 -42.76 -28.90 -28.93 -145.72 -145.84 -16.94 -16.95 -265.32 -265.52 -22.07 -22.09
-42.67 -42.70 -31.82 -31.90 -145.50 -145.61 -19.78 -19.81 -264.92 -265.12 -24.85 -24.89
-47.22 -47.26 -34.72 -34.76 -150.77 -150.91 -22.60 -22.62 -270.90 -271.14 -27.61 -27.63
-47.11 -47.18 -37.57 -37.70 -150.37 -150.63 -25.41 -25.47 -270.23 -270.67 -30.35 -30.42
-47.04 -47.07 -40.60 -40.64 -150.13 -150.25 -28.30 -28.33 -269.81 -270.02 -33.18 -33.21
-51.31 -51.43 -40.41 -40.50 -154.42 -154.80 -28.20 -28.25 -274.09 -274.74 -33.08 -33.13
-55.81 -55.85 -43.55 -43.60 -159.38 -159.53 -31.16 -31.19 -279.50 -279.75 -35.98 -36.00
-55.72 -55.76 -43.25 -43.35 -159.10 -159.24 -31.00 -31.05 -279.04 -279.27 -35.81 -35.86
-55.55 -55.63 -46.40 -46.56 -158.58 -158.84 -33.96 -34.06 -278.16 -278.59 -38.71 -38.80
-55.47 -55.51 -46.09 -46.29 -158.34 -158.46 -33.78 -33.90 -277.74 -277.95 -38.54 -38.65
-60.05 -60.15 -163.54 -163.84 -283.56 -284.05
-59.73 -59.87 -162.62 -163.00 -282.02 -282.65
-64.39 -64.45 -168.00 -168.15 -288.10 -288.35
-64.00 -64.09 -166.90 -167.15 -286.29 -286.72
-68.69 -68.74 -172.31 -172.47 -292.40 -292.66
-68.27 -68.32 -171.17 -171.31 -290.56 -290.78
Table -3.9: Entropy Values for a Component EMPTY with agent X placed at different position.
27
Experimental Analysis with Highly Resembling Components
CEQ F1 F2 F3 F4 F5 Distance(F5)
U 4.8765 4.8765 2.4371 3.69 15.8801
0
V 4.8765 4.8765 2.4371 3.69 15.8801
SEQ
U 278.18 275.13 66.66 169.14 789.10
00.99
V 277.59 275.10 68.54 168.66 789.88
CEQ F1 F2 F3 F4 F5 Distance(F5)
intended 4.69 4.72 21.91 19.2 50.72
0
indented 4.69 4.72 21.91 19.2 50.72
SEQ
intended 1657.4 1671.3 7707 5546.9 16582.6
0.2
indented 1656.5 1670.4 7793.4 5462.1 16582.4
Entropy values extracted using CEQ and SEQ
28
Sample TNR cap


O
Arial cap


O
TNR small


O
Arial small


o
TNR digit


0
Arial digit


0
TNR cap - O
0 167.4 958.7 645.32 449.2 192
Arial cap - O
167.4 0 1126.1 812.72 616.6 359.4
TNR small l o
958.7 1126.1 0 313.38 509.5 768.7
Arial small -o
645.32 812.72 313.38 0 196.12 453.32
TNR digit - 0
449.2 616.6 509.5 196.12 0 257.2
Arial digit -0
192 359.4 768.7 453.32 257.2 0
Distance using feature F5 extracted using SEQ
Experimental Analysis with Variation in Font Size and Font Style.
29
Experimental Analysis with Fine (word) Components
Sample F1 F2 F3 F4 F5
Distance
using F5
but 680.93 647.51 709.80 472.91 2,511.20
8.24 tub 664.00 617.77 661.07 485.95 2,428.80
hydrobia 2970.1 2972.9 17955 18224 42122
2761 hydrobic 2899.5 2890.1 16651 16920 39361
impunity 2093.5 2121.1 7263.3 6871.7 18350
1489 impurity 1991.1 2018.8 6614.6 6236.2 16861
bombay 1830.9 1819.3 7262.9 7031.6 17945
549 dombay 1802 1849.5 6977.7 6767.1 17396
intended 1657.4 1671.3 7707 5546.9 16583
1 indented 1656.5 1670.4 7793.4 5462.1 16582
korna 1089.4 1091.6 3411 1958.1 7550
185 korne 1026.1 1002 3275.2 2061.8 7365
Distance between words using SEQ
30

Features extracted through CEQ Features extracted through SEQ.


Distance using F5 extracted through CEQ Distance using F5 extracted through SEQ

31
Experimental Analysis with Medium (line) Components
Line 1: Radha went to school in the afternoon.
Line 2: in the afternoon Radha went to school.
Line 3: went to school Radha in the afternoon.
Line 4: went to school in the afternoon Radha.
Line 5: in the afternoon went to school Radha.

Feature vectors for 5 line samples using CEQ Feature vector for 5 line samples using SEQ

Distance using F5 extracted through CEQ Distance using F5 extracted through SEQ
32
Wavesim Transform:
A Novel Perspective Of
Wavelet Transform For Signals


The CWT is essentially a collection of inner products of a signal
f(t) and the translated and dilated wavelet for all a and b
where the values W(a,b) represents the correlation coefficients:


CWT has a cross correlation interpretation as well.

The WaveSim(WS) transform of f(t) with
respect to a wavelet a,b(t) is defined as
(, ) = ((), , ())
where Sim(.,.) is the similarity measure
computed as illustrated in figure with a
Haar Wavelet. The amplitude of is
max(f(t)).

A1 = Area Covered by the positive half of the Haar Wavelet
A2 = Area Covered by the negative half of the Haar Wavelet
A3 = Positive area spanned by f(t) within the positive half of the wavelet.
A4 = Negative area spanned by f(t) within the negative half of the wavelet
A5 = Negative area spanned by f(t) within the postive scale range of the wavelet
A6 = Positive area spanned by f(t) within the negative scale range of the wavelet
(A3/A1) is the similarity measure between the positive half of the wavelet and f(t)
in the positive scale range, (A5/A1) is the dissimilarity measure between the
positive half of the wavelet and f(t) in the positive scale range.
Likewise (A4/A2) is the similarity measure between the negative half of the
wavelet and f(t) in the negative scale range, (A6/A2) is the dissimilarity measure
between the negative half of the wavelet and f(t) in the negative scale range.
The WaveSim coefficient is computed as follows
Sim(f(t), (t)) = ((A3 A5)/A1 + (A4 A6)/A2))
helps in normalizing the coefficient values to lie in the interval [-1,1].
Case 1: Input signal is a set of 200 samples with equal values
(f(t) is a horizontal straight line)
Case 2: Input signal is a set of 200 samples with the first 100
samples holding a value 20 and the second 100 samples holding a
value 80.
Case 3: Input signal is a set of 200 samples linearly increasing from
1 to 200. (f(t) is a straight line)
Case 4: Input signal is a set of 200 samples with a curved structure.
Thank You

S-ar putea să vă placă și