Documente Academic
Documente Profesional
Documente Cultură
Gualter License IC Determination
Gualter License IC Determination
___________________________
____________________2016
____ _________________2016
LICENSE PROJECT
Image Complexity Determination System
Author
Rodideal Gheorghe, student gr. C-41
Project Supervisor
Veaceslav L. Perju, Dr. Hab., conf. univ.
hiinu 2016
MINISTERUL EDUAIEI AL REPUBLIII MOLDOVA
Admis la susinere
Deanul Faultii
Iuri Dubovehi, Dr, conf. univ.
Admis la susinere
ef atedr
___________________________
_____________________2016
___________________________
____ _________________ 2016
TEZA DE LIEN
Sistemul de Determinare a Complexitii Imaginilor
Exeutant
Rodideal Gheorghe, Studentul grupei C-42
ondutorul tezei
Perju Veaeslav, Dr. hab., conf. univ.
hiinu 2016
S AR I N A
pentru teza de lien a studentului grupei C-42
Rodideal Gheorghe
Tema: Sistemul de Determinare a Complexitii Imaginilor
aprobat prin ordinul nr. ________ din ___ _____________ 20...
oninutul notei expliative: 1. Analiza algoritmilor, metodelor i sistemelor existente de
determinare a complexitatii imaginelor; 2. Elaborarea, realizarea i eretarea unui algoritm
noude determinare a pixelilor din imagini; 3. Elaborarea i eretarea sistemului de gestioinare a
complexitatii imaginelor.
Lista materialului graphi: 1. lasifiarea algoritmilor, metodelor i sistemelor existente de
determinare; 2. Strutura algoritmului nou de determinare a complexitatii; 3. Shema-blo a
softului elaborat de determinarea complexitatii; 4. Rezultatele eretrilor algoritmului de
determinare a complexitatii imaginelor; 5. Strutura sistemului de determinare; 6. Rezultatele
eretrilor sistemului de determinare a complexitatii imaginelor.
Data nmnrii sarinii: ___ _____________ 20...
Exeutant
Rodideal Gheorghe,
Studentul grupei C-42
ondutorul tezei
Perju Veaeslav, Dr. hab., prof. univ.
ADNOTARE
ABSTRAT
The thesis ontains the introdution, three hapters, onlusions and reommendations,
bibliography of XX titles. It onsists of XX pages, inluding XX figures and X tables and XX
formule.
Keywords:algoritmi,detetarea,reunoaterea,sistem,multidimensional,marcareadetetion.
Field of study of the thesis is information proessing.
Goals and objetives inlude researhing, multidimensional extraction system accounts..
Novelty and originality of this work is use of drivers to onnet with images.
The theoretial signifiane developing a software that implements several algorithms for
Image Complexity Detection.
Appliative value of the work is that this system an be edge detection in various
ompanies and areas.
Implementation results. The system developed has been onfigured and tested on
multiple omputers and multiple images.
Contents
Introduction.....................................................................................................................................9
1.
1.2.
Comlity[8].................................................................................................................11
1.3.
1.4.
1.5.
1.6.
1.7.
Image partitioning[8]......................................................................................................22
1.8.
Complexity measures[8].................................................................................................23
1.9.
Comlity as m rorty[13]..................................................................................26
1.10.
1.11.
1.12.
1.13.
1.14.
1.15.
1.16.
Conclusion...................................................................................................................44
Bibliografie:...................................................................................................................................45
Introduction
In this paper, i introduce a new framework based on information theory and image segmentation
to study the complexity of an image. Different authors have established a relationship between
aesthetics and complexity. In 1928, G.D. Birkhoff introduced the concept of the aesthetic
measure, dened as the ratio between order and complexity The complexity is roughly the
number of elements that the image consists of and the order is a measure for the number of
regularities found in the image Using information theory, M. Bense transformed Birkhoffs
measure into an informational measure: redundance divided by statistical information. To
compute the complexity, he introduced the assumption that an input pattern can always be
described as a two dimensional grid of discrete symbols from a pre-dened repertoire. On the
other hand, he observed that order corresponds to the possibility of perceiving large structures A.
Moles held that an aesthetic measure is closely related to image complexity, and based his
measure of image complexity on information theory P.MachadoandA. Cardosoestablished that
anaesthetic visual measure depends on two factors: processing complexity and image complexity
. They consider that images that are simultaneously visually complex and easy to process are the
images that have a higher aesthetic value. From the above discussed works, it appears that
complexity is at the core of aesthetics. With the guideline that under standing complexity can
shedlight on a esthetics, we will explore image complexity from an information theoretic
perspective. Image complexity has also been related to entropy of the image intensity histogram.
However, this measure does not take into account the spatial distribution of pixels, neither the
fact that a complexity measure must take into account at what level one wants to describe an
object. For instance, a random sequence requires a long description if all details are to be
described, but a very short one if a rough picture is required . In image processing, an image is
segmented by grouping theimagespixelsintounitsthatarehomogeneousinrespect to one or more
characteristics, or features. Segmentation of nontrivial images is one of the most difcult tasks in
image processing. Image segmentation algorithms are generally based on one of two basic
properties of intensity values: discontinuity and similarity. In the rst category, the approach is to
partition the image based on abrupt changes in intensity, such as edges in an image. The principal
approaches in the second category are based in partitioning an image into regions that are similar
according to a set of predened criteria. Thresholding, region growing, and regionsplitting and
merging are examples of methods in this category . This paper is organized as follows. In this
chapter, we present an algorithm which splits an image in relatively homogeneous regions using
abinary space partition (BSP)ora quad-tree. In nex chappter , complexity is dened by using two
measures which take into account the level at which the image is considered. Finally, in next
chapter , we present our conclusions and future research.
Comlity[8]
In the last two decades, the study of complexity has become a very active research area in many
different fields (automata, information theory, computer science, physics, biology, neuro-science,
etc.) . But, what is complexity? Websters dictionary (1986) defines a complex object to be an
arrangement of parts, so intricate as to be hard to understand or deal with. According to W. Lis,
the meaning of this quantity should be very close to certain measures of difficulty concerning the
object or the system in question: the difficulty in constructing an object, the difficulty in describ ing a system, the difficulty in reaching a goal, the difficulty in performing a task. There are many
definitions of complexity corresponding to the different ways of quantifying these difficulties.
A list of complexity measures provided by Seth Lloyd is grouped under three questions: how
hard is it to describe, how hard is it to create, and what is its degree of organization? In the first
group, entropy is widely applicable for indicating randomness. It also measures uncertainty,
ignorance, surprise, or information. In the second group, the computational complexity quantifies
the amount of computational resources (usually time or space) needed to solve a problem.
Finally, in the third group, mutual information expresses the concept of complexity that
quantifies the degree of structure or correlation of a system or the amount of information shared
between the parts of a system as a result of this organizational structure.
To our knowledge, the only framework existing until now dealing with image complexity is
defined in, which deals with comparing the performance of ATR applications.
In this context, image complexity is defined as a measure of the inherent difficulty of finding a
true target in a given image. Such a metric should predict the performance of a large class of
ATRs on diverse imagery, without advanced knowledge of the targets. A split and merge
segmentation algorithm is first applied that partitions an image into compact regions of uniform
gray-level, no larger than the expected target size. Recursive thresholding determines the splits.
After the segmentation procedure is applied, the target similarity of each region is estimated. The
distribution of this similarity is taken as a basis for complexity measurement. For instance, if
there are many regions with target similarity near the maximum the image is relatively complex.
Three complexity measures are then given. The first is the number of regions whose targetsimilarity exceeds a given threshold, the second measures the distance from the body of the distribution to the most significant outlier, and the third is the weighted average of the distance to
all outliers.[8]
Pic 1.1.1: Input and output distributions for the partitioning of channel.
Img comlity[26]
s mntiond bov, in ordr to trct nd trck th trgt utomticlly it is ncssry for us to
msur th img comlity. Considring th liction nd rsrchs in ctul liction
nd rsrch, w dfin th img comlity s follows: th img comlity is msur of
th inhrnt difficulty of trcting nd trcking trgt.
Bsd on scl, th img comlity dscrition cn b clssifid into two ctgoris: on is
bsd on th globl chrctristics nd th othr on is bsd on rgionl chrctristics. Bsd
on for rticulr trgt or not, th img comlity dscrition cn b clssifid into two
ctgoris: on is in connction with rticulr trgt nd th othr on is nothing to do with th
trgt. Bsd on diffrnt dscritors, th img comlity dscrition cn b clssifid into
thr ctgoris: Thr r gry lvl bsd, dg bsd nd sh bsd.
bout th clssifiction of th dscrition mthods of img comlity, shown in Fig. 1.2.1:
Fig. 1.2.1 Tonomy of img comlity mtrics
Trgt
Indndnt
Trgt
Dndnt
sh
Rgionl
(1.4.2)
(1.4.3)
From th gry lvl co-occurrnc mtri, w cn gt th scond-ordr momnts,
contrst, corrltion, ntroy nd sris of ttur dscritions. Considring th rl-tim nd
dimnsionl consistncy, w just us gry lvl co-occurrnc mtri to clcult th ttur
ntroy to dscrib th rndomnss of th img ttur. Th formul of ttur ntroy
clcultion is s follows:
(1.4.4)
Fig. 1.4.1 Tow imgs with th sm informtion ntroy nd dgs, but diffrnt in ntroy
(1.5.1)
where n = |X|, pt = Pr[X = xt] for i {1,..n}. As logpt represents the information
associated with the result xt, the entropy gives us the average information or uncertainty of a
random variable. The logarithms are taken in entropy is expressed in bits. We use the convention
that 0log0 = 0. We can use interchangeably the notation H (X) or H(p) for the entropy, where p =
{p\, p
Pij = Pr[X = xi, Y = yj] is the joint probability. Mutual information represents the amount
of information that one random variable, the output of the channel, contains about a second
random variable, the input of the channel, and vice versa. I(X, Y) is a measure of the shared
information or dependence between X and Y.[8]
Method based on Entropy
Given an image with N pixels and an intensity histogram with n( pixels in bin i, we
define a discrete information channel where input X represents the bins of the histogram, with
probability distribution {pt} = { N }, output Y the pixel-to- pixel image partition, with uniform
distribution {q j} = { N }, and conditional probability {pj|t} of the channel is the transition
probability from bin i of the histogram to pixel j of the image. This information channel can be
represented by
(1.5.4)
Pic 1.5.1: Lena image with luminance Y709 for differents values of Pe (i) and MIR (ii). The (RMSE, PSNR) values for (i)
are (45.47, 14.66), (30.02, 18.27), (14.03, 24.88), and (8.21, 29.54), respectively. For (ii) we have (32.98, 17.45), (16.23,
23.61), (9.71, 28.07), and (6.25, 31.90), respectively.
Pic 1.5.2: Lena image in RGB system with Pe = 0.4. We obtain (a) 1835 (r = 0.70), (b) 3692 (r = 1.41), and(c) 4179 (r =
1.57) regions. The merging image (d) has RMSE=13.20 and PSNR=25.32.
ic 1.5.3 (a) BSP (r = 0.010, MIR = 42.34) (b) Quad-tree (r = 0.010, MIR = 39.10) (c) Quad-tree (r = 0.014, MIR = 42.34)
tsk: first, soft comuting roch using fuzzy vlus sms rorit whn w r trying
to dscrib sitution whr binry logic is too strict. s for th ntroic distnc function, w
cn rformult our min qustion from How coml is this img? to Wht is th distnc of this
img from th simlst ossibl img in th dfind ftur sc? .
o(n) = Tog(27 (nlog(n) + (1 - n)log(1 - n). [7] z Gi(n) = i (1- - nn1), G2(n) = 4n(i - n)
Fig. 1.4.1. Rsults from filtring nd lying ntroy functions to ll imgs in our ool, nd thn tking mn vlus
for ch ctgory
Fig. 1.5.1 mls of tst imgs, clssifid by intuitiv comlity: high comlity (to); mdium comlity (middl);
low comlity (bottom)
Th rimnts wr hld in dim light room to rduc visul distrction, giving tim to
th rticint for drknss dttion. ll th usul rgonomic rcutions, such s using
qusi-soundroof room, wr tkn, nd th subjct ws llowd to choos thir own rfrrd
osition nd visul ngl. Th imgs wr rsntd full scrn. Th softwr usd ws hommd using th multimdi rogrmming nvironmnt Mcromdi Drmwvr 2004 M on
n l Mcintosh comutr with TFT LCD monitor. Th chosn imgs wr comutr
scns of intings, dividd in thr ctgoris rrsnting diffrnt lvls of visul comlity,
bsd on th rsnc or bsnc of crtin clsss of fturs nd cu oints. Figur 1 shows
mls of inting usd in this study.
ch img wr rsntd for fid riod of tim (90 scs.), with no tmorl clus;
th rimnt lso hd controlld dsign in ordr to minimiz sid ffcts: lights dimmd nd
uniform, subjct lon in soundroof room. Th subjct ws lrtd to focus thir ttntion on
th contnts of th dislyd imgs. Th imgs usd for th rimnts wr chosn
ccording to th intuitiv hyothsis tht th comlity of scn incrss with th numbr of
objcts nd thir rltiv osition, nd with its ovrll structur [8]. Th chosn imgs wr
intings, dividd in thr ctgoris rrsnting diffrnt lvls of visul comlity. Hr th
stimt tim rcivd by ch subjct is rortd. W considr it s subjctiv msurs of
comlity for th thr ctgoris of imgs introducd bov. In th following it will b
dnotd s
Th sml mn vlu nd th vrinc of th rcivd tim (jT, uT )
r rortd in Tbl 1.5.2. It is vidnt tht th mn rcivd tim dcrss with th
comlity of th img. In ordr to comr th rcivd tim with th objctiv msur of
comlity, nd to highlight our intrst in rltiv diffrncs btwn stimtions md by th
sm subjct whn wtching diffrnt imgs, th following normlizd tim msurs r
introducd:
Fig. 1.6.1 nti-corrltion btwn th stimt of th mntl clock nd th msur of comlity vi ntroy functions
(1.7.1)
The channel varies at each partition step because the number of regions is increased and,
consequently, the marginal probabilities of YY and the conditional probabilities of YY over X also
change. This process can be interpreted in the following way: the choice of the partition which
maximizes the MI increases the chances of guessing the intensity of a pixel chosen randomly
from the knowledge of the region it pertains to.
The algorithm proposed here generates a partitioning tree for a given probability of error Pe by
maximizing the mutual information gain at each partitioning step. This algorithm is based on
Fano's inequality and was introduced by Sethi and Sarvarayudu in the context of pattern
recognition. Similar algorithms with different split criteria have been used in learning and DNA
segmentation.
Given the error probability Pe allowed in partitioning, Fano's inequality provides us with
a lower bound for the gain of mutual information. Taking the equality, we obtain the minimum
value of MI needed in the partitioning algorithm for a given probability of error:
Imin(X, Y) = H(X) - H(Pe) - Pe log(B - 1),
(1.7.2)
where B is the number of bins of the histogram. Note that Imin(X, Y) is calculated from the initial
channel.
The partitioning process can then be seen as follows. At each partitioning step, the tree
acquires information from the original image. The total I ( X , Y ) captured by the tree can be
obtained adding up the mutual information available at the non-terminal nodes of the tree
weighted by the relative area of the region, i.e., the relative number of pixels, corresponding to
each node. The mutual information Ii of an interior node i is only the information gained with its
corresponding splitting. Thus, the total mutual information acquired in the process is given by
(1.7.3)
where T is the number of non-terminal nodes and n is the number of pixels corresponding to
node i . It is important to stress that this process of extracting information enables us to decide
locally which is the best partition. Partitioning stops when I ( X , Y ) > I min( X , Y ) . Alternatively,
a predefined ratio of mutual information ( M I R ) can be given as a stopping criterion. Note that
I ( X , Y ) is the MI of the channel obtained at the end of the process.
This process can also be visualized from equation
H ( X ) = I ( X , Y) + H ( X | Y) ,
(1.7.4)
For a random image and the finest resolution it would be maximum and equal to H (X).
Thus, given an image partition, we can express the heterogeneity of an image using the JSdivergence applied to the probability distribution of each region.
We can also ask which partition maximizes the compositional complexity, for a given number of
regions. This partition should extract the maximum information of the image and create the
maximum heterogeneity between the generated parts. Finding this partition is an NP-complete
problem. We have approached the solution to this problem in using a greedy algorithm.
Our second measure is the number of needed regions in the partitioned image to extract a given
ratio of information. It is related to the complexity in describing an image, and depends on the
accuracy level given by Pe or MIR. The justification for our measure is that the number of
regions is the number of leaves of the tree created in the partitioning process. The coding of this
tree (or equivalently the description of the image) will be clearly dependent on this number. This
is further justified by taking into account that our algorithm tries to create homogeneous regions
with the minimum splitting. In this case, the error probability of the channel is interpreted as the
compression error and thus the number of regions is also related to the difficulty of compression.
[8]
Results
We use a uniform partition to test the compositional complexity on the images in Fig. 1.1.1. The
results obtained are shown in Fig. 1.7.2 for the number of partitions running from 2 x 2 to the
number of pixels in the respective images. We observe that the relative ordering of the
complexities depends on the resolution level (number of partitions). For instance, the earth rise
image appears to be the most complex at resolution 4 x 4 while the wild flowers appears as the
least one. However, this behavior is reversed at high resolution.
In Figure 1.7.1 we can analyze the behavior of the second proposed complexity measure. While
the lines in the graph in Fig 1.7.2 cross themselves, the ones in Figure 1.7.1 keep a regular
ordering. Observe their exponential growing with MIR that is due to the increasing cost of the MI
extraction. It is important to note that for MIR = 0.5 we obtain a good qualitywith a few number
of regions. With respect to the number of regions, the most complex image appears to be the
Baboon and the least one is the Earth rise.
It can also be shown (Figure 1.7.3) that while blurring an image will cause a loss of complexity,
increasing the contrast causes the opposite effect. For instance, for a MIR = 1 and the luminance
channel Y709, the contrasted Lena image of Figure 1.7.3.b (r = 91.7) needs more regions than
the original Lena image (r = 89.4) and the blurred image of Figure 1.7.3.a (r = 48.3) needs less
regions.[8]
Figure 1.7.1 : Ratio of the number of regions r with respect to MIR for the images of Fig. 1 with luminance Y709.
Figure1.7.2: Compositional complexity ICC over the number of regions R of the partitioned images of Fig. 1 with
luminance Y709. The number of partitions goes from 2 x 2 to the number of pixels N in the respective images.
Figure 1.7.3: Lena image: (a) Out of focus and (b) more contrasted than its original.
tht th visul systm might us stil-frquncy nlysis to rocss visul imgs. This ld to
th ssumtion tht th mor coml n objct, th mor high frquncis rquird in its
sctrum for rcognition. This roch is widly usd by nginrs who mloy stilfrquncy or wvlt dscrition of imgs. In ths trms comlity cn thn b dfind s th
numbr of ctiv stil frquncis or s th numbr of ctiv wvlts.
similr, but mor formlizd dfinition of img comlity cn b found in Nsnn
t l (1993). Ths rsrchrs suggst stimting th comlity of imgs s th roduct of th
squrd mdin of th distribution of stil frquncis nd th img r. Th incrsing
numbr of rltiv frquncis, or hrmonics, in th sctrum rsults in lrgr mdins. With n
incrsing numbr of non-riodic lins nd stroks in th objct, th numbr of high hrmonics
lso incrss. In this cs th mdin of such n img sctrum lso incrss nd lds to
highr stimt of img comlity, rovidd th r is th sm. ccording to this dfinition,
riodic ttrns with lrgr r r mor coml. For sinusoidl grtings, th img
comlity, thus dfind, is roortionl to th numbr of brs in th grting. Thus Nsnn t
ls (1993) dfinition incororts trditionl concts of img comlity introducd rlir in
th nintnth cntury.[7]
(1.13.1)
ssuming tht th joint comlity is symmtric, i.. C( 0|1) = C(1|0). lso on wnts to
nsur tht th distnc is normlizd roritly.
s it ws notd bov th idl comlity msur dos not ist nd Kolmogorov comlity
is not comutbl. On cn roimt th idl comlity msur in diffrnt mnnrs,
howvr. Shnnons informtion thory introducd th conct of ntroy, which is sily
stimtd from dt. ntroy cn b sn lso s sttisticl msur of comlity. vn
though Kolmogorov comlity is not comutbl it cn b roimtd using comrssion
bsd mthods. Comlity cn lso b stimtd from modl tht roimts th log-df of
dt s w do in this r.
Rltiv ntroy s distnc msur
Givn discrt robbility distribution Shnnons ntroy H() is dfind s
H() =
() log ().
(1.13.2)
(1.13.3)
(1.13.4)
This is not tru mtric but it cn b usd dirctly s msur of distnc or dissimilrity
btwn distributions.
Using th symmtric vrsion of KL-divrgnc (q. 1.13.4) s th ir-wis distnc btwn
two imgs is stright forwrd. It is not quit th idl distnc msur in q. 1.13.1, but it
cturs th id of stimting th comlity in th contt of nothr img.
lgorithmic comlity
Kolmogorov comlity K() of string is th lngth of shortst rogrm using givn
dscrition lngug L on univrsl Turing mchin U tht roducs th string .
K() = min{|| : U() = },
(1.13.5)
(1.13.6)
(1.13.7)
(1.13.8)
To us th NCD for msuring ir-wis distncs btwn imgs on just comrsss imgs
srtly nd conctntd nd obsrvs th diffrnc btwn th comrssion rsults.
Method that Using IC s n roimtion for ntroy
rcticl roimtion of ntroy cn b ttind by fiing som modl which
roimts th log-df. W roos hr to us this roch, in connction with th modl of
indndnt comonnt nlysis (IC), or quivlntly srs coding. Ths modls r widly
usd in sttisticl img modlling. In IC, th df is roimtd s
(1.13.9)
whr n is th dimnsion of th sc, th w r linr fturs, collctd togthr in th mtri
i
W.
(1.13.10)
1.12.11
whr C() is th covrinc mtri of th dt. This formul shows tht th scond trm in
(1.12.10) is siml function of th dt covrinc mtri. In fct, log | dt W| is mimum if
th dt covrinc hs minimum dtrminnt. minimum dtrminnt for covrinc
mtri is obtind if th vrincs r smll in gnrl, or, wht is mor intrsting for our
uross, if som of th rojctions of th dt hv vry smll vrincs. Sinc in IC, w
constrin th vrincs of th comonnts to b qul to on, only th lttr cs is ossibl.
Thus, our ntroy msur bcoms smll if th dt is concntrtd in subsc of limitd
dimnsion.
Thus, this msur of ntroy (comlity) is smll if th comonnts r vry srs, or if th
dt is concntrtd in subsc of limitd dimnsion, both of which r in lin with our
intuition of structur of multivrit dt.
rcticlitis Rmmbring
th us of IC modl.
ssuming tht w wnt to stimt th distnc btwn two imgs, w stimt th IC modl
from both imgs srtly nd combind.
Th comlity vlu tht w gt using q. 10 is normlizd in similr mnnr s th NCD in
q. 8.
In rctic th IC modl for imgs is stimtd from dt tht contins lrg numbr of
rndomly smld img tchs.[3]
0 0
+ 0
+ +
0 0
0 0
0 0
The design temperature T measures the degree of internal contrast; the density of
differentiations; the smallness of subdivisions. This is the raw amount of information contained
in each design. The least complex pattern is a uniform placement of the same symbol over the
array, so a measure of temperature is based on the number of symbols in a region the basic
criterion is that a uniform surface, which has no differentiations, should have T = 0. We define T
as the number of different elements (smallest units) minus one. The smallest units in this
example are the 0 or + symbols. T for the four 2X2 arrays takes values 0 or 1 .
T(1) = 0, T(2) = T(3) = T(4) = 1
(1.14.2)
The design harmony H is the symmetry content of each square, measured according to
the following criteria. If any individual symmetry is present, it rates a 1, otherwise a 0.
H is the sum total of all the individual contributions hi . Each hi takes values of 0 or 1, so
H in this example can take values from 0 to 6. For the four arrays 1 through 4 in Figure 1, we
have:
(1.14.3)
The life and complexity are computed from equation (1) using T from Equation (2) and H
from Equation (1.16.3).
array
(1)
(2)
(3)
Life L
Complexity C
Fig 1.14.2 Values of L and C for the four 2X2 arrays in Figure 1.
The relative numbers have meaning for comparing between different arrays. Square (4)
has the greatest degree of life L - three times that of array (2) or (3). The reader will agree that
this is our intuitive feeling for the relative degree of interest among all four arrays. Array (4) is
more interesting to most people than either (2) or (3). On the other hand, array (1) generates no
interest at all because of its uniformity, and that is reflected in its null L value.
The complexity C distinguishes only between array (1), which has no complexity, and the
other three, which have about the same degree of disorganized complexity.
These calculations reveal L to be an important quantity in visualization: it measures the
difference between organized and disorganized complexity. To the best of the author's
knowledge, this is the first time that this is done. Our measurements show that the model clearly
links what we perceive as the visual interest or "life" of a design with a numerical value that can
be computed. This is the key to our model, and underlines why we consider it so important. The
complexity C does not distinguish between what is interesting and what merely has substructure.
These points become strikingly evident in more complex examples, which we consider next.
The two types of problem reduction must interact in any computation of complexity,
since some subblocks may have the full alphabet of symbols while near groupings of cells with a
single symbol might be present. These subblocks permit a recursive formulation based on the
symbols: the entire pattern is a 3x3 array whose elements are 2x2 subblocks. A measure of the
total pattern complexity is based on this subblock reduction.
The design temperature T equals the number of different symbols minus one. These
measurements have to be done hierarchically on three different scales. First on the 6X6 level,
then on each of four 3X3 subblocks, then on each of nine 2X2 subblocks. It will be useful to
label these subblocks in terms of letters and numbers. Let the index n take values a, b, c, d,
and m run from 1 to 9. The regular 3X3 and 2X2 subdivisions of a 6X6 matrix will be denoted as
follows:
Fig. 1.14.3. Subdivisions of a 6X6 array into 3X3 and 2X2 subblocks.
For simplicity, we will ignore all other subblocks. There exist 4X4 subblocks also, and
other 3X3 and 2X2 subblocks that do not coincide with the above subdivisions. (A more accurate
count should include all possible subarrays, which are indeed picked up by the human eye when
the mind computes L and C).
For each different size, the T or H value is the sum of all the T or H values for the
different subblocks.
T(3X3) = SIGMA n = 14 Tn(3X3), T(2X2) = SIGMA m = 19 Tm(2X2)
(1.14.4)
The same goes for H. The three contributions are then combined as a weighted sum of each
different scale.
T = T(6X6) + T(3X3)/2 + T(2X2)/3
(1.14.5)
The weights are 1 for the 6X6 array, 1/2 for the 3X3 subblocks, and 1/3 for the 2X2
subblocks. These give the proper equipartition when we count the subdivisions in terms of their
width. (The significance of this weighting for 1/f scaling will be discussed in a separate paper).
Note that we are not finding the average over the number of matrices. Thus, even though there
are four 3X3 matrices, their width is 1/2 of the original 6X6 matrix, so we divide by 2 instead of
4. Similarly, the width of the 2X2 matrices is 1/3 of the original 6X6 matrix. We will use the
above combination for computing both T and H totals.
One could ignore weighting altogether, and simply add all contributions from all sizes of
submatrices. However, that would skew the numbers so that the smaller elements contribute
much more than the larger elements. Elements of different size contribute simultaneously to our
perception of the whole, so it is necessary to count them in the proper balance.
The harmony H is generalized from the previous example by including measures of
similarity at a distance. Different subblocks may interact with each other. This makes it
necessary to count translational symmetry, which did not apply when dealing with isolated 2X2
arrays. In addition to the six symmetry measures h i , i = 1,...,6 given in the previous section, we
introduce three measures of translational symmetry:
h8 = relation to another element by a translation, plus a reflection about either the x-axis
or the y-axis (a "glide reflection").
justified mathematically. Two different subblocks may be similar as they are oriented, and also
be similar after a reflection or a rotation. A subblock may be related by glide reflection to another
subblock, and by glide rotation to yet another, which counts as 2. (We do not consider each glide
rotation by different multiples of 90 separately, because that would lead to more complication
than we want in this model. Also, empirical experiments show that glide reflections about the
two diagonal axes do not provide a strong visual connection, and for that reason they are not
counted here).
The design harmony is defined as the sum of the hi, i = 1,..., 9. Each hi takes values 0 or 1,
so H for a given array (of any size) ranges from 0 to 9. As in the case of T, these computations
have to be done on three different levels, 6X6, 3X3, and 2X2 (Equation (1.16.4)), then combined
with the appropriate weights in Equation (1.16.5).
++++++
++00**
+#*#*#
++++++
++00**
#0#*#*
++++++
######
*#++*#
++++++
######
#*00#*
++++++
**0#++
*#*#+#
++++++
**+*++
#*#*#0
+0#*0#
++++++
+#**#+
*#0+**
+0000+
#0##0#
#0*#0+
+0*#0+
*#++#*
0#+0+#
+0#*0+
*#++#*
+*+#**
+0000+
#0##0#
#0+*0+
++++++
+#**#+
The computations for T and H are straightforward, and all details are given in the
Appendix. Unlike the simple 2X2 case treated earlier, this is not an exhaustive classification of
all possible 6X6 arrays with four different entries. We just pick a sample of arrays to show how
the method works in practice. The matrices chosen have very different internal structure that
illustrates various possibilities.
Before reading further, we suggest that the reader study the above matrices and rankorder them in terms of decreasing C and L. Remember that C measures the intensity of design
complications. In art, C measures the level of visual excitement, which often arises from chaotic
aspects of a design. The life L measures the degree of organized complexity in a design; the
visual interest comes from the degree to which elements interact coherently. The name "life" is
chosen because continuing to increase L mathematically brings one closer to the structure of
living organisms. This comparison is useful for the test that we propose. The reader can decide
which of the above six arrays most resembles something that could be organic, then rank-order
them in L based on this impression.
This example requires the following amended definition of C and L, instead of Equation
(1.16.1):
L = T H , C = T ( 50 - H )
(1.14.6)
array
(1)
(2)
(3)
(4)
(5
Temperature T
8.5
11.7
16.3
Harmony H
51
23
15.7
3.3
Life L
196
184
54
Complexity C
230
401
761
Table 1.14.5. Computation of T, H, L, and C for the six arrays in Figure 1.16.3.
These numbers are larger than the earlier 2X2 case, and that reflects the existence of
substructure. The amended definition of C = T(50 - H) instead of Equation (1.16.1) is necessary
so that our numbers are all positive. The number 50 is arbitrary, and is needed only as an upper
scale. (To be more precise, we should have chosen 51, the maximum H from array (1) in Table
1.16.2, but that makes little difference for the comparison).
The six arrays can be rank-ordered in terms of decreasing L and C.
Ranking in decreasing L: (6), (2), (3), (5), (4), (1).
Ranking in decreasing C: (4), (3), (5), (6), (2), (1).
These measures are in accord with our perception of pattern complexity. Most readers
will agree that these rankings correspond to what they have already concluded from direct
observation. The six examples are decreasing in complexity in the same order as our feelings. We
demonstrate here a strong correlation between the subconscious process of perception and a
simple quantitative model. Our model can be refined by incorporating more and more input, but
even at this stage, it is remarkably accurate in predicting our emotional response to a design.
Even though the design temperature of array (6) is high, the number of internal
symmetries organizes the complexity so that C is lowered and L is raised. Contrast this to the
high-T, low-H array (4) - it has very little internal organization, which raises C and lowers L.
Array (6) shows how C essentially differs from T. Could one not skip the additional
complications of measuring symmetries in this model and simply compute T as the complexity
of a design? The answer is no, because the ranking in decreasing T is 4, 6, 3, 5, 2, 1, which is
entirely distinct from the ranking in decreasing C. Without example (6), however, the two
rankings would coincide.
Superficially, arrays (3), (4) and (6) look pretty similar. Their complex substructure
emerges after looking at them analytically for some time. The more we look, the more hidden
substructure is discovered as the mind moves the symmetry links from its subconscious to its
conscious cognitive state. The point is that we perceive all of this information in an instant. By
asking ourselves which array most resembles organic forms we can make an immediate
assessment without having to analyze whether the substructure is organized or disorganized. The
human mind is evidently programmed and equipped with tools to make this judgment.[4]
1.14.
Method
1. Method based on
Entropy
2. rnc of
gry lvl
3. rnc of
dgs
Description
Advantages
comlity.
Disadvantages
functions.
Th rnc of gry lvl cn rflct th Th mor informtion in th img, th
gry lvl.
determination.
Th quntity nd th comlity of th With this method w cn tk dvntg of
trgt cn b rssd with dgs.
4. Rndomnss of
img tturs
rss th sc distribution of
gry lvl.
5. 2x2 matrices
We apply the model for complexity to the These calculations give us an important This method is used more on the
with binary
simplest example: the set of all 2X2 quantity in visualization: it measures the temperature estimation measures
entries
6. Method that
Using IC s n
roimtion
for ntroy
between
organized
smallness of subdivisions.
Ths modls r usd onli in
sttisticl img modlling.
7. Method that
Using hierarchy
to generalize to
higher
dimensions
9. Image
partitioning
This process can be interpreted in the following way: the choice of the partition
which maximizes the MI increases the
chances of guessing the intensity of a pixel
chosen randomly from the knowledge of the
region it pertains to.
1.15.
Conclusion
The many advantages of our approach include that: it shows sensitivity to habitat features
at the community level; it is inexpensive, simple and accessible to all; it allows for the
monitoring of forests at multiple scene scales in both space and time; it can provide additional
information of ecological relevance to sensor networks; it can be added to the bag of sampling
devices of most field protocols; and the use of structural complexity as an EO is practically and
theoretically attractive. The list of disadvantages includes that: it is a methodological approach in
its infancy that will require confirmation from other systems; photographic settings will have to
be fully standardized and their calibration addressed (e.g., image resolution versus extent); and
MIG estimates will need to be correlated to other measures of plant architecturesuch as canopy
closure, canopy cover and vertical structureto further interpret themechanisms beyond our
present definition of structuralcomplexity in an image. Monitoring forest dynamics at a high
resolution in spaceand time offers the possibility of discerning the ecologicalsignature of these
systems. Signature variations could provideinformation on the integrity and stability of
ecologicalprocesses, both globally and locally. The detection of localdisturbances assessed by a
change in structural complexitycould help alert ecologists and guide their actions to siteswhere
the integrity is threatened. By revisiting the same sitesweek after week one quickly realizes how
dynamic anecosystem may be. Phototropism, flooding events, springand fall phenology, growth
and senescence, flowering time,grazing and disease perturbations, falling trees, and gapdynamics
are some of the many processes that structure theforest habitat on a relatively short temporal
window. A holisticapproach capable of integrating these processes in time and space would
certainly benefit scientists and decision makers.
Bibliografie:
1. Honghai Yu, Stefan Winkler, IMAGE COMPLEXITY AND SPATIAL
INFORMATION ( ECE Department, University of Illinois at Urbana-Champaign, USA
Advanced Digital Sciences Center (ADSC), University of Illinois at Urbana-Champaign,
Singapore 2011).
http://vintage.winklerbros.net/Publications/qomex2013si.pdf
5. Richard Alan Peters , Robin N. Strickland Image Complexity Metrics for Automatic
Target Recognizers(Department of Electrical and Computer Engineering Vanderbilt
University Nashville, TN 37235, Department of Electrical and Computer Engineering
Vanderbilt University University of Arizona Tucson, AZ 85721 2008).
http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.39.5833&rep=rep1&type=pdf
6. Maurizio Cardaci Vito Di Ges`u, Maria Petrou, and Marco Elio On the Evaluation of
Images Complexity: A Fuzzy Approach(Tabacchi Universit`a di Palermo, C.I.T.C., Italy
2010).
http://www.math.unipa.it/~digesu/Time_work.pdf
9. Mario Ignacio Chacn Murgua, Alma Delia Corral Senz and Rafael Sandoval
Rodrguez Chihuahua A Fuzzy Approach on Image Complexity Measure(Institute of
Technology,
DSP
&
Vision
Laboratory
2009).
http://www.scielo.org.mx/pdf/cys/v10n3/v10n3a6.pdf
11. Roque Marin, Eva Onaindia, Alberto Bugarin Current Topics in Artificial Intelligence(
11th Conference of the Spanish Association for Artificial Intelligence 2005).
https://books.google.md/books?
id=F2rLaFtIrd0C&pg=PA347&lpg=PA347&dq=image+complexity+determination&sour
ce=bl&ots=WccGEEW40C&sig=zZVBXQmOhMBc2hNHKffgKVMk02s&hl=ru&sa=X
&ved=0ahUKEwiGqfbt4MfLAhUljXIKHYJBBPIQ6AEISTAJ#v=onepage&q=image
%20complexity%20determination&f=false
13. Agata Ciokosz-Styk1, Adam Styk Advanced image processing for maps graphical
complexity estimationInstitute of Geodesy and Cartography, Poland
http://icaci.org/files/documents/ICC_proceedings/ICC2013/_extendedAbstract/260_proce
eding.pdf
17. K C
https://ru.wikipedia.org/wiki/%D0%9A%D0%BE
%D0%BB%D0%BC%D0%BE%D0%B3%D0%BE%D1%80%D0%BE
%D0%B2%D1%81%D0%BA%D0%B0%D1%8F_%D1%81%D0%BB%D0%BE
%D0%B6%D0%BD%D0%BE%D1%81%D1%82%D1%8C
18. .. , ..
.
http://izvestia.asu.ru/2013/1-1/info-comp/TheNewsOfASU-2013-1-1-info-comp-09.pdf
19. . . , . .
( . . .
, , 15.11.2012).
http://www.jip.ru/2012/362-371-2012.pdf
20. . . , . .
( , .
2013).
http://jre.cplire.ru/iso/feb13/9/text.html
http://www.fesmu.ru/elib/Article.aspx?id=204188
22.
https://habrahabr.ru/post/229757/
23. ., . . CGM
Journal, 2003.
24. Gupta D. Computer Gesture Recognition: Using the Constellation method.// Caltech
undergraduate Research Journal, 2001, vol.1, 1. pp. 26-31.
25. Vezhnevets V. Face and facial feature tracking for natural Human-Computer Interface.//
GraphiCon 2002.
27. Song Qinghuan, Chen Zhongbia Sun Shaohuac, Hu Jinlonga, Cheng Huaa, Institute
of Optics and Electronics, Chinese Academy of Sciences, Chengdu, Sichuan A scene
recognition method based on image complexity China University of Chinese Academy
of Sciences, Beijing , China
http://spie.org/Publications/Proceedings/Paper/10.1117/12.2069491