Documente Academic
Documente Profesional
Documente Cultură
Keywords— DSS query, Query Optimization, I/O costs, Communication Costs etc.
I. INTRODUCTION optimal results in minimum time. The growth of the
chromosome is restricted to generate better generations. The
A DSS query is convoluted and data intensive
major objective of this study is to determine the effect of
query. The execution of DSS query demands momentous
varying I/O to communication costs over Total Costs and
amount of I/O, processing and communication resources.
Communication Costs of a distributed DSS query.
The sum of I/O, processing and communication costs
Design and statistics of 3-join DSS query is
represent Total Costs of the query. Total Costs represents the
represented in second section. Third and fourth section
amount of different resources required to execute the query.
explained the experimental setup and design of entropy and
Due to convoluted nature and significant requirement of
restricted chromosome based DSS query optimizer
different resources, DSS query should be optimized before
(EGA_QO). Fifth and sixth section elucidated the
its execution [1][2]. Query optimization is a method of
assumption and the effect of varying input output costs
selecting the preeminent query execution plot as per
respectively. Conclusion is framed in seventh section.
optimization function.The query can be optimized by using
Finally, references are mentioned in eight section of this
diverse deterministic and stochastic techniques. Moreover, a
manuscript.
query can be optimized by changing the order of sub
operations, or by altering the location where sub operations
II. DESIGN AND STATISTICS OF 3-JOIN DSS QUERY
would be executed. In addition to, one can optimize the
query by executing it with different algorithmic approaches Initially, a 3-Join DSS query has been considered for
[3][4][5]. The distributed queries can be optimized by analysis. The statistics of the query are given below [8][9]:
abating either the Total Costs or Response Time of a query.
Total Costs are optimized for increasing the throughput of Total Number of Operations : 11
the system and Response Time is optimized to speed up the Total Number of Intermediate Fragments : 14
execution process of a query. In this research work, the focus Number of Selection Operations : 04
is given on increasing the throughput of DSS query optimizer Number of Projection Operations : 04
[6][7]. Number of Joins Operations : 03
To optimize and analyse a 3-Join DSS query, a Number of Base Relations : 04
hybrid model called entropy and restricted chromosome Number of Sites : 04
based DSS query optimizer has been used [8][9][10]. The Figure 1 represents the tree diagram for the 3-Join
query optimizer has developed using the features of DSS query. Each node and edge represents the sub operation
information theory and GA. The optimizer is designed to and the fragment of the query respectively. Sub operations,
abate the resource ingestion. The use of GA assists in finding
fragments and base relations are represented as On, Fn and O5, O6, O7, O8 : projection operations
Bn respectively. Here, O9, O10, O11 : Join operations
O1, O2, O3, O4: Selection operations
drastic change in the ‗Communication Costs‘ of a query. The ratio on the ‗Communication Costs‘ of the ‗3-Join DSS‘
‗Communication Costs‘ is reduced to almost half of its value query is presented in Figure 3. When the ratio was varied
when the ratio was varied from 1:1.6 to 1:1. The effect of from 1:1.6 to 1:1, the ‗Communication Costs‘ of the query is
varying input output to communication costs coefficients reduced by 90%.
Communication Costs
5000
0
1:1.6 1:1.5 1:1.4 1:1.2 1:1
The following Figure 4 represents the values of the 1:1.6 to 1:1. Consequently, Total Costs of the query was
Total Costs of the ‗3-Join DSS‘ query when the ratio of input reduced by 2%.
output to communication costs coefficients is varied from
1655000
1650000
1645000
1639410 1638810
1640000 1637010
1635410
Total Cost
1635000
1630000
1625000
1:1.6 1:1.5 1:1.4 1:1.2 1:1
Input Output Costs to Communication Costs Ratio