Documente Academic
Documente Profesional
Documente Cultură
RESEARCH ARTICLE
OPEN ACCESS
----------------------------------------************************----------------------------------
Abstract:
The Virtual Design Advisor (VDA) has addressed the problem of optimizing the performance of Database
Management System (DBMS) instances running on virtual machines that share a common physical
machine pool. In this work, the search algorithm in the optimization module of the VDA is improved. An
Exhaustive Greedy algorithm (EG) studies the effectiveness of tuning the allocation of the shared
resources (the share values); and presents a mathematical analysis of the effect of the share values on
reaching an optimal solution. Also, it studies the effect of the share values of resources on the feasibility
and speed of reaching an optimal solution. On the other hand, the particle swarm optimization (PSO)
heuristic is used as a controller of the greedy heuristic algorithm to reduce trapping into local optima. Our
proposed algorithm, called Greedy Particle Swarm Optimization (GPSO), was evaluated using prototype
experiments on TPC-H benchmark queries against PostgreSQL instances in Xen virtualization
environment. Our results show that the GPSO algorithm required more computation but in many test cases
succeeded to escape local optima and reduce the cost as compared to the greedy algorithm alone. Also, the
EG search algorithm was faster than the GPSO algorithm when the search space of the share values grows.
Keywords: - Virtualization, Resource Allocation, Particle Swarm Optimization, Greedy Search,
Query Optimizer.
----------------------------------------************************---------------------------------I. INTRODUCTION
Cloud computing allows users to use
computational resources and services of data
centers (i.e., machines, network, storage,
operating systems, application development
environments, application programs) over the
network to deploy and develop their applications
[1]. The main feature of cloud computing is
providing self-service provisioning, which allows
the users to deploy their own sets of computing
resources [2].
Cloud computing relies on virtualization to
partition and multiplex physical machine
infrastructure
[4].
A
virtual
machine
configuration or resource allocation controls the
sharing of physical resources allocated to VMs.
The problem of optimizing the performance of
the virtualized applications (i.e., the applications
that run on VMs) is critical to the success of
ISSN :2394-2231
http://www.ijctjournal.org
Page 128
ISSN :2394-2231
http://www.ijctjournal.org
Page 129
ISSN :2394-2231
http://www.ijctjournal.org
Page 130
(5)
(2)
(3)
(4)
ISSN :2394-2231
3) Calibration
In VDA, calibration is a process to map each
resource allocation into a corresponding set of
values of the query optimizer's tuning parameters.
This process uses a calibration model that is
constructed as a set of calibration equations [7, 8].
The query optimizer becomes aware of the
virtualized environment it runs in, and chooses an
optimal execution plan by estimating and
comparing the costs of a set of plans based on the
given resource allocation [8].
C. Optimization Problem in VDA
The search algorithm in the Virtual Design
Advisor uses the calibration process to enumerate
configurations for the VMs. The search
algorithms use the what-if mode of the query
http://www.ijctjournal.org
Page 131
ISSN :2394-2231
10
1.020E+14
1.019E+14
1.018E+14
1.017E+14
Time in Seconds
Estimated Cost
1.016E+14
10
11
Share Values
Estimated Cost
Time
http://www.ijctjournal.org
Page 132
+_
' # _!
')
& = GCD % , % , . . , %0
(6)
62
(7)
58
+_
CPU Allocations %
% = ) %&'_!
54
50
46
42
38
0
% #
+_
ISSN :2394-2231
5 6 7
Share Values
-Exhaustive-Config1VM
-Optimal-Config1VM
10 11 12
-Exhaustive-Config2VM
-Optimal-Config2VM
Default-Config
http://www.ijctjournal.org
Page 133
20
1.0172475E+14
16
Estimated Cost
1.0172477E+14
1.0172473E+14
12
1.0172471E+14
1.0172469E+14
1.0172467E+14
0
0
200
400
600
Search Space
800
1000
1200
Fig. 4 Different Search Spaces for Exhaustive Greedy for two workloads
V.
ISSN :2394-2231
Time in minutes
C @
+1 =C@
+?@
+1
(9)
http://www.ijctjournal.org
Page 134
value
Estimated cost
Greedy Heuristic
ISSN :2394-2231
&>0
& < 100
http://www.ijctjournal.org
Page 135
ISSN :2394-2231
OPQRSTU VOPQRWXY
OPQRSTU
+ BJ 0
(10)
?& &
^_`Pabccde V^_`Pafgh
^_`Pabccde
(11)
http://www.ijctjournal.org
Page 136
JjR_kQJQ0P
= Kkl
Rm0P JQ
(12)
oafgh Voabccde
oafgh
(13)
8.62E+13
ISSN :2394-2231
Estimated Cost
( Sequential Page Fetches)
8.57E+13
8.52E+13
8.47E+13
8.42E+13
8.37E+13
8.32E+13
8.27E+13
swarms 10
swarms 20
swarms 30
swarms 40
swarms 50
swarms 100
Fig. 6 Effect of swarm size on total estimated cost for two VMs.
Normalized cost improvement
+ i&%_
0.012
0.010
0.008
0.006
0.004
0.002
0.000
10
30
50
Swarm Size
70
90
http://www.ijctjournal.org
Page 137
.8E+10
.7E+10
1.0E+16
Estimated Cost
Estimated Cost
.6E+10
8.0E+15
.5E+10
6.0E+15
.4E+10
.3E+10
4.0E+15
.2E+10
2.0E+15
.1E+10
.0E+00
0.0E+00
2
10
2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Number of Workloads
Default
Number of Workloads
(5%Greedy (
GPSO
(5%Greedy(
Default
GPSO
ISSN :2394-2231
%16
%14
1.0E+016
%12
Estimated Cost
8.0E+015
%10
6.0E+015
%8
%6
4.0E+015
%4
2.0E+015
%2
0.0E+000
%0
2
GPSO
9 10 11 12 13 14 15 16 17 18 19
Number of Workloads
( 5%Greedy(
http://www.ijctjournal.org
Page 138
%60
3E+15
%50
CPU Allocation
2E+15
%40
2E+15
%30
1E+15
CPU-intensive (W8)
%20
5E+14
%10
%70
0E+00
2
10
5E+15
1w
4w
7w
10w
2w
5w
8w
Estimated Cost
Estimated Cost
Number of Workloads
3w
6w
9w
4E+15
3E+15
2E+15
1E+15
%100.0
7000
%99.9
Runtime (seconds)
8000
0E+00
2
6000
%99.8
5000
%99.7
4000
%99.6
3000
%99.5
2000
%99.4
1000
%99.3
7E+15
%99.2
6E+15
0
3
10
11
12
13
14
15 16 17 18 19
GPSO Runtime
15
Exhaustive Greedy
GPSO Cost
5E+15
Number of Workloads
Greedy Runtime
(5%Greedy Cost (
10
No. of Workloads
overhead
Estimated Cost
Default
4E+15
3E+15
2E+15
0E+00
2
Default
(5%Greedy(
10
No of Workloads
15
Exhaustive Greedy
GPSO
1) Estimated Cost
In this experiment, random TPC-H workloads
were used. Fifteen queries were generated using
the same method described in 3). The EG
algorithm was compared to the default equal
ISSN :2394-2231
1E+15
2) Running Time
Figure 15 shows the runtime overhead of the
compared algorithms. It shows that the EG
algorithm was faster than the GPSO algorithm.
http://www.ijctjournal.org
Page 139
Time in minute
60
50
40
30
20
10
0
0
10
15
20
NO of Workloads
(5%Greedy(
Exhaustive Greedy
GPSO
20
1.0172475E+14
16
Estimated Cost
1.0172477E+14
1.0172473E+14
12
1.0172471E+14
1.0172469E+14
Time in minutes
[1]
1.0172467E+14
0
0
200
400
600
800
1000
1200
[2]
Search Space
Exhaustive Greedy Cost
GPSO Cost
GPSO Time
Fig. 16 The runnimg time of the exhaustive greedy algorithm grows much
faster than GPSO with increasing search space size.
ISSN :2394-2231
[3]
[4]
[5]
L. C. Qi Zhang, Raouf Boutaba, "Cloud computing: state-of-theart and research challenges " Journal of Internet Services and
Applications, vol. 1, No. 1, pp. 7-18, May 2010.
A. A. Soror, U. F. Minhas, A. Aboulnaga, K. Salem, P.
Kokosielis, and S. Kamath, "Deploying Database Appliances in
the Cloud.," IEEE Data Eng. Bull., vol. 32, No. 1, pp. 13-20,
2009.
B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang,
"Mars: a MapReduce framework on graphics processors," in
Proceedings of the 17th international conference on Parallel
architectures and compilation techniques, 2008, pp. 260-269.
P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R.
Neugebauer, I. Pratt, and A. Warfield, "Xen and the art of
virtualization," in Proceedings of the nineteenth ACM symposium
on Operating systems principles, Bolton Landing, NY, USA,
2003, pp. 164-177.
C. Computing. (2010). Handbook of Cloud Computing.
Available: http://www.springerlink.com/index/10.1007/978-14419-6524-0
http://www.ijctjournal.org
Page 140
[7]
[8]
[9]
[10]
[11]
[12]
[13]
[14]
[15]
[16]
[17]
[18]
[19]
[20]
[21]
[22]
[23]
[24]
ISSN :2394-2231
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
http://www.ijctjournal.org
Page 141