Sunteți pe pagina 1din 4

Proceedings of the 2013 International Conference on

Pattern Recognition, Informatics and Mobile Engineering, February 21-22

Instance and Value (IVH) Algorithm and Dodging


Dependency for Scheduling Multiple Instances in
Hybrid Cloud Computing
B. Arun Kumar T. Ravichandran
Karpagam University Hindusthan Institute of Technology
Coimbatore, India Coimbatore, India
arunkumar.oct06@gmail.com dr.t.ravichandran@gmail.com

Abstract- Cloud computing is designed such a way to avoid over- service without the procurement of new permanent resources.
provisioning when used with utility pricing. It also removes the Thus, Cloud Computing is simply IT services sold and
need to over-provision in order to meet the demands of users. It delivered over the Internet.
involves multitenancy and multitasks, i.e., many customers can Cloud offers three types of services SaaS (Software as a
perform different tasks, accessing a single or multiple application
Service) provides all the functions of a sophisticated
instances. Sharing resources among a large pool of users assists in
reducing infrastructure costs and peak load capacity. Due to the traditional application [6][7][10] to many customers, often
raise in convention of many applications currently, there is thousands of users, but through a Web browser, not a “locally-
necessitating for high processing and storage capacity along with installed” application. It eliminates customer suspicions about
the consideration of cost and instance use. To provide proficient application servers, storage, application development and
resources, Cloud computing is been pioneered. Many related, common concerns of IT. Highest-profile examples are
organizations have their own private cloud, but when there is Yahoo and Google, and VoIP from Vonage and Skype.
need for extra resources they go for public cloud where they have PaaS (Platform as a Service) delivers virtualized servers
been outlaid for their use. In such “pay-per-use”, workflow on which customers can run existing applications or develop
execution cost must be considered during scheduling based on
new ones without having to worry about maintaining the
users’ QoS constraints. It is an algorithm that calculates an
optimal value and instance for it. Using simulation, we have operating systems, server hardware, load balancing or
compared the performance of our proposed approach with the computing capacity. These vendors provide APIs or
existing scheduling strategies for different type and size of development platforms to create and run applications in the
workflows. The IVH algorithm comes to the decision of desiring cloud – e.g. using the Internet. IaaS (Infrastructure as a
which resource should be chartered from public providers. Service) delivers utility computing capability, typically as raw
virtual servers, on demand that customers configure and
Keywords - Multiple workflows, IVH algorithm, Dependency, manage. IaaS is designed to replace the functions of an entire
Hybrid cloud. data center. This saves cost (time and expense) of capital
equipment deployment but does not reduce the cost involved
I. INTRODUCTION in configuration, integration or management and these tasks
must be performed remotely.
Virtual machines [4] have become the prevalent Apart from these we have the following Cloud computing
abstraction and unit of deployment because they are the least- infrastructure models: The On-demand services are managed
common denominator interface between service providers and by third party resources provider, and the applications
developers. Using virtual machines as object tool, it is requested from different customers are liable to be mixed
adequate for 80 percent of application usage, and it helps to together on the cloud’s servers, storage systems, and networks.
satisfy the user need to rapidly deploy and scale applications. If a public cloud is employed with performance, security, and
[5] Virtual appliances, virtual machines that include software data locality in mind, the existence of other applications
that is moderately or fully configured to perform a specific running in the cloud should be transparent to both cloud
task such as a Web or database server, further develop the architects and end users.
ability to create and deploy applications in vertical domains. The organization permanent infrastructure can also be
The combination of virtual machines along with meant as private cloud, it was built for the limited use of one
appliances as standard deployment objects is one of the key client, providing the utmost control over data, security, and
features of cloud computing [9]. Essentially, IT resources are Quality of Service. The company owns the infrastructure and
rented and shared among multiple leaseholders much as office has control over how applications are deployed on it. These
space, apartments, or storage spaces used by leaseholder. types of clouds can be built and managed by a company’s own
Distributed over an Internet connection, the “cloud” replaces IT organization or by a cloud provider. Hybrid cloud combines
the company data center or server by providing the same both the public and private cloud models to attain elasticity at

978-1-4673-5845-3/13/$31.00©2013 IEEE
2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering (PRIME) 409

the user level. They can help to provide on-demand services, presented. But this existing algorithm does not provide an
externally provisioned scale. The ability to augment a private optimal solution to the scheme.
cloud with the resources of a public cloud can be used to Although cloud computing is playing a major role
maintain service levels in the face of rapid workload currently many problem exists one among them is, which
fluctuations. Sometimes called “surge computing,” a public resource to select based on Instance and Value. So the
cloud can be used to perform periodic tasks that can be proposed IVH algorithm is designed such a way that it
installed easily on a public cloud. A hybrid cloud is designed provides an optimized scheduling to the public cloud.
by carefully determining the best split between public and
private cloud components. One of the problems to face is to III. IVH OPTIMIZATION ALGORITHM
determine when and how to split a workflow, which is
composed of dependent tasks, to execute in private re- sources The scheduler’s intention is clear-cut: minimize the
and in public resources. In this paper, we propose a case study makespan without creating too much communication overhead
in scheduling multiple workflows to the Cloud Computing between multiple workflows. We have developed an approach
paradigm by verifying that only the task that has a dependency to dynamically schedule multiple workflows, verifying the
value within the range specified. dependency range and finally trying to evaluate a minimized
instance and value.
II. RELATED WORK The proposed algorithm makes an initial schedule [2] that
is based on the communication cost, priority and scheduled
The stratagem aims at determining which task should use cost. This initial schedule considers only the private resources
paid resources and what kind of resource should be requested and check if they satisfy the desired deadline. If the deadline is
to the cloud in order to minimize costs and execution time and not satisfied, the algorithm starts the process of deciding which
meet deadlines. Both heuristic and meta-heuristic strategies are resources it will request to the public cloud. This decision is
analyzed among various scheduling algorithms. Linear based on performance, value, and the number of tasks to be
programming is a general modus operandi to tackle such an scheduled in the public cloud.
optimization problem. Bossche et.al therefore analyze and The two main steps of the algorithm are the selection of
propose a binary integer program formulation of the tasks to reschedule and the selection of re-sources from the
scheduling problem and evaluate the computational costs of public cloud to compose the hybrid cloud. While the former
this technique with respect to the problem's key parameters. decides which tasks can have their execution time reduced by
He found out that this approach results in a tractable solution using more powerful resources from the public cloud, the latter
for scheduling applications in the public cloud Van den determines the performance and value involved in the new
Bossche (2010), but that the same method becomes much less schedule.
feasible in a hybrid cloud setting due to very high solve time
variances. Workflow IF
[1] developed an Ad-Hoc method that helps in the job Dependency De=0 to 5
assignment and reassignment [2][3]. Path Clustering Heuristic, Deadline Z
and introduces the concept of rounds, which take turns sending Resource SH
tasks to execution and evaluating the performance of the Stipulated Start Time SST
resources. Stipulated Finish Time SFT
Yu-Kwong Kwok has made a pair-wise comparison Public resource pool M
among seven scheduling algorithm under various conditions. Private Resource Pool H
But the drawback of this algorithm is that it has a set of several Priority Pi
procedures that takes too much time to compile. Pending task PT
A hybrid heuristic scheduling algorithm was implemented Node Ni
on heterogeneous system that comprised of three phases. The Task Time Remaining TTR
key idea of the hybrid heuristic is to use a standard list Node set ST
scheduling approach to rank the nodes of the DAG and then Instance and value IV
use this ranking to assign tasks to groups of tasks that can be
subsequently scheduled independently. [10]. Haluk Topcuoglu 1. IF=Workflows==set of tasks TS==single task T
has provided two performance-effective and low complexity 2. Perform initial schedule
task scheduling algorithms namely HEFT and CPOP 3. Dependency De=0-5
algorithms for heterogeneous system. 4. For each IF in TW
Edwin.S.H.Hou has developed a genetic algorithm for For each T in TS do
multiprocessor scheduling Hou, [6]. The algorithm is based on If T < De Do
the precedence relations between the tasks in the task graph. 5. If SH Є H then
He has compared the genetic algorithm with the list scheduling 6. Schedule T in H
and optimal schedule using random task graphs and a robot 7. While (M > Z && iteration =T) do
inverse dynamics computational task graphs for various are 8. Select node from ST with ↑Pi
2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering (PRIME) 410

9. If T Э ST then For each T in TS do


10. Add T to ST If T < De Do
11. Iteration=iteration+1 21. Select H Є S then
12. End while 22. Calculate CT for new H
13. else select next task from TS 23. If CT < ( H Є J ) then
14. else select next workflow from PT 24. Add SH to M
15. Schedule the SH with ↓ SFT 25. else select next task from TS
16. Else 26. else select next workflow from WT
17. While Request for SH in M 27. Schedule SH with ↓ SFT
18. If SFT > ATR then 28. Else
19. Queue WT to execute 29. Continue Request in H
20. For each W in TW 30. End while

TABLE 1. COMPARISON TABLE

NUMBER CSH IVH


OF
TASKS TIME COST DEPENDENCY EFFICIENCY TIME COST DEPENDENCY EFFICIENCY
RATIO RATIO
50 70% 73% 78% 70% 92% 85% 89% 89%
75 67% 63% 54% 60% 93% 88% 79% 88%
100 59% 53% 45% 53% 92% 88% 80% 87%
150 47% 53% 33% 45% 96% 89% 82% 89%
200 37% 30% 28% 32% 99% 96% 88% 95%

Workflow consists of set of task; in this paper the algorithm verifies whether the predetermined finish time SFT
algorithm consider a set of workflows that comprises of a is greater than TTR, then queue the tasks to execute. In line 22
number of instances, which are dependent on one another. The calculate the cost and time value for the new resource to
main task is to reduce the dependency. We have assigned a allocate. Finally allocate the resource with the lowest SFT.
range for dependency for instance: dependency De value is Finally schedule the resource with the lowest SST. Once
between 0 - 5. the resources are not enough in the private cloud the request is
The First line of the algorithm initializes the set of done to the public cloud. If the evaluated SFT is greater than
workflows to a variable IF. The third line performs initial the TTR then the waiting tasks are put in the queue to execute.
schedule which considers only the Private resource pool and Next select the resource form the public cloud and evaluate the
schedule these workflow in the Private resource pool itself new IV for new resource allocation. Once the value of IV is
based on some attributes like communication cost, Priority and less than the available resource in the private cloud then only
instance. Initial scheduling involves evaluating the cost and the public cloud is requested. Since the IV value is considered
time value for each workflow based on some parameters like to be less than the old IV the resource is added to the set S.
communication cost, Priority; cost for each resource allocation Now schedule the resource with the lowest SFT, suppose the
is done. IV value is larger than verify inside the private cloud itself.
Basically every workflow has a set of tasks which are less Finally allocate the resource with the lowest SFT.
or more dependent on each other. The main task is to verify
that each task must have the defined dependency range only. IV. SIMULATION RESULTS
The fourth line checks the range and once if the dependency
value is less than the range, the allocation or request to the We have made a case study and provided a comparison
resource is done else it is not. Next the algorithm checks table between the existing CSH algorithm and IVH algorithm.
whether the available resources are enough or not. If it is We have made a comparison that involves a number of tasks.
sufficient enough to finish the job, the workflow is requested As shown in Table no: 1 as though the number of tasks
in the private cloud itself else it is requested in the public increases we get a clear idea that CSH algorithm’s efficiency
cloud. is decreasing, but even though the task is increased the
Once scheduled the workflows in the private cloud, until efficiency value is increased in a manner compared to CSH.
the deadline is met the task is running inside the private cloud. The IVH algorithm is far better evolved to have a better way
The iteration is repeated until the deadline Z is met, where the of using this algorithm in order to reduce the instance and
algorithm continues by selecting a node Ni from the node set value by reducing the dependency among the number of tasks.
ST with the highest priority. Once the iteration is over it
checks in the public resource pool H. The line18 in the
2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering (PRIME) 411

V. CONCLUSION REFERENCES

Hybrid clouds are being used to execute different kinds of [1] Amir, Y., Awerbuch, B., Barak, A., Borgstrom, R.S.,Keren, “An
applications. Among them, workflows have an important role opportunity cost approach for job assignment in a scalable
computing cluster,” IEEE Transaction on Parallel and
in processes of many fundamental science fields, such as Distributed Systems 11(7), pp. 760–768, 2000.
Physics, Chemistry, Biology, and Computer Science. To [2] Bittencourt, L.F., Madeira, E.R.M., “A performance oriented
speedup science advancements, it is important to provide adaptive scheduler for dependent tasks on grids,” Concurrency
efficient resource utilization as well as to make application and Computation: Practice and Experience 20(9), pp. 1029–
executions affordable. In this paper we present IVH: The 1049, 2008.
Instance and Value optimized scheduling algorithm for Hybrid [3] Bittencourt, L.F., Madeira, E.R.M, “Towards the scheduling of
multiple workflows on computational grids,” Journal of Grid
Cloud. IVH is an algorithm to speed up the execution of Computing 8, pp. 419–441, 2010.
multiple workflows obeying a desired execution time, but also [4] M. Cardosa, M.R. Korupolu, and A. Singh (2009), “Shares and
reducing costs when compared to the CSH approach. Utilities Based Power Consolidation in Virtualized Server
The extensive evaluation carried out in this work Environments,” Proc. IFIP/IEEE 11th Int’l Conf. Symp.
provides sufficient data to support the conclusion that the IVH Integrated Network Management (IM ’09).
algorithm can provide efficient scheduling in a hybrid cloud [5] L. Grit, D. Irwin, A. Yumerefendi, and J. Chase, “Virtual
Machine Hosting for Networked Clusters: Building the
scenario with a low dependency ration between the tasks. Its Foundations for Autonomic Orchestration,” Proc. IEEE Int’l
multicore awareness, along with the cost knowledge, can Workshop Virtualization Technology in Distributed Computing,
provide makespans as low as the user needs. As a 2006.
consequence, the user is able to control costs by adjusting the [6] Hou, E.S.H., Ansari, N., Ren, H, A genetic algorithm for
multiprocessor scheduling. IEEE Transactions on Parallel and
desired workflow execution time Z, if the dependency ratio Distributed Systems 5(2), pp. 113–120, 1994.
among the workflow is less. In general, the proposed [7] F. Hermenier, X. Lorca, and J.-M. Menaud, “Entropy: A
algorithm has the ability of reducing the execution costs and Consolidation Manager for Clusters,” Proc. ACM SIGPLAN/
time in the public cloud with the increase of the workflow SIGOPS Int’l Conf. Virtual Execution Environments (VEE ’09),
desired execution time. Besides that, in some cases where the 2009.
desired execution time is too low, IVH finds better schedules [8] Kwok, Y.K., Ahmad, I, “Dynamic critical-path scheduling: An
effective technique for allocating task graphs to
than the CSH approach by taking advantage of multicore multiprocessors,” IEEE Transactions on Parallel and Distributed
resources, reducing the number of violations of Z. Systems 7(5), pp. 506–521, 1996.
[9] “The NIST definition of cloud computing 15”,National Institute
of Standards and Technology (NIST), Tech. Rep., July 2009.
[Online]. Available: http://csrc.nist.gov/groups/SNS/cloud-
computing/cloud-def-v15.doc.
[10] Rahman, M., Venugopal, S., Buyya, R, A dynamic critical path
algorithm for scheduling scientific workflow applications on
global grids. In: Third IEEE International Conference on e-
Science and Grid Computing, pp. 35–42. 2007.

S-ar putea să vă placă și