Documente Academic
Documente Profesional
Documente Cultură
LOCAL GOVERNMENT
PLANNING AND
DEVELOPMENT
Edited by
00 Muhamud Wambede Nabalegwa
Jamil Serwanga, Haroonah Nsubuga,
and Abbas Mohammed Omar
DATA MANAGEMENT FOR
LOCAL GOVERNMENT
PLANNING AND
DEVELOPMENT
Edited by
Jamil Serwanga, Haroonah Nsubuga, Muhamud Wambede Nabalegwa
and Abbas Mohammed Omar
ISBN: 9789970937257
© Copyright 2018 by Jamil Serwanga, Haroonah Nsubuga, Muhamud
Wambede Nabalegwa and Abbas Mohammed Omar
First Print 2018
All rights reserved. No part of this publication may be reproduced in
any language, stored in any retrieval system or transmitted in any form
or by any means, electronic, mechanical, photocopying, recording, or
otherwise, without prior written permission of the copyright owners.
Published by Kyeyune Publishers
Kampala, Uganda
Reg. No. 212001
Email: kyeyupu@gmail.com
Printed in Uganda
i
PREFACE
On several occasions, the world has been wondering and henceforth
asking why Data Management for Local Government and Development?
This book endeavours as much as possible to provide sound and
concrete responses to these queries that are floated frequently the
world over. The queries may be justified simply because a good many
people are utilising and employing data in their day today work but have
never bothered to find out reasons for using and protecting data. What
is surprising is that, whether they know or they do not, they execute
their routine work and at the end of the day, they register some success
and failures.
However, due to the need to execute work professionally and with a lot
of accuracy and acumen, it is no longer acceptable that people should
continue guessing what they are doing and why they are doing what
they are doing. Time for guess work is over in this century. People at
all levels; (local government authorities inclusive) must use and protect
data professionally and effectively to forge the pace and the accuracy
desired for excellent planning for the growth and development of their
communities and organisations.
Whether the organisations they are working for are big or small,
international or domestic, public or private, they ought to address the
ever growing need for general information at all levels of management
and administration. This is so because they would be enmeshed in a
situation they may not be able to control if they went wrong in regard to
data usage, retrieval for better and informed decision making especially
at local government levels.
All the chapters of this book have been so much simplified that the
reader is effectively guided on how to enhance his/her understanding
and competencies required of him/her for effective management of
data for purposes of realising development at local government levels.
Our interest in publishing this book stemmed from practical experiences
of local government authorities who are really trying to devise means
and ways of enabling their local government systems graduate from
the vicious cycle of both poverty and poor management as they do not
have well aligned data on critical issues like adolescent reproductive
health, population density in their areas of jurisdiction, school going
children and school drop outs, quality of health care in place, quality and
quantity of existing infrastructure desired to support their development
goals, plans, efforts and agenda. Availability of this critical data enables
them to move to a level of utilising data on key socio-economic aspects
in their areas of jurisdiction.
We are therefore optimistic that this book will be found handy for
iii
local government authorities at Sub County, parish, village, district,
Population Secretariat, regional and also national levels on the one
hand, Central Government and Development Partners like Rockefeller
Foundation, DANIDA, UNFPA, UNICEF, UNDP, World Bank, African
Development Bank and many others on the other hand.
Jamil Serwanga
Haroonah Nsubuga
Muhamud Wambede Nabalegwa
Abbas Mohammed Omar
iv
CONTENTS
PREFACE ............................................................................. ii
ACKNOWLEDGMENT .......................................................... vi
v
ACKNOWLEDGMENT
We, the Editors of this book are strongly and fully convinced that putting
together ideas that culminate into a production of this nature is the effort
of many people and organisations. It would therefore bear numerous
loopholes that may be so glaring that its stature and value may be lost
without the genuine support and input of a third party. This piece of
work therefore is not unique because it has been beefed up by several
people and organisations that willingly and zealously accepted to share
their views and experiences with us either directly or indirectly. To all
of them, we register our sincere thanks and appreciation. Indeed the
list is long but the following people and organisations are specifically
mentioned here due to their unique and immense contribution to the
production of this book.
The World Bank; the donors of the Research Fund we utilised. The
management of Makerere University (the host of I@mak.com);
the organisers and managers of this project, the Islamic University
in Uganda and Kyambogo University; the employers of the main
researchers, for the continued advise, encouragement and support
to initiatives and efforts geared toward knowledge creation through
research and publication.
Despite the fact that this book is of immense value to many people who
have become stakeholders in its production, we are fully aware of its
weaknesses that are often typical of editors and authors. We bear the
vii
full responsibility thereof alone.
Editors:
Jamil Serwanga
Haroonah Nsubuga
Muhamud Wambede Nabalegwa
Abbas Mohammed Omar
viii
CHAPTER ONE
BY
Jamil Serwanga1
Haroonah Nsubuga2
Muhamud Nabalegwa3
It should be noted that, even though there are divergent views about
the meaning of the term planning, it is agreed upon by most economists
who have agreed that, the term would imply a deliberate control and
direction of the economy by the central authority for the purpose of
achieving definite targets and objectives within a specified period of
time (Jhingan, 2011).
With reference to the above observation, it has been argued that the
agenda for planning for development is necessary if poverty is to be
minimized in nations, if national and per capita income is to be raised, if
inequalities in income and wealth are to be minimized and if employment
opportunities are to be generated (Jhingan, 2011). In order to achieve
these targets and objectives, planning is highly emphasized and correct
data must be available to achieve the same. It has been argued that
in the process of development planning, the local community has to
3
be fully involved. This is derived from the observation that “Not only
is local; needs-based, this is to say, demand driven development
completely accepted as permitting the most effective, efficient use to
be made of scarce resources to satisfy actual needs, but it has come
to be given a role in its own right. It is now recognized that it is only
at the local level that mobilization of the common sense of purpose,
confidence, creativity and energies necessary to embark successfully
on the development can take place” (Kanyhama, 1998:3). A similar idea
is maintained by Blakely and Bradshaw (2003).
It stands out clearly that for sound economic planning and management
of resources and projects for the development of a country, a survey of
existing potentials of a country should be done. Through the surveys,
the deficiencies of the economy would also be identified. Also, from the
surveys, statistical data and information would be obtained regarding
the total available human and non-human resources of the country.
Data regarding the economic variables of the country for example,
natural resources, along with their degree of utilization, agricultural
and industrial output, and transport facilities among others are crucial
4
for fixing national development targets and priorities in the national
development plans.
Governments at all levels are usually confronted with the need to collect
and analyse data from the various sectors of the economy. The data
collected would be used by various economic development agents,
local governments inclusive, to guide local and national development
policy decisions (Amankwah-Amoah, 2015). Therefore, it can be argued
here that with reliable data, development plans can be formulated and
managed in the most effective way for the realization of the set targets
and objectives. Similarly, where unreliable data is available, unrealistic
4 See Josh (2016)
5
development targets are to be set and the attainment of their targets
and objectives would be in vain. This scenario is prevalent in least
developed countries of the world.
7 David F. Mpanga (2013), “Numbers and Statistics can be very deceitful”, posted
on Sunday, July 28, 2013; The New Vision News Paper, Kampala Uganda
7
The reason identified for this occurrence is the absence of institutional
capacity to verify to validate the data collected and presented. This is
a pointer to the quality of data against which development investment
decisions should be based upon. It is true that for the success of
development planning quality – meaningful data is highly needed for
the validation of development hypotheses.
Given the challenges related to the quality of data, there are a numbers
of ways through which quality data can be generated for planning
purposes at the lower government levels. These methods have been
tried in some parts of the world and include Participatory Rural Appraisal
(PRA) and Geographic Information System (GIS) (Turyatunga, 2004).
(ii) To enhance the use of reliable and accurate data among communities
and households in the production processes for efficient utilization of
resources;
During the Financial year 2010/2011, ten (10) out of the 28 districts were
subdivided into new districts creating a total of 16 new districts. Also in
this FY, five (5) new districts namely; Kanungu, Arua, Abim, Kayunga
and Manafwa were brought on board and baseline data collection was
done in Abim. The baseline data for Kanungu and Kayunga districts
was generated during the FY 2011/2012.”10
Currently, there are roll out strategies which have enabled the CIS
programme to extend to 49 districts. It is also expected that the CIS will
be rolled out to all the districts of the country (UBOS, 2014).
12
of CIS. If this happens, then, accessibility and quality of data will
improve –adding value to the decision making process at the lower
local governments in Uganda.
14
References
Brownson, R.C., Smith, C.A., Jorge, N.E., Deprima, L.T., Dean, C.G
& Cates R.W. (ND). The Role of Data-driven Planning and Coalition
Development in Preventing Cardiovascular Disease.
(http://www.ncbi.nlm.nil.gov/articles/PMC accessed
on 14/10/2013)
16
Valeda F. D (2007), Local Economic Development in Uganda and the
Connections to Rural Community Libraries and
Literacy, New Library World, Vol. 108 No. 5/6, 2007,
PP. 2003-217, Emerald Group Publishing Limited,
New York.
17
CHAPTER TWO
Chemisto Haruna12
This Chapter is intended to equip the reader especially the lower local
government managers/planners with knowledge and skills in data
presentation and analysis for policy formulation and decision making
process at sub-county level.
Continuous Variables: weight, height and the like. This may relate to
people in a sub-county. These are very good examples of continuous
variables because they take on any value within a given range.
21
2.5 Importance of Data Presentation and
Analysis
Data presentation and analysis is important in policy formulation,
decision making and planning process because of the following
reasons:
Thus, plans, policies and decisions that are based on accurate data
presentation and analysis increase the credibility of the decision makers
and ensure that, their policies, plans and decisions taken are realistic
and representative of the needs and priorities of the communities.
1 1 50 10 6 3 1
2 2 58 8 6 2 1
3 1 50 4 4 1 2
4 2 58 3 5 2 1
5 1 25 3 5 2 1
6 2 40 6 6 3 1
7 3 55 6 10 3 2
8 1 22 2 3 2 1
9 2 25 2 5 2 1
10 3 40 9 12 3 2
11 1 25 9 6 2 1
12 1 55 5 7 3 1
13 2 40 4 5 2 2
14 1 50 12 15 4 1
23
15 1 55 9 10 3 1
16 2 58 10 18 4 1
17 1 68 16 20 5 1
18 4 68 12 16 6 1
19 2 40 8 8 3 2
20 2 58 6 9 2 2
21 1 25 5 7 3 1
22 1 40 10 22 5 1
23 4 35 4 7 3 2
24 1 35 6 11 4 1
25 3 55 18 25 6 1
26 2 68 24 3 2 2
27 1 75 18 15 8 1
28 4 40 8 9 4 1
29 3 55 5 8 3 1
30 1 58 15 18 4 1
2 2 3 3 4 4 4 5 5 6
6 6 6 8 8 8 8 9 9 9
10 10 10 12 12 15 16 18 18 24
Certainly, this is not the best way of arranging this particular set of data,
but it is better than the data given in Table 3.1. A table can be formed
showing the number of times a particular number of children that are
24
possessed by the individuals appear. This is called frequency which is
usually denoted by symbol f.
2 2
3 2
4 3
5 2
6 4
8 4
9 3
8 3
12 2
15 1
16 1
18 2
24 1
∑ 30
26
Table 2.4 Frequency Distribution Table showing the Ages of the
Individuals in a given Sub-county
It should be noted that the numbers for each class interval indicate the
limits. For example 20 and 29 are lower and upper limits of that class
interval. Once the groups are not given, then it is advisable to use the
following guidelines in constructing the class intervals:
These guidelines only help when one is constructing the class groups.
Otherwise, one may be given a grouped data that may not obey the
above guidelines.
27
2.10 Graphical Presentation of Data
Other than presenting data in table form, it can also be presented
graphically. The most commonly used graphs to present data are:
(i) Pie-Charts
(ii) Bar Graphs
(iii) Histograms
(iv) Frequency Polygons
(v) Frequency Curves (Ogives)
There are three most common values or measures that indicate the
average namely; the Mean, Median and Mode. The mean is defined as
the arithmetic average of the distribution and the median is the middle
value in the distribution while the mode is the most frequently occurring
value or that value with the highest frequency.
28
Table 2.5 Frequency Distribution Table showing the Age of the
Individuals in the Sub-county
Mean
Considering Table 3.5, one may be interested in determining the
mean age of the individuals in the sub-county. The procedure is to
have the corresponding elements in the age column multiplied by the
corresponding elements in the frequency column. The sum of the
column marked fx is divided by the total number of individuals.
Thus the mean (average) age µ = ∑fx = 1401 = 46.7
∑f 30
Hence, the average or the mean age of the community in the sub-
county (using the above example) is approximately 47 years.
However, in the case of grouped data in Table 3.4 the mean age can
be estimated assuming that all the observations in each class are
concentrated in the class mid-point.
29
Thus the mean (average) age µ = ∑fx = 1435 = 47.83
∑f 30
In the case of grouped data, the average or mean age of the community
in the sub-county is estimated at approximately 48 years.
Median
The median is the middle number or term in the distribution. For
ungrouped data, the median depends on the number of observations
and is n/2, where n is the number of observations, while for an odd
number of observations, the median is the (n+1)/2. If the number of the
observations is even, then the median is the average of the two middle
terms. The data, however, has to be arranged in either ascending or
descending order.
For grouped data, the median has the following simple formula which is
used to obtain an estimate of the exact median within the median class,
where Lb is the lower class boundary of the median class, cfm-1 is the
cumulative frequency of the class before the median class, i is the class
width of the median class and fm is the frequency of the median class.
Mode
The mode is the most common observation. In the case of ungrouped
data, the mode is therefore the age with the highest frequency, which
in this case, is 40 when the age distribution is not grouped. However,
when the age distribution is grouped, then the following simple formula
is used to obtain an estimate of the mode, where Lb is the lower class
30
boundary of the modal class, d1 is the difference between the frequency
of the modal class and that of the class before the modal class, d2 is
the difference between the frequency of the modal class and that of the
class after the modal class and i is the width of the modal class.
2.12 Conclusion
No doubt, the values mean, mode and median among others will have
profound implication, indication and direction of policy designs and
directions to the planners at the lower local government levels. These
values should be taken seriously for any development decisions to be
meaningful.
31
References
32
CHAPTER THREE
Kiboma Gimui13
Masokoyi Waswa14
Kawalya Isa15
13 Former Lecturer at the Islamic University in Uganda, Faculty of Arts and Social
Sciences, Department of Geography
14 Former Lecturer at the Islamic University in Uganda, Faculty of Management
Studies, Department of Public Administration
15 Lecturer at the Islamic University in Uganda, Faculty of Arts and Social Sciences,
Department of Political Science
33
3.1 Data Collection Methods for Social
Economic Activities
34
3.2 Data Collection Methods
Both qualitative and quantitative data may be collected using the
following methods and techniques:
On the other hand however, there are some challenges that are likely
to be encountered when using surveys and questionnaires during the
process of data collection. Some of the challenges are identified as:
All in all, surveys and questionnaires are the most commonly used
methods for collecting data by programmes involving social economic
activities.
39
Challenges of Tests and Assessment Method
• Can be costly and time intensive
• May require scoring by an external source
40
3.3 Conclusion
More methods of data collection are available for use depending on the
nature of the data needed by the local government authority including
the planners. However, selection of one or more methods will depend
on a number of factors from the perspective of the local government
unit interested in the data. The data collection method selection criteria;
may include time, funds availability or generally the budget, human
resource and nature of respondents among others.
41
References
Axinn, W.G. and Pearce, L.D. (2006). Mixed Method Data Collection
Strategies. Cambridge University Press, London
42
CHAPTER FOUR
Ssesanga Idris16
Kiyonga Hamid17
• It should be adequate,
• It should be accurate,
• It should be relevant,
• It should be timely delivered to Local authorities, if it is to be of
value to them.
The main purpose of this chapter is to equip the reader and more
especially data managers at lower local government with skills and
knowledge on how to store data or information and be able to avail it in
time when required to aid the planning, decision making and resource
control among others at Sub-county or village levels.
45
4.3 Meaning of Data
There is no universally accepted definition of the term, but generally
data refers to un processed information. Martin (2005) defined data
as raw facts, ideas and it is only after such scattered raw facts, ideas,
suggestions have been organized, interpreted and made meaningful
that we get information. Data in the context of sub-counties constitutes
say, raw facts about some things such as the forest reserves existing
in the sub-county, the swamps existing in the sub-counties, the women
existing in the sub county, the rain fall patterns of a certain sub-county,
20% elderly women among others.
It should be noted that data storage has a strong link to the type/ format
of data to be stored because it determines the documentation of data
as well as the storage equipment to be used and even the retention
schedules of such data.
47
4.6 Meaning and Scope of Data
Documentation
Data documentation refers to the medium or format in which data
(information) is captured from the diverse sources during data collection
and generation methods and how it is reported to the intended or
potential users.
Therefore, data is captured and documented differently according to
the type and format of data. It can be documented through:
49
Table 4.2: Typical sample of how statistical data is documented
SN Item Size/ Qty Ownership Productivity Village(s) Parish Status Market
1.0 Natural resources
Forest coverage
Land acreage
Rivers
Lakes
Quarries;
1. sand
2. stones
Wetlands
Rainfall patterns
2.0 Agricultural Production
Animals:
1. Cows
50
2. Goats
3. Pigs
4. Poultry
Crops:
1. Coffee
2. Cotton
3. Tea
4. Banana
3.0 Social Services
3.1 Schools:
1. Primary
2. Secondary
3. Tertiary
3.2 Health services:
1. Clinics
2. Dispensaries
3. Hospitals
3.3 CSOs:
1. NGOs
2. CBOs
3. Mosques
4. Churches
5. Associations
51
4.9 Conclusion
Given the objectives of this chapter as mentioned earlier on, data
management, storage and preservation provide the rationale behind
data management with particular emphasis to data storage and
preservation in the context of planning and development and service
delivery at the lower local government levels. Lower local government
managers are expected to pay much attention to the aspects of data
management, storage and preservation. This will add value and
meaning to the decision making processes by lower local governments
regarding development processes and programmes.
52
4.10 References
Briney, K. (2015) Data Management for Researchers: Organize,
maintain and share your data for research success. Pelagic
Publishing, New York
Borghoff, U.M., Rödig, P., Scheffczyk, J. and Schmitz, L. (2007)
Long-Term Preservation of Digital Data: Principles and
Practices. Springer, Berlin Heidelberg
Martin, M. (2005) Local and Regional Government Information: How to
Find It, how to Use it. Greenwood Press, London
Berman, F. (2008). Got data?: a guide to data preservation in the
information age. Communications of the ACM, 51(12), 50-56.
Altman, M., Adams, M., Crabtree, J., Donakowski, D., Maynard, M.,
Pienta, A., & Young, C. (2009). Digital preservation through
archival collaboration: The data preservation alliance for the
social sciences. The American Archivist, 72(1), 170-184.
Berman, F. (2008). Got data?: a guide to data preservation in the
information age. Communications of the ACM, 51(12), 50-56.
Piramanayagam, S.N. and Chong, T.C. (2011) Developments in Data
Storage: Materials Perspective. Wiley, London
53
CHAPTER FIVE
Kalanzi Ismail18
A computer
This is a system composed of hardware and software that can be used
to accomplish tasks as speedy and accurately as possible.
Software
These are the programs that can enable a computer to perform various
tasks.
Hardware
This comprises the physical devices (Equipments) of a computer.
Desktop
The desktop is considered to be the working area of the screen of the
computer.
Icons
These are the graphical representations of various programmes.
55
Folders
These are containers in which files can be stored on a computer.
File
This is a collection of related information.
Clicking
This is the pressing of the mouse buttons.
Double clicking
Double clicking involves the rapid pressing and releasing of the mouse’s
left button.
Right clicking
This is the pressing of the right button of the mouse.
56
5.2 The Primary storage device
A primary storage location holds data for short period of time. The data
stored in a primary storage gets lost immediately when a computer is
switched off.
Floppy Disks
Floppy disks are one of the oldest storage devices which are rarely
used today. A floppy disk would store less data of about (1.5MB) and
can easily be damaged due to poor storage.
The main advantage of floppy disks was that they were very cheap to
buy and very common. However, floppy disks were associated with the
following demerits:
They were easily physically damaged if unprotected and magnetic
58
fields could damage the data.
• They were relatively slow to access because floppy discs
rotate far more slowly than hard discs, at only six revolutions
per second.
A Flash Disk
This is a portable storage device that can store a large amount of data
compared to floppy disks. It can be inserted (connected) to a USB
(universal serial bus) port of a computer in order to transfer data.
Merits of PDAs
• They don’t depend on computers to store data because they
59
have most of the features of a computer.
• Speedy when accessing data
• Can store huge data
Demerits of PDAs
• They are very expensive to buy.
• They need serious maintenance to operate normally and
effectivelly
• They are complicated even though it depends on the model.
Disk partitioning
This is the logical dividing of a storage device into more than one part.
It is mostly performed on new hard disks and damaged ones.
Steps:
• Right click on my computer and click on manage.
• From the menu which appears, click on disk management.
• Right click on the disk for partitioning and select ‘new partition’.
Then follow the wizard.
Steps
• Open my computer, right click on the partition you want to free,
select properties and click on Disk Cleanup.
• Wait for the computer to scan the space and check in the box
for the files to be removed and click ok for the process to start.
Disk checking
This is the repairing of damaged storage devices. Many computers get
disk damage and continue to work but can let you down when you do
not expect. And mostly on crucial moments when you are badly in need
of your data. So, in order to avoid sudden break down of our storages,
we need to carry out the following steps.
Steps
• Open my computer, right click on the disk part to check and
select properties.
61
• Select tools from the menu which appears.
• Click on ‘check now’ and check the combo box options and
then click ‘start’.
Defragmenting
This is the restoring of lost program files on disks which may cause halt
in smooth use of the storage device.
Steps
• Open my computer, right click on the disk part to check and
select properties.
• Select tools from the menu which appears.
• Click on ‘defragment now’.
• Click on defragment from the menu which appears.
Formatting
This is the preparing of a storage device for storing data. The most
common file systems used when formatting are FAT and NTFS.
Steps
• Right click on the device to be formatted.
• Select ‘format’ from the dialogue box.
• Choose the file system you want to use from file system pop
down menu and click start button.
Points to remember
• You need to choose an appropriate storage device to use. (For
instance PDA for data collection and scheduling, flash or floppy
for quick data mobility and transfer and CD and Hard disks for
data backup)
• Perform the six storage device management routines
appropriately.
62
References
Balasa, F. (2010). Data Storage, In-Tech, Intellectwebb.org Accessed
on 6/7/2018 at 3:27pm)
Camden, R. (2015). Client-Side Data Storage: Keeping It Local.
O’Reilly Media, New York
Clark, T. (2005). Storage Virtualization: Technologies for Simplifying
Data Storage and Management. Addison-Wesley
Khurshudov, A. (2001). The Essential Guide to Computer Data
Storage: From Floppy to DVD. Prentice Hall, London
Morris, R.J.T and Truskowski, B.J. (2003). The Evolution of Storage
System, IBM System Journal Vol 42, No 2.
Piramanayagam, S.N. and A1 - Chong, T.C. (2011). Developments in
Data Storage: Materials Perspective. Wiley and Facet, London.
Schroader, A. and A1 - Cohen, T. (2011). Alternate Data Storage
Forensics. Elsevier Science. London.
Services, E.M.C.E. (2010). Information Storage and Management:
Storing, Managing, and Protecting Digital Information. Wiley.
Roebuck, K. (2011). Cloud Data Storage: High-impact Strategies
- What You Need to Know: Definitions, Adoptions, Impact,
Benefits, Maturity, Vendors. Lightning Source.
63
CHAPTER SIX
64
6.1 Introduction
We live in big data world. As an information society, we have a growing
reliance on creating and consuming data which must be available
whenever and wherever it may be desired. Data and related information
services are enabled or provided via information technology services
combining applications, facilities, networks, servers, storage hardware
and software resources. More data can be stored in the same or
smaller physical footprint than in the past, thus requiring less power
and cooling per gigabyte, terabyte, petabyte, or exabyte. As a result
of this increasing reliance on information, both for home and personal
use along with business and professional needs, more data is being
generated, processed, moved, stored and retained in multiple copies
for longer periods of time.
The rapidly expanding generation of Internet-based services such
as email, blogging, social networking, search and e-commerce have
substantially re-defined the behavior and trends of web users when
it comes to creating, communicating, accessing content, sharing
information and storing data (Wei, 2013). Recently, there has been a
great deal of publicity about cloud data storage.
65
6.2 Definition of cloud data storage
Cloud data storage refers to the practice of storing electronic data with
a third party service accessed via the Internet. It is an alternative to
traditional local storage or physical data storage, where recorded data
is stored on a hard disk or portable storage, or, alternately, a server
or device connected to a local network. It can also be called “hosted
storage,” “Internet storage” or “online storage” (Beal, 2015). Zhao et.al.,
(2014), defined online data storage as a virtual storage approach that
allows users to use the Internet to store recorded data in a remote
network. This data storage method may be either a cloud service
component or used with other options not requiring on-site data backup.
Carlson (2013), looks at cloud storage as a model of data storage in
which the digital data is stored in logical pools, the physical storage spans
multiple servers (and often locations) and the physical environment
is typically owned and managed by a hosting company. These cloud
storage providers/companies are responsible for keeping the data
available and accessible and the physical environment protected and
running. People and organizations buy or lease storage capacity from
the providers to store user, organizational, or application data.
66
6.3 Types of cloud data storage
As identified by Schulz (2011), there are three main types of data cloud-
based storage: public, private and hybrid.
What is the earliest form of data storage used by human beings? When
this question is raised, answers such as stone, paper, tape and so on
often come up before someone suggests “the brain.” Although the brain
is the data storage system that the Creator has provided us with, it is
not sufficient for all purposes. Even though the brain can be used for
storage of certain kinds of information, how reliably one can retrieve
the information depends on the individual and the circumstances.
Moreover, information stored in a person’s brain cannot be transferred
to others after the life of that person. Thus, we need data or information
storage systems for at least two main purposes notably:
IDrive: allows you 5GB of storage and the use of unlimited machines –
a generous start. You won’t get a free trial and it can only be used with
Windows and Mac. It offer a ‘one stop shop’ solution that enables you
to back up everything – PC, Mac, devices and even Facebook pictures
into the same account.
70
Source: adopted from http://cloudnewsdaily.com/cloud-storage/
72
6.7 Challenges of cloud data storage
Given the obvious benefits of Cloud Storage, raises the question as to
why many more organizations have not already adopted cloud storage
for on-premises applications and alleviate present expensive storage.
According to IMEXResearch.com, there are traditional IT barriers which
must be overcome when considering integration of cloud storage into
today’s production environments.
Performance and data transfer rates become key issues as the
distance between the data and the user increases - which is what
happens in cloud computing.
Even unlimited bandwidth without solving the latency problem will not
improve the performance because it is the latency - or the chattiness
- of the protocols, plus the speed of light limitations that cause the end
user experience to be very poor.
Not all data access patterns are well suited to the cloud, particularly
if there are large distances to cover. In such cases, bandwidth becomes
not only a challenge but a financial consideration.
Cloud storage isn’t about to replace the storage network in the data
center any time soon, at least not for data-intensive, high performance,
low-response time, transactional applications and for mission-
critical data. But there is pretty many chances where companies
and organizations of all sizes will augment their on-premise storage
with cloud storage potentially from various vendors in a hybrid model
deployment. However, hybrid models tend to bring interoperability
issues and the need to deal with different tools, API’s, management
infrastructure and the like.
The popular storage use cases tend to be infrequently accessed
data scenarios including archiving, backup and offsite data protection.
Performance and data transfer rates become key issues as the distance
between the data and the user increases - which is what happens in
cloud computing. It will be a hybrid cloud storage world-wide for many
years to come with a lot of that storage living in private clouds. Many
large enterprises deal with petabytes of data and their processing.
73
Apart from the above mentioned traditional IT barriers, a survey
conducted by IMEX Research reveals that both small and large
enterprises today are concerned about:
Security & Data Availability - Concerned about the security of their
data, once it is in the hands of the cloud provider whose multi-tenancy
infrastructure is shared by others. Numerous questions arise when
considering cloud storage, including “is my data secure?”, “Am I in
compliance?”, “what happens if a provider loses a disk drive?”, “is my
data protected?”, “how do I know if my data is truly unusable when
I delete it?”, and many others. StorSimple allows you to provide an
encryption key. They are rightfully worried about the availability of their
data and how that impacts their day-to-day operations. What happens
if a cloud storage service is offline for a period of time?
Performance - Legitimately concerns about application performance
if the application storage is in the cloud. Will the cloud storage service
satisfy my workloads?
Bandwidth limitations – Bandwidth is a limiting factor when accessing
a public storage cloud, as they are accessed over the Internet.
Primary storage deduplication and compression, minimizes bandwidth
consumption dramatically while also improving performance.
Latency constraints – Latency is the silent killer of application
performance, both in terms of response time and throughput.
StorSimple takes advantage of parallelization, persistent connections,
and TCP optimizations to overcome latency and improve performance.
Manageability - Are concerned about being locked into their proprietary
cloud storage infrastructure and applications services. They do not
have vendor independent tools or industry standards to evaluate the
applicability or measure the effectiveness of cloud storage for their
environment.
Costs - Many cloud storage services charge their customers based
on the amount of storage capacity consumed and the amount of
bandwidth consumed. Much as reducing cloud storage costs through
deduplication and optimization and taking advantage of pay-as-you-
grow schemes helps but choosing a cloud storage vendor which meets
74
all other criteria remains a challenge.
6.9 Conclusion
Today, the term cloud storage has become increasingly common.
Although many people use the terms “cloud storage” and “online
storage” interchangeably, technically, cloud storage is a particular
kind of online data storage. In order to be considered cloud storage, a
service must be sold on demand, provide elasticity (the user can have
as much or as little as desired) and offer self-service capabilities. Cloud
storage comes in all shapes and forms. Direct comparison between
providers is often difficult because they focus on different aspects of
the service. Often people will base their decision on the amount of free
storage available.
However, this is only one element that you might want to consider. Most
solutions will include a degree of free storage, though if you are backing
up photos and media, this can run out quickly. It is therefore worth
looking ahead to see how much you will have to pay and how much
space it will get you. Aside from that, you should look at what operating
75
systems are supported and whether you will be able to back up and use
files from more than one machine – if you are working on documents at
home as well as in the office, for example, this may be important.
76
6.10 References
Agrawal, D., Das, S. and Abbadi, A.E. (2013) Data Management
in the Cloud: Challenges and Opportunities. Morgan &
Claypool, California.
Balasa Florine (2010) Data Storage. In-teh, Delhi
Zhao, L., Sakr, S., Liu, A., Bouguettaya, A. (2014) Cloud Data
Management. Springer International Publishing. London
77
ABOUT THE EDITORS
Jamil Serwanga
Acting Vice Chancellor Equator University of Science and
Technology -Project - Uganda
Haroonah Nsubuga
Dean Faculty of Arts and Social Sciences, Zanzibar Univer-
sity