Documente Academic
Documente Profesional
Documente Cultură
A Database is a collection of related data organised in a way that data can be easily accessed,
managed and updated. Any piece of information can be a data, for example name of your school.
Database is actualy a place where related piece of information is stored and various operations
can be performed on it.
DBMS
A DBMS is a software that allows creation, definition and manipulation of database. Dbms is
actualy a tool used to perform any kind of operation on data in database. Dbms also provides
protection and security to database. It maintains data consistency in case of multiple users. Here
are some examples of popular dbms, MySql, Oracle, Sybase, Microsoft Access and IBM DB2
etc.
Users : Users may be of various type such as DB administrator, System developer and
End users.
DBMS : Software that allow users to define, create and manages database access, Ex:
MySql, Oracle etc.
Functions of DBMS
Concurrency Control
Provides a clear and logical view of the process that manipulates data.
Advantages of DBMS
Disadvantages of DBMS
Complexity
Costly
Large in size
Database Architecture
Database architecture is logically divided into two types.
1.
2.
Two-tier Client / Server architecture is used for User Interface program and Application
Programs that runs on client side. An interface called ODBC(Open Database
Connectivity) provides an API that allow client side program to call the dbms. Most
DBMS vendors provide ODBC drivers. A client program may connect to several
DBMS's. In this architecture some variation of client is also possible for example in
some DBMS's more functionality is transferred to the client including data dictionary,
optimization etc. Such clients are called Data server.
Three-tier Client / Server database architecture is commonly used architecture for web
applications. Intermediate layer called Application server or Web Server stores the
web connectivty software and the business logic(constraints) part of application used to
access the right amount of data from the database server. This layer acts like medium
for sending partially processed data between the database server and the client.
Database Model
A Database model defines the logical design of data. The model describes the
relationships between different parts of the data. In history of database design, three
models have been in use.
Hierarchical Model
Network Model
Relational Model
Hierarchical Model
In this model each entity has only one parent but can have several children . At the top
of hierarchy there is only one entity which is called Root.
Network Model
In the network model, entities are organised in a graph,in which some entities can be
accessed through sveral path
Relational Model
In this model, data is organised in two-dimesional tables called relations. The tables or
relation are related to each other.
Database Keys
Keys are very important part of Relational database. They are used to establish and
identify relation between tables. They also ensure that each record within a table can be
uniquely identified by combination of one or more fields within a table.
Super Key
Super Key is defined as a set of attributes within a table that uniquely identifies each
record within a table. Super Key is a superset of Candidate key.
Candidate Key
Candidate keys are defined as the set of fields from which primary key can be selected.
It is an attribute or set of attribute that can act as a primary key for a table to uniquely
identify each record in that table.
Primary Key
Primary key is a candidate key that is most appropriate to become main key of the
table. It is a key that uniquely identify each record in a table.
Composite Key
Key that consist of two or more attributes that uniquely identify an entity occurance is
called Composite key. But any attribute that makes up the Composite key is not a
simple key in its own.
Non-key Attribute
Non-key attributes are attributes other than candidate key attributes in a table.
Non-prime Attribute
Non-prime Attributes are attributes other than Primary attribute.
dBMS:Centralised vs Distributed
Contents
[hide]
2.1 Examples
2.2 Advantages
2.3 Disadvantages
3 Distributed database
3.1 Examples
3.2 Advantages
3.3 Disadvantages
4 Review
Centralised - where the database is physically in one location and users typically use an
Internet connection to access it. Banks (such as ANZ) tend to use centralised databases.
Distributed - Where the database is in many locations often where you have a national or
international company and customers tend to regularly interact with a local branch. For example:
Google uses Big-Table a distributed DBMS as searching tends to be by users in a particular
region of the world.
Centralised
Centralised
LAN
WAN
Examples
Some major banks do all their processing on a mainframe, in some cases in a different
country.
Clients may use several branches, and online banking for transactions.
Advantages
Faster Response
Disadvantages
Processing overheads
Data integrity
Distributed database
Note that users think they are working with a single corporate database
Examples
Google: Use a DBMS called Bigtable. (Note it is not a Relational Database).("What database
does Google use?", 2010[1]; Chang,F., et al.,2006[2])
Advantages
Minimise communications
Costs
Local control
Disadvantages
Processing overheads
Data Integrity
1. Centralised IT Infrastructure:
In a centralised IT infrastructure, a central computing facility comprising one or more large computers is located and all the applications are mounted on it, wherein the entire data, irrespective of its
source, origin and type, are located and processed.
A typical centralised IT infrastructure consists of a large central
computer system with-a variety of highly configured peripheral
devices concentrated at that location. A battery of dump terminals,
not necessarily physically close to the central computer system, are
connected to it with the help of communication links to enable users
2. Decentralised IT Infrastructure:
However, with the advancements in data communication technologies and availability of reliable data communication facilities at
declining costs, business enterprises are switching over from centralised data processing to varying degrees of decentralised data
processing (DDP). The DDP facility would generally consist of relatively smaller computers located at different places in an enterprise
with or without a central computing facility.
Advantages:
The rationale behind switching over to varying types of DDP
can be traced from the following advantages that are associated with DDP:
(a) The data processing facility in DDP at each location is oriented
to satisfy the specific needs and is developed in the light of the local
constraints.
(b) DDP also covers, to some extent, the risk of putting all eggs in
one basket. It also ensures that any failure (of hardware, software or
personnel) has minimum possible impact on the overall functioning
of the system.
(c) DDP helps in efficient use of computing facilities by personnel
located at different places by the resource sharing.
(d) Obviously, DDP ensures increased users involvement which is
one of the important success factors for any system. It also permits
local development of small applications for local use.
(e) DDP offers the necessary flexibility for gradual growth in hardware and software, and ease in their replacement.
(f) It provides quicker response to users, more particularly when a
local portion of the facility is to be used. This, in turn, ensures higher
end-user productivity.
(g) As individual responsibilities can be assigned easily for security
and privacy of information in case of DDP, security system is likely
to be more effective.
(h) Vendor independence at each location also provides the flexibility of adapting systems and application software to the changing
needs at each location without adversely affecting the work at other
locations.
Disadvantages:
However, DDP also has its own limitations that warrant
selective use of the DDP approach. Some of these limitations
include:
(a) Data generated by one application may not be useful for another
application due to lack of standardization of data structures resulting
in problems of incompatibility and duplication of data. Thus, it is
necessary to draw a central plan for generation of information.
(b) Duplication may also occur in software effort as similar applications may be developed by different technical personnel.
(c) DDP depends for its day-to-day operations on data communication facilities that may not be operational all the time. Maintenance
problems in communication facilities may adversely affect the
functioning.
(d) As the data, in DDP, may be dispersed and stored in diverse
forms, it may become difficult to update data and exercise control
over it. As it can be observed, the impact of these limitations can be
minimised, if not eliminated, by adopting suitable managerial
policies.
It may be noted that the choice is not exclusive between centralisation and decentralisation. In fact, depending upon the application,
volume of data, nature of processing, communication facilities, information needs and other critical factors, a choice is made for various
applications with regard to the scope and degree of
decentralisation.