Sunteți pe pagina 1din 17

DWDM

Components (Building Blocks) of Data Warehouse

References :
Data Warehousing Fundamentals by Paulraj Ponniah Chapter 2.
Data Mining Concepts and Techniques by Han and Kamber Pg: 105 - 110

5/18/2013

Overview
In order to build operational system:
o Front-end component consists of GUI useful for user to input data. o Data storage component includes DBMS. o Display component set of screens and reports for user. o Connectivity component network software.

All the components are arranged in the most optimal way depending on information requirement and framework of organization.
5/18/2013 3

Overview
Architecture:- Proper arrangement of components. Components:- H/W, S/W components (Building Blocks). Basic building blocks of DW:
1. 2. 3. 4. 1. 2. Source Data Data Staging Data Storage Information Delivery Management & Control Metadata

Other building blocks:

Every DW has the same basic building blocks. Eg. DW for a grocery store, DW for Global Banking institution. Difference is in the manner in which some of the blocks are made stronger than the others in the architecture.
5/18/2013 4

5/18/2013

1. Source Data Component


Source data coming into the data warehouse may be grouped into four broad categories: Production Data: This category of data comes from various operational systems of the enterprise. Based on information requirements in DW, you choose segments of data from different operational systems. Internal Data: In every organization, user keep their private spread sheets, documents, customer profiles and some times even departmental Databases. This data is known as Internal Data which is also useful in a Data warehouse.
5/18/2013 6

1. Source Data Component


Archived Data: In operational systems, we periodically take the old data and store it in archived files. The Data in these archived files is referred to as Archived Data. External Data: In this Category, the data included the data from the external sources. For Example: Market share data of competitors.

5/18/2013

2. Data Staging Component:


When we extracted data from various operational systems and from external source, we have to prepare the data for storing in the data ware house. Data staging provides a place and an area with set of functions to clean, change, combine, convert and prepare source data for storage and use in DW. The 3 major functions need to be performed for getting the data ready.
5/18/2013 8

2. Data Staging Component:


Data Extraction / Extract the Data: For data extraction we have to employ the appropriate technique to get the suitable data from lot of data received from the operational system for data warehouse. Data Transformation: Data transformation involves many forms of combining pieces of data from the different sources. This functions ends when we have a collection of integrated data that is cleaned, standardized and summarized. Now, we are ready to load data in data warehouse.

Data Loading: In this phase large volume of data is loaded into the DW in order to make it live. This involves substantial amount of time.
5/18/2013 9

3. Data Storage Component:


The Storage for the data ware house is a separate repository. Since this data is used for analysis, DW are read-only repositories. Data Warehouse employ: Relational Database Management tools. Multidimensional Database Management tools. Data extracted for data warehouse storage is aggregated in many ways and the summary data is kept in the multidimensional databases.
5/18/2013 10

3. Data Storage Component:

5/18/2013

11

4. Informational Delivery Component:


Users who need information from DW: 1. Novice users: Have no training. Thus require prefabricated reports and preset queries. 2. Casual users: Need information once in a while. These users also need prepackaged information. 3. Business Analyst: Looks for ability to do complex analysis using the information in the DW. 4. Power users: Need ability to navigate throughout the DW, pick interesting data and format his/her own query. They need ability to drill through the data layers, and create custom reports.

5/18/2013

12

4. Informational Delivery Component:

5/18/2013

13

4. Informational Delivery Component:


Ad hoc queries: Predefined and so primarily meant for novice and casual users. Complex queries, Multidimensional (MD) analysis and statistical analysis: Cater to the needs of business analyst and power users. Executive Information System (EIS): Meant for senior executives and high-level managers. Data Mining: Some DW also provide data to data-mining applications. DW may include several information delivery mechanisms. Delivery may be through e-mail, web based or intranet.
5/18/2013 14

5. Meta Data Component:


Metadata in a Data ware house is similar to the Data dictionary or the Data Catalog in a Data Base Management System. Metadata in a DW fall into three major categories: Operational Metadata: Contains all the information about the operational data sources. It helps the information in DW to tie back to the original data source. Extraction and Transformation Metadata: It contains data about extraction of data from the source systems, namely extraction frequencies, extraction methods and business rules for data extraction. It also contain information about all the transformation that took place during data staging. End-User Metadata: It is the navigational map of the data warehouse. It enables the end-users to find information from the data warehouse.
5/18/2013 15

6. Management and Control Component:


This component of the data ware house architecture sits on the top of all other components. The mgt. and control component co-ordinates the services and activities within the data warehouse.
It controls data transformation and data transfer into DW storage. It moderates information delivery to the users. It works with the database mgt. systems and enables data to be properly stored in the repositories. It also monitors the movement of the data into the staging area and from there into the data warehouse storage itself.

This component interacts with Metadata component to perform its functions.

5/18/2013

16

Conclusion
The Data ware house is an informational environment that Provides an integrated and total view of the enterprise. Makes the enterprises current and historical information easily available for Decision Making Makes Decision-Support transactions possible without hindering Operational Systems. Renders the Organizations information Consistent. Presents a Flexible and interactive Source of Strategic information.

5/18/2013

17

S-ar putea să vă placă și