Sunteți pe pagina 1din 15

In which approach of data warehousing, the transaction data is partitioned into facts

1.
2.
3.
4.

Normalized approach
Operational approach
Dimensional approach
None of these

One of the techniques of implementing the OLAP engine is a Specialized SQL server. This server
exhibits the following properties:
1. It assumes that the data warehouse is in a multidimensional model and is implemented in a
relational DBMS
2. It facilitates OLAP operations in SQL, when the data warehouse organized as relational
tables
3. It facilitates OLAP operations in SQL
4. It facilitates SQL queries for the data warehouse that is physically organized as a multidimensional
model

B-Tree is used as an index to provide access to records


1.
2.
3.
4.

Without scanning the entire table


By scanning the entire table
By scanning the entire meta data
None of these

An operational system is which of the following?


1.
2.
3.
4.

A system that is used to run the business in real time and is based on current data
A system that is used to support decision making and is based on current data.
A system that is used to support decision making and is based on historical data.
A system that is used to run the business in real time and is based on historical data.

In a traditional MIS system, there is an almost _______ sequence of queries.


1.
2.
3.
4.

exponential
quadratic
none of these
linear

Data warehouse bus matrix is a combination of


1.
2.
3.
4.

Dimensions and facts


Facts and data marts
Dimensions and data marts
Dimensions and detailed facts

Naturally Evolving architecture occurred when an organization had a _______ approach to handling
the whole process of hardware and software architecture.
1.
2.
3.
4.

None of these
Good
Relaxed
Not Relaxed

The most common source of change data in refreshing a data warehouse is:
1.
2.
3.
4.

Queryable change data


Logged change dat
Cooperative change data
Snapshot change data

The performance in a MOLAP cube comes from the __________ look-up time for the array data structure.
1.
2.
3.
4.

O(3)
O(0)
O(1)
O(2)

The active data warehouse architecture includes which of the following?


1.
2.
3.
4.

Data that can extracted from numerous internal and external sources
Near real-time updates
All of these
At least one data mart

A snowflake schema is which of the following types of tables?


1.
2.
3.
4.

Dimension
All of these
Fact
Helper

A data warehouse is said to contain a subject-oriented collection of data because


1.
2.
3.
4.

It is a generalization of object-oriented approach


It is built for a specific application
It cannot support multiple objects
Its contents have a common theme

Which of the following is not the managing issue in the modeling process?
1.
2.
3.
4.

Ensure that the transaction edit flat is used for analysis


Document each candidate data source
Walk through business scenarios
Do regions report to zones

The Pivot is an OLAP operation which:


1.
2.
3.
4.

Is a visualization operation, rotating the axes for alternative presentation


Integrates several dimensions
Is not a visualization operation
Two consecutive slice operations in two different dimensions

Data transformation includes which of the following?


1.
2.
3.
4.

A process to change data from a detailed level to a summary level


A process to change data from a summary level to a detailed level
Joining data from one source into various sources of data
Separating data from one source into various sources of data

Taken jointly, the extract programs or naturally evolving systems formed a spider web, also known as
1.
2.
3.
4.

Online Systems Architecture


Distributed Systems Architecture
Legacy Systems Architecture
Intranet Systems Architecture

What is true of the multidimensional model?


1.
2.
3.
4.

It typically requires less disk storage


Increasing the size of a dimension is difficult
Typical business queries requiring aggregate functions take more time
It typically requires more disk storage

The following technology is not well-suited for data mining:


1.
2.
3.
4.

Expert system technology


Data visualization
Technology limited to specific data types such as numeric data types
Parallel architecture

Which of the following statement is true?


1.
2.
3.
4.

The operational data are used as a source for the data warehouse
The data warehouse consists of data marts and operational data
All of these
The data warehouse is used as a source for the operational data

Fact tables are


1.
2.
3.
4.

Partially demoralized
Completely normalized
Partially normalized
Completely demoralized

_______ is an application of information and data.


1.
2.
3.
4.

Education
Power
Intelligence
Knowledge

Why aggregate is used in dimensional model of data warehouse?


1.
2.
3.
4.

To store data in one operational system


To take the dimension and change its granularity
All of these
To retrieve data from an operational system

The extract process is which of the following?


1.
2.
3.
4.

Capturing all of the data contained in various operational systems


Capturing a subset of the data contained in various decision support systems
Capturing all of the data contained in various decision support systems
Capturing a subset of the data contained in various operational systems

The growth of master files and magnetic tapes exploded around the mid- _______.
1.
2.
3.
4.

1980s
1970s
1950s
1960s

The content of a data warehouse is said to be non-volatile, because


1.
2.
3.
4.

Its life-span is very long


It is a read-only data
It disappears when the system is switched off
It remains the same even after the system crashes

A goal of data mining includes which of the following?


1.
2.
3.
4.

To create a new data warehouse


To confirm that data exists
To analyze data for expected relationships
To explain some observed event or condition

Which of the following is the most important when deciding on the data structure of a data mart?
1.
2.
3.
4.

XML data exchange standards


Data access tools to be used
Extract, Transform, and Load (ETL) tool to be used
Metadata naming conventions

ROLAP is preferred over MOLAP when


1.
2.
3.
4.

The multidimensional model does not support query optimization.


The data warehouse is in relational tables, but no slice and dice operations are required.
A data warehouse contains many fact tables and many dimension tables
A data warehouse and relational database are separable

Which of the following process includes data cleaning, data integration, data selection, data transformation,
data mining, pattern evolution and knowledge presentation?
1.
2.
3.
4.

ETL process
KDD process
MDX process
KTL process

Roll-up is an OLAP operation:

Which switches from the details to the aggregate


level along many dimensions
Which switches from the details to the aggregate
level within a classification hierarchy

Easy

Hard

Question Insights

At which level we can create dimensional models?


1.
2.
3.
4.

Detailed models level


Implementation level
Architecture models level
Business requirements level

We want to add the following capability to Table2: for each type of account in each region, also show the dollar
amount besides the number of customers. This adds to Table2:
1.
2.
3.
4.

Other row(s)
Another measure for each cell
Other column(s)
Another dimension

Which one manages both current and historic transactions?


1.
2.
3.
4.

OLTP
Spread sheet
OLAP
XML

Transient data is which of the following?


1. Data that are never deleted once they have been added
2. Data that are never altered or deleted once they have been added
3. Data in which changes to existing records do not cause the previous version of the records to be
eliminated
4. Data in which changes to existing records cause the previous version of the records to be
eliminated

Dimension data within a warehouse exhibits one of the following properties


1.
2.
3.
4.

Dimension data is the information that is used to analyze the elemental transactions
It contains historical data
Dimension data consists of the minor part of the warehouse
The aggregated information is actually dimension

A star schema has what type of relationship between a dimension and fact table?
1.
2.
3.
4.

Many-to-many
One-to-many
One-to-one
All of these

A data warehouse is an integrated collection of data because


1.
2.
3.
4.

It is a relational database
It is a collection of data of derived from multiple sources
It contains summarized data
It is a collection of data of different types

The users of data warehouse are knowledge workers in other words they are _________ in the organization.
1.
2.
3.
4.

Database Administrator
Decision maker
DWH Analyst
Manager

We want to add the following capabilities to Table2: show the data for 3 age groups (20-39, 40-60, over 60), 3
revenue groups (less than $10,000, $10,000-$30,000, over $30,000) and add a new type of account: Money
market. The total number of measures will be:
1.
2.
3.
4.

4
Between 10 and 30 (boundaries included)
Between 40 and 60 (boundaries included)
More than 100

A data warehouse is said to contain a time-varying collection of data because


1.
2.
3.
4.

It contains historical data


Its contents vary automatically with time
Its life-span is very limited
Its content has explicit time-stamp

Analytical processing uses ____________ , instead of record level access.


1.
2.
3.
4.

Single-level hierarchy
None of the Given
Single-level aggregates
multi-level aggregates

Multi-dimensional databases (MDDs) typically use ___________ formats to store pre-summarized cube
structures.
1.
2.
3.
4.

SQL
Non- proprietary file
Object oriented
proprietary file

Among the following, which is NOT the objective of clustering?


1.
2.
3.
4.

To initiate hypothesis about the data


It facilitates OLAP operations
To cover natural groupings
To find consistent and valid organizations of the data

A data warehouse is which of the following?


1.
2.
3.
4.

Can be update by end user


Organized around important subject areas
Contains numerous naming conventions and formats
Contain only current data

The generic two-level data warehouse architecture includes which of the following?
1.
2.
3.
4.

At least one data mart


Both at least one data mart and near real-time updates
None of these
Near real-time updates

____________is changing the view of the data.


1.
2.
3.
4.

Dice only
Slice and Dice both
None of these
Slice noly

What is MOLAP?
1. MOLAP is a ROLAP with a supporting multidimensional model.
2. MOLAP is an OLAP engine for (i) relational models and (ii) multidimensional OLAP operations
3. MOLAP is an OLAP engine for (i) Multidimensional models and (ii) supports multidimensional
OLAP operations
4. MOLAP is an OLAP engine for (i) multidimensional models and (ii) SQL based OLAP operations

What is reconciled data?


1.
2.
3.
4.

Data that has been selected and formatted for end-user support applications.
Data stored in the various operational systems throughout the organization.
Data stored in one operational system in the organization.
Current data intended to be the single source for all decision support systems

The need to synchronize data upon update is called


1.
2.
3.
4.

Data Imitation
Data Coherency
Data Manipulation
Data Replication

The Dice operation is concerned with :


1.
2.
3.
4.

Multiple runs of slice


Two consecutive slice operations in two differ ent dimensions
Selecting certain cells of more than one dimension
Slice on more than one

The Slice operation deals with :


1.
2.
3.
4.

Selecting the cells of any one dimension of data cube


Selecting all but one dimension of the data cube.
Merging the cells of all but one dimension
Merging the cells along one dimension

Suppose the amount of data recorded in an organization is doubled every year. This increase is __________.
1.
2.
3.
4.

Quadratic
Linear
Logarithmic
Exponential

Which of the following statements is not true about refreshing a data warehouse:
1. Updates to dimension tables may occur at different times than the fact table
2. None of these
3. It is a process of managing timing differences between the updating of data sources and the related data
warehouse objects
4. The data warehouse administrator has more control over the load time lag than the valid time lag

Data warehousing and on-line analytical processing (OLAP) are _______ elements of decision support system.
1.
2.
3.
4.

None of the given


Unusual
Essential
Optional

What is ROLAP?
1. ROLAP is an OLAP engine for (i) Multidimensional models and (ii) SQL queries, but does not support
slice and dice operations
2. ROLAP is an OLAP engine for (i) relational model and (ii) SQL based OLAP operations
3. ROLAP is a set of relational operations equivalent to OLAP operations
4. ROLAP is an OLAP engine for (i) multidimensional models and (ii) SQL based OLAP operations

Which of the following is the collection of data objects that are similar to one another within the same group?
1.
2.
3.
4.

Data source
Cluster
Grid
Table

Which of the following is not related to dimension table attributes?


1.
2.
3.
4.

Verbose
Descriptive
Complete
Equally unavailable

In a 3-tier architecture of the data warehouse, the OLAP engine is related to:
1.
2.
3.
4.

The Second layer of the architecture


The fourth layer of the architecture
The first layer of the architecture
The third layer of the architecture

Which of the following employees data mining techniques to analyze the intent of a user query, provided
additional generalized or associated information relevant to the query?
1.
2.
3.
4.

Iceberg query method


Data analyzer
Query parser
Intelligent query answering

Choose the correct answer:


1.
2.
3.
4.

Data mining and data warehousing are the same


Data warehousing is the application of data mining.
Data mining is the application of data warehousing
Data mining and data warehousing are totally unrelated

The process of removing the deficiencies and loopholes in the data is called as
1.
2.
3.
4.

Extracting of data
Loading of data
Compression of data
Cleaning up of data

In order to populate the data warehouse, which of the following set of operations are appropriate?
1.
2.
3.
4.

Query and update


Refresh and load
Insert and delete
Create and edit.

The following is true of three-tier data warehouses:


1. Once created, the data marts will keep on being updated from the data warehouse at periodic
times
2. The data marts are different groups of tables in the data warehouse
3. A data mart becomes a data warehouse when it reaches a critical size
4. Once created, the data marts will directly receive their new data from the operational databases

The divide&conquer cube partitioning approach helps alleviate the __________ limitations of MOLAP
implementation.
1.
2.
3.
4.

Maintainability
Scalability
Security
Flexibility

What is a data mart?


1. It is the access layer of the data warehouse environment that is used to get data out to the users
2. It is the process of extracting data
3. It is the collection of data in data warehouse

4. It is the process of extracting patterns from large data sets

Attempt to find a function which models the data with the least error is known as
1.
2.
3.
4.

None of these
Association rule
Clustering
Regression

The value at the intersection of the row labeled "India" and the column "Savings" in Table2 should be:
1.
2.
3.
4.

800,000
200,000
500,000
300,000

A Drill-down operation is concerned with:


1.
2.
3.
4.

Which splits cells of any one dimension based on the characteristics of the dimension?
Which merges cells of any one dimension based on the characteristics of the dimension
Which splits cells of two dimensions
Which merges cells of two dimensions

A Data mart:
1.
2.
3.
4.

Is a stand-alone of a data warehouse, by itself, of a department of the enterprise


Is a place where data are dumped without any restrictions .
Is the analysis unit of the overall architecture of a warehouse
Is an essential component in the sense that every data warehouse necessarily has several data
marts

The load and index is which of the following?


1.
2.
3.
4.

A process to load the data in the data warehouse and to create the necessary indexes
A process to reject data from the data warehouse and to create the necessary indexes
A process to upgrade the quality of data before it is moved into a data warehouse
A process to upgrade the quality of data after it is moved into a data warehouse

A virtual house is essentially:


1. It is the operational database but provides warehouse facilities through a middleware
2. It is the conceptual view of the relational and transactional database

3. A traditional relational database providing a multidimensional view through a middleware and it


is different from the enterprises operational database
4. It is the materialized view of the operational and transactional database

Choose the most correct answer::


1.
2.
3.
4.

Fact Constellation schema contains denormalized Dimension table


Snowflake schema contains normalized Dimension table
Star schema contains normalized Dimension table
Snowflake schema contains denormalized Dimension table.

What is data scrubbing?


1.
2.
3.
4.

A process to reject data from the data warehouse and to create the necessary indexes
A process to load the data in the data warehouse and to create the necessary indexes
A process to upgrade the quality of data after it is moved into a data warehouse
A process to upgrade the quality of data before it is moved into a data warehouse

Choose the most correct answer:


1.
2.
3.
4.

Snowflake schema contains two Fact tables


Fact Constellation schema contains two Fact tables
Star and snowflake schema contains two Fact tables.
Star schema contains two Fact tables

A data warehouse is built as a separate repository of data, different from the operational data of an enterprise
because
1.
2.
3.
4.

It contains data derived from multiple sources


A data warehouse cannot afford to allow corrupted data within it
It is necessary to keep the operational data free of any warehouse operations
A data warehouse contains summarized data whereas the operational database contains
transactional data

________ gives total view of an organization.


1.
2.
3.
4.

Database
OLAP
Data Warehouse
OLTP

Data Warehouse provides the best support for analysis while OLAP carries out the _________ task.
1.
2.
3.
4.

Analysis
Whole
Mandatory
Prediction

Relational databases allow you to navigate the data in ____________ that is appropriate using the primary,
foreign key structure within the data model.
1.
2.
3.
4.

None of these
Any Direction
Only One Direction
Two Direction

In _________ system, the contents change with time.


1.
2.
3.
4.

ATM
DSS
OLTP
OLAP

S-ar putea să vă placă și