Documente Academic
Documente Profesional
Documente Cultură
NAME: IDNO:
BIRLA INSTITUTE OF TECHNOLOGY & SCIENCE, PILANI
II SEMESTER 2004-2005
SS G515 DATA WAREHOUSING
Comprehensive Examination
th
Date: 06 May 2005
Time: 3 Hours
Weightage: 35% [Part A (closed book) – 19 & Part B (open book) – 16]
Part A – Closed Book
Points to note:
Answer multiple choice questions in the Question paper itself
Some questions may have more than one correct option. You will get credit only if you
mark all the correct options
There is NO NEGATIVE MARKING
PUT A TICK on the correct option(s)
Short answer questions are to be solved in the supplementary answer sheet provided
Multiple-Choice Questions (20*0.5=10)
Page 1 of 5
Comprehensive Examination SS G515 – Data Warehousing
Page 2 of 5
Comprehensive Examination SS G515 – Data Warehousing
Page 3 of 5
Comprehensive Examination SS G515 – Data Warehousing
1. Design a star schema for the data warehouse clearly identifying the fact
table(s), dimensional table(s), their attributes and measures along with the
primary key and foreign key relationships.
2. Write an SQL query by which you can display region-wise, bank-wise, year-
wise total amount of loans disbursed from your schema.
3. Draw a cuboid that would display the result of the query specified in Q. 2
above.
4. From the cuboid of Q. 3 above, if we want to see the amount of loan disbursed
during the year 2000 for the state of Maharashtra, which sequence of OLAP
operations would you need to perform?
5. Show the lattice of cuboids for the multi-dimensional data considering all the
dimensions in your schema using a single level of hierarchy for each dimension.
6. Draw possible schema hierarchies for each dimension.
7. Based on the schema hierarchies drawn in Q. 6 above, determine the total
number of cuboids, considering all the aggregation levels.
8. Draw a set of aggregated fact tables and their corresponding shrunken
dimensions for all the levels of hierarchies along the branch dimension. What are
the implications of doing this on the ETL process?
9. Once your data warehouse is ready and operational, there is a new
requirement to maintain the amount of loan re-payed at the same level of
granularity. Extend your star schema to a fact constellation schema to take care
of the new requirement.
10. What is the additivity of the fact(s) in your fact table(s)?
[2+1+1+1+1+1+1+1+2+1]
Page 4 of 5
Comprehensive Examination SS G515 – Data Warehousing
Problem 2
Consider the attendance fact table in the BITS data warehouse. The dimensions
for this fact table are student, course, faculty, time, room, and campus and there
is a dummy fact (4 bytes). Assume finest granularity. The warehouse contains
data for the last 5 academic years for all campuses. It is found that the
attendance is 70%. If there are 10000 students in the student dimension,
estimate the size of the fact table (in GB) given that there are 200 courses and
each course has 40 lectures per semester.
Create an aggregated schema, which gives the course-wise, total attendance for
whole semester. (Draw the schema clearly)
If we need to calculate the percentage attendance for any course, do we need
information from any other fact table?
Did sparsity failure occur? Justify you answer.
[2+1+1+1]
Page 5 of 5