Documente Academic
Documente Profesional
Documente Cultură
Course Objectives
No Objective
This field covers all aspects of computing and information access across multiple
CO1 processing elements connected by any form of communication network, either local area,
or wide area
There has been a steady growth in the development of contemporary applications that
CO2 demonstrate their efficacy by connecting millions of users/applications/machines across
the globe without relying on a traditional client-server approach.
The general computing trend is to leverage shared resources and massive amounts of data
CO3 over the Internet. This course aims to provide an understanding of theory and systems
aspects of distributed data
Text Book(s)
M. Tamer Özsu • Patrick Valduriez Principles of Distributed Database Systems Third
T1
Edition
T2 Distributed Operating Systems: Concepts And Design By Pradeep K. Sinha
Reference Book(s)
HBase
https://hbase.apache.org
http://www.tutorialspoint.com/hbase/
MapReduce
https://www-01.ibm.com/software/data/infosphere/hadoop/mapreduce/
http://www.tutorialspoint.com/hadoop/hadoop_mapreduce.htm
SAN
http://searchstorage.techtarget.com/definition/storage-area-network-SAN
http://www.snia.org/education/storage_networking_primer/san/what_san
NAS
http://searchstorage.techtarget.com/definition/network-attached-storage
http://www.webopedia.com/TERM/N/network-attached_storage.html
Content Structure
Learning Outcomes:
No Learning Outcomes
LO2 Understanding of Distributed Storage systems and the technologies used to implement
Course No SS ZG554
Lead Instructor
Glossary of Terms
1. Contact Hour (CH) stands for a hour long live session with students conducted either in a physical
classroom or enabled through technology. In this model of instruction, instructor led sessions will be
for 22 CH.
a. Pre CH = Self Learning done prior to a given contact hour
b. During CH = Content to be discussed during the contact hour by the course instructor
c. Post CH = Self Learning done post the contact hour
2. Contact Hour (CS) stands for a two-hour long live session with students conducted either in a
physical classroom or enabled through technology. In this model of instruction, instructor led
sessions will be for 11 CS.
a. Pre CS = Self Learning done prior to a given contact session
b. During CS = Content to be discussed during the contact session by the course instructor
c. Post CS = Self Learning done post the contact session
3. RL stands for Recorded Lecture or Recorded Lesson. It is presented to the student through an online
portal. A given RL unfolds as a sequences of video segments interleaved with exercises
4. SS stands for Self-Study to be done as a study of relevant sections from textbooks and reference
books. It could also include study of external resources.
5. LE stands for Lab Exercises
6. HW stands for Home Work.
7. M stands for module. Module is a standalone quantum of designed content. A typical course is
delivered using a string of modules. M2 means module 2.
Detailed Structure
Introductory Video/Document: << Introducing the faculty, overview of the course, structure and
organization of topics, guidance for navigating the content, and expectations from students>>
Each of the sub-modules of Recorded Lectures (RLx.y ) shall delivered via 30 – 60mins videos
followed by:
Contact session (CSx.y) of 2Hr each for illustrating the concepts discussed in the videos with
exercises, tutorials and discussion on case-problems (wherever appropriate); contact sessions (CS)
may cover more than one recorded-lecture (RL) videos.
Course Contents
Contact Hour 1: Distributed Data Storage Technology
T1 – 1
RL 1.1 T1 - 2
Pre CH RL
RL 1.2 R1 – 1
R1 – 2
Lab Reference
RL 1.3 T1 - 2
Pre CH RL
RL 1.4 R1 – 2
Lab Reference
R1 – 2
Pre CH RL RL 1.5
RL - 1
Lab Reference
T2 - 9.1
Pre CH RL Features of Distributed File system
T2 - 9.2
Lab Reference
Pre CH RL - -
Lab Reference
Contact Hour 6: Distributed Databases
Time Type Description Reference
RL 2.1 T1 – 1
Pre CH RL
RL 2.2 RL - 2
T1 – 1.7.1
T1 – 1.7.2
Distributed DBMS Systems T1 – 1.7.3
During CH CH
Architectural Models for DDBS T1 – 1.7.4
T1 – 1.7.5
T1 – 1.7.6
Post CH SS - -
Lab Reference
RL 2.3
T1 – 1
Pre CH RL RL 2.4
RL - 2
RL 2.5
T1 – 1.7.8
Distributed DBMS Architecture
During CH CH T1 – 1.7.9
Distributed Data Sources
T1 – 1.7.10
Lab Reference
RL 3.1
RL 3.2
T1 – 3
Pre CH RL RL 3.3
RL - 3
RL 3.4
RL 3.5
Framework of Distribution
T1 – 3.1
Distributed Design Issues
T1 – 3.2
During CH CH Top-Down Design Process
T1 – 3.3
Fragmentation
T1 – 3.4
Allocation
Lab Reference
Contact Hour 9: Distributed Database Design Issues & Integration
Time Type Description Reference
T1 – 4
Pre CH RL RL 3.6
RL - 3
Lab Reference
RL 4.1
RL 4.2 T1 – 5
Pre CH RL
RL 4.3 RL - 4
RL 4.4
Database Security
Discretionary Access Control
During CH CH T1 – 5.2
Multilevel Access Control
Distributed Access Control
Lab Reference
Pre CH RL - -
View Management
Views in Centralized DBMSs
During CH CH T1 – 5.1
Views in Distributed DBMSs
Maintenance of Materialized Views
Lab Reference
Pre CH RL CH 1 to 11 -
During CH CH Mid-Semester Review CH 1 to 11
Post CH SS CH 1 to 11 -
Lab Reference
RL 5.1
RL 5.2 T1 – 13
Pre CH RL
RL 5.3 RL – 5
RL 5.4
Post CH SS
Lab Reference
RL 5.5 T1 – 13
Pre CH RL
RL 5.6 RL – 5
Lab Reference
RL 6.1
RL 6.2
T1 - 14
Pre CH RL RL 6.3
RL - 6
RL 6.4
RL 6.5
Lab Reference
Lab Reference
RL 7.1
RL 7.2 T1 - 17
Pre CH RL
RL 7.3 RL - 7
RL 7.4
Lab Reference
RL 7.5
T1 - 17
Pre CH RL RL 7.6
RL - 7
RL 7.7
Web Crawling
Indexing
During CH CH T1 – 17.2
Ranking and Link Analysis
Keyword Search
Lab Reference
Pre CH RL - -
Web Querying
Semi-structured Data Approach
During CH CH Web Query Language Approach T1 – 17.3
Question Answering
Searching and Querying the Hidden Web
RL 8.1
Pre CH RL RL 8.2 Online References
RL 8.3
Lab Reference
Pre CH RL RL 8.4 RL - 8
Lab Reference
Pre CH RL CH 1 TO 21 -
Post CH SS CH 1 TO 21 -
Lab Reference
Evaluation Scheme:
Legend: EC = Evaluation Component; AN = After Noon Session; FN = Fore Noon Session
No Name Type Duration Weight Day, Date, Session, Time
EC-1 Quiz-I/ Assignment-I Online 5% September 10 to 20, 2018
Quiz-II Online 5% October 20 to 30, 2018
Quiz-III/ Assignment-II Online 5% November 10 to 20, 2018
EC-2 Mid-Semester Test Closed 2 hours 35% 30/09/2018 (AN)
Book 2 PM – 4 PM
EC-3 Comprehensive Exam Open Book 3 hours 50% 25/11/2018 (AN)
2 PM – 5 PM
Note - Evaluation components can be tailored depending on the proposed model.
Important Information:
Syllabus for Mid-Semester Test (Closed Book): Topics in CS 1-5.
Syllabus for Comprehensive Exam (Open Book): All topics given in plan of study
Evaluation Guidelines:
1. For Closed Book tests: No books or reference material of any kind will be permitted.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
2. For Open Book exams: Use of prescribed and reference text books, in original (not photocopies) is
permitted. Class notes/slides as reference material in filed or bound form is permitted. However,
loose sheets of paper will not be allowed. Use of calculators is permitted in all exams.
Laptops/Mobiles of any kind are not allowed. Exchange of any material is not allowed.
3. If a student is unable to appear for the Regular Test/Exam due to genuine exigencies, the student
should follow the procedure to apply for the Make-Up Test/Exam. The genuineness of the reason for
absence in the Regular Exam shall be assessed prior to giving permission to appear for the Make-up
Exam. Make-Up Test/Exam will be conducted only at selected exam centres on the dates to be
announced later.
It shall be the responsibility of the individual student to be regular in maintaining the self-study schedule as
given in the course handout, attend the lectures, and take all the prescribed evaluation components such as
Assignment/Quiz, Mid-Semester Test and Comprehensive Exam according to the evaluation scheme
provided in the handout.