Documente Academic
Documente Profesional
Documente Cultură
Unit-1
Estimated Time: 5 hrs.
Syllabus(Unit-1)
• Concepts of Data Warehouse and Data Mining
including its functionalities
• Application of Data Warehouse and Data
Mining
• Issues in Data Warehouse and Data Mining
• Stages of Knowledge discovery in
database(KDD)
• Setting up a KDD environment
What is Data?
• A representation of facts, concepts, or
instructions in a formal manner suitable for
communication, interpretation, or processing
by human beings or by computers.
??
Wisdom
Knowledge
Information
Data
Problems with Data
• The Explosive Growth of Data: from terabytes
to petabytes
• High-dimensionality of data
• High complexity of data
• New and sophisticated applications
• Fast developing Computer Science and
Engineering generates new demands
Evolution of Database Technology
• 1960s: Data collection, database creation, IMS
and network DBMS
• 1970s: Relational data model, relational DBMS
implementation
• •1980s: RDBMS, advanced data models
(extended-relational, OO, deductive, etc.) and
application-oriented DBMS (spatial, scientific,
engineering, etc.)
• 1990s—2000s: Data mining and data
warehousing, multimedia databases, and Web
databases
Size of Databases
• Terabytes -- 10^12 bytes: Walmart -- 24 Terabytes
• Petabytes -- 10^15 bytes: Geographic Information
Systems
• Exabytes -- 10^18 bytes: National Medical Records
• Zettabytes -- 10^21 bytes: Weather images
• •Yottabytes -- 10^24 bytes: Intelligence Agency Videos