Sunteți pe pagina 1din 2

Silicon Institute of Technology

COURSE HANDOUT (Even Sem 2011-12) Subject Code & Name : PECS 5409 : DATA & WEB MINING : 2008-2012 Branch/Sem/Batch : 8th Sem: B. Tech (CSE) Name of Faculty: Manoj Kumar Pandia Course Objective: To aware the students about the different techniques of data mining and how these data mining techniques can be applied to www. Pre-requisites: None Lecture Schedule: Sl. No. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 Topics to be covered
Intro to Data Mining, Data Mining functionalities, patterns in data mining, Type of patterns, Classification of Data Mining Systems, Major issues in Data Mining Association Rule Mining Mining Single-Dimensional Boolean Association Rules from Transactional Databases Mining Multilevel Association Rules from Transaction Databases Mining Multidimensional Association Rules from Relational Databases and Data Warehouses From Association Mining to Correlation Analysis. Constraint-Based Association Mining Issues Regarding Classification and Prediction Classification by Decision Tree Induction Bayesian Classification Classification by Back propagation Classification Based on Concepts from Association Rule Mining Other Classification Methods. Prediction and Classifier Accuracy Types of Data in Cluster Analysis A Categorization of Major Clustering Methods, Partitioning Methods

Book / Chapter HK Ch-1 HK Ch-5 HK Ch-5 HK Ch-5.3 HK Ch-5.3.2 HK Ch-5.4 HK Ch-5.5 HK Ch-6.2 HK Ch-6.3 HK Ch-6.4 HK Ch-6.6 HK Ch-6.8 HK Ch-6.10,11,12 HK Ch-7.2 HK Ch-7.3, 7.4 HK Ch-7.5 HK Ch-7.6 HK Ch-7.7 HK Ch-7.8 HK Ch-7.11 Bing Ch-6.1 Bing Ch-6.2 Bing Ch-6.3 Bing Ch-6.4 Bing Ch-6.5 1

No. of Classes 1 1 1 1 1 1 1 1 1 1 2 1 2 1 1 1 1 1 1 1 1 1 1 1 1

CLASS TEST I
Hierarchical methods Density-Based Methods Grid-Based Methods Model-Based Clustering Methods Outlier Analysis Basic Concepts of IR IR models Relevance Feedback Evaluation Measures Text and Web Page Pre-Processing

26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42

Graph Mining

Bing Ch-6.1 CLASS TEST II Bing Ch-7.1 Bing Ch-7.2 Bing Ch-7.3 Bing Ch-7.4 Bing Ch-7.5 Bing Ch-8.1, 8.3 Bing Ch-9.2 Bing Ch-9.4 Bing Ch-9.5 Bing Ch-10.2 Bing Ch-10.4

1 1 1 2 2 1 1 1 1 1 1 1 1 1 1 1 1

Social Network Analysis Co-Citation and Bibliographic Coupling Page Rank HITS Community Discovery Basic and Universal Crawlers Wrapper Generation: Wrapper Induction Automatic Wrapper Generation: Problems String Matching and Tree Matching Pre-Processing for Schema Matching Domain and Instance-Level Matching

CLASS TEST III1


Sentiment Classification Bing Ch-11.1 Feature-Based Opinion Mining and Summarization Bing Ch-11.2 Opinion Search, Opinion Spam Bing Ch-11.4 11.5 Data Collection and Pre-Processing Bing Ch-12.1 Data Modeling for Web Usage Mining, Discovery and Analysis of Web Bing Ch-12.2, 12.3 Usage Patterns

Total No of Classes: 42

Text Books:

nd

1. J. Han & M. Kamber, Data Mining: Concepts and Techniques, Morgan Kaufmann, 2 ed, 2006. (Module 1) (HK) 2. Bing Liu. Web Data Mining, Exploring Hyperlinks, Contents and Usage Data, Springer Publishers (Module 2 and Module 3) (Bing) References:
th

1. Margret H Dunham,Data Mining Introductory and advanced topics, Pearson Education, 6 ed,2009, 2. Shawkat Ali and Saleh Wasimi,Data Mining: Methods and Techniques, Cengage Learning, Indian Edition,2009,

*****

S-ar putea să vă placă și