Sunteți pe pagina 1din 6

vi

Table of Contents

Declaration............................................................................................................................................. i
Certificate ............................................................................................................................................. ii
Acknowledgement ...............................................................................................................................iii
Abstract................................................................................................................................................ iv
List of figures ........................................................................................................................................ v
List of tables......................................................................................................................................... vi

1. INTRODUCTION ............................................................................................................................. 1
1.1 Introduction ...................................................................................................................................... 1

1.2 Data Preprocessing........................................................................................................................... 4

1.2.1 Data Discretization................................................................................................................ 4

1.2.2 Feature Selection ................................................................................................................... 4

1.3 Data Cleaning Steps ......................................................................................................................... 4

1.3.1 Handle missing data ............................................................................................................... 4

1.3.2 Outlier Analysis .................................................................................................................... 5

1.3.3 Remove Duplication .............................................................................................................. 5

1.3.4 Data Aggregation ................................................................................................................... 5

1.3.5 Data Normalization ................................................................................................................ 5

1.4 Machine learning ............................................................................................................................. 6

1.4.1 Logistic Regression................................................................................................................ 6

1.4.2 Decision Tree Classifier ......................................................................................................... 6

1.4.3 Naïve Bayes ........................................................................................................................... 6

1.4.4 Support Vector Machine ........................................................................................................ 7

1.4.5 K Nearest Neighbors Classifier ............................................................................................. 7

1.4.6 Linear Discriminant Analysis ................................................................................................ 7

1.5 Objective .......................................................................................................................................... 8





vii

1.6 Proposed System .............................................................................................................................. 8

1.7 Limitations ....................................................................................................................................... 9

2. LITERATURE REVIEW ................................................................................................................ 10

2.1 Programming Language used ....................................................................................................... 11

2.2 Data Visualization ......................................................................................................................... 11

2.2.1 Matplotlib............................................................................................................................ 11

2.2.2 Seaborn ............................................................................................................................... 12

2.3 Existing Predictive Models ............................................................................................................ 13

2.4 Predictive Analytics ....................................................................................................................... 13

2.5 Steps to develop predictive model ................................................................................................. 14

2.6 Selecting model according to the situation .................................................................................... 14

2.7 Deploying the predictive model ..................................................................................................... 14

2.8 Assessment of predictive model .................................................................................................... 14

2.9 Applications of predictive analytics............................................................................................... 15

2.10 Different prediction models used for diabetes ............................................................................. 15

2.11 Clinical model .............................................................................................................................. 17

3. EXPERIMENTAL ANALYSIS ...................................................................................................... 18

3.1 Experimental setup and procedure ................................................................................................. 18

3.2 Flow diagram to solve prediction problem .................................................................................... 19

3.3 Assumptions creating decision tree ............................................................................................... 21

3.4 Data distribution............................................................................................................................. 22

3.5 Search strategy ............................................................................................................................... 25

3.6 Quantitative data extraction and analysis ...................................................................................... 26

3.7 Qualitative data extraction and analysis ....................................................................................... 26

4. RESULT AND DISCUSSION ........................................................................................................ 27

4.1 Comparison on default data .......................................................................................................... 27





viii

4.2 Comparison of calibration of classifiers ........................................................................................ 27

4.3 Data Preprocessing......................................................................................................................... 31

4.4 Accuracy graph for KNN Algorithm ............................................................................................. 32

4.5 Outcome ......................................................................................................................................... 32

5. CONCLUSION ................................................................................................................................ 33

6. FUTURE WORK ............................................................................................................................. 34

REFERENCES .................................................................................................................................... 35




ix




xi

S-ar putea să vă placă și