Documente Academic
Documente Profesional
Documente Cultură
Disease Patients
18-11-2019 2
Base Paper:
Anam Mustaqeema, Syed Muhammad
Anwar, Abdul Rashid Khanb,Muhammad
Majid, A statistical analysis based
recommender model for heart disease
patients , International Journal of Medical
Informatics 108(2017) 134-145.
18-11-2019 3
Data Mining Introduction:
Data mining is a process of extracting interesting
patterns from large sets of data that can be used for
predictions.
In today’s world data is everything , it plays a vital
role in business analysis but it has it’s own
prominence in medical field.
To extract features from huge sets of data, as per
the given attributes we use data mining techniques.
18-11-2019 4
Heart disease Introduction:
18-11-2019 5
Abstract:
• This project provides a prediction model that
enhances disease prediction to cardiac patients.
• The prediction model identifies the disease of a
patient and classifies it into two classes i.e.,
Heart Disease , No Heart Disease
18-11-2019 6
Architecture:
18-11-2019 7
18-11-2019 8
Algorithm (HD_PM) Model
Input: Patient's clinical dataset D
Output: Disease prediction P=0,1
18-11-2019 9
Dataset description
collected from UCI machine learning repository
14 attributes
The num field refers to the presence of heart disease in
the heart patient integer valued from 0 (no presence)
to 1
Dataset
18-11-2019 10
Attribute description
1. age - age in years
2. sex-sex (1= male; 0 = female)
3. cp-chest pain type
4. trestbps-resting blood pressure (in mm Hg on admission to
the hospital)
5.chol-serum cholestoral in mg/dl
6. fbs-fasting blood sugar > 120 mg/dl(T,F)
7. restecg-resting electrocardiographic results
8.thalach- maximum heart rate achieved
9. exang-exercise induced angina (1 = yes; 0 = no)
10. oldpeak-ST depression induced by exercise relative to rest
11. slope-the slope of the peak exercise ST segment
12. ca-number of major vessels (0-3) colored by flourosopy
13. thal-3 = normal; 6 = fixed defect; 7 = reversable defect
14. num (the predicted attribute)
18-11-2019 11
The Classifiers used are:
SVM
provides a hyper plane to classify the labelled data.
a binary classifier
For M number of classes, this classification gives (M -1)(M)/
2 number of binary classifiers
Random Forest
a large number of individual decision trees that operate as an
ensemble
It is similar to bagging, but when the classification tree is
growing, random forest considers only the binary split points
18-11-2019 12
Contd:
MLP
It has more than one perceptron and has two phases.
First is forward pass, where the classified outputs for the
given input data are evaluated by
18-11-2019 13
Contd:
Gradient Boosting
a loss function to be optimized
a weak learner to make predictions
an additive model to add weak learners to minimize the
loss function
Decision Trees
most widely used classifier for supervised learning
used for both classification and regression
generate understandable rules inferred from the data
features
18-11-2019 14
Contd:
Logistic Regression
classification algorithm used to assign observations to a
discrete set of classes
Unlike linear regression which outputs continuous number
values, logistic regression transforms its output using the
logistic sigmoid function
returns a probability value which can then be mapped to two
or more discrete classes
18-11-2019 15
contd:
RANDOM FOREST
18-11-2019 16
Code:
SVM
18-11-2019 17
Decision Tree
18-11-2019 18
Gradient Boosting
18-11-2019 19
LOGISTIC REGRESSION
18-11-2019 20
contd:
MLP
18-11-2019 21
GUI
18-11-2019 22
GUI
18-11-2019 23
GUI
18-11-2019 24
GUI
18-11-2019 25
GUI
18-11-2019 26
GUI
18-11-2019 27
GUI
18-11-2019 28
Thank You
18-11-2019 29