Naive Bayes

Q-4>
Problem Statement:Implement the Naïve Bayes Classifier algorithm program.
Theory:Naive Bayes is a simple technique for constructing classifiers: models that

assign class labels to problem instances, represented as vectors of feature values,
where the class labels are drawn from some finite set. There is not a
single algorithm for training such classifiers, but a family of algorithms based on a
common principle: all naive Bayes classifiers assume that the value of a particular
feature is independent of the value of any other feature, given the class variable.
For example, a fruit may be considered to be an apple if it is red, round, and about
10 cm in diameter. A naive Bayes classifier considers each of these features to
contribute independently to the probability that this fruit is an apple, regardless of
any possible correlations between the color, roundness, and diameter features.
For some types of probability models, naive Bayes classifiers can be trained very
efficiently in a supervised learning setting. In many practical applications,
parameter estimation for naive Bayes models uses the method of maximum
likelihood; in other words, one can work with the naive Bayes model without
accepting Bayesian probability or using any Bayesian methods.
Despite their naive design and apparently oversimplified assumptions, naive Bayes
classifiers have worked quite well in many complex real-world situations. In 2004,
an analysis of the Bayesian classification problem showed that there are sound
theoretical reasons for the apparently implausible efficacy of naive Bayes
classifiers.[7] Still, a comprehensive comparison with other classification
algorithms in 2006 showed that Bayes classification is outperformed by other
approaches, such as boosted trees or random forests.[8]
An advantage of naive Bayes is that it only requires a small number of training
data to estimate the parameters necessary for classification
Procedure/Algorithm:
Initialization: Each data sample represented by n dimensional feature vector i.e

X={x1,x2,…xn} and the attribute values are A1,A2,A3,…An, suppose that there
are m classes namely c1,c2,…,cm.
Step 1: Given an unknown sample X, belongs to class label having the highest
posterior probability conditioned on X. That is, the Naïve Bayes Classifier assigns
an unknown data sample X to a class label Ci, if and only if P(Ci|X) > P(Cj|X)
where 1<=j<=m, j!=i.
Step 2: P(Ci|X) = (P(X|Ci) * P(Ci)) / P(X)
Step3: P(C1) = P(C2) = ……..= P(Ci) = P(Cm), we prove,
(P(X|Ci).P(Ci))/P(X) > ((P(X|Cj).P(Cj))/P(X) , P(Ci)=si/S
Step 4: Hence prove that:-
P(X|Ci).P(Ci) > P(X|Cj).P(Cj)
Step 5: Probability of P(X|Ci) = ΠP(Xk|Ci) (k=1 to n)
Step 6: P(Xk|Ci) will be calculated using any one of the following options:-
1) If Ak is categorical then P(Xk|Ci)=Sik/Si (Sik = no of samples belongs to class

label Ci having attribute value Ak)
2) If Ak is continuous then we have to go for Gaussian distribution/Gaussian

density function.
μ
Ciis the mean and σCiis the standard deviation.
P(Xk|Ci) = 1/(√2ΠσCi) * e(-(xk – μCi) ^ 2 / 2σci2)
Step 7: Similarly we have to calculate P(X|Cj).P(Cj) , calculate same for Cj as

done for Ci.
Demonstration:
An example of a feature vector and corresponding class variable can be: (refer 1st
row of dataset)
X = (Rainy, Hot, High, False)
y = No
So basically, P(X|y) here means, the probability of “Not playing golf” given that
the weather conditions are “Rainy outlook”, “Temperature is hot”, “high humidity”
and “no wind”.
So, we calculate the following probabilities using Naïve Bayes Classifier.
So, in the figure above, we have calculated P(xi | yj) for each xi in X and yj in y
manually in the tables 1-4. For example, probability of playing golf given that the
temperature is cool, i.eP(temp. = cool | play golf = Yes) = 3/9.
Also, we need to find class probabilities (P(y)) which has been calculated in the
table 5. For example, P(play golf = Yes) = 9/14.
P(Yes|today) = 0.67
P(No|today) = 0.33
Therefore, P(Yes|today) > P(No|Today).So, prediction that golf would be played is
‘Yes’.
Program Description:-
import numpy as np
import math
import random
class Classification(object):
def __init__(self):
self.classlabel = 2
self.age = np.array((['<=30', '<=30', '31..40', '>40', '>40', '>40', '31..40', '>40', '<=30', '31..40',
'31..40', '>40']), dtype = str)
self.income = np.array((['high', 'high', 'high', 'medium', 'low', 'low', 'low', 'medium', 'low',
'medium', 'medium', 'medium', 'high', 'medium']), dtype=str)
self.student = np.array((['no', 'no', 'no', 'no', 'yes', 'yes', 'yes', 'no', 'yes', 'yes', 'yes', 'no', 'yes',
'no']), dtype = str)
self.credit_rating = np.array((['fair', 'excellent', 'fair', 'fair', 'fair', 'excellent', 'excellent', 'fair',
'fair', 'fair', 'excellent', 'excellent', 'fair', 'excellent']), dtype = str)
self.buys_computer = np.array((['no', 'no', 'yes', 'yes', 'yes', 'no', 'yes', 'no', 'yes', 'yes', 'yes',
'yes', 'yes', 'no']), dtype = str)
print(self.age)
print(self.income)
print(self.student)
print(self.credit_rating)
print(self.buys_computer)
print("Enter age,income,student and credit rating class labels:")
self.a = input()
self.i = input()
self.s = input()
self.c = input()
def naive_bayes(self):
str1 = 'yes'
c=0
c1 = 0
for i in range(len(self.buys_computer)):
if(str1 == self.buys_computer[i]):
c = c+1
else:
c1 = c1+1
#print(c, "\t", c1)
p1 = c/len(self.buys_computer)#for yes
p2 = c1/len(self.buys_computer)#for no
prob1 = np.zeros(4, dtype=float)
prob2 = np.zeros(4, dtype=float)
p=0
p1 = 0
p2 = 0
p3 = 0
q=0
q1 = 0
q2 = 0
q3 = 0
pro1 = 0.0
pro2 = 0.0
for j in range(len(self.age)):
if(self.buys_computer[j] == 'yes'):
if(self.a == self.age[j]):
p = p+1
if(self.i == self.income[j]):
p1 = p1+1
if(self.s == self.student[j]):
p2 = p2+1
if(self.c == self.credit_rating[j]):
p3 = p3+1
for j in range(len(self.age)):
if(self.buys_computer[j] == 'no'):
if(self.a == self.age[j]):
q = q+1
if(self.i == self.income[j]):
q1 = q1+1
if(self.s == self.student[j]):
q2 = q2+1
if(self.c == self.credit_rating[j]):
q3 = q3+1
prob2[0] = q/c1
prob2[1] = q1/c1
prob2[2] = q2/c1
prob2[3] = q3/c1
prob1[0] = p/c
prob1[1] = p1/c
prob1[2] = p2/c
prob1[3] = p3/c
print(prob1[0] , prob1[1], prob1[2], prob1[3])
print(prob2[0] , prob2[1], prob2[2], prob2[3])
pro1 = prob1[0]*prob1[1]*prob1[2]*prob1[3]*p1
pro2 = prob2[0]*prob2[1]*prob2[2]*prob2[3]*p2
print(pro1, "\t", pro2)
print("Class label of buys_computer of the new data sample is:")
if( pro1 > pro2):
print("yes")
else:
print("no")
C = Classification()
C.naive_bayes()
Output:-
Report :
1) Naive Bayes can be modeled in several different ways including normal,
lognormal, gamma and Poisson density functions.
2) Applying Laplace correction to handle records with zeros values in
X variables improves performance.
3) It is easy and fast to predict class of test data set. It also perform well in
multi class prediction.
4) When assumption of independence holds, a Naive Bayes classifier
performs better compare to other models like logistic regression and you
need less training data.
5) It perform well in case of categorical input variables compared to
numerical variable(s). For numerical variable, normal distribution is
assumed (bell curve, which is a strong assumption).

Naive Bayes

Încărcat de

Informații document

Titlu original

Drepturi de autor

Formate disponibile

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Drepturi de autor:

Formate disponibile

Naive Bayes

Încărcat de

Drepturi de autor:

Formate disponibile

Q-4>

Problem Statement:Implement the Naïve Bayes Classifier algorithm program.

Theory:Naive Bayes is a simple technique for constructing classifiers: models that

Initialization: Each data sample represented by n dimensional feature vector i.e

Step 2: P(Ci|X) = (P(X|Ci) * P(Ci)) / P(X)

Step3: P(C1) = P(C2) = ……..= P(Ci) = P(Cm), we prove,

(P(X|Ci).P(Ci))/P(X) > ((P(X|Cj).P(Cj))/P(X) , P(Ci)=si/S

Step 4: Hence prove that:-

P(X|Ci).P(Ci) > P(X|Cj).P(Cj)

Step 5: Probability of P(X|Ci) = ΠP(Xk|Ci) (k=1 to n)

1) If Ak is categorical then P(Xk|Ci)=Sik/Si (Sik = no of samples belongs to class

2) If Ak is continuous then we have to go for Gaussian distribution/Gaussian

P(Xk|Ci) = 1/(√2ΠσCi) * e(-(xk – μCi) ^ 2 / 2σci2)

Step 7: Similarly we have to calculate P(X|Cj).P(Cj) , calculate same for Cj as

S-ar putea să vă placă și