Documente Academic
Documente Profesional
Documente Cultură
net/publication/220693385
CITATIONS READS
261 25,868
2 authors, including:
Drew Mcdermott
Yale University
135 PUBLICATIONS 10,040 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Drew Mcdermott on 10 January 2016.
Machine Learning
• Introduction
• Supervised Machine Learning
• Naïve Bayes
• Markov-Chains (learning the parameters of the
model)
• Decision Trees
Occam’s Razor:
Everything else being
equal choose the less
complex Hypothesis
words
Email contains Email contains Email contains Email contains
“free” “money” “ubc” “midterm”
Number of parameters?
Easy to acquire?
If you have a large collection of emails for which
you know if they are spam or not…… Slide 15
CPSC 502, Lecture 14
Learn the Probabilities
• You have 100,000 emails of which 10,000 are spam
Machine Learning
• Introduction
• Supervised Machine Learning
• Naïve Bayes
• Markov-Chains
(learning the parameters of the model)
• Decision Trees
simplifications
P( w1,.., wn) P(w ) P( w1) k 2 P( wk ) unigram
n n
1
C ( wn 1, wn)
P( wn 1, wn) N pairs C ( wn 1, wn)
P( wn | wn 1)
P( wn 1) C ( wn 1) C ( wn 1)
Nwords
C (big , red )
P(big , red ) N pairs C (big , red )
P(red | big )
P(big ) C (big ) C (big )
Nwords
wn 1
C ( wn 1wn)
Excerpt of a P( wn | wn 1)
1616x1616 table C ( wn 1)
CPSC 502, Lecture 14 26
BERP Table: P( wn | wn 1)
wn
wn 1
Prob.
C ( wn 1wn) 1?
C ( wn 1)
wn to
wn 1
unigram C ( w) C ( w) 1
P( w) P ( w)
*
N N V
*
C ( w) N
P ( w)
*
C ( w) (C ( w) 1)
*
N N V
Pseudo-counts
CPSC 502, Lecture 14 32
CPSC 502, Lecture 14 Slide 33
wn
Add-One: Bigram
wn 1
Counts
C ( wn 1, wn) C ( wn 1, wn) 1
P( wn | wn 1) P ( wn | wn 1)
*
C ( wn 1) C ( wn 1) V
N
…… C ( wn 1, wn) (C ( wn 1, wn) 1)
*
C ( wn 1) V
CPSC 502, Lecture 14 34
BERP Original vs. Add-one
wn smoothed Counts
wn 1
5
6
2 2
19
15
Machine Learning
• Introduction
• Supervised Machine Learning
• Naïve Bayes
• Markov Chains
• Decision Trees
Inductive inference
length
thread
short long
new old
reads skips
unknown known
length
thread
long
short
skips new
reads old
e3
e2
length skips
e4 e6
long
short
skips
reads
e1 e5
42
CPSC 502, Lecture 14
Learning Decision Trees
20 attributes:
●
◆age
◆years with current employer
customers
CPSC 502, Lecture 14 48
Screening images
Given: radar satellite images of coastal waters
●
personnel
Attributes:
●
◆size of region
◆shape, area
◆intensity
Constraints:
●
◆effect of holidays
Attributes:
●
◆temperature
◆humidity
◆wind speed
of expert systems
Given: Fourier analysis of vibrations
●
Preventative maintenance of
●
◆ Customer loyalty:
identifying customers that are likely to defect by
detecting changes in their behavior
(e.g. banks/phone companies)
◆Special offers: