Documente Academic
Documente Profesional
Documente Cultură
Andrew Ng
Andrew Ng
Image
Text detec8on
Character segmenta8on
Character recogni8on
Sliding
windows
Machine
Learning
Text detec8on
Pedestrian detec8on
Andrew Ng
Posi've examples
Nega've examples
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Text detec8on
Andrew Ng
Text detec8on
Posi've examples
Nega've examples
Andrew Ng
Text detec8on
[David Wu]
Andrew Ng
Posi've examples
Nega've examples
Andrew Ng
Andrew Ng
Machine Learning
Character recogni8on
A I
N Q
T A
Andrew Ng
Abcdefg Abcdefg
Abcdefg
Abcdefg Abcdefg
Real
data
[Adam
Coates
and
Tao
Wang]
Andrew Ng
Real
data
[Adam
Coates
and
Tao
Wang]
Synthe'c data
Andrew Ng
Andrew Ng
Synthesizing
data
by
introducing
distor8ons:
Speech
recogni8on
Original
audio:
Audio
on
bad
cellphone
connec'on
Noisy
background:
Crowd
Noisy
background:
Machinery
[www.pdsounds.org]
Andrew Ng
Synthesizing
data
by
introducing
distor8ons
Distor'on
introduced
should
be
representa'on
of
the
type
of
noise/distor'ons
in
the
test
set.
Audio:
Background
noise,
bad
cellphone
connec'on
Usually
does
not
help
to
add
purely
random/meaningless
noise
to
your
data.
intensity
(brightness)
of
pixel
random
noise
[Adam
Coates
and
Tao
Wang]
Andrew Ng
Discussion on geJng more data 1. Make sure you have a low bias classier before expending the eort. (Plot learning curves). E.g. keep increasing the number of features/number of hidden units in neural network un'l you have a low bias classier. 2. How much work would it be to get 10x as much data as we currently have? - Ar'cial data synthesis - Collect/label it yourself - Crowd source (E.g. Amazon Mechanical Turk)
Andrew Ng
Discussion on geJng more data 1. Make sure you have a low bias classier before expending the eort. (Plot learning curves). E.g. keep increasing the number of features/number of hidden units in neural network un'l you have a low bias classier. 2. How much work would it be to get 10x as much data as we currently have? - Ar'cial data synthesis - Collect/label it yourself - Crowd source (E.g. Amazon Mechanical Turk)
Andrew Ng
What
part
of
the
pipeline
should
you
spend
the
most
'me
trying
to
improve?
Component
Overall
system
Text
detec'on
Character
segmenta'on
Character
recogni'on
Accuracy
72%
89%
90%
100%
Andrew Ng
Another
ceiling
analysis
example
Face
recogni'on
from
images
(Ar'cial
example)
Camera! image! Preprocess! (remove background)!
Face detection!
Andrew Ng
Label"
Accuracy
85%
85.1%
91%
95%
96%
97%
100%
Andrew Ng