Documente Academic
Documente Profesional
Documente Cultură
History
Applications
Handwriting recognition
Recognizing spoken words
Face recognition
ALVINN
TD-BACKGAMMON
ALVINN
TD-GAMMON
Plays backgammon
Created by Gerry Tesauro in the early 90s
Uses variation of Q-learning (similar to what we
might use)
Basic Idea
Properties
Perceptrons
N inputs, x1 ... xn
Weights for each input, w1 ... wn
A bias input x0 (constant) and associated weight w0
Weighted sum of inputs, y = w0x0 + w1x1 + ... + wnxn
A threshold function, i.e 1 if y > 0, -1 if y <= 0
Diagram
w1
x1
w0
x2
w2
.
.
.
xn
x0
Thres
hold
y = wixi
wn
1 if y >0
-1 otherwise
Linear Separator
This...
x2
+
+
x2
x1
x1
-
Boolean Functions
x1
x2
x1
x2
x0=-1
w0 = 1.5
w1=1
x1 AND x2
w2=1
x1
x0=-1
w0 = 0.5
w1=1
w2=1
x1 OR x2
x0=-1
w0 = -0.5
w1=1
NOT x1
Gradient Descent
Gradient Descent
1
2
Error function : E x = t d od
2 d D
w i =w i w i
E
w i =
wi
w i = t d o d x id
d D
Sigmoid Function
Output : o= w
x
1
y=
y
1e
y
= y 1 y
y
Variable Definitions
Backpropagation Rule
1
2
E d w
=
t
o
2 k outputs k k
Ed
w ij =
w ij
For output units:
w ij = t j o j o j 1o j x ij
For internal units:
w ij = j x ij
=o j 1o j
k Downstream j
k w jk
Backpropagation Algorithm
Momentum
Data from your training set that is not trained on, but
instead used to check the function
Stop when the performance seems to be decreasing on
this, while saving the best network seen so far.
There may be local minimums, so watch out!
Representational Capabilities
Input Encoding
Output Encoding
Network structure
Inputs
x1
3 Hidden Units
x2
.
.
.
x960
Outputs
Other Parameters
Try it yourself!
/afs/acm.uiuc.edu/user/jcander1/Public/NeuralNetwork