Documente Academic
Documente Profesional
Documente Cultură
Pawan Goyal
Labeled graphs:
I Nodes in V are labeled with word forms (and annotation).
I Arcs in A are labeled with dependency types.
Labeled graphs:
I Nodes in V are labeled with word forms (and annotation).
I Arcs in A are labeled with dependency types.
Notational convention:
I Arc (wi , d, wj ) links head wi to dependent wj with label d
d
I wj (wi , d, wj ) A
wi
I i j (i, j) A
I i j i = j k : i k, k j
G is connected:
I For every node i there is a node j such that i j or j i.
G is acyclic:
I if i j then not j i.
G obeys the single head constraint:
I if i j then not k j, for any k , i.
G is projective:
I if i j then j k, for any k such that both j and k lie on the same side of i.
G is connected:
I For every node i there is a node j such that i j or j i.
G is acyclic:
I if i j then not j i.
G obeys the single head constraint:
I if i j then not k j, for any k , i.
G is projective:
I if i j then j k, for any k such that both j and k lie on the same side of i.
Dependency Parsing
Input: Sentence x = w1 , . . . , wn
Output: Dependency graph G
Parsing Methods
Deterministic Parsing
Maximum Spanning Tree Based
Constraint Propagation Based
Basic idea
Derive a single syntactic representation (dependency graph) through a
deterministic sequence of elementary parsing actions
Basic idea
Derive a single syntactic representation (dependency graph) through a
deterministic sequence of elementary parsing actions
Configurations
A parser configuration is a triple c = (S, B, A), where
S : a stack [. . . , wi ]S of partially processed words,
B : a buffer [wj , . . .]B of remaining input words,
A : a set of labeled arcs (wi , d, wj ).
Learning Problem
Approximate a function from configurations, represented by feature vectors
to transitions, given a training set of gold standard transition sequences.
Learning Problem
Approximate a function from configurations, represented by feature vectors
to transitions, given a training set of gold standard transition sequences.
Three issues
How to represent configurations by feature vectors?
How to derive training data from treebanks?
How to learn classifiers?
o(c, T) =
Left-Arc if top(Sc ) first(Bc ) in T
Right-Arc if top(Sc ) first(Bc ) in T
Reduce if w < top(Sc ) : w first(Bc ) in T
Shift otherwise
LEARN({T1 , . . . , TN })
1 w 0.0
2 for i in 1..K
3 for j in 1..N
4 c ([]S , [w1 , . . . , wnj ]B , {})
5 while Bc , []
6 t arg maxt w.f (c, t)
7 to o(c, Ti )
8 if t , to
9 w w + f (c, to ) f (c, t )
10 c to (c)
11 return w