Documente Academic
Documente Profesional
Documente Cultură
GO
REGISTER
Advanced Search
Table of Contents
Preface
Acknow ledgments
Author's Notes
1 The Speaker as Information Processor
2 The Speaker as Interlocutor
3 The Structure of Messages
4 The Generation of Messages
5 Surface Structure
6 Lexical Entries and Accessing Lemmas
7 The Generation of Surface Structure
8 Phonetic Plans for Words and Connected Speech
9 Generating Phonetic Plans for Words
10 Generating Phonetic Plans for Connected Speech
11 Articulating
12 Self-Monitoring and Self-Repair
Appendix: Symbols From the International Phonetic Alphabet, With Examples
Bibliography
Author Index
Subject Index
proxy.library.upenn.edu:2580/…/view?is… 1/1
) 1- 4. North-Hollanri
Lingua92 ( 1994
Preface
For many years, the topic of lexical acquisition was a stepchild in linguistic
inquiry . While" the acquisition of syntax was acknowledged to be organized
according to a set of deep and highly structured innate principles, word
meanings were assumed to be acquired by a simple and particularistic
associativeprocedure that mapped perceptual experienceonto phonetic entities
. We believe this stance was anomalous from the start, as had been
pointed out eloquently by Plato and, in modem times, by Quine. Not all
words are perceptually based; even those that are map quite abstractly from
' '
perceptual categories; and neither perception nor learning is simple . During
the past decade, it has becomeclear from linguistic inquiry that the lexicon is
more highly structured than heretofore thopght ; moreover, that much of
grammar turns on critical - and universal - links between syntactic and
lexical-semanticphenomena.
Hence the present volume. The papers grew out of a workshop held at the
University of Pennsylvania. Its aim was to bring together psychologists,
computer scientists, and linguists whosejoint concern was the lexicon and its
acquisition; these researchershave typically worked quite separately and on
problems thought to be disparate. The volume is organized in six sections:
1. Nature of the mental lexicon.. The two essaysthat open the collection
are from linguists gleaning general perspectiveson lexical learning from the
linguistic facts themselves. The first , by Edwin Williams , arguesfrom the real
complexity and idiosyncratic nature of the mental lexicon (including lexical
phrases) that innate linguistic and categorization principles are not enough,
that real children require a learning theory more sophisticated than usually
has been supposed. A detailed analysis of English causativealternation verbs
by Levin and Hovav eloquently supports Williams on the complexity of
lexical structure.
2. Discovering the word units.. To acquire a vocabulary, the learner requires
some proceduresfor segmentingthe continuously varying sound wave
into word -sizedpieces. Cutler discusses this problem, presentingevidencethat
a bias toward rhythmic alternation servesas a powerful bootstrap for infants
(and adults , whatever their specific language background) solving this
problem. Kelly and Martin then show that , armed with such blases, learners
word ' s meaning; Pinker showed further that such a procedure would be
'
marvelously useful for the construction of phrase structure ( semantic boots-
' . Landau and Gleitman 1985
trapping ) ( ) emphasized that learning might
sometimes, even usually, be the other way round ; specifically, that syntactic
structure plays a necessary role in narrowing the search space for verb
' '
meanings ( syntactic bootstrapping ) . The three articles in this section hotly
debate how the semantics-syntax links (specifically, the relation betweenverb
argument structures and subcategorization structures) are implicated in the
learning processfor verbs. Fisher, Hall , Rakowitz , and Gleitman emphasize
the role of syntax in narrowing the range of interpretations made available by
experience; Pinker argues that logically the child must work from experience
to determine the syntax in the first place; and Grimshaw stalwartly proposes
a reconciliation of these two views.
6. Proceduresfor verb learning: The positions just described, even if
correct, can suggestonly some boundary conditions on a learning theory for
verbs. Fisher et al., asjust described, assertthat some rudiments of syntax are
necessaryfor learning the verb meanings. But as Pinker points out , the
question then would be: How did the child acquire the syntax if not by
exploiting the word meanings themselves? Brent, in the best tradition of
computer science, offers a discovery procedure for verb subcategorization
that usesstring-local surface cues only , and does surprisingly well. The final
article in the volume, from Steedman, was designedas a specific commentary
on Brent' s work . But this short article is much more general than that. It
strikes a number of sane cautionary notes about current theories of verb
learning, of which the most important is that real acquisition is bound to be a
messybusiness, with syntactic, semantic, and prosodic cues recruited by the
child more or less catch-as-catch-can. Steedmanalso points out , in what we
find an appropriately laudatory comment, that it is only in the presenceof
'
explicit computational models such as Brent s that the bootstrapping theories
can ever be developedand seriously evaluated.
It is the hope of the editors that this compendium of topics and views on
lexical learning will be of particular use to the linguists who constitute the
Lingua audience, in two ways. First , we suppose it will be useful to see the
kinds of methodological and substantive contributions that scientific psychology
can make to the question of languagelearning - and henceto the theory
of language. Second- as we have emphasizedearlier - becausesyntactic and
lexical structure are so closely entwined, we expect that the various articles
will be informative as to how theselinkages enter into the learning procedure
for vocabulary.
Preface
, June1993
Philadelphia Lila Gleitman
Barbara Landau
Acknowledgments
This book emergedin the stimulating circle of the Max -Planck-lnstitut fiir
'
Psycholinguistik and Nijmegen University . Numerous colleagues can t
give up educating me in matters linguistic and psycholinguistic, and I
would be at a losswithout them. Also , many distinguishedvisitors from all
over the world have given me help and advice. I am sincerelygrateful to all
of them.
Some, from within and without this circle, have taken the effort to read
drafts of chapters and to formulate comments, either orally or in writing .
Before mentioning them, I must say that each of theseefforts has deeply
touched me. Time and again I experiencedit asa great privilege to be taken
so seriously and with so much friendship. In alphabetical order, these
angelswere Manfred Bierwisch, Melissa Bowerman, Herbert Clark , Anne
Cutler , Jane Edwards, Lyn Frazier, Merrill Garrett , Gerard Kempen,
Wolfgang Klein , Aditi Lahiri , and John Marshall . I would also like to
thank Gerard van Galen, who gave me a specialtutorial on psychomotor
theory.
Though I typed the manuscript myself with two index fingers on a
terminal , finishing toucheshave beenmade by Uschi de Pagter and Edith
Sjoerdsma, especiallyin completing the bibliography . Our librarian , Karin
Kastens, also helped me on countlessoccasions. Many thanks to all three
of them.
The graphic work was done by Wil Maas during three months of ideal
cooperation.
NiJmegen
. February 29, 1988
Author ' s Notes
On Pronominalization
On Tr ~ ription
EdwinWilliams
CognitiveScienceLaboratory, Princeton University, 221 NassauStreet, Princeton, NJ08540, USA
' '
The abstractnessof lexical knowledge, and its independencefrom the words of a language, are
generally underestimated . Abstract idioms demonstrate that the lexicon is the repository not only
of fixed expressions, but of abstract language-particular phrase- and word-patterns as well.
-
Paradigmsare language-particular patternings dimensionedand structured in languageparticular
ways, with - of
languageparticular patterns syncretism . The learning burden of these two abstract
but learned features of lexical knowledge falls ouside of what current thinking about language
learning would allow.
1. Introduction
I will survey two aspects of lexical knowledge which I feel pose special
problems for learning. In both casesthe remarks will tend to emphasizethe
extensivenessabstractness , and at the sametime the language-particularity of
lexical knowledge, and will consequentlymagnify the learning problem. One
conclusion that could be drawn from these observations is that intricate
structures can be learned, and this learning is not adequately modelled by
.
either parameter setting or list learning.
I t is useful to distinguish two notions of lexicon, one Bloomfieldian and the
other grammatical. The Bloomfieldian lexicon is the repository of all of a
'
languages idiosyncracies. The grammatical 'lexicon is the linguist' s theory of
'
the category of linguistic object we call a word . These are quite different
conceptions, but have come to be identified in modern times. The result is a
picture of grammar in which we have a clean streamlined syntax, where
syntax is the theory of phrase, and a notion of word that entails that words
are idiosyncratic and irregular at heart, with partial regularities expressedas
' '
redundancy rules whose name implies low level generalizationswhere exception
is the rule.
I think that this picture is wrong. A more correct picture is I think given in
Disciullo and Williams ( 1986) . In that view, both the word formation and the
' '
syntactic system are clean streamlined systems, independent of the lexicon.
The lexicon is the repository of forms about which something specialmust be
learned. This certainly includes, for example, all the monomorphemic words,
but it also includes composed units of both syntax and word formation ,
composed units whose properties do not all follow from the rules which
compose them. This would include a great deal of words, but also a great
deal of phrases. In fact, I now think there are more lexical phrasesthan there
are lexical words, but this re~ ains a speculation.
In addition , I think that lexical knowledge includes knowledge of complex
abstract.structures that cannot be arrived at through parameter setting, and
which must be learned from the data in a strong sense. I will discusstwo of
these: ' abstract idioms' in section 2, and paradigms in section 3. If my view of
theseis correct, the rich structure each exhibits results not from a rich innate
predefined linguistic structure, but rather from a richly structured learning
strategy.
If nothing else, then, the remarks that follow draw attention to the learning
problems that might arise if first , more ~f linguistic knowledge is lexical than
has been thought, and second, if acquired lexical knowledge is more abstract
and structured in language-particular ways than has been thought . I think
that the sort of structure that is found in each case reveals the hand of the
learning strategy.
2. Idioms
The phrasesin the lexicon are called ' idioms' . We generally think of idioms
as ' frozen' expressions. I will use the term idiom to refer to any defined unit
whose definition does not predict all of its properties. This will include the
phrase kick the bucket, a phrase, whose idiomatic meaning is die, but also
transmission, which unpredictably means ' such and such car part ' . We don' t
think of transmissionas an idiom , but it will be useful and I believecorrect to
include it .
It has been commonly assumedthat idioms are well-formed structures, but
I think lately this view has been questioned. To say that an idiom is well
formed is to say simply that it conforms to the rules of formation for the type
that it is. Of course, the idiom does not conform to all of the rules, else it
would not be an idiom . What rules do idioms obey? It is useful to consider
the rules of ' form ' apart from the rules of interpretation . We find idioms
violating both types of rules.
E. Williams I Remarks on lexical knowledge
The idioms that do not obey the rules of interpretation are deviant in
meaning. The rules of argument structure are not obeyed in some cases, and
in others, the rules of referenceare not obeyed. We might refer to both these
' '
types of idioms as semantic idioms.
It is perhaps misleading for me to say that there are idioms which do not
obey the rules of form , for it is an overwhelming fact that idioms obey the
basic rules of fonn it) a language. For example, all idioms in English obey the
' head-initial '
setting of the head position parameter. On the other hand, there
are unpredictable, language-particular exploitations of the formal possibilities
'
in a language; we might call theseidioms ' formal idioms.
A widely shared set of assumptionsabout idioms is the following :
( 1) usual assumptions:
idioms are listed
idioms are well-formed phrases
idioms have empty parts :
The cat has Xs tongue
relation of idioms to syntax:
insert the idiom , then fill in the parts
'
So, for example, The cat has got X s tongue is an idiomatic phrase, fully
consistent with the laws of English syntax, which is inserted in a phrase
marker for an S position ; further substitution of a referential NP for the
position of X will yield an English sentence.
These assumptions, though common, lead to some surprising conclusions,
or at least some surprising conclusions, or at least some surprising questions.
.
In particular , they lead to the view that a great deal of what we have called
' rule' in ' '
syntax might really be idiom . To give an example, consider the fact
that embeddedquestions in English must begin with a WH word. It is well
known that this is not to be describedby making WH movement obligatory
in English. What kind of information is it then that [ Wh-phraseS ] is a
(potential) embeddedquestion in English? It COULD be that this form is an
idiom :
' '
(2) [[ Wh-phraseS ]. , : embeddedquestion
Under this conception, (2) could be listed in the lexicon, awaiting insertion
into some embedded S position, where it is an embedded question (as the
-
slogan after the colon above indicates) . Further substitution of some wh
E. Williams I Remarkson lexical knowledge
phrase for the wh-phrase position in (2) and of an S for the S position in (2)
will complete the sentence, exactly parallel to The cat has got X 's tongue.
One does not ordinarily think of this as idiomatic information , even
though it is languageparticular ; rather, one thinks of this feature of English
as one of the ' parametric' possibilities. (2) is not the ordinary way to encode
this information .
Example (2) may seem sufficiently different from The cat has got your
tongue that it would never be misidentified as an idiom in the same sense.
However, I intend to supply enough examplesof casesintermediate between
the two ' that I cannot see how it cannot be questioned whether (2) is the
correct description or not.
I believe that neither of these is a preposition, and that the notion that
English is head initial can be maintained in a strong form .
Notwithstandingcan be assimilated to the following constructions:
' '
]sc S]s: evenwith NP, S
(6) [[NP notwithstanding
It appears that time prepositions in general can take some sort of extent
specification; however, this specification in general precedesthe time preposition
, as we might expect specifiers to. (8c, d) shows clearly that the extent
specification is not a part of the complement structure, as the prepositions in
and before have complements to their right . Now , ago differs from these in
two ways. First , it cannot have a complement to the right . This however
shows nothing except that ago is intransitive. Secondly, in the caseof ago, the
extent specification is obligatory . Here, if we think that extent specification is
ordinarily outside of the subcategorizational reach of a head, we might
appeal to an idiom to capture this exceptional feature of ago:
E. Williams/ Remarkson lexicalknowledge
In (b), ' bridges' refers to ' problems' , and (b) says that we are talking about
more than one problem. In (a) on the other hand, it is impossible to refer to
more than one death.
Is this an arbitrary difference? I do not think so; I think it follows from the
fact that the meaning of ' kick the bucket' is intransitive. This intransitivity of
meaning meansthat there can not be a correspondencebetweenthe syntactic
and semanticargument structures, sincesyntax is transitive here. In (b) on the
other hand, the verb cross can be assigneda transitive theta structure, and
one of its theta roles can be assignedto bridges. So what is wrong with the
meaning of (b) is simply the reference, not the theta structure: cross refers to
. solve and bridges refers. to problems. But in (a), it is not just reference, but
theta structure itself, which is idiosyncratic.
If this is so, then (b) has a transitive theta structure, and its passivizability
is no surprise, if passiveis an operation on theta structures, as I believe it is.
E. Wi/Iiams / Remarkson lexical knowledge
And the fact (a) is not passivizableis no surprise either, sincethere is no theta
relation betweenthe direct object and the rest of the sentence.
If this account is correct, then theta structure is not necessarilyrespectedin
idioms, though it may be. A child learning an idiom will aggressivelyassignit
as much structure as possible according to its rules. If it can assign it a
semanticargument structure basedon its syntactic argument structure, it will ,
but if not , then the idiom will lack argument structure.
2.2. Abstractidioms
New, made-up terms for lower trunk wear must conform as well. One
exception, bathing -suit , is an exception only in that it does not specifically
refer to lower - trunk wear, but rather means whatever one wears to bathe
in ; it is an accidental fact of current fashion that this refers to lower -trunk
wear.
Now , what sort of information is this? It is information about a general
restriction on form that follows from meaning; specifically, if a noun is going
to have such and such a meaning, then it must be plural ;
' '
( 12) Ns +- : lower trunk wear
I .have drawn the arrow from right to left to mean, if an item is going to have
the meaning on the right , then it MUST have the form on the left. This is a
different sort of idiomatic information from knowing that kick the bucket
CAN mean die, and so the different notation.
E. Williams / Remarkson lexical knowledgE
Another example like pants is fish . Fish names , with some exceptions , are
all unmarked plurals or have that as an option : trout , bass, perch , bream ,
yellow tail , mahimahi . The exceptions are not really fish , by and large :
whale , guppy , minnow . Other animal families are untouched by this idio -
syncracy : bee, wasp , ant . As far as I can tell , this is an unpredictable , but
' '
very general fact about English , and counts as idiomatic information
about the language .
A further case of the same sort is the language - particular patterns of
lexicalization identified by Talmy ( 1985) . He found that languages systematically
differed in the kinds of verb meanings they allowed . For example ,
English allows verbs of motion to indicate a means of motion . Float can be
used as a directional verb , but at the same time , it indicates the manner of
motion : John floated under the bridge can mean that John moved under the
bridge by Boating . Spanish and French lack entirely verbs of this kind . F/otter
in French and flotar in Spanish ( float ) can mean only to float stationarily ,
and the restriction is apparently a hard and fast one. Similarly , verbs of
posture ( sit , kneel , lie , etc.) differ systematically from language to language ,
in whether the stative , inchoative , or causative is the basic or underived form ;
English , Japanese, and Spanish systematically differ in this choice - see Talmy
( 1985) for details .
Again , we have language -particular variation of a quite general sort .
' '
Again , the information is idiomatic , but the question remains , how to
' '
represent it . We might again represent it as an idiom with a hole in it :
'
-
wipers
' '
VP > > V PP > > V P N > > laisse - - conte left for count
pour ,
' '
abandoned one
Here, the double carat means ' has as an instance' . So, compound terms in
French are instancesof syntactic constructions. As word level items they have
their own limitations (e.g., no referential material may occur in them, and so,
for example, no determiners are allowed) , but they are neverthelesswell-
formed syntactic objects.
English , on the other hand , exploits a different system to form its
compound terms :
The system exploited here is the affixation system in the lexicon, which is
right -headed
'
. Ordinarily , Y is a suffix, forming the head of a word. English
lets Y be a full noun, giving us compounds.
Importantly , both languageshave both resources- English has the same
(left -headed) syntax as French, and French has the same right -headedaffixation
systemas English; however, they each exploit a different one of thesefor
their compound terms. I assumethat this is ' idiomatic ' - that is, language-
' '
particular , but perhaps not parametric .
2.3. Syntacticidioms
(18a
) [wh-phraseS]
'
> >
[what N S] : Little (amount relatives)
18b
( ) gave
I him what food I had.
19
( ) give
I him what I had.
E. Williams I Remarks on lexical knowledge
( 18b) has an implication that ( 19) does not have, namely, that there was little
food in question.
Now , where does this implication come from ? It does not come from what,
which does not have this implication in general, not even in free relatives,
exceptin the context in ( 18b) . Furthermore, it does not inhere in free relatives
in general. In fact, it occurs only in the structure in ( 18), it is idiosyncratic to
that structure. Assuming that there is no parameter to set here, then this is a
learned fact about this structure.
What is interesting is how formally similar ( 18) is to ( 17) . The only
difference is that the semantics of ( 18) is very particular, and therefore
plausibly idiomatic , whereas the semantics of ( 17) is very general. But the
formal means may be the same in the two cases: a feature of meaning and
form are connected in an idiomatic entry in the lexicon. If a learner can
induce ( 18), it would seem that ( 19) would be accessible to the same
mechanism.
A related sort of case arises from Subject Aux Inversion in English; the
following is an idiom of English:
This is comparable to ( 17) - an obligatory idiom (that is, of the pants variety)
has the effect of forcing syntactic rules to apply. More interesting are the
casesof inversion which receivea conditional interpretation :
The rule of inversion gives a large number of forms which are ungrammatical
in this context :
*
(22) Could I write poetry, I would not be a linguist .
In fact, inversion in the conditional context works only for the auxiliaries had
a~d were. What sort of infonnation is this? Importantly , the casesallowed in
the construction are a subset of the casesallowed in general; hence, what is
learned is that not all the formally allowed possibilities are realized. We will
adopt the following convention for representingthis situation :
E. Williams I Remarks on lexical knowledge
(23) Instanceprinciple :
If a fonn to which a meaning is assignedhas listed subinstances, then
those subinstancesare exhaustive.
(24) NyPNy
by side
(26) N to.N
> > hour minute second year
window door station
head hand cheek
NbyN
> > minute, hour , etc. ;
layer house cave
N for N
> > dollar for dollar
N from N
> > limb from limb
Non N
> > layer on layer
the other hand, both prepositions must be present: depart en part (' limb from
limb ' ) ( V. Deprez, p .c. ., T . Hoekstra has pointed out to me the existenceof
heurepar heure and cote a cote) .
The various subcasesof (24) do not have a common element of meaning.
So, for example, cheekto cheekrefers to the pressingtogether of two cheeks,
as in dancing; but minute to minute and day by day refers to a seriesof days
in sequence . Even the instancessharing a common proposition do not have a
completely shared element of meaning, as the two instanceswith to just cited
show. Therefore, these forms are not compositional, despite clear patterns in
the meaning.
On the other hand, from the fact that they are so prevalent in one
language, and absent altogether in another, we know that they are present as
a group in some sense. It seemsunlikely that there is a parameter for this
property alone; perhaps it follows from some other parameters, though it is
hard to seehow.
An alternative is that the structure in (25)-( 27) is induced from the data of
the language. How could this happen? Supposethat at a certain point in the
.
course of acquisition, some number of forms with the shape NxPNx have
been learned. As a class, they conform to the shapeof left headedcompound
prepositional phrases, and so do not fall outside of the language altogether.
On the other hand, their properties are not entirely explicable in terms of the
general principles of the grammar; in particular, the use of bare singular
count Ns as the objects of prepositions is not a general feature of prepositional
phrases in English. So these remain idiomatic ; however, they are
idiomatic as a class, not as individuals.
Another family of idioms is illustrated in the following :
What is special about this case is the use of the bare singular as object.
Normally , this is not allowed for English count nouns, but is allowed here.
So, we have an abstract idiom , of the following form :
E. Williams I Remarks on lexical knowledge
(30) [V N] vp
+ HUNT + ANIMAL
The form in (30) is a special case of the general form of VP (and so, for
' ' ' '
example, is V N and not N V ), and thus conforms to our overall claim that
abstract idioms are always instancesof more generalpatterns of the language.
The limits of this idiom are somewhat roughly indicated by (b) and (c) - the
verb must be a verb of hunting, and the object must be an animald ) shows
a further restriction - not only are determiners excluded, but adjectives as
'
well (unless"sleepy is a kind of elephant) .
One might conclude from this the observation that this construction was
' '
in fact a lexically compound verb - to hunt monkey then would be
syntactically intransitive . I doubt this, since English in general disallows
compound verbs, and particularly disallows left -headed compound verbs;
but even if the conjecture were correct , then the problem posed by these
examples is not solved, but simply delivered to the lexicon, with fee still
unpaid.
I . assume that it is not at all predictable that English would have this
pattern ; and in fact, inspection of (b) and (c) might lead one to not expect
this pattern . I conclude therefore, that learning ( 30) entails generalizing over
exampleslike those in (29), and the limits of the generalization must follow
in some way from the actual mechanism of generalization.
The existence of these idioms of intermediate abstractness argues that
learning languagedoes not reduce to (a) learning parameter settings, and (b)
learning the properties of particular lexical items. Rather, there are structures
between these two extremes, what I have called abstract idioms , which
.can
only be learned as language-particular generalizations of particular
forms.
3. Paradigms
(31) Latin :
(a) + - finite + - indicative (b) amo amamus
+ - passive + - perfective amas amatis
presjimperfjfut amat amant
+ - plural Ij2j3person
( 32) - 0 -mus
- as - atis
-t - ant
3.1. Extensiveness
Here the forms are all 1st singular, present-perfective crossed with active-
passive. One comer of this square contains a phrase, amatus sum, while the
other three comers contain words. This shows that phrases form an inextricable
part of paradigmatic information . If we removed the perfective passive
form , we would have destroyed the symmetry of the paradigm, which is
otherwise perfectly symmetrical.
We can see the same thing in an English paradigm, the comparative
paradigm:
The rule is, if an adjective is mono- (or nearly) syllabic, then form the
comparative with -er ; if not , then the comparative and superlative are formed
'
phrasally. The existenceof this paradigm is what pennits us to speak of the
' of an
comparative adjective, even though there are two ways of forming
.
comparatives Many languageslack any way at all to fonn the comparative;
English has two ways, one morphological, the other syntactic.
Paradigms include not just inflectional dimensions, but what have been
called ' derivational' processes as well. I am sure that there is no distinction
be~ween derivational and inflectional morphology , but if there is, then
paradigms are found in both morphologies.
E. Williams I Remarkson lexical knowledge
.
( I ) Hierarchalize the dimensionsof the paradigm.
Assign forms to nodes in the hierarchy.
(35) V'
.
I
I I
finite infinitive
I I
I I I I
sg I
m
m
2 3*
rfl
I 2 3
rfl
rfl
1 2 3
rfl
123
rfl
rfl
pi I 2 3 I 2 3 1 2 3 I 2 3
V * = run *
past = ran 3 * = runs perf * = run
The tenninal nodes are the actual cells of the paradigm. The starred nodes
are the nodes to which actual forms are assigned. By convention, a cell is
filled by the nearestspecifiedform above it . The identity of the forms and the
points in the tree at which they are mapped are given in (35) . The assignment
'
shown is the most economical, as each form is assignedonce.
We might call the starred nodes entry points' - these are the points at
'
which concrete forms are specified. The tree along with the starred nodes we
E. Williams / Remarkson lexical knowledge
' '
might call the pattern of syncretism . This is a pattern which is independent
of the rules for creating the forms in the pattern. It is a part of the formal
structure of the paradigm. Strikingly , the pattern of syncretism holds generally
in a category, not just for particular verbs, as we will see.
If we look at the pattern of syncretism for a variety of English verbs, a
startling pattern emerges:
(36) V$
I
I I
finite perf * infinitive
~
I I r +- , r +- ,
pres pastS I 2 3 I 2 3
r +- , r +- , ~ m
* IA 2 JA I 2 3 1 2 3
sg IA 2 3
r +- , r +- ,
.
pI I 2 3 I 2 J
Entry points :
$ modals
$* regular; go-went
$*" be
have, says, does - irregular in 3prsg: * haves
Here, marked on the same tree, are the entry points for severalcategoriesof
verbs, including irregular verbs. As can be seen, the sets of entry points form
a nested set; the verb be shows the most distinctions, and consequently has
the most entry points, but all other verbs, including all irregulars, have some
subset of the entry points of be. It is far from obvious that such a relation
should exist - if a verb is going to be irregular, why should it not be irregular
in having a different pattern of syncretism, a different set of entry points ? But
this does not happen, even irregular verbs respectthe pattern of the language
as a whole. In fact, even suppletive verbs, the limiting case of irregularity ,
respectsthe pattern of syncretism; the verb go has went as its past tenseform .
Things could have been different: went could have been the third past plural
form , with goed (or something else) for all the other forms ; but then, go-went
would have violated the language-wide pattern of syncretism.
E. Williams / Remarks on lexical knowledge
< 1>
1 .
" s
~
I
-;Il 1
I p -a -aeas
N
direct indirect direct indirect . Ac - am
r- l -, -
r - L- , r--l --w r~ D - ae -IS
-a .
- IS
nom acc abl dat nom acc abl dat Ab
I -pi
I I
< 2> s
~
, I N -us -i
direct indirect . direct indirect . Ac -urn - os
, . J--, r-- - L- , r=
:
L - ,
-
r - L- , D -0 - is
nom acc abl dat nom acc abl dat Ab -0 - is
< 2n >
~ ____~ - -- ,
-;I1 I
I p etc.
direct indirect . direct . indirect .
r -- L- , r- L- , -
r - L- , -
r - L- ,
nom acc abl dat nom acc abl dat
1 -pi
I 1
< 3> s
~
. I
direct indirect direct indirect .
r-- L- , r-- L- ,
-
r - L- , r - L- ,
nom acc abl dat nom acc abl dat
< 4>
I
s
~
I
-;Il I
I p
direct indirect direct indirect .
r-- L, r - Li r ~ -
r - L- ,
nom acc abl dat nom acc abl dat
E. WilliamsI Remarkson lexicalknowledge
principle syncretisms:
1st decl: pl . indirect (-is)
2nd decl:pl . indirect (is), sg. indirect (-0)
neuter: above + direct sg. (-urn). pl . direct (-a)
3rd decl. :pl . indirect(-ibus)
4th decl. :pl . direct (-es) pl . indirect(-ibus)
neuter: sg. (direct = indirect) (-0), pl . indirect, pl . direct
Vx
I
I I I
. .
perf * partic
I
I I I
A
pres imperfect future .
I
I I
sg plur
------ ---- - - - ------"
, +- , r 1 ,
1 Z' 3
'
I 2' 3
The asterisked positions represent the entry points for the regular verbs of
all four conjugations. This alone is striking , for again, there are different
rules in the different conjugations for yielding the forms at these entry
' '
points . The first and second conjugation form the future by suffixing -b- ,
whereas the third suffixes nothing , but switches the conjugational class of
the basic stem. Neverthess, that the future is an entry point is common to all
the conjugations.
The verb essehas the most entry points, and every other verb uses some
subset of those entry points. Thus the conjugation of esse is the basic
conjugation, to which all the others are related.
'
As a final example, we consider Anderson s ( 1984) description of Georgian
'
verb conjugation. The system is quite complex; Anderson s account uses
blocks of rules, both conjunctively and disjunctively ordered. It is my
contention that such a systemwill fail to capture the most abstract patterning
of a paradigm, which, as we have seen, is generally independent of affixes or
E. Williams / Remarkson lexical knowledge
We ask at this point , why are these patte .rns of syncretism in language ?
Why is a pattern of syncretism replicated accross different modes of realizing
paradigm cells?
I speculate that it is the acquisition of paradigm structurethat is respon -
sible for this arrangement.
Pinker has demonstrated how the blocking principle will give rise to the
developmentof paradigm structure in the languagelearner. The basic idea is
that wheneverthe languagelearner has been forced to posit two items to fill a
single cell, he is then motivated to split the paradigm (really, to double it ) so
as to avoid violating the blocking principle :
(39)
The paradigm now has a new dimension, and a whole new set of cells; the
' '
language learner now must learn the significance of the dimension, and fill
in the rest of the cells.
I think that this algorithm for building paradigms, combined with the
notion of paradigm structure I have just outlined, will predict some of the
patterns we have observed.
E. WilliamsI Remarkson lexicalknow/edgE
There are two sorts of splitting than can take place - intraand interparadigm
' '
splitting . Given a present tense paradigm for some verb, see, for
' ' ' '
example, the recognition that saw , like see, can be used for the 1st person
' '
singular, leads to a postulation of a past tenseplane, in which to locate saw .
This is intraparadigm splitting . The other kind of splitting might be called
' declensional'
splitting : given a set of endings, say the endings for the 1st
declensionin Latin (-a, -am, -De, -a, etc.), the recognition that the ending -us
can signify nominative singular, just as -a does, triggers the splitting of the
nouns into (at least) two declensions. The differencebetweenthis and the first
caseis that in the first case, a given word will have forms in every cell of the
new and old plane, but in the second, a word will have forms in only a single
declension. We may neverthelessconsider the splitting to be formally the
same in the two cases, and this is supported by the observation that
syncretism patterns the samein the two cases.
An example of a ' declensional' split is the comparative paradigm in
English; the paradigm is a linear three-point paradigm: adjective, comparative
, superlative. But there are two modes of forming members, as we have
. seen- one for monosyllabics and Simple disyllabics, and another for everything
.else. A telling point which showsthat we are dealing with a declensional
split here is that the same criterion that is used to determine whether the
' '
comparative is A -er or more A is used to determine whether the superlative
' '
is A -est or most A . This criterion is not therefore a part of the rules
themselves , but is rather a general criterion of membership in the two
declensions; much as ' feminine' is a criterion for membershipin the Latin 1st
declension.
Now , supposethat a learner has learned a piece of paradigm structure, and
has learned not only the labels for the dimensions, but has also learned the
positioning of the entry points, which I have called the pattern of syncretism.
Supposefurther that when the paradigm is split, that this abstract pattern of
syncretismis replicated along with the cells themselves:
(40)
0
If this is done, then we will expect to seepatterns of syncretismrecurring. The
following prediction is made: whatever paradigm is learned first will embody
E. Williams / Remarkson lexical knowledge
4. Learning .words
Many adjectives do not end in -y , and many words ending in -yare not
adjectives, but the probability that a word is an adjective increasesonce one
knows that it ends in -yo
Note that this is true for a large class of adjectives where the -y does not
serve as a suffix (the second group) . Even excluding the caseswhere -y is an
affix, there is a correlation.
A battery of such correlations could serve the next step: to identify
m~rphemes, and assign them properties. In the case of -y - + A , there are
sufficient casesto warrant postulating a suffix with the category adjective,
and assign it wherever possible - that is, wherever an independent stem
exists.
E. Williams I Remarkson lexical knowledge
Even where the analysis into morphemesdoes not hold , the information of
the correlation has been noted; it seemsunlikely that this information is
forgotten once the morpheme has been established.
Another example in English of a no- go morpheme is the suffixtude ; most
of the 50 or so membersdo not admit an analysis into morphemes:
(42) altitudeattitudeplatitude,
And yet, this suffix so strongly marks nounhood that there is not a single
verb or -adjective with this ending.
The number of examples it takes to establish a correlation between two
properties is quite small. For example, the English noun-deriving suffix -af
has fewer than 20 instances; and yet, the correlation between suffix and
category seemsfirmly established for all speakers, as well as the restriction
that the stem must be stressfinal , a separatelearned correlation :
4.2. Learningsubsystems
The strength.of this paradigm is great enough that when a coined word meets
the criterion for membership, it immediatelyparticipatesin the entire paradigm:
How do we know that we are dealing with a paradigm here, and not simply
with several distinguishable morphological rules? Becausethe correlations
amongst the rules striking : for example, 80%' of the words that take -ive to
form adjectivestake -ion to form nouns; this is as compared to the baserate
of 6% of nouns that take. -ion. Surely speakersnotice this sort of correlation ,
and use it to construct a paradigm like (44), which can then be exercisedon
new forms.
The criteria for membership in this paradigm are somewhat rough, but
include at least the following :
The force of thesecriteria can be seenin the fact that 26% of the nouns that
begin with trans- fonn nominalizations in -ion, against the base rate of 6%
for nouns in general.
Incidentally, this sort of thing is not limited to the English Latinate
vocabulary. 39% of the verbs beginning with the causative prefix enencroach
, endear, encase, ensnare) form their nominalization in -ment, as
opposed to the background rate of lessthan 5% .
' '
We ordinarily think of -ion as an unproductive suffix, compared, say, to
the suffixness . However, as we have seen, in the domain of certain classesof
stems, -ion approachescomplete productivity , and even becomesapplicable to
new forms. The other side of the coin is that -ness is actually not so
E. Williams I Remarkson lexical knowledge
productive over the entire set of stemsin English - it is of course only in the
adjectives that it shines. So -nessand -ion are equally productive, we might
say, over their own classes- the class for -nessis adjectives, and the class for
-ion is Latinate verbs. Given this, we might wonder, why isn' t every affix
completely productive within some arbitrarily drawn subclassof the lexicon,
say the class of things to which it does attach? I think the answer is that
subclasses cannot be arbitrary . The Latinate class is productive for -ion
precisely becauseit can be identified independently of the occurencesof -ion:
it can be identified as the claSsto which -ive attaches, or perhaps it can be
identified in some way along the lines of (46) . In either case, we must
attribute to the language learner the ability and the inclination to compare
subclasses, and look for high matches. When a high match is found , then a
dimension of a paradigm has been identified.
References
This paper investigates the phenomena that come under the label ' causative alternation ' in
English, as illustrated in the transitive and intransitive sentencepair Antonia broke the vase/ The
vasebroke. Central to our analysis is a distinction between verbs which are inherently monadic
and verbs which are inherently dyadic. Given this distinction , much of the relevant data is
explained by distinguishing two processes that give rise.to causative alternation verbs. The first ,
and .by far more pervasiveprocess, forms lexical detransitive verbs from certain transitive verbs
with a causativemeaning. The secondprocess, which is more restricted in its scope, results in the
existence of causative transitive verbs related to some intransitive verbs. Finally , this study
provides further insight into the semantic underpinnings of the Unaccusativity Hypothesis
(Perlmutter 1978).
I . Introduction
We refer to this alternation as the causative alternation and to verbs with both
uses as causative alternation verbs. 1
Such transitive/ intransitive pairs have receivedconsiderableattention from
linguists . working in a variety of linguistic frameworks: analyses of this
phenomenon have ranged from the primarily syntactic (e.g., Burzio 1986) to
the primarily semantic (e.g., Fillmore 1968) and from the wholly lexical (e.g.,
Wasow 1977, Keyser and Roeper 1984) to the partly lexical (e.g., Borer
1991). 2 There is a sensein which, at least descriptively, the phenomenon is
taken to be well-understood, and the same handful of acceptedfacts regarding
this phenomenon are frequently cited. 3 Although the analysis of this
phenomenon has been the focus of many studies (see, for example, the
referencescited in Levin 1993), relatively little has been said about the
phenomenonexcept in relation to the multifarious theoretical concernsit has
been used to shed light on.
It turns out , however, that many important questions about the phenomenon
itself remain unanswered. And as long as the phenomenon itself is still
not well-understood, a complete analysis of the alternation cannot be devel-
1 Besidesthe causativealternation. English also has a ' periphrastic ' causative which is
.
expressed with the verbsmakeor have. asillustratedin Antoniamadethevasebreak. It hasoften
beenremarkedthat the notion of 'cause ' that entersinto the relationbetweenthe transitiveand
intransitiveusesof the alternatingverbsallowsfor a morerestrictedrangeof interpretations than
that found in English periphrasticcausatives . The type of causationassociatedwith the
alternatingverbswhicharethesubjectof our studyis termeddirect(or. sometimes .
. manipulative
contact. or immediate ) causation . while Englishperiphrasticcausatives allow indirectas well as
directcausation(Comrie1981 . Cruse1972 . NedjalkovandSilnitsky1973 . Shibatani1976. among
others). As we shallsee. the type of causativeexpressed with alternatingverbsin Englishis not
availableto all verbs. contrastingwith the type of causativeexpressedby the periphrastic
causative constructionin English.whichis generallyavailable . In somelanguages both directand
indirectcausationaremorphologicallyencoded . but in suchlanguages . the two typicallyinvolve
distinctmorphologicaldevices . We referto the kind of causativewearefocusingon in this paper
as the lexicalcausative . sinceit is usuallyformedusingthe lexicalresources of a languageand
showsthe hallmarksof a lexicalprocess( Wasow1977 ).
2 The causativealternationhas also attractedconsiderableattention outsidethe theoretical
linguisticsliterature. It is the subjectof a numberof studiesin psycholinguistics and child
languageacquisition; seePinker( 1989 ) for a reviewof this literature.
3 Two notableexceptions are Pinker( 1989 ) and Haspelmath( 1993 ).
verbsin English
B. Levin, M. RappaportHovavI Causative
oped. For this reason, before presenting our analysis of the causativealternation
, we provide a survey of certain properties of the causative alternation
aimed at providing a contribution towards filling this gap in our understanding
. This facet of our investigation focuseson two related questions: (i )
Is it possibleto delimit semantically the classof verbs which participate in the
alternation? and (ii ) Do all examplesof the causative alternation as defined
above represent instances of a single phenomenon? Answers to these questions
will not only help us understand the causativealternation itself, but they
should also deepenour understanding of the nature of lexical representation
and its relation to syntactic structure.
In this paper, we hope to show that the phenomenathat fall under the label
4causativealternation' are on the one hand lessidiosyncratic and on the other
hand less uniform than is typically believed. We suggestthat much of the
data we investigate is explained once we distinguish two processes that give
rise to transitive and intransitive verb pairs.4 The first , and by far more
'
pervasive process, is the one which forms lexical 4detransitive verbs from
some transitive causative verbs. The second, which is more restricted in its
sCQpeforms causative verbs from some intransitive verbs. With respect to
intransitivity , we hope to provide further insight into the semantic underpin-
nings of the Unaccusativity Hypothesis, the hypothesis proposed by Perlmut-
ter ( 1978) that the class of intransitive verbs consists of two subclasses, each
associatedwith a distinct syntactic configuration . Finally , as in our previous
work , we hope to show that if the relevant aspectsof meaning of a verb (or
class of verbs) are properly identified, many of the apparent idiosyncratic
properties of that verb (or verb class) fall into place.
This section sets out the properties of the causative alternation that need to
be accounted for . We begin by repeating the often - made observation that
4 That is, disregarding those verbs which participate in the unspecifiedobject alternation (e.g.,
the verb eat as in Terry ate her lunch/ Terry ate). In this paper, we also do not discussthe middle
construction (e.g., Bread cuts easily) ; this construction involves intransitive uses of transitive
verbs which at least on the surface show some similarity to the intransitive variant of the
causativealternation. We consider the middle construction to be a distinct phenomenon since it
differs from the intransitive variant of the causative alternation in interpretation and other
properties (Keyser and Roeper 1984, Ruwet 1972, among others) ; however, seeHale and Keyser
( 1987) for an analysis that treats the intransitive variant of the causative alternation as a special
caseof the middle construction.
B. Levin. M . Rappaport Hovav I Causativeverbsin English
there are many verbs in English which occur in the transitive/ intransitive
pairs characteristic of this alternation. A preliminary list of such verbs is
given below.
( 3) bake , bounce , blacken , break , close, cook , cool , dry , freeze, melt , move ,
open, roll, rotate, shatter , spin , thaw , thicken , whiten , widen , ...
5 In
Basque the change jn transitivity is accompanied by a change in the auxiliary accompanying
the verb. Simplifying somewhat, the transitive use selectsthe transitive auxiliary ukan
' have' while the intransitive use
, selectsthe intransitive auxiliary izan ' be' . Thus the difference in
auxiliary reflects general properties of Basqueand not properties of the alternation. The labels
' NO R' and ' NO R K ' are the
traditional namesfor the casesassociatedwith the noun phrasesin
.
the examples. SeeLevin ( 1989) for more discussion.
6 For more on the
morphological relationships between the verb forms in the transitive and
intransitive variants of the causative alternation, see the discussion of Nedjalkov ( 1969) and
Haspelmath ( 1993) at the end of section 4.
B. Levin. M . Rappaport Hovav / Causativeverbsin English
These examples might suggestthat agentivity is the crucial factor and that
agentive verbs do not participate in the alternation , while non-agentive verbs
do. As it happens, both suggestionsare wrong. There are agentive verbs
which do show the causativealternation, as in (9) and ( 10), and non-agentive
verbs which do not , as in ( 11)-( 14) .
'7 Some
English intransitive verbs without transitive causativecounterparts are used transitively
in the resultative construction, but in this construction such verbs do not have the transitive
causative meaning which the alternating verbs have. Consider the verb laugh in the resultative
construction The crowd laughedthe actor off the stage. This construction does not mean that the
crowd made the actor laugh, which would be the interpretation that would parallel the intended
interpretation of (8b), but rather that the crowd laughed.
B. Levin
, M. Rappaport
Hovav/ Causative in English
verbs
.The behavior of the agentive verbs of manner of motion contrasts with that
of non-agentive verbs of manner of motion , which, as shown in ( 17), do not
require a directional phrase in either their transitive or intransitive use.
8 There
may be some disagreement about whether the directional phrases are absolutely
necessaryin the transitive causative usesof these verbs, particularly with a verb like jump . But
even if thesephrasesneed not be expressedin certain circumstances, they are always understood
in the transitive causative use. A speaker who accepts ( 16c) stin cannot give this sentencethe
interpretation that the rider made the horse jump in place; rather this sentencereceives the
interpretation involving the directional phrase: the rider made the horse jump over something.
We look at this issuein more detail in section 8, where we also discusssome verbs of manner of
motion that do not have causativeforms even in the presenceof directional phrases.
Verbs of manner of motion are not unique in imposing the directional phraserequirement. The
behavior of agentiveverbs of position parallels that of agentiveverbs of manner of motion in that
they can have a causativevariant only in the presenceof a directional phrase, which givesthem an
' assume ' .
position reading: MDUdestood the baby versusMDUdestood the baby on the table. We do
not discussthis data here becausethis classof verbspresentsa number of complications. SeeLevin
and Rappaport Hovav (to appear) for more discussionof verbs of position, as well as a discussion
of a directional phrase requirementthat surfacesin certain circumstanceswith verbs of emission.
B. Levin. M. Rappaport Hovav/ Causative
verbsin English
from that shown by verbs such as break (Cruse 1972, Hale and Keyser 1987,
among others), we include this alternation among the data that needsto be
accounted for since the general form of the alternation is the same: the
transitive and intransitive usesof theseverbs differ with respectto the notion
of ' cause'. Aside from Pinker ( 1989), previous researchershave taken the
central property of these verbs to be that when intransitive they require
agentive subjects, noting that this property appears to be carried over to the
object of their transitive causative use. This work disregards the change in
status of the. directional phrase. In contrast, we believe that the directional
phrase is the key to explaining why theseverbs show the alternation. On the
other hand, the contrast between ( 15)-( 16) and ( 17) suggeststhat , although
there are agentive verbs which participate in the alternation as we have
initially defined it , this alternation may be an instance of a different phenomenon
, as we propose in section 8.
Jespersen ( 1927) calls the classof causativealternation verbs the ' move and
'
change verbs, becauseit includes a variety of verbs of motion and verbs of
change of state. The list of alternating verbs presented in (3) can easily be
divided into two subclasses along theselines:
( 18a) bake, blacken, break, close, cook, cool, dry, freeze, melt, open,
shatter, thaw, thicken, whiten, widen.
(ISb) bounce, move, roll, rotate, spin, ...
To the extent that verbs of motion involve a change of position (though not
' '
necessarilya translation through space), the set of move and change verbs
' '
might be given the unified characterization verbs of change.
. This semanticcharacterization,
although on the right track , is nevertheless
inadequate. As we will see, change of state verbs do constitute the core of the
class of intransitive verbs which alternate. However, to the extent that verbs
of manner of motion like run are verbs of motion , it remains to be explained
why they cannot appear in this alternation without directional phrases (in
contrast to non-agentive manner of motion verbs like ro//). There are also
verbs manifestingthe causativealternations which cannot be readily character-
ized as verbs of change. These include verbs of sound and light emission and
verbs of position .
include the verb cut, which Hale and Keyser ( 1987) define as in (25), or kill ,
which has been defined - albeit controversially - as ' cause to die' (Lakoff
1970, McCawley 1968, among others).
Verbs close in meaning to cut such as slice or carve do not show the
alternation ; neither do verbs related to kill , such as murder and assassinate.
9 This
property of -;ze is also noted by Keyser and Roeper ( 1984).
B. Levin, M . Rappaport Bovav I Causativeverbsin English
The behavior of -ify and -ize verbs contrasts strikingly with that of English
verbs formed with the suffix -en. The suffix -en is also arguably a causative
suffix, but verbs with this suffix appear to show the causative alternation
rather more freely.
(37a) Antonia broke the vase/ the glass/ the dish/ the radio.
(37b) The vase/the glass/ the dish/ the radio broke.
* Antonia broke the cloth the
(38a) / paper/ the innocence.
* The cloth the
(38b) / paper/ the innocence broke.
Assuming that selectional restrictions reflect the meaning of a verb, then this
pattern of selectional restrictions reflects the fact that both variants share a
common core of meaning.
However, the extent to which selectionalrestrictions are shared acrosssuch
pairs is not as great as is often thought . Smith ( 1970), whose study of the
factors that determine participation in this alternation we come back to in
section 3, points out that some intransitive verbs that typically do not enter
into such alternations may enter into them for certain specific choices of
subjectsof the intransitive use, as shown in the following examples.
The exampleswith the verbs burp and buzz show that selectional restrictions
need not be identical for the corresponding arguments in the transitive and
intransitive uses. In theseexamples, the set of possibleobjects of the transitive
use are a subsetof the set of possible subjectsof the intransitive use.
B. Levin. M . Rappaport Hovav I Causativeverbsin English
A similar example involves the verb peel. This verb does not alternate at all in
its most literal sense' remove peel from a fruit or a vegetable', although it can
be used intransitively to describethe removal of skin - a ' peel' -like covering -
from a body part . The intransitive use ~f peel seemseven to be preferred in
.the use in (46) . 11
(45a
') 1 peeledthe orange.
* The
(45b) orange peeled.
(46a) 11peeledmy nose.
(46b) My nose was peeling.
The examplesin (43)-(46) show that for some causativealternation verbs the
selectional restrictions on the object of the transitive and the subject of the
intransitive do not always coincide exactly. 12 The transitive object or the
intransitive subject may show narrower restrictions. Presumably, for those
choicesof arguments where these do not have transitive or intransitive uses,
they lack them for the same reason that some verbs never have them.
To summarize, an account of the causative alternation as defined in the
broadest sensemust explain why someverbs show this alternation freely, why
The phenomenon we are dealing with falls under the general rubric of
diathesis alternations, alternations which involve changes in the syntactic
expression of the arguments of a predicator as well as in its adicity (the
number of arguments it requires) . Therefore, our account of the alternation
should ideally be embeddedin a generaltheory which accounts for the adicity
of predicatesand the expressionof their arguments. We follow much current
researchon the lexicon (Jackendoff 1990, Rappaport et al. 1988, Rappaport
and Levin 1988, Hale and Keyser 1986, 1987; Pinker 1989, among others) in
assuming that the properties of diathesis alternations in general can be
predicted from the formulation of appropriate lexical semantic representations
for the alternating verbs together with a set of linking rules (Carter
1988), rules which determine the syntactic expression of arguments of a
predicate. As assumedin much of this literature, we postulate two levels of
lexical representation. The first is a lexical semantic representation, a representation
of the syntactically-relevant aspectsof verb meaning, which probably
takes the form of a predicate decomposition. The second is a lexical
syntactic representation or argument structure, which encodesthe syntactic
expressionof the arguments of a verb. We assumethat the lexical syntactic
. representationis derived from the lexical semantic
representationby a set of
linking rules.
Since the causative alternation verbs can be found with either one or two
arguments, a question which arises in the context of determining the lexical
semantic representation of these verbs is whether they are basically one
argument or two argument verbs. That is, are the causativeusesformed from
the non-causative ones or vice versa? We assumethat the basic use of the
verb will impose less stringent restrictions on its arguments than other uses
since the conditions associatedwith a derived use might impose additional
co~straints on the arguments of the verb. In those instanceswhere there are
different selectionalrestrictions on the transitive and intransitive uses, the use
with the looser selectional restrictions, if there is one, is the basic one. This
means that given the data discussed above, for the verb buzz it is the
B. Levin, M. RappaportHovav / Causativeverbsin English
intransitive use that is basic, but for the verb peel it is the transitive use that is
basic. The question to be asked in such instances is what aspect of verb
meaning determines that peel is basically transitive, while buzz is basically
intransitive.
The selectionalrestriction criterion still leavesopen the issueof those verbs
that appear to have similar selectionalrestrictions for both the transitive and
intransitive uses, such as break or open. (Although given the comment in
footnote 12, it is possible that for all verbs the selectional restrictions in one
variant are looser than those in the other.) In order to isolate the meaning
components which determine the (in )transitivity of a verb, we compare verbs
like break that permit transitive and intransitive uses, to verbs such.as laugh,
cry, or glitter that permit only intransitive uses (except perhaps under very
special circumstances). (In section 6 we will address the issue of what
distinguishes the break verbs from transitive verbs like cut and write, which
have only transitive, but not intransitive , uses.) The question is what makes
verbs like break on their intransitive use different from these other verbs?
Here we draw on Smith' s ( 1970) insightful discussionof the semantic factors
that playa part in determining which verbs that can be used intransitively
have transitive causativeuses.
Smith characterizesthe difference between those intransitive verbs which
'
do and do not have transitive causativeusesby meansof a notion of external
'
control . Verbs like break, Smith proposes, denote eventualitiesthat are under
the control of some external causewhich typically brings such an eventuality
about. Such intransitive verbs have transitive usesin which the external cause
is expressedas subject. Verbs like laugh and cry do not have this property :
' '
the eventualities each one denotes ' cannot be externally controlled but can
' '
be control led only by the person engaging in it ; that is, control cannot be
' '
relinquished ( 1970: 107) . Smith takes the lack of a causativetransitive use for
these verbs and other verbs such as shudder, blush, tremble, malinger, and
hesitate, to be a reflection of the presenceof internal control ; we return in
section 4 to the question of why verbs of internal control should have this
property .
Similar distinctions have been recognized in other work on English (e.g ., Hale
and Keyser 1987) and other languages (e.g ., Guerssel1986 on Berber ) .
B. Levin, M . Rappaport Hovav / Causativeverbsin English
For reasons which we explain below, we will not use Smith' s notion of
'control' for
distinguishingamong intransitive verbs which do and do not have
causativeuses. Rather, we use a related notion, distinguishing between' inter-
' ' '
nally and externally caused eventualities. With an intransitive verb denoting
an internally causedeventuality, someproperty inherent to the argument of the
verb is ' responsible' for bringing about the eventuality. On this approach, the
concept of internal cause subsumesagency. For agentive verbs such as play ,
speak, or work, the inherent property responsiblefor the eventuality is the will
or volition of the agent who performs the activity . However, an internally
caused eventuality need not be agentive. For example, the verbs blush and
tremble are not agentive, but they, nevertheless , can be consideredto denote
internally causedeventualities, becausethese eventualities arise from internal
properties of the arguments, typically an emotional reaction. 13
Verbs with an inanimate, clearly non-agentive subject, may also denote
internally causedeventualitiesin the sensethat theseeventualitiesare possible
becauseof inherent properties of their subjects. In particular , the notion of
internal cause can be straightforwardly e~tended to encompass verbs of
emission. It is an internal physical property of the argument of such a verb
which brings about the eventuality denoted by the verb. This property is
reflected in the strong restrictions that these verbs impose on possible
subjects. For example, only very few things have the properties that are
necessary to sparkle, and the same holds for other verbs of emission.
Consistent with the classification of theseverbs as internally causedis the fact
that , as mentioned in section 2, verbs of emission generally do not have
causativecounterparts, as illustrated in (48) . (We return in section 7 to cases
in which they do.)
*
(48a) The jeweller sparkled the diamond .
* '
(48b ) Max glowed Jenny s face with excitement .
*
(48c) We buzzed the bee when we frightened it .
*
(48d ) The cook bubbled the stew.
13 The verbsshudder andshake , whichat first glanceappearto havethe samemeaning , present
an interestingminimalpair. Only shake , and not shudder
, showsa transitivecausativeuse. Our
accountwouldsuggestthat shakingis externallycausedandshudderingis internallycaused . This
proposalreceivessupportfrom an examinationof the thingsthat can shakeand shudder . The
tw~ setsarenot co-extensive ; thesetof thingsthat shudderis to a largeextenta subsetof the set
of thingsthat shake. Thingsthat shudderusuallycan be thoughtof as havinga ' self-controlled'
;
body they include people, animals , and, perhapsby forcedextension , the earth or a car. In
contrast, leaves , or furniture can only shake. This difference
, teacups , like the internal versus
externalcausedistinction, reflectsthe way weconceptualize the world.
B. Levin, M . Rappaport Bovav I Causativeverbsin English
Since verbs of emission pattern with other verbs without causative counterparts
, we use the notion internal versusexternal causerather than the notion
of control . It seems inappropriate to attribute control to the inanimate
emitter argument of a verb of emission.
In contrast to internally caused verbs, verbs which are externally caused
inherently imply the existenceof an external cause with immediate control
over bringing about the eventuality denoted by the verb: an agent, an
instrument, a natural force, or a circumstance. Thus something breaks
becauseof the existenceof some external cause; something does not break
solely because of its own properties. Some of these verbs can be used
intransitively without the expressionof an external cause, but , even when no
cause is specified, our knowledge of the world tells us that the eventuality
theseverbs denote could not have happenedwithout an external cause.
.We thus assume that the intransitive verbs which have transitive uses are
externally caused, while those intransitive verbs which do not are internally
caused. A closer look at the class of alternating verbs will bear out this
suggestion.
The change of state verbs that figure prominently among the alternating
verbs describe changes in the physical shape or appearanceof some entity
that can be brought about by an external cause, be it an agent, a natural
force, or an instrument. Many of theseverbs are deadjectival; they are based
on stage-level adjectives which describe properties of entities that can be
caused to change, such as their physical characteristics, color , and temperature
(Dixon 1982) . Some examples of such deadjectival verbs taken from
Levin ( 1993) are given below in (SO); these verbs fall into two major groups,
one in which the verbs are zero-related to adjectives, as in (a), and the second
in which the verbs are formed from adjectivesthrough the use of the affix -en,
as in (b) .
( 50a) brown , clear, clean, cool, crisp, dim , dirty , dry, dull , empty, even,
finn , level, loose, mellow, muddy, narrow , open, pale, quiet, round ,
shut, slack, slim, slow, smooth, sober, sour, steady, tame, tan, tense,
thin , warm, yellow, ...
(SOb ) awaken, blacken, brighten, broaden, cheapen, coarsen, dampen,
darken, deepen, fatten, flatten, freshen, gladden, harden, hasten,
B. Levin. M. RappaportHovav I Causativeverbsin English
14
Betsy Ritter has pointed out to us the expressionSmarten up! Here the verb is related to the
' '
adjectival sense intelligent , but interestingly the verb is related to a stage-level use of the
. It
adjective appears that this adjective, like rnany other basically individual -level adjectives, can
sornetirnesbe used as a stage-level predicate.
Dowty ( 1979: 129, fn. 4) discusses other instancesin which deadjectival verbs lose sorneof the
sensesof their baseadjective. For exarnple, he notes that although the adjective tough can rnean
' '
either ' difficult or ' resistant to tearing , the verb toughencannot rnean ' rnake difficult ' . We think
that the stage-level versusindividual -level distinction could be responsiblefor at least sorneof the
differencesin available sensesthat Dowty cites including the toughenexarnple.
15 There seernsto be a
gap in the English verb inventory : there appear to be no agentive verbs
of changeof state. We do not have an explanation for their absence. In fact, we are aware of very
few internally causedverbs of changeof state at all , and those we have found , such asflower and
blo.r.rom, and, in sorne languages, blush are non-agentive. We discuss this type of verb in Levin
and Rappaport Hovav (to appear) .
B. Levin, M. RappaportBovavI Causative
verbsin English
intransitive verb. For example, while transitive break means' causeto become
broken' , intransitive break means ' become broken' . We suggest that this is
not the case. A scrutiny of the range of verb classesin Levin ( 1993) reveals
that there are no externally causedverbs without a transitive variant. That is,
all externally causedverbs have a transitive causative use, but not all of them
need have an intransitive use in which the external causeis unspecified(e.g.,
write or murder) . Given this generalization, we offer the following analysis:
internally caused verbs are inherently monadic predicates, and externally
causedverbs are inherently dyadic predicates, taking as arguments both the
external cause and the passive participant , which is often referred to as the
patient or theme. The adicity of the predicate is then a direct reflection of a
semantic property of the verb. Externally caused verbs only detransitivize
under specific circumstances; we discuss the circumstances that license the
non-expressionof the causeargument of externally causedverbs in section 6.
But it is important to stress that on our analysis externally causedverbs do
not undergo a processof causativization - they are inherently causative- but
rather a processof detransitivization. Since the majority of causativealternation
verbs are externally caused, it is the processof detransitivization that is
most pervasive in English.
The following lexical semantic representationsfor the two types of verbs
reflect the type of distinction we suggest.
there are certain classesof verbs which denote eventualities which can be
construed on cognitive grounds to be either internally or externally caused. It
is precisely with respect to these kinds of verbs that cross-linguistic variation
is expected. In fact, appropriately formulated linking rules should predict
which kinds of verbs are most likely to exhibit cross-linguistic variation . The
distinction betweeninternal and external causation seemsto do just this, and
we take it to corroborate our approach.
19 Pinker ( 1989
) points out that internally causedverbs are not expectedto have causativeuses
becausethe eventuality they denote cannot have an external causewhich is at the same time an
immediate cause; that is, such eventualities cannot be construed as being directly caused.
Although this property is probably implicated in the non- causativizability of such verbs, the
existenceof ii1;ternally causedverbs which do causativizeunder certain syntactic conditions, such
as those discussedin section 8, suggeststhat syntactic factors enter into the explanation as well.
verbsin English
B. Levin. M. RappaportBovav/ Causative
associate two arguments from a single argument structure with the same
argument structure position . General principles will determine that in languages
with causativeaffixes or verbs the introduced causewill be first in line
for being chosen as the external argument in its clause.
The Directed Change Linking Rule is similar in spirit to familiar linking
rules which associatea patient or a theme (or an equivalent notion ) with the
direct object grammatical function (Anderson 1977, Fillmore 1968, Marantz
1984, among others) . Our formulation is meant to give specific semantic
content to the notions ' patient' and ' theme' . The Directed Change Linking
Rule is meant to apply to verbs of change of state and verbs of change of
location. This second class includes verbs of directed motion such as come,
go, rise, andfa // but NOTverbs of manner of motion such as roll , run, jog , and
bounce. This difference follows because, although the action denoted by a
verb of manner of motion inherently involves a kind of change, it is not a
directed change. Tenny suggeststhat there are certain kinds of changeswhich
can be characterized ' ... as a change in a single parameter or a change on a
scale' ( 1987: 189) . We call such changes' directed changes'. Tenny. arguesthat
an argument denoting an entity which is specifiedto undergo such a changeis
realized in the syntax as a direct object. This property distinguishes a change
of state verb like dry from both agentiveand non-agentive verbs of manner of
motion like walk and roll . The verb dry specifiesa change characterizable in
tenDS of a single parameter, dryness, whereas walk and roll do not specify
such a change. In contrast, for verbs of directed motion there IS a directed
2o
change: a movement in a particular direction. The argument of a nonagentive
manner of motion verb such as roll will be a direct internal
argument, as we will see, but this linking will be effected by another linking
rule. The justification for this will be given in section 7.
The linking rules we have formulated also ensure that when a verb like
break is used transitively, the external cause will be the external argument,
and the patient, since it undergoes a specified change, will be the direct
internal argument. When a verb like break is used intransitively with only the
patient argument, the Directed Change Linking Rule will apply, and this
.
argument will be the direct internal argument. Since these verbs have s-
structure subjects when intransitive , this argument must assumethe subject
grammatical relation at s-structure, presumably as a consequenceof independent
syntactic principles. The typical GB -framework account of the expression
of the arguments of such verbs makes reference to the Case Filter ,
Durzio' s Generalization, and the Extended Projection Principle (e.g., Burzio
1986); we do not go into details here. 21
Together the Immediate Cause and Directed Change Linking Rules can be
used to predict whether the membersof the verb classesthat we discussedin
section2 will have causativeusesor not. Verbs of changeof state are inherently
dyadic verbs, so they will always have causativeuses, although not as a result
of causativization; in section 6 we elaborate on the circumstancesin which
these verbs can have monadic ' detransitive' uses. Internally caused verbs are
not expectedto have causativeuses, explaining the behavior we observedfor
verbs of emission; we discuss in section 7 why some verbs of emission
neverthelessdo have causatives. Agentive verbs of manner of motion , as
internally causedverbs, are also not expected~o have causativeuses. As seenin
section 2 these verbs do not typically have causative uses in isolation; we
discussin section 8 why theseverbs may have causativeusesin the presenceof
a directional phrase. We attribute the mixed behavior of verbs of position to a
split in the class: some of theseverbs are internally causedand others are not ,
and the internally causedverbs are not expectedto have a causativeuse.
These linking rules leave open the question of what happens with an
argument that falls under neither of the linking rules introduced in this
section. Here we make the assumption, which we justify in Levin and
Rappaport Hovav (to appear), that an argument that is not linked by one of
these two linking rules will be a direct internal argument rather than an
external argument.22
The Default Linking Rule will apply to the theme (located) argument of
transitive sit , stand and other externally caused verbs of position, since this
argument neither causes the eventuality denoted by the verb nor does it
23
undergo a specified change. We return to the Default Linking Rule in
sections7 and 8, where we illustrate its applicability more fully .
5. The Un8ccaativeHypothesis
-intransitive
(60a) break : - [vpV NP]
60b : NP
( ) laugh [vpV]
Given the definitions of unaccusativeverbs as verbs taking a single direct
internal argument and unergative verbs as verbs taking a single external
argument, the linking rules proposed in section 4 will receivesupport if there
is evidencethat internally caused verbs are unergative and externally caused
verbs, when monadic, are una(x:usa . tive. We review two uoa(x:usa. tive diagnostics
;
that can be used to support this claim ; for further discussion see Levin and
Rappaport Hovav (to appear) .
23 Wedo notdiscuss of theapplication
theseverbsfurtherin thispapersincea full account of
the linkingrulesto theseverbswouldrequireus to introducecertaincomplications in their
behavior. Wediscuss in LevinandRappaport
thesecomplexities Hovav(to appear ). However,
wewouldliketo pointout thatouraccount thattheexternally
suggests caused verbsof position
shouldbebasically .
transitive
B. Levin, M . Rappaport Hovav / Causativeverbsin English
Thus the different patterns of the resultative construction correlate with the
status of a verb as unaccusativeor unergative: a monadic verb which allows a
resultative phrase to be predicated directly of its subject is unaccusative, while
a monadic verb which allows such a phrase to be predicated of an object
- either a ' fake' reflexive or a non-subcategorizedobject - is unergative.
The closely related X ' s way construction is also an unaccusativediagnostic.
This construction, in which a resultative phrase is predicated of the subject of
a verb through the use of the phrase ' X ' s way' in object position, is found
with unergative. verbs, but not with unaccusative verbs (Jackendoff 1990,
Marantz 1992) .
24 SeeLevin and
Rappaport Hovav (to appear) for an explanation of the differential behavior
of the two classesof verbs in the resultative construction, and Hoekstra ( 1992) f~r an alternative
account.
B. Levin, M . Rappaport Hovav / Causativeverbsin English
25 Given their
unergative classification. we would not expect these verbs to pattern as
unaccusativeverbs with respectto the resultative construction. In actual fact. some of theseverbs
are found in the unaccusativeresultative construction. but as we discussin Levin and Rappaport
Hovav (to appear) their unaccusative behavior correlates with a shift in meaning. with the
additional meaning being one that is typically associatedwith an unaccusativeclassification.
B. Levin, M . Rappaport Hovav I Causativeverbsin English
section 4. In that work , we also show that there are some verbs which are
compatible with both internal and external causation. Theseverbs include the
non-agentive verbs of manner of motion such as roll and bounce and the
verbs of position . As we show in that work , with such verbs external
causation is correlated with unaccusative status, while internal causation is
correlated with unergative status.
Smith proposesthat the verbs of change that may be used intransitively are
'
precisely those in which the change can come about independently in the
'
sensethat it can occur without an external agent ( 1970: 102) . She identifies
independenceand external control - the notion which we have subsumed
under our notion external cause - as the two features which characterize
verbs of change. Independence allows for the possibility of intransitive
counterparts, and external control or causation allows for the possibility of a
'
transitive causativeuse. Smith s observation can also be recast as follows : the
transitive verbs that detransitivize are those in which the eventuality can
happen spontaneously without the volitional intervention of an agent. We
believe that this property is reflected in the ability of such verbs to allow
natural forces or causes, as well as agents or instruments, as external. causes,
and, hence, as subjects, as illustrated with the alternating verb break.
B. Levin, M. RappaportHovav/ Causative
verbsin English
( 70) The vandals / the rocks / the stonn broke the windows .
Verbs such as break contrast with verbs such as murder, assassinate , write.
and build. These four verbs, as well as any other verbs which, like them,
denote eventualities that require the participation of a volitional agent and do
not admit natural force subjects, will not detransitivize, despite the fact that
their meaningsinvolve a notion of ' cause'.
In fact , these four verbs are among those that require an agent in the
strongest sense: they do not even allow an instrument as subject .
(72a) *Theknifeassassinated
/murdered
thecandidate
.
(72b) *Thepenwrotetheletter.
(72c) 11The
cranebuilt thehouse.
A verb like cut shows that the set of verbs that do not detransitivize is not
limited to verbs which restrict their subjects to volitional agents. Although
this .verb does not typically allow natural force subjects, it does allow
instruments in addition to agents as subjects. 26
The behavior of a verb like cut can receive an explanation. Its meaning
includes a specification of the means involved in bringing the action it
denotes about, which in turn implies the existence of a volitional agent.
Specifically, the very meaning of the verb cut implies the existenceof a sharp
instrument that must be used by a volitional agent to bring about the change
26 SeeBrousseauand Ritter ( 1991) for further discussionof the circumstancesthat allow verbs
to take both instruments and agents as subjects.
B. Levin, M . Rappaport Bovav / Causativeverbs"in English
of state denoted by the verb. If the samechange of state were to come about
without the use of a sharp instrument, then it could not be said to have come
about through cutting , showing that the choice of instrument makes cutting
cutting .
Perhaps the same considerations can explain the behavior of the verb
remove, which does not have an intransitive form . Its non-existence might
seemsomewhat surprising since at a first approximation this verb' s meaning
' '
might be paraphrased as cause to become not at some location . A closer
look at the verb remove's meaning reveals that the eventuality it denotes is
brought about by a volitional agent, as shown by the oddnessof the examples
in (75), which have inanimate non-volitional subjects.
Again , the -iff and -ize verbs that do and do not permit intransitive uses
contrast ,with respectto the range of subjectsthey permit when transitive. The
verbs that resist detransitivization show a narrower range of subjects when
transitive ; specifically, they appear to exclude natural force subjects.
' Hollall I CausativelIerbsin English
B. Levin, M . Rappapor
Other alternating -iff and -ize verbs are denominal; their meaning may be
'
paraphrasedroughly as causeto turn into the substancenamed by the noun
that the verb is based on' : caramel for caramelize, powder for pulverize, gas
for gasify, and so on.
The non-alternating -iff and -ize verbs also include some denominal verbs
whose stems are nouns that name substances: zincify , carbonize, and lodize.
But -what is interesting is that the meaning of these non-alternating verbs is
different from that of the alternating verbs: it could be paraphrased as
' ' '
process or treat using the substance rather than cause to turn into the
'
substance. We suggest that due to this difference in meaning, these verbs
require an agent and hence do not detransitivize. In fact, if zincify meant
' turn to zinc' rather than ' '
processwith zinc , we would predict that the verb
could alternate, and our own intuitions , as well as those of others we have
consulted, is that it would. A preliminary examination of a wider range of
non-alternating -iff and -ize verbs suggeststhat many describe changesthat
involve a particular type of processing or treatment, as with the previously
cited verbs homogenize and pasteurize or as with the verbs sterilize or
vulcanize. Other non-alternating verbs involve changesof state that only come
about through the active intervention of an agent, such as legalizeor sanctify.
The constraint on detransitivization also explains why some verbs have
intransitive usesonly for certain choicesof patient : it is only for thesechoices
of patient that the change can come about without the intervention of an
agent. For instance, in section 2 we noted the following contrasts involving
the verb clear:
Hovav
, M. Rappaport
B. Levin I Causative inEnglish
verbs
Our knowledge of the world tell us that tables and sidewalks are things that
are cleared (of dishes and snow, respectively) through the intervention of an
animate agent. The sky, however, can clear through the intervention of
natural forces, such as the wind . Thus the difference in the possibility of
intransitive counterparts.
-
Similarly, peeling - causing an entity to lose an outer layer is typically
brought about through the actions of a volitional agent, particularly if a fruit
or vegetable is involved. However, there are certain entities that lose their
outer layers due to natural causesrather than through the action of an agent,
and in these instancesthe verb peel can be used intransitively , as in the case
of the loss of skin from a person, as illustrated in (84) .
7. Why can some intern aBy caused verbs have a causative use?
the verbs burp and buzz, which we have seen are internally caused, can be
used transitively for certain types of arguments, as in the examplesbelow.
with transitive causative usesthan there are verbs of sound emission since in
most instances the entities of which verbs of light emission are predicated
emit light without the intervention of an external cause, unless theseentities
are devices. More verbs of sound emission than verbs of light emission are
predicated of entities which emit a sound only under manipulation by an
external cause. Some verbs of emission, such as sparkle and burble cited in
section 2, never have causative transitive uses. It is unclear to us at this point
whether some verbs of emission lack causative uses because they denote
eventualities in which causation simply cannot be assumed by an external
cause - that is, they are necessarily internally caused - or because, even
though external causation may be possible, the set of verbs denoting eventu-
alities compatible with both internal and external causation is explicitly
learned from examples.
We can now propose an explanation for why burp is apparently the only
verb denoting a bodily processwith a transitive causative use. One of the few
feasible instances of external causation of a bodily process is burping as it
applies to babies. Babies are incapable of burping by themselves, so that the
person caring for the baby must assumecontrol of the burping . Thus the verb
burp can be used transitively only when babies are involved.
We propose then that the eventualities denoted by a small number of
English verbs are compatible with either internal or external causation, giving
rise to both an intransitive use and a transitive causative use of these verbs.
Sincethe causativeuse, when available, is associatedwith direct manipulation
of the emitter by an external cause, we assume that in such instances the
emitter is no longer viewed as the causeof the eventuality, and that the only
cause is the external cause which manipulates the emitter. The Immediate
Cause Linking Rule will apply to the external cause, so that it will be the
external argument. The Default Linking Rule will apply to the emitter, since
it does not meet the conditions on the other linking rules, and it will be the
direct internal argument.
As mentioned earlier, certain verbs of manner of motion have meanings
compatible with either internal or external causation. These verbs include the
set of verbs of manner of motion which are not necessarilyagentive, such as
swing, bounce, or roll . In Levin and Rappaport Hovav (to appear) we provide
evidence that a verb like roll is in fact unaccusativewhen predicated of an
inanimate entity, as in The ball rolled ( on the floor ) , but unergative when
used agentively, as in The dog rolled ( on thefloor ) . This behavior is just what
our analysis predicts. When internal causation is involved, the Immediate
Cause Linking Rule will ensure that the single argument, as the internal
verbsin English
B. Levin. M. RappaportHovavI Causative
cause, will be the external argument, and the verb will be unergative. When
external causation by an agent or a force, such as a push or gravity, is
involved but no overt cause is expressed, the single argument will be the
direct internal argument due to the Default Linking Rule, and the verb will
be unaccusative. (The Directed Change Linking Rule does not apply since
there is no directed change; the verb roll is atelic in the absence of a
directional phrase.)
These verbs are internally caused monadic predicates. By the linking rules,
their single argument should be an external argument; therefore, contrary to
fact, these verbs are not expected on our analysis to have the transitive
causativeuseswhich some of them do manifest. In this section we provide an
account of why some members of this set of internally caused verbs have
causativeuses.
In the process, we will also provide an answer to another question that is
posed by the linking rules that figure in our account of causatives. We have
formulated two linking rules which associatearguments with the notion of
direct internal argument, and one which associatesarguments with the notion
of external argument. Since one of the rules linking arguments to direct
internal argument is a default rule, a natural question to ask is why we need
the other rule that links arguments to direct internal argument, the Directed
'
Change Linking Rule, at all. Couldn t the Default Linking Rule alone yield
the same results? For example, if we dispensed with the Directed Change
Linking Rule, the Default Linking Rule could be applied to the verb break
with the desired result. This section explains why both linking rules are
needed. We illustrate the necessityof the Directed Change Linking Rule using
the behavior of agentive verbs of manner of motion with respectto causativi-
zation.
B. Levin, M . Rappaport Hovav / Causativeverbsin English
29 We do not
repeat this evidence here; see Hoekstra
" ( 1984), L. Levin ( 1986), Levin and
Rappaport Hovav ( 1992), C. Rosen ( 1984), among others.
B. Levin, M. RappaportHovav/ Causative
verbsin English
(95a) ... severalstrong Teamsters... shuffled Kit out of the room ... (L .
Matera, A Radical Departure, 1988; Ballatine, New York , 1991, p . 79)
"
(95b) ... I promised Ms. Cain I would ride her around the ranch
. . ." (N . Pickard, Bum Steer, Pocket Books, New York , 1990, p . 92)
* The trainer
(96a) logged the runners to the finish line.
* The leader climbed the team to the summit.
(96b)
30 This account leavesunexplained the fact, noted also in Pinker ( 1989), that verbs of directed
motion which are not verbs of manner of motion do not have causative counterparts: . She
arrived the package ( at the store) . We believethat the lack of causativeswith theseverbs may not
be a problem for our accou~t of causativesof verbs of motion . We suspectthat theseverbs are
best not characterized as verbs of motion for several reasons, but rather they should be
considered verbs of appearance. Interestingly, verbs of appearance, for reasons that we do not
.
fully understand, do not permit causative uses: The magician appeareda dove (from his sleeve) .
Pinker suggeststhat the semantic conditions we formulate here are only necessaryconditions for
participation in the alternations. He proposes that membership in lexically specified semantic
subclasses of verbs determinesthe sufficient conditions for participation indiathesis alternations
in general. These subclasses are implicated in what Pinker caDs narrow-range lexical rules. It
remains to be seenwhether the lack of causative usesfor certain classesneeds to be stipulated
lexically as Pinker suggestsor can be shown to follow from more general principles.
31 It is clear from the context that in (9Sb) the riders are actually on separatehorses; that is, the
exampledoes not have the accompanimentinterpretation found in sentencessuch as I walkedmy
dog, which might be argued to instantiate a distinct phenomenon from the phenomenon being
discussedhere.
B. Levin. M. RappaportHovavI Causative
verbsin English
(98) Julie Smith will stroll you through the Garden District , in New
OrleansM Durning... (NewYorkTimes
)
On this account, agentive verbs of manner of motion enter into a real
' '
processof causativization , in the sensethat the causativeform is the derived
form. The account of the causativeforms of theseverbs contrasts with that of
the causative forms of verbs like break, which we have argued are basically
dyadic and enter into a process of detransitivization. This analysis, as we
mentioned in section 3, is corroborated by the fact, noted in Hale and Keyser
( 1987), that cross-linguistically it is the causative form of such verbs which
tends to be morphologically marked. 32
32 Another fact which suggests that the pr~ involvedwith verbsof mannerof motion is
differentfrom theoneinvolvedwith verbsof changeofstatc is pointedout by Reinhart( 1991). She
notesthat the introducedsubjectin a transitiveuseof a verb of mannerof motion must be an
agent, not an instrumentor naturalfon:e. CompareTheriderjumpedthehorseoverthefencewith
. Thewhip/thelightningjumpedthehorseoverthe
fence. Thispropertyis alsonotedin Cruse( 1972)
verbsin English
B. Levin, M. RappaportBovav/ Causative
When its argument is inanimate, the eventuality denoted by the verb would
be externally caused. The argument would be an internal argument by the
Default Linking Rule, since neither of the other two linking rules would be
applicable, and the verb would be expected to show unaccusativebahavior.
In fact, the verb cannot be found in an unergative type resultative construction
with an inanimate subject, as shown in ( I OOa), though it can be found in
an unaccusativetype resultative construction, as shown in ( l00b ) .
B. Levin
, M. Rappaport
HovavI Causative
verbsin English
( I OOa) * The bowling balls rolled the markings off the floor .
(cr. The basketball players dribbled the markings off the floor .)
I
( OOb) The door rolled open/ shut.
The contrasting behavior of the verbs roll and run supports our account of
thesetwo verbs.
9. Conclusion
33 We
argue in Levin and Rappaport Hovav (to appear) that verbs of inherently directed
motion such as arrive are unaccusativeand monadic.
B. Levin, M . Rappaport Hovav I Causativeverbsin English
References
Anderson , S.R., 1977 . Commentson the paper by Wasow. In: P. Culicover, A. Akmajian,
T. Wasow(eds.), 1977 , 361- 377.
Baker, M., 1988 . Incorporation: A theory of grammaticalfunction changing . Chicago , IL :
Universityof ChicagoPress .
Borer, H., 1991 . TheCausative -inchoativealternation: A casestudyin parallelmorphology . The
LinguisticReview8, 119- 158.
Bresnan . J. and A. Zaenen , 1990 . Deep unaccusativityin LFG. In: K. Dziwirek, P. Farrell,
E. Mejias-Bikandi (eds.), Grammaticalrelations: A cross-theoreticalperspective , 45- 57.
Stanford, CA: CSLI, StanfordUniversity.
Brousseau , A.-M. and E. Ritter, 1991 . A non-unifiedanalysisofagentiveverbs. WCCFL 10.
Burzio. L., 1986 . Italian syntax: A Government -Bindingapproach . Dordrecht: Reidel.
Carter, R., 1988 . On linking: Papersby RichardCarter. (LexiconProjectWorking Papers25,
editedby B. Levinand C. Tenny.) Cambridge , MA : Centerfor CognitiveScience , MIT .
Comrie, B., 1981 . Languageuniversalsand linguistic typology. Chicago , IL : University of
ChicagoPress .'
Cruse. D.A., 1972 . A noteon Englishcausatives . LinguisticInquiry 3, 522- 528.
Culicover . P., A. Akmajianand T. Wasow(eds.), 1977 . Formal syntax. New York: Academic
Press .
Dixon, R.M.W., 1982 . Wherehaveall the adjectivesgone? In : R.M.W. Dixon, Wherehaveall
the adjectives ?
gone and otheressaysin semantics and syntax, 1- 62. Berlin: Mouton.
Dowty, D.R., 1979 . Word meaningand Montaguegrammar. Dordrecht: Reidel.
Dowty, D.R., 1991 . Thematicproto-rolesand argumentselection . Language67, 547- 619.
Fillmore, C.J., 1967 . The grammarof hilling and breaking . In: R. Jacobs , P. Rosenbaum (eds.),
Readingsin Englishtransformationalgrammar, 120- 133. Waltham, MA : Ginn.
Fillmore, C.J.. 1968 . The casefor case. In: E. Bach, R.T. Harms(eds.), Universalsin linguistic
theory , 1 - 88 . New York: Holt, Rinehartand Winston.
Fontenelle , T. and J. Vanandroye , 1989. Retrievingergativeverbs from a lexical data base.
DictionariesII , 11- 39.
Guerssel , M., 1986 . On Berberverbsof change: A study of transitivity alternations . (Lexicon
ProjectWorkingPapers9.) Cambridge , MA : Centerfor CognitiveScience , MIT .
Hale. K.L. and S.J. Keyser , 1986 . Sometransitivity alternationsin English. (LexiconProject
7.
WorkingPapers CambridgeMA : Centerfor CognitiveScience
) , , MIT .
Hale, k .L. and S.J. Keyser, 1987 . A viewfrom the middle. (LexiconProjectWorkingPapers10.)
Cambridge , MA : Centerfor CognitiveScience , MIT .
Haspelmath , M., 1993 . More on the typology of inchoative /causativeverb alternations . In:
B. Comrie(ed.), Causatives and transitivity. Amsterdam : Benjamins .
Hoekstra , T., 1984 . Transitivity. Dordrecht: Foris.
Hoekstra , T., 1988 . Smallclauseresults. Lingua74, 101- 139.
Hoekstra . T., 1992 . Aspectand thetatheory. In: I.M. Roca(ed.), 1992 , 145- 174.
Hornby, A.S. (ed.), 1974 . Oxford advancedlearner's dictionary of current English. Oxford:
Oxford UniversityPress .
Jackendoff , R.S., 1990 . Semanticstructures . Cambridge , MA : MIT Press .
Jesperse n, 0 ., 1927 . A modemEnglishgrammaron historicalprinciples , Volume3. Heidelberg :
Carl Winters.
verbsin English
B. Levin, M. Rappaport Hovav/ Causative
thewordunits
Discovering
) 81- 104. North-Holland
Lingua92 ( 1994
Anne Cutler . .
MedicalResearch
Council.AppliedpsychologyUnit. 15 ChaucerRoad. Cambridge
CB22EF. UK
assumethat complete utterancesare not what lexicons consist of. The number
of utterances is potentially infinite , so that to store all the utterances we
might ever hear our lexicon would also have to have infinite capacity. Even if
we set an arbitrary length limit , the number of possible utterances is enormously
large; for instance, Miller ( 1967) calculated that there must be at least
1020possible English sentencesof twenty words or less - a total which, he
drily added, would take considerably longer than the estimated age of the
earth to speak.
Instead, we assume that the contents of a lexicon consist of sound-to-
meaning mappings in discrete chunks. ( Wecan refer to lexical entries by the
shorthand term ' words' , although of course not aU lexical entries necessarily
correspond to what would be written as a single separate word. Some subword
forms such as affixes or stem morphemes may well have lexical
representation, as may particles which are conjoined with other words in
writing ; likewise, multi -word idiomatic expressionsand frequently occurring
phrasesmay be representedby a single entry .) Thus using a lexicon requires
the separation of utterances into the lexically relevant chunks of which they
are made up - producing speechrequires the languageuser to string together
lexical entries to make a whole utterance, and recognising speech requires
division of an utterance into units which can be looked up in the lexicon.
Likewise, acquiring a lexicon eventually involves acquiring the ability to use it
in theseways.
The present contribution focusseson the very start of lexicon-building :
how the infant might find out what words in the input languageare like , and
might assemblean initial stock of known words. The initial task is perceptual.
What exactly does it involve? For instance, does it involve (as mature useof a
lexicon in speechrecognition involves) division of multiword utterancesinto
lexically relevant chunks? And if so, how difficult is this task? To answer
thesequestions we need to consider the nature of the speechinput with which
the infant is most likely to be confronted. Comparative studies of various
types of speechare consideredin the next section.
more varied and less likely to expressdefault accenting than those in read
utterances, and the acoustic differences between strong and weak syllables
were greater in spontaneousthan in read speech, there was greater opportunity
for processing effects of both sentence accent and syllable stress to
appear in the spontaneousthan in the read speech; this would account for the
finding of significant facilitation due to sentenceaccent and syllable stressin
the fonner but not in the latter. The results of the gating studies described
above provide similar evidenceof the perceptual importance of syllable stress
in spontaneousspeech.
These findings speak to the majority case for speech processing. Most
speech that adult listeners hear is spontaneously produced. Such speech is
characterised by a fairly slow overall rate of speech, short prosodic units,
frequent pausesand, in English, a clear opposition betweenstrong and weak
syllables. These factors affect the way the speechis processed.
The one exception is that infant -directed speechis reported to have higher
pitch and a wider fundamental frequency range ( Fernald and Simon 1984,
Garnica 1977) . In contrast, the fundamental frequency range of spontaneous
speechhas been reported in at least some studies to be relatively narrow , at
least in intimate conversation (Johns-Lewis 1986, Blaauw 1991) . Pitch is a
particularly important dimension of infant -directed speech, since the fact that
infants prefer to listen to this style of speech(Fernald 1985) has been found
to be principally due to its pitch characteristics (Fernald and Kuhl 1987,
Sullivan and Horowitz 1983) .
Ohala ( 1983, 1984) has argued that raised pitch is an ethologically universal
signal of smallness, ingratiation and non-threatening attitude. From such a
perspective it would be possible to argue that raised pitch might not be a
phonologically relevant manipulation in speechto infants, but might simply
arise from universal expressionof affection or nurturance on the part of an
adult to an infant . Against this conclusion, on the other hand, might be cited
the more recent findings that the pitch manipulations found in infant -directed
speech in American English and related languagesare apparently not universal
. Although rising contours predominate in infant -directed speechin the
stresslanguagesEnglish (Sullivan and Horowitz 1983) and German (Fernald
and Simon 1984), falling contours are more prevalent in the tone languages
Mandarin (Grieser and Kuhl 1988) and Thai (Tuaycharoen 1978) . In a
comprehensive review of the literature on pitch in infant -directed speech,
Shute ( 1987) concluded that pitch modifications are not only clearly not
universal across languages, but may also differ within one language as a
function of sex of the speaker, age of the child addressee , frequency of the
' s interaction with children and other factors.
speaker
In fact a recognisablestyle of infant -directed speechis itself not universal,
contrary to the confident expectations of researchersin the 70s that it would
prove not only to be universal (Ferguson 1977) but absolutely necessaryfor
successfulacquisition (R. Brown 1977) . It is now clear that there are cultures
where infants are exposedto much normal adult speechbut no speechin any
special infant -directed mode (Heath 1983, Schieffelin 1985, Schieffelin and
Ochs 1983) . Even where infant -directed speech appears to conform to the
pattern obserVedin English and like languages, this may not constitute a
specialisedmode; thus infant -directed speechin Quiche Mayan has relatively
high pitch , but so, in this language, does adult -directed speechfrom the same
informants (Bernstein-Ratner and Pye 1984) .
Thus it is reasonable to conclude that infants in the earliest stages of
- and for some
language acquisition receive at least some of their input
A. Cutler/ Segmentation
problems
infants, perhaps all of their input - in a fonn that at least closely resembles
nonnal spontaneous speech between adults. One of the characteristics of
spontaneousspeech, it will be recalled, is the high frequency of phonological
elisions and assimilations (G. Brown 1977, Labov 1972, Milroy 1980) . Some
studies have reported that child -directed speech, too , is replete with such
distorting processes (Bard and Anderson 1983, 1991; Shockey and Bond
1980), which is consistent with the view that this style of speechlies on a
generalcontinuum with adult-directed casual speech. Other studies, however,
have reported lower frequency of distorting phonological transformations in
speechto infants than in speechto adults (Bernstein- Ratner 1984a,b) . In an
attempt to resolve this apparent contradiction , Stoel-Gammon ( 1984) transcribed
five hours of speechto one-year-olds; her results strongly support the
view of a continuum , since she effectively discovereda continuum within her
own data, from very clear articulation (e.g. releaseof word -final stop bursts,
clear articulation of unstressedvowels) to very casual forms (frequent vowel
reduction, omissions of whole syllables such as [sko] as a pronunciation of
'
let s go) . Stoel-Gammon concluded that the phonological characteristics of
speech to children depend on such factors as contextual redundancy, the
function of the individual utterance, and the situational context - the same
factors that determine the phonologicalforms of adult spontaneous speech
(Liebennan 1963, Cheshire 1982) .
There is to my knowledge no evidence, from any culture, of a greater
incidence of isolated words in speech to children than in other forms of
speech. Even though phrases may be short, they are still phrases. Thus a
speechsegmentationproblem, as describedin the introductory section to this
paper, seemsto exist for the infant as for the adult languageuser. The speech
'
that the infant hears is continuous; much of the speech of the infant s
environment will be speech among mature language users; in perhaps a
majority of cultures speechaddressedspecifically to the infant would fonn
only a small proportion of the input ; even then, such speech may not
necessarilybe clearly articulated. The problem is compounded for the infant
by the necessity of compiling a lexicon, and this added difficulty does not
trade off against reduced segmentation difficulty in the input . In fact, the
scaleof the segmentationproblem in the structure of the input is remarkably
similar for the infant and for the adult.
A. CutlerI Segmentation
problems
than two-thirds of all weak syllables w~re the sole or initial syllables of
grammatical words.
This means that a listener encountering a strong syllable in spontaneous
English conversation would seem to have about a three to one chance of
finding that strong syllable to be the onset of a new lexical word. A weak
syllable, on the other hand, would be most likely to be a grammatical word.
English speech should therefore lend itself to a segmentation procedure
whereby strong syllablesare assumedto be the onsetsof lexical words. Cutler
and Norris interpreted results of an experiment they ran as evidencefor such
a procedure. They used a task which they called word -spotting, in which
listeners were asked to detect real words embeddedin nonsensebisyllables;
detection times were slower to the embeddedword in , say, mintayf (in which
the second vowel is strong) than in mintef (in which the second vowel is
schwa) . Cutler and Norris interpreted this as evidence that listeners were
segmenting mintayf prior to the second syllable, so that detection of mint
therefore required combining speechmaterial from parts of the signal which
had been segmentedfrom one another. No such difficulty would arise for the
detection of mint in mintef, since the weak second syllable would not be
divided from the preceding material.
Further evidence for such a procedure was produced by Cutler and
Butterfield ( 1992), who investigated the way in which word boundaries tend
to be misperceived. In both spontaneousand experimentally elicited misperceptions
they found that erroneous insertions of a word boundary before a
strong syllable (e.g. achievebeing heard as a cheap) and deletions of a word
boundary before a weak syllable (e.g. bird in being heard as burgling) were far
more common than erroneous insertions of a boundary before a weak
syllable (e.g. effective being heard as effect oJ> or deletions of a boundary
before a strong syllable (e.g. were waiting being heard as awaken) . This is
exactly what would be expected if listeners deal with the segmentation
-
problem by assuming that strong syllables are likely to be word initial , but
weak syllables are not.
As Cutler and Norris point out , the strong syllable is defined by the quality
of its vowel (full , in comparison to the reduced vowels of weak syllables) ;
thus spotting strong syllables cannot provide a complete solution to the
segmentation problem since word boundaries actually occur prior to the
onset of syllables. A strong syllable spotter must be supplementedby some
means of estimating actual syllable onset; Cutler and Norris suggest that
more than one alternative realisation of such a device would be feasible.
-
Assuming that a rhythmically based segmentation procedure is indeed prac
A. Cutler I Segmentationproblems
tation algorithm still performed effectively (in fact it correctly detected more
word boundaries in uncertain input tharl the phoneme sequenceconstraints
had detectedin completely specifiedinput) .
The efficiency and robustness of rhythmic segmentation therefore suggest
that listeners profit from employing an explicit segmentation procedure of
this kind . A striking fact about this procedure, however, is its language-
specificity: as described for English, the procedure is based on stressrhythm ,
i.e. the opposition of strong and weak syllables. Clearly, it therefore cannot
be a universal strategy, becausemany (indeed most) languagesof the world
do not have stress rhythm . However, all languages have rhythm - speech
rhythm need not be stress-based. In the next section alternative forms of
rhythmic segmentation are described, supported by the results from experiments
in languageswhich do not have stressrhythm .
Mehler (e.g. 1981) and his colleagues(e.g. Segui 1984) have used a variety
of psycholinguistic tasks to demonstrate processing advantagesfor syllables
in speech comprehension. In one experiment, which launched a series of
cross-linguistic comparisons, Mehler et al. ( 1981) had French subjects listen
to lists of unrelated words and press a responsekey as fast as possible when
they heard a specifiedword -initial sequenceof sounds. This target was either
a consonant-vowel (CV) sequencesuch as ba- or a consonant-vowel-consonant
(CYC) sequencesuch as hal- . The words which began with the specified
sound sequencehad one of two syllabic structures: the initial syllable was
either open (CY ), as in balance, or closed (CYC ), as in balcon. Mehler et al.
found that responsetime was significantly faster when the target sequence
corresponded exactly to the initial syllable of the target-bearing word than
when the target sequenceconstituted more or less than the initial syllable.
Thus responsesto ba- were faster in balancethan in balcon, whereasresponses
to bal- were faster in balcon than in balance. Mehler et al. claimed that this
result supported a syllabically basedsegmentationstrategy in speechrecognition
in French. Similarly, Segui et al. ( 1981) found that listeners are faster to
detect syllable targets than to detect targets corresponding to the individual
phonemeswhich make up those samesyllables. Further evidencefrom French
that polysyllabic words, whether they are heard in isolation or in connected
speech, are analysed syllable by syllable came from studies by Segui ( 1984)
and by Dupoux and Mehler ( 1990) .
A. Cutler / Segmentationproblems
7. Limits to rhythmicsegmentation
al. 1986) ; neither English nor French listeners use moraic segmentationwhen
performing the same task with Japanese(Otake et al. 1993). In other words,
syllabic segmentationseemsto be specific to French listeners, moraic segmentation
to Japanese listeners. (In fact the French listeners segmented both
English and Japanesespeechby syllables, just as they segmentFrench!)
Moreover, under appropriate conditions listeners can be seen to abandon
the rhythmic segmentation procedures characteristic of their language
community . When responding very fast, French listeners can base their
responseson subsyllabic units (Dupoux 1993) . CVV sequencesare apparently
less conducive to application of moraic segmentation by Japaneselisteners
than the (more common) CVCV and CVN sequences(Otake 1992). The
failure to find processing disadvantages for English words beginning with
weak syllables when the words are carefully read, reported in the second
section of this paper, may reflect a similar case: if the input is very clear,
stress-basedsegmentationmay not need to be called into play. Thus it is quite
clear that none of the rhythmic segmentation procedures constitutes an
'
absolutely necessarycomponent of adult listeners speechprocessing.
The strongestevidencethat this is so comes, however, from studiesof bilingual
processing . Cutler et al. ( 1992) testedFrench-English bilinguals with the techniques
which had demonstratedsyllabic respondingin French listeners(Mehler et
al. 1981) and stress-based responding in English listeners(Cutler and Norris
1988). Their subjectswereas bilingual as they could find - eachhad learnedboth
languagesfrom the earlieststagesof acquisition, spokeboth languagesdaily, and
was acceptedas a native speakerby monolingualsin eachlanguage.
Yet these bilinguals did not necesarilyproduce the pattern of results which
monolinguals had shown on each previous experiment. Instead, their response
patterns could be predicted from a measureof what Cutler et al. called
' '
language dominance , which amounted in essenceto a decision as to which
of their two languagesthe bilinguals would be most sorry to lose. On Mehler
et alis target detection task with French materials, only those bilinguals who
'
chose French as their ' dominant language showed a syllabic pattern of
responding; the English-dominant bilinguals showed no trace of syllabic
'
effects. On Cutler and Norris word -spotting task, in contrast, a stress-based
responsepattern appeared only with those bilinguals who chose English; the
responsesof the French-dominant bilinguals were unaffected by the rhythmic
pattern of the embedding nonsense word. Apparently , these maximally
competent bilinguals had available to them in these tasks only one rhythmic
segmentationprocedure - either that which was characteristic of one of their
native languages, or that which was characteristic of the other, but not both.
A. Cutler/ Segmentation
problems
arguably the casethat they may not result automatically from experiencewith
the statistical properties of the native language. A possible answer to this
question, proposed by Cutler et al. ( 1992) and by Cutler and Mehler ( 1993), is
discussedin the next section.
9. Prosodyandthe preHnguisticinfant
( 1993) found that nine-month -old infants acquiring English preferred to listen
to lists of bisyllabic words with initial stress (crossing, former , cab/e) than
bisyllables with final stress (across, before, decay), although no such preferences
appeared with six-month-olds. Even when the lists were low-pass
filtered to remove most of the segmental information , nine-month -olds still
preferred the initial -stress lists, suggestingthat their preferenceswere based
on prosodic structure. Jusczyk et ale argued that during the second half of
their first year, infants exercisetheir ability to segmentspeechwith the result
that they acquire knowledge of the typical prosodic structure of words in the
input language.
At later ages, language-specific exploitation of rhythmic structure by
children is established: children learning English use stressrhythm in segmentation
(Gerken 1991, Gerken et ale 1990, Peters 1985) ; children learning
French and other languageswith syllable rhythm use syllables (Alegria et ale
1982, Content et ale 1986) ; children learning Japanese use morae (Mann
1986). The hypothesis proposed here is that language rhythm is also what
allows infants to accomplish their very first segmentation of speech. An
ability to processrhythm is inborn . By using this ability , infants are enabled
to overcomethe segmentationproblem and hencetake their first step towards
compilation of their very own lexicon.
References
Dupoux, E., 1993 . The time courseof lexicalprocessing : The syllabichypothesisrevisited . In:
G.T.M. Altmann, R.C. Shillcock(eds.), Cognitivemodelsof speechprocessing : The Sperlonga
MeetingII , 81- 114. CambridgeMA : MIT Press .
Dupoux, E. and J. Mehler, 1990 . Monitoring the lexiconwith normal and compressed speech :
Frequencyeffectsand the prelexicalcode. Journalof Memoryand Language29, 316- 335.
Ferguson , C., 1977 . Baby talk as a simplifiedregister . In : C.E. Snow, C.A. Ferguson(eds.),
Talkingto children: Language input andacquisition , 209-235. Cambridge : CambridgeUniversity
Press.
Fernald, A., 1985 . Four-month-old infants prefer to listen to motherese . Infant Behaviorand
Development 8, 181- 195.
Fernald, A. and P. Kuhl, 1987 . Acousticdetenninantsof infant preference for motheresespeech .
Infant Behaviorand Development10, 279- 293.
Fernald, A. andT. Simon, 1984 . Expandedintonationcontoursin mothers '
speechto newborns .
Developmental Psychology 20, 104- 113.
Fernald, A., T. Taeschner , J. Dunn, M. Papousek , B. de Boysson -Bardiesand I. Furui, 1989 .A
cross-languagestudy of prosodicmodificationsin mothers ' and fathers'
speechto preverbal
infants. Journalof Child Language16, 477- 501.
Garnica, 0 ., 1977 . Someprosodicand paralinguisticfeaturesof speechto youngchildren. In:
C.E. Snow, C.A. Ferguson(eds.), Talkingto children: Languageinput and acquisition , 63- 88.
:
CambridgeCambridgeUniversity Press.
Gerken, L., 1991 . The metricalbasisfor children's subjectless sentences . Journalof Memoryand
Language30, 431- 451.
Gerken, L., B. Landauand R.E. Remez , 1990. Functionmorphemes in youngchildren's speech
and .
perception production Developmental Psychology 26 , 204 - 216
.
Gleitman, L.R. and E. Wanner, 1982.Languageacquisition:The stateof the stateof the art. In:
E. Wanner, L.R. Gleitman(eds.), Language acquisition:Thestateof theart, 3- 48. Cambridge :
CambridgeUniversityPress .
Gleitman, L.R., H. Gleitman, B. Landauand E. Wanner, 1987 . Wherelearningbegins: Initial
representations for languagelearning.In: F. Newmeyer(ed.), TheCambridgelinguisticsurvey,
150- 193. Cambridge : CambridgeUniversityPress .
Glenn, S.M., C.C. Cunningham and P.F. Joyce, 1981 . A study of auditory preferences in
nonhandicap '
ped infants and infants with Downs Syndrome . Child Development52, 1303 -
1307 .
Grieser, D.L. and P.K. Kuhl, 1988 . Maternalspeechto infantsin a tonal language : Supportfor
universalprosodicfeaturesin motherese . Developmental Psychology24, 14- 20.
Grosjean , F., 1985 . The recognitionof wordsafter their acousticoffset: Evidenceand implications
. Perceptionand Psychophysics 38, 299- 310.
Halle, P., B. de Boysson -Bardiesand M.M. Vihman, 1991 . Beginnings of prosodicorganisation :
Intonation and duration patternsof disyllablesproducedby Japanese and Frenchinfants.
Languageand Speech34, 299- 318.
Harrington, J.M., I. Johnsonand M. Cooper, 1987 . The applicationof phonemesequence
constraintsto word boundaryidentificationin automatic , continuousspeechrecognition .
Proceedings of the First EuropeanConference on Speech Technology , Edinburgh,Vol. I , 163-
167.
Harrington, J.M., G. Watsonand M. Cooper, 1989.Word boundarydetectionin broadclassand
phonemestrings. ComputerSpeechand Language3, 367- 382.
A. CutlerI Segmentation
problems
Hayes, J.R. and H.H. Clark, 1970 . Experimentson the segmentation of an artificial speech
analogue . In : J.R. Hayes(Ed.), Cognitionand the developmentof language , 221- 234. New
York: Wiley.
Heath, S.B., 1983 . Wayswith words: Language , life and work in communitiesand classrooms .
Cambridge : CambridgeUnivenity Press .
Hirsch-Pasek , K., D.G. KemlerNelson, P.W. Jusczyk , K.W. Cassidy , B. Drussand L. Kennedy ,
1987 . Clausesare perceptualunits for younginfants. Cognition26, 269- 286.
Johns-Lewis, C., 1986 . Prosodicdifferentiationof discoursemodes . In: C. Johns-Lewis (ed.),
Intonationand discourse , 199-219. London: CroomHelm.
Johns-Lewis, C., 1987 . The perceptionof discorusemodes . In: M. Coulthard(ed.), Discussing
discourse . Universityof Binningham: DiscourseAnalysisResearch MonographNo. 14, 249-
271.
Jusczyk , P.W., 1993 . How word recognitionmay evolvefrom infant speechrecognitioncapaci-
tien. In: G.TiM. Altmann, R.C. Shillcock(eds.), Cognitivemodelsof speechprocessing : The
SperlongaMeetingII , 27- 55. CambridgeMA : MIT Press .
Jusczyk , P.W. and E. Thompson , 1978 . Perceptionof a phoneticcontrast in multisyllabic
utterances by,2-month-old infants. Perceptionand Psychophysics 23, 105- 109.
Jusczyk , P.W., A. Cutler and N.J. Redanz , 1993 . Infants' preference for the predominantstress
of
patterns English words . Child Development 64 , 675 - 687 .
Jusczyk , P.W., DiG. KemlerNelson, K. Hirsch-Pasek , L. Kennedy , A. Woodwardand J. Piwoz,
1992 . Perceptionof acousticcorrelatesof major phrasalunits by young infants. Cognitive
Psychology 24, 252- 293.
Karzon, R.G., 1985 . Discriminationof polysyllabicsequences by one- to four-month-old infants.
Journalof ExperimentalChild Psychology39, 326- 342 .
Klatt, D.H., 1989 . Reviewof selectedmodelsof speechperception . In: W.D. Manlen-Wilson
(ed.), Lexicalrepresentation and process , 169-226. Cambridge , MA : MIT Press .
Kowal, S., D. O' Connell, E. A. O' Brienand ET . Bryant, 1975 . Temporalaspects of readingaloud
and speaking : Threeexperiments . AmericanJournalof Psychology 88, 549- 569.
Kuhl, P.K., K.A. Williams, F. Lacerda , K.N. Stevensand B. Lindblom, 1992 . Linguistic
experience alten phoneticperceptionin infantsby six monthsof age. Science 255, 606- 608.
Labov, W., 1972 . Sociolinguistic patterns.Philadelphia , PA : UDiversityof Pennsylvania Press.
Lamel, L. andV.W. Zue, 1984 . Propertiesof consonantsequences within wordsand acrossword
boundaries . Proceedings of the 1984InternationalConference on Acoustics , Speech and Signal
Processing 42.3.1 - 42.3.4.
Levin, H., Schaffer , C.A. andC. Snow, 1982 . Theprosodicand paralinguisticfeaturesof reading
and tellingstories. Languageand Speech25, 43- 54.
Levitt, A.G. and JiG.A. Utman, 1992 . From babblingtowardsthe soundsystems of Englishand
French: A longitudinaltwo-casestudy. Journalof Child Language19, 19- 49.
Levitt, A.G. and Q. Wang, 1991 . Evidencefor language -specificrhythmic influencesin the
reduplicative babbling of French - and English - learning infants . Languageand Speech34, 235-
249.
Lieberman , P., 1963 . Someeffectsof semanticand grannnaticalcontexton the productionand
of
perception speech . Languageand Speech6, 172- 187.
Luce, P.A., 1986 . A computationalanalysisof uniqueness pointsin auditoryword recognition .
Perceptionand Psychophysics 39, 155- 158.
Mann, V.A., 1986 . Phonological awareness : Therole of readingexperience . Cognition24, 65-92.
A. Cutler I Segmentationproblems
Shipman , D.W. and V.W. Zue, 1982 . Propertiesof large lexicons: Implicationsfor advanced
isolatedword recognitionsystems . Proceedings of the 1982InternationalConferenceon
Acoustics , Speechand SignalProcessing , Paris, 546- 549.
Shockey , L. and Z.S. Bond, 1980 . Phonologicalprocess es in speechaddressedto children.
Phonetica37, 267- 274.
Shute, H.B., 1987 . Vocalpitch in motherese . EducationalPsychology 7, 187-205.
SpringD .R. and P.S. Dale, 1977 . Discriminationof linguisticstressin earlyinfancy. Journalof
Speechand HearingResearch 20, 224- 232.
Stem, D.N. S. Spieker , R.K. Barnettand K. MacKain, 1983 . The prosodyof maternalspeech :
Infant ageand context-relatedchanges . Journalof Child Language10, 1- 15.
Stoel-Gammon,C., 1984 . Phonological variabilityin mother-childspeech . Phonetica41, 208- 214.
Sullivan,J.W. and F.D. Horowitz, 1983 . Theeffectsof intonationon infant attention: Theroleof
the risingintonationcontour. Journalof Child Language10, 521- 534.
Svartvik, J. and R. Quirk, 1980 . A corpusof Englishconversation . Lund: Gleerup.
Tuaycharoen , P., 1978 . The babblingof a Thai baby: Echoesand responses to the soundsmade
. by adults. In : N. Waterson , C. Snow(eds.), The developmentof communication , 111- 125.
Chichester : ~ iley.
Wakefield , J.A., E.B. Doughtie and B.-H.L. Yom, 1974 . The identificationof structural
components in an unknownlanguage . Journalof Psycholinguistic Research 3, 261- 269.
Werker, J.F. and L. Polka, 1993 . Developmental changesin speechperception : New challenges
and newdirections . Journalof Phonetics21, 83- 101.
Wexier, K. and R. ManziniParameters and learnabilityin bindingtheory. In: T. Roeper ,
E. Williams(eds.), Parametersetting, 41- 76. Dordrecht: Reidel.
Whalen, D.H., A.G. Levitt and Q. Wang, 1991 . Intonationaldifferences betweenthe reduplicative
babblingof French- and English-learninginfants. Journalof Child Language18, 501- 516.
Wilso~, M.D., 1988 . MRC psycholinguistic database: Machine-usabledictionary, Version2.0.
BehaviorResearch Methods, Instrumentationand Computers20, 6- 10.
) 105- 140. North-Holland
Lingua92 ( 1994
Perceptual and cognitive abilities that are species- and domain-specific may nonethelesshave
componentsthat are widespreadacrossspeciesand apply to numerousdomains. For example, all
theories of sentenceparsing are constrained by the operations of a limited -capacity memory that
is a generalcharacteristic of cognition . This paper discussesanother ability that is general across
speciesand, within a species, across numerous cognitive and perceptual domains. We review
evidencefrom the animalleaming and human cognitive literature that animals (a) possessfine-
grained sensitivity to probabilistic patterns in their environment and (b) usemultiple probabilistic
cues to solye particular problems. Such sensitivity is advantageousbecausethe structure of the
environment itself can often be characterized as probabilistic . The chancesof successat solving
various problems, from foraging to depth perception, would therefore increaseif animals were
sensitiveto probabilistic cues and could determine whether multiple cuesconvergeon a solution.
We discussthe implications of theseclaims for languageprocessing, and argue that the domain-
general ability to detect and exploit probabilistic information is brought to bear on numerous
language-specific problems.
1. Introduction
structures that assist their aquatic lifestyle. Thus, waterproof fur helps to
insulate the platypus from the cold waters in which it normally swims and its
famous bill is actually a highly-developedsenseorgan that is used in foraging.
However, the history of these two adaptations is quite different. Whereas the
bill seems to be a specific adaptation of the platypus and monotreme
relatives, the fur , of course, is a general mammalian trait that the platypus
inherits (seeGould , 1991, for further discussion). This trait is general in two
ways. First , many speciesshare it and, second, it is relatively general purpose.
Thus hair is not only involved in insulation, but also in sensationsof touch
and the transduction of sound in the inner ear.
As Darwin emphasized in many of his works, the same evolutionary
concepts that apply to anatomical structures can also extend to thought and
behavior. Thus, some of our own cognitive and perceptual abilities are
inherited from our ancestors(and henceit is no coincidencethat we can learn
something about, for instance, our visual system by studying the visual
systemsof other animals - they are branches from the same tree) . Still other
structures might be unique to the evolutionary history of human beings.
Language is, of course, the most obvious human duck-bill , and much ink has
been spilled over the years in arguments over whether this obvious intuition
should really be believed. That controversy will be avoided here. Instead, we
will focus on the following general theme: Whatever the evolutionary history
of the language faculty , language processing must involve cognitive and
perceptual abilities that are domain-general in nature. For an example,
consider the parsing literature. Both linguists (e.g., Frazier and Fodor 1978)
and psychologists (e.g., Just and Carpenter 1992) have devoted considerable
attention to how listeners identify sentenceconstituents and their grammatical
relations. Radically different viewpoints are often proposed about how
these tasks are accomplished. However, throughout all the controversy, one
unifying point stands clear: The parser is fundamentally a slave to a limited
capacity short-term (or working ) memory system. Thus, parsing preferences
from Kimball ' s ( 1973) ' fast phrase closure' to Frazier and Fpdor ' s ( 1978)
' minimal attachment' are all
justified by reference to constraints on short-
term memory capacity. Indeed, potential individual differences in parsing
blaseshave recently been related to individual differencesin memory capacity
(Just and Carpenter 1992) .
In sum, then, a system that is shared by other animals, namely a limited
capacity memory, is critically involved in a systemprobably unique to human
beings, namely language. Furthermore, an understanding of this ancestral
memory system is crucial to a full understanding of how linguistic compe-
M .H . Kelly , S. Martin / Domain-general abilities
tence gets put to use in perfonnance. Finally , since this ancestral system is
shared with our non-human relatives, we could gain insights into the human
systemthrough memory experimentswith animals as well as experimentswith
human beings. We will extend these claims here to another domain potentially
shared with non-human animals. This domain is reflected in the old
adage that death and taxes are the only certainties in life. In less depressing
tenDs, much of the information that animals learn during their lives, and
which must be used to guide behavior, is probabilistic in nature. Should the
human being bet on the Flyers to win the NHL championship now that they
have signed the Michael Jordan of hockey in Eric Lindros ? Is the breeding
seasonpromising enough for a bird to raise two clutches of young rather than
just one? Will the squirrel find more nuts foraging by this tree or that one?
Should a listener who hears ' The fruit flies ...' assumethat ' fruit ' is a noun
and ' flies' is a verb?
Though these questions vary in their subject matter and importance, they
all nonethelesscontain some level of uncertainty, and the types of information
that could be used to decide a course of action are probabilistic in
nature. Historically , attention to probabilities and their use has been either
ignored or looked upon with aversion bordering on plain disgust. Thus,
mathematical treatments of probability are relatively recent developments,
and were fonnulated in the dubious area of gambling. Even physics, the
'
paragon of scientific certainty, has not escapedfrom the clutches of probability
, and the probabilistic component inherent to quantum mechanicsdrove
Einstein to exclaim that God could simply NOT be a closet gambler.
One possible reason for negative views of probability could be an implication
that the world is full of mass confusion that cannot lead to anything
'
beyond ignorance. However, an animal s environment is not a homogeneous
' '
soup, although it rarely provides sure things . The basic fact of the matter is
that the world is awash with stuff best described as ' tendencies', ' maybes',
' estimates' and ' '
, generally-speakings. Furthermore, thesetenDs characterizea
wide array of domains, from foraging to parsing, and must be confronted by
creatures from aardvarks to zebus. Finally , animals must often make rapid
decisions while minimizing the likelihood and severity of errors. These
considerations lead to the following conclusion: Animals that have the
capacity to detect probabilistic patterns and exploit them will have an
advantageover those that do not. Furthermore, confidencein the solution to
a problem should increaseif multiple cues converge on that solution. Hence,
one might expect to see widespread sensitivity to multiple probabilistic
infonnation sources in the animal kingdom . Since many aspects of the
-generalabilities
M.H. Kelly, S. Marlin I Domain
Since the cognitive revolution , the once dominant area of animal learning
has become of peripheral interest to many experimental psychologists. This
major shift of both theoretical and experimental emphasiscould be attributed
to a variety of factors, ranging from the development of symbolic computing
devices to Chomsky' s critique of stimulus- response accounts of language.
Hence, in the cognitive sciencesin general, and languagestudies in particular ,
traditional investigations of animal learning and their possible implications
for human cognition have been virtually , if not completely, ignored. (For
some recent exceptions, see Alloy and Tabachnik 1984, Estes 1985, Gluck
and Bower 1988.) We believe that this abandonment of animal learning is a
sad state of affairs. Certain basic processes of learning are clearly homolo-
gous across speciesranging from insects to human beings, and suciJ universality
should hardly be of peripheral interest to cognitive scientists. Our
M .H . Kelly , S. Martin I Domain-general abilities
dent. That is, given the overall base rate of shocks, one would expect a
certain number of tone-shock pairs JUST BY CHANCE . Given that such
encountersare mere coincidences, one should make no conclusion about the
relevanceof the tone to the shock, and hence the tone should be ignored in
the final test phase. Rescorla's results strongly supported the contingency
interpretation of learning. As the predictive power of the CS dropped, so did
the effectsof the tone on the rats' bar pressing. When the tone and the shock
reached statistical independence , the presenceof the tone had no effect on
behavior.
In sum, Rescorla' s experiments refute the traditional belief, and one still
generally held outside of the animal learning field, that the number of CS-US
pairings is the crucial variable in learning. Rather, animals are sensitiveto the
statistical relationship between the variables. If no statistical relationship
exists, then even the presenceof numerous CS-US pairings will not causethe
animal to assume that the CS predicts the US. Furthermore, statistical
relationships can exist and be learned even in the absence of any CS-US
'
pairings. For instance, supposeRescorla s experiment is repeated, but with all
of the scheduled tone-shock pairs removed. Thus, the rat would receive a
number of shocks, but none accompaniedby the tone. Since no CS-US pairs
are presented, the stereotyped view of Pavlovian conditioning would predict
that the animal would learn no relation betweenthesevariables. However, the
rats do in fact learn something useful in this situation , namely that the
presenceof the tone predicts the absenceof the shock. Hence, the rats treat
the tone as a safety signal and actually show lessfear reactions when the tone
is present than when it is absent (Rescoria 1969) .
So far , we have discussedsome evidence that animals are sensitive to the
probabilistic relationship betweena single CS and a US. However, numerous
experimentsdemonstrate that animals can attend to multiple predictors of an
event. For example, suppose that an animal has learned that a tone and a
light both signal shock. If the stimuli are now presentedtogether, then, up to
a certain point , the animal will exhibit a stronger fear reaction to the joint
'
than to the separatepresentation of the stimuli. 2 Thus, an animal s behavior
in a particular situation is best predicted by what it has learned about the set
of informative cues present in its environment rather than any single cue
examined in isolation. Indeed, what an animal learns about a novel stimulus
will depend on what it has learned about other stimuli present at the same
time. Consider, for example, the following situation. An animal learns that a
tone is associatedwith a mild shock. After this relationship is established, the
animal now is presentedwith two stimuli - the original tone and a novel light .
No shocks are experiencedduring this phrase of compound stimulus presentation
. Later, the light can be presented alone, and the animal' s behavior
monitored to determine what , if anything, has been learned about the light . A
number of possibilities are conceivable. Since the light was never experienced
with the shock, one might expect that the former would later be perceivedas
a neutral stimulus. Alternatively , some of the fear induced by the tone might
spread to the light . However, the animal actually treats the light as a safety
signal, and the strength of the safety reaction varies with the strength of the
fear associatedwith the tone (seeRescoria and Wagner, 1972, for summary) .
In learning about the impact of the light , the animal apparently takes into
consideration its knowledge about the tone. Given that the latter normally
signals shock, the absenceof the latter becomes associated with the novel
'
light . As Rescoria and Wagner ( 1972: 73) state, The effect of a reinforcement
or nonreinforcement in changing the associative strength of a stimulus
depends-upon the existing associativestrength, not only of that stimulus, but
also of other stimuli concurrently present' .
In sum, then, numerous experiments in animal learning indicate (a) widespread
sensitivity to probabilistic relations betweenevents and (b) the use of
multiple cues in identifying the probable course of imminent events, and the
types of behaviors neededto deal with them. Nonetheless, we must emphasize
that even though animals behave in accordance with statistical patterns in
their environment, this behavior does not lead to strong conclusions about
the underlying mechanismsthat support it . Animals might reveal sensitivity
to statistical patterns without actually representingor manipulating probabilistic
data. For example, each learning trial could lead to the adjustment of
the association strength between a CS and a US. This association strength
would be increasedwhen the US appearswith the CS and decreasedwhen the
US appears without the CS. Over time, the probability of a CS-US pairing
would become more and more correlated with the association strength,
though the probability itself would not be representedby the animal. Thus
variations in the probabilities of different eventscould produce behavior that
is attuned to those probabilities without entailing that these values are
M .H . Kelly , S. Martin / Domain-general abilities
2.3. Someimplications
for languagelearning
2.4. Objections
'
2.4.1. How to define what s missing?
For a language learner to determine that a verb has not appeared in a
structure, and that this absenceis statistically meaningful, the nature of the
expected structure must be defined. As Pinker ( 1989) has pointed out , the
structures cannot be individual sentencetokens, becausethere are an infinite
number of such structures, and the odds that one of them would be
encounteredin the corpus of speechheard by the learner is vanishingly small.
'
Thus, the items that enter into the learner s statistical tabulations must be
more abstract than individual sentences . Hence reference to the statistical
structure of the environment will not be sufficient to characterize learning
since the nature of the environment is partly defined by how the learner
categorizes items within it . Furthermore, these categories may in fact be
intrinsically linguistic in nature. Hence we are back to a basic problem in
language learning : What notions about the nature of language do children
-generalabilities
M.H. Kelly, S. Martin / Domain
3 Here is an
easy way to dernonstrate this statistical principle : Instruct your cornputer to
generate sarnples of randorn nurnbers between 0 and 100. Start off with two sarnples. each
containing 5 cases. The rneans of these sarnples will probably differ greatly. Now continue
generating sarnple pairs. but increasethe nurnber of casesin each pair . As the nurnber of cases
-general
M.H. Kelly, S. Martin/ Domain abilities 117
that such an account would predict that the greatest variability in linguistic
judgments should occur for relatively rare constructions, where the idiosyncrasies
of the sample will have a much greater impact on learning.
getslarger, the meansof the sampleswill moveclosertogether, convergingon a valueof 50. Yet
the sampleswill obviouslydiffer in their exactsequence
of numbers.
-generalahiliti P.~
M.H. Kelly, S. Marlin I Domain
issues, one optical and the other psychological. First , what types of optical
patterns exist in the two-dimensional image that are correlated with the three-
dimensional world that reflected the image to the eye? Second, which of these
information sourcesare actually used in visual perception, and how are they
weighed? The second question becomes particularly important given that
numerous potential cues to depth exist, such as those listed in table I .
TableI
Somevariablesthat arecorrelatedwith distance
Variable Description
Binocular disparity Differencesin the image of an objected projected to the two eyes
Motion parallax The relative velocity of imagesacross the retina
Accommodation Changesin the shape of the lens
Relative size The size of an image on the retina
Height in picture plane The location of an image along the vertical axis
Texture gradients The e , shape, and densityof textureelementslike pebbles
, tiles on a floor
Occlusion An object blocking the view of a more distant object
Of course, these cues vary in their range and predictive power. Thus
accommodation is not effective beyond 10 feet, and height in the projection
'
place can easily be violated (e.g., a person s head will be higher in the
projection place than the feet, yet is not further away from an observer) .
Nonetheless, many cuesare simultaneouslyavailable, and hencean important
issueconcernshow the various cuesare weighedand whether an available cue
might even be used. The situation could be complicated further becausethe
weights given to a set of cuesmight vary with context. Consider, for example,
the caseof textural cuesto depth. Many surfacesconsist of textured elements,
such as pebbles along a path or tiles in a room , that have roughly the same
size and shapeand are distributed randomly . However, becausethe elements
can systematically vary in their distance from an observer, their projected
imageswill vary also. As distance from the observer increases, the images of
the texture elements will decreasein size, compress in shape, and become
more densely packed. Investigations of these variables have found that
human observers(a) are sensitive to all three dimensions, but (b) respond to
some dimensions more strongly than others, and (c) change their weightings
with context (Cutting and Millard 1984). In particular , for flat horizontal
surfaces, the size gradient had the greatest effect in creating impressions of
depth, followed by the density and shape gradients. However, for curved
surfaces, the shapegradient dominated over the other variables.
M .H . Kelly , S. Martin / Domain-general abilities
Given that cues vary in their effectiveness, one might consider the extreme
case, and inquire whether some cues , though present, might receivea weight
of zero, and hence be totally ignored. Such total neglect might appear most
clearly when another more powerful cue is present. In such a situation , visual
processing might engage in what Bruno and Cutting ( 1988: 162) call cue
' selection' in which ' observers ... use the
, single most effective available
information source and the others' . For instance a static
[ ] disregard , , pictorial
cue to depth, like height in the image plane, might become ignored when
motion parallax is present, given that the former is lessconsistently predictive
of distance. However, using a variety of experimental methods, Bruno and
Cutting examined the impact of motion parallax, height in the image plane,
occlusion, and relative size on distancejudgments, and found that all of these
variables contributed significantly to those judgments. Such results indicate
that human beings use a variety of cuesin three-dimensional perception, even
cuesthat are not completely reliable.
3.3. Categorization
' '
Suppose subjects view a clearly printed letter k either by itself or in a
'
word , and they must identify whether a ' k or some other letter was
presented. Numerous experiments using variations on this task have repeat-
edly found that subjectsidentify the target letter more accurately if it appears
in a word than if it appears by itself (e.g., Reicher 1969, Johnston 1978) .
' '
Thus, ' k ' would be identified more accurately in the context of work than
alone (or in a string of letters that do not form a word ) . Why is the word
context advantageouseven though the letter in isolation is clearly printed ?
One possibility concerns the high speedswith which skilled readers process
orthographic material. Given that the context in which a letter appears is
partly predictive of its identity , observers could bypass complete perceptual
'
encoding of a letter, and use partial information about a letter s appearance
plus partial information about the identity of surrounding letters to speed
-generalabilities
M.H. Kelly, S. Martin I Domain
overall processing while minimizing error rates. Thus, observers could circumvent
the usual tradeoff between speed and accuracy by using multiple
sources of information about letter identity . Of course, this additional
information is probabilistic in nature, given that words other than ' work '
' ' ' ' ' '
begin with wor , such as word and worm . Nonetheless, human beings are
apparently sensitive to this probabilistic information , and use it to increase
processingspeed.
4. Probabilisticinformationandlanguage process
ina
4.2.1. Phonemesequences
Languagesoften have restrictions on phonemesand/ or phoneme sequences
at syllable, morpheme, and word boundaries. Some of these restrictions are
strong constraints, such as the impossibility of obstruent + nasal sequences
within English syllables. However, large-scale corpora analyses might also
reveal probabilistic relationships between the distributions of phonemesand
various boundary types. Speakersdo appear to have some knowledge of such
probabilistic relations. Thus, Cutler et al. ( 1987) have shown that English
-
speakers have learned that English consonant vowel sequencesare more
likely to follow a CVCV than a CVCC pattern . Experiments on human
sensitivity to frequency information have found that people are not only
sensitiveto the frequenciesof individual letters (Attneave 1953) and syllables
(Rubin 1974), but also to letter combinations (Underwood 1983) . Given that
letter combinations will be correlated with phoneme combinations, such data
indicate that speakers also have knowledge of the relative frequencies of
phoneme sequences . However, experiments need to be performed to see if
speakers are sensitivenot just to the overall frequency of phoneme sequences,
but also to significant interactions between those frequencies and syllable,
morpheme, and word boundaries.
-generalabilities
M.H. Kelly, S. Martin / Domain
Table2
Mean reactiontimes(in msec ) and error rates(pereight items) to identify a disyllabicpattern
' '
(e.g., the mind) as two wordsdependingon the initial word in the pattern
-generalabilities
M.H. Kelly, S. Martin / Domain
6 Of course, the mothers are not deliberately making their speech more predictive of word
boundaries so as to assist their children in acquiring English. Rather, parental speechcontains a
strikingly high percentageof words that refer to concrete objects and easily perceivableevents, for
the good reason that children are more interested in these entities rather than more abstract
themes like religion or politics . Given that the vocabulary of everyday objects contains a high
-
proportion of monosyllabic words and is drawn from a core Germanic lexicon that is character
ized by polysyllabic words beginning with strong syllables, a greater correspondencebetween
strong syllables and word boundaries will appear in parental speech than in speech between
adults.
-generalabilities
M.H. Kelly, S. Martin / Domain
4.3. Parsing
bear on the ambiguity , and when is that information available? Over the past
two decades, the dominant answershave claimed that only a restricted range
of information contributes to initial parsing decisions, with other classesof
information only coming into play later to rectify errors in the first parse
(e.g., Frazier and Fodor 1978, Frazier and Rayner 1982, Ferreira and Clifton
1986, Ferreira and Henderson 1990) . In particular , preferencesfor certain
phrase structure geometries are used to guide initial parsing, such as a
preference to create the simplest phrase structure tree consistent with the
preceding input . These preferences are reputedly blind to other types of
information , such as the identity of the verb in the sentence.
The impenetrability of certain information classesto parsing mechanisms
createsvery interesting predictions. Consider, for example, the sentences' The
student forgot the solution was in the back of the book ' and ' The student
' '
hoped the solution was in the back of the book . In both sentences , the
' '
solution is the subject of a complement clause. However, a complement
clauseinterpretation would create a more complex phrase structure tree than
an interpretation that initially categorized' the solution ' as the direct object of
the respectiveverbs ' forgot ' and ' hoped' . Hence, upon reaching ' the solution ' ,
a reader might opt initially for the simpler direct object parse, which would
have to be rejectedwhen later disconfirming information is encountered. Still ,
one might entertain the possibility that the identity of the verbs themselves
could affect theseinitial parsing blases. Although ' forgot ' can be followed by
an NP object or an object clause, ' hoped' cannot appear with the former
structure. If such facts about verb argument types can influence the earliest
stages of parsing, then one would predict that readers would not posit a
direct object interpretation for ' hoped' , and so would not need to reject such
a hypothesis when subsequentportions of the sentenceare read. One would
therefore predict that reading times for the disambiguating areas of the
sentencewith ' hoped' would be faster than those for the same areas of the
sentence with ' forgot ' . This prediction and analogous ones using other
dependentmeasuresand experimental procedureswere supported in a set of
studies by Trueswell et al. (in press b) . The results could not be attributed to
' ' ' '
generally faster processingof forgot than hoped becausethe reading time
differencesbetween the verbs were significantly reduced in control sentences
' '
containing the complementizer that , which made the sentencesunambiguous
. Although some studies have been reported that did not find effects of
verb subcategorization preferenceson parsing (e.g., Ferreira and Henderson
1990), Trueswell et al. argue that thesestudies have a number of methodolo-
gical flaws. For instance, the verbs used by Ferreira and Henderson ( 1990)
-generalabilities
M.H. Kelly, S. Martin I Domain
did not differ very much in their preferencefor noun phrase versus sentential
objects. Consequently, their methods were not sensitive enough to detect
differences.
In sum, then, the experiments conducted by Trueswell et al. (in press b)
indicate that subcategorization information about a verb has immediate
effects on parsing (see also Shapiro et al. 1993), contrary to models that
severely restrict the class of information available to parsing mechanisms.
Indeed, Trueswell et al. argue that a wide range of information sources
contribute to parsing decisions. As one final illustration , Trueswell et al. (in
press a) have recently provided evidence that semantic properties of the
sentencesubject can influence parsing. They contrasted sentenceslike ' The
defendant examined by the lawyer turned out to be unreliable' with ' The
evidenceexamined by the lawyer turned out to be unreliable' . Parsing models
that rely solely on syntactic preferenceswould claim that readerswould have
'
difficulty with these sentencesbecausethey would initially treat defendant/
'
evidenceexamined to be the subject and verb of the main clause. Although
such syntactic preferencesmight exist, Trueswell et al. explore the possibility
that they could be modified by the relative prototypicality of the preceding
nouns as the subject for the verb. Although ' defendant' is a reasonable
' ' ' '
subject of the verb examined , evidence is not , and Trueswell et al. provide
evidence that readers weigh such information in their initial parsing proces-
ses. Thus, like visual depth perception, parsing appears to proceed by
considering multiple sourcesof information , which may range from general
syntactic preferencesto properties of the specific lexical items composing a
sentence.
' The
- stole the base' whereas verbs, but not nouns, can appear in the
'
sentence The runner - the base' .
While not disputing the important role of semantic and syntactic information
for grammatical class, we would like to explore a relatively neglected
domain of information . In particular , a large number of phonological cuesto
grammatical classexist, and experimentshave repeatedly shown that speakers
are sensitiveto them. For example, nouns and verbs in English differ in stress
pattern, syllable number, duration , vowel characteristics, and other phonolo-
'
gical dimensions (see Kelly , 1992, for a review) . English speakers have
revealed their sensitivity to these and other correlations in a wide range of
tasks. For instance, if listeners hear disyllabic pseudowords that differ in
stressand are asked to use each of these words in a sentence, they will use
words with first syllable stress more often as nouns and words with second
syllable stressmore often as verbs (Kelly 1988b) . Another set of experiments
took advantage of the fact that English words often develop uses in " other
' '
grammatical categories. Thus, police originated in English as a noun, but
' '
subsequentlydevelopeda verb use, whereas fumble originated as a verb, but
later developed a noun use. Despite the frequency with which these lexical
extensionsoccur, blasesagainst certain types of extensions exist (Clark and
Clark 1979). Although the cited blases tend to be based on semantic and
pragmatic factors, Kelly ( 1988b) speculatedthat the phonological characteristics
of nouns and verbs could influence the easewith which they develop uses
in the other category. In particular , if English speakers have learned the
correlation between stressand grammatical class in English, then they might
' '
use this knowledge as a measureof fit betweena current noun and possible
verb use and vice versa. Thus, they might consider nouns to be better verb
candidates to the extent that they have the prototypical verb stress pattern.
This possibility was tested by presenting one group of subjectswith pairs of
disyllabic nouns that differed in stress but were control led for other factors ,
7 Although thesecorrelationsare well- documented , their originsmay seemmysterious . However
, explanationsfor someof thesepatternshavebeenproposedand experimentally evaluated.
For example ) hasarguedthat threefactorsmight be involvedin the evolutionof
, Kelly ( 1992
phonologicalpredictorsof grammaticalclass: ( I ) Wordsfrom differentgrammaticalclasses differ
. (2) Thesedistributionaldifferencesmay havephonological
in their distributionsin sentences
reflexesassociated with them, whichcreatecontextualdifferences in the waywordsfrom various
classesare pronouncedand/or perceived . (3) Overtime, listenersviewthesecontextualeffectsas
permissible context-freepronunciations. This accounthasbeenappliedto the Englishnoun-verb
stressdifference(Kelly I 988a, 1989) and duration difference( Daviset al. 1992 ), and might be
usedas a heuristicto predictthe existence of other, currentlyunknownphonologicalcorrelates
with grammaticalclass.
-generalabilities
M.H. Kelly, S. Martin / Domain
such as word frequency (e.g., llama and gazelle) . Another group of subjects
were presented with pairs of disyllabic verbs that differed in stress (e.g.,
grovel and beseech ) . None of the words had uses in the other category as
determined by the most recently available Webster's Collegiate Dictionary .
Subjects in the two groups were asked to select one noun (verb) from each
pair and use it as a verb (noun) in a sentence. The choices were significantly
affected by the stress patterns of the words, as nouns with second syllable
stress were used as verbs more often than nouns with first syllable stress
whereas verbs with first syllable stress were used as nouns more often than
verbs with second syllable stress. These patterns are not simply a laboratory
artifact . Analyses of the history of English also found the same patterns in
actual grammatical category extensions down through the centuries (see
Kelly , 1988b, for details) .
Although such experiments demonstrate that English speakers possess
implicit knowledge of the noun- verb stressdifference, and that such knowledge
may even have affectedthe history of English, they do not entail that the
initial identification of a word as a noun or verb is at all affected by stressor
other phonological variables. That is, the subject might have recognizedthat
' ' ' '
gazelle is a noun just as easily as they recognized llama . Stressthen came
into play only later as a metalinguistic variable, but did not affect initial
categorization of the words. In order to determine whether stress actually
affects grammatical category assignmentsearly in speech processing, Kelly
and Martin ( 1993) recently conducted a noun- verb categorization task that
we will summarize here.
The task was straightforward : On each trial , the subjectsheard a word that
they had to classify as a noun or a verb as quickly as possible. The words
were disyllabic and differed in stress, with half of the nouns and half of the
verbs having first syllable stress and the other half having second syllable
stress. In addition to varying a phonological correlate to grammatical class,
we also manipulated a semantic cue. Given that nouns generally denote
concrete objects and verbs readily perceived actions, we selectedour words
such that half of the nouns and half the verbs had the prototypical meanings
of their class, whereas the other half denoted abstractions. It is conceivable
that the phonological cue to grammatical classwould only influence grammatical
category assignmentswhen the semantic cue is absent. Given that the
semanticcue is universal whereasthe phonological cuesare languagespecific,
one might expect the former to be more basic and perhaps completely
eliminate effects of the phonological cue. The results indicated that both
semantic and phonological factors affected the subjects' judgments. Nouns
M .H . Kelly , S. Martin / Domain-general abilities
and verbs were classified faster if they had meanings typical of their classes.
In addition , the phonological variable of stressinteracted with grammatical
classsuch that nouns were classifiedfaster if they had first syllable stressbut
verbs were classified faster if they had second syllable stress. Most importantly
, the phonological effects were not eliminated for words that had
concrete meanings. Indeed, these effects were significantly magnified by the
presenceof semantic features that convergedon the samecategorization. The
effect of concretenesswas likewise magnified by the presenceof appropriate
phonological features. For example, concrete nouns with first or second
syllable stress were classified faster than abstract nouns with either stress
pattern. However, the differencebetweenthe concrete and abstract nouns was
larger for items with first syllable stress. For verbs, on the other hand, the
concrete/ abstract difference was larger for items with second syllable stress.
Thus, listenersappear to use a conspiracy of cues to identify the grammatical
category of a word . These cues can either be language-universal, like the
semanticcue to grammatical class, or language-specific, like stressin English.
Finally , the language-specific cues are not overwhelmed or even damped by
the language-universal factors. Instead, they mutually reinforce one another.
5. Conclusions
References
Alloy. L.B. and N. Tabachnik . 1984. Assessmentof covariationby humansand animals: The
joint influenceof prior expectations
andcurrentsituationalinformation. Psychological
Review
91. 112- 149.
ArmstrongS.. L. GleitmanandH. Gleitman. 1983 . Whatsomeconceptsmightnot be. Cognition
13. 263- 308.
M .H . Kelly , S. Martin I Domain-general abilities
Saegert, S., W. Swapand RiB. Zajonc, 1973 . Exposure , context, and interpersonalattraction.
Journalof personalityand SocialPsychology 25, 234- 242.
Seligman , M.E.P., 1970 . On the generalityof the laws of learning. PsychologicalReview77,
406- 418.
Shannon , B., 1976 . Aristotelianism , Newtonianism , and the physicsof the layman. Perception5,
241- 243.
Shapiro, B.J., 1969 . The subjectiveestimationof relativeword frequency . Journal of Verbal
Learningand VerbalBehavior13, 638- 643.
Shapiro, L.P., H.N. Nageland B.A. Levine, 1993 . Preferences for a verb's complements andtheir
usein sentence processing . Journalof Memoryand Language32, 96- 114.
Shedier, J.K., J. Jonidesand M. Manis 1985 . Availability: Plausiblebut questionable . Paper
presented at the 26thannualmeetingof the Psychonomic Society , Boston, MA.
Smith, E.E. and D. Medin, 1981 . Categories andconcepts . Cambridge , MA : HarvardUniversity
Press .
Trueswell , J.C., M.K. Tanenhaus and S.M. Garnsey , in pressa. Semanticinfluences on parsing:
Useof thematicrole informationin syntacticambiguityresolution . Journalof Experimental
Psychology : Learning, Memory, and Cognition.
Trueswell , J.C., M.K. Tanenhaus and C. Kello, in pressb. Verb-specificconstraintsin sentence
processing : Separating effectsof lexicalpreference from garden -paths. Journalof Experimental
:
PsychologyLearning Memory , , and Cognition .
Underwood , B.J., 1983 . Attributesof memory. Glenview , IL : Scott, Foresman .
'
Walley, A.C. and TiD. Carrell, 19.83. Onsetspectraand formant transitionsin the adult s and
child' s perceptionof placeof articulationin stopconsonants . Journalof the AcousticalSociety
of America73, 1011 - 1022 .
Wasserman , E.A. and H. Shaklee , 1984. Judging response -outcomerelations: The role of
respense -outcome contingency , outcome probability , and method of informationpresentation.
Memoryand Cognition12, 276- 286.
Zajonc, R.B., 1968 . Attitudinal effectsof mere exposure . Journal of Personalityand Social
PsychologyMonographSupplement 2
9( ), Part 2 , 1- 28.
Section 3
the
Categorizing world
92(1994
Lingua ) 143
- 167 -Holland
. North
Susan Carey
Departmentof Brain and CognitiveScience
, Massachusetts
Instituteof Technology
, 77 Massachusetts
Ave., 25-406, Cambridge. MA 02139 , USA
1. Intr04uctio D
true that if the contrast between the two types of stuff is basedon the shape
differencesof individual pieces, then some representationof those individual
pieces must enter into the representation of shape. But our concepts of
spaghetti and macaroni (and the words spaghetti, macaronI) do not quantify
over those individuals. Similarly , we can represent the shape of a scattered
portion of sand, arranged, for example, into an S, and when we refer to it as
'a ' ' '
portion or an S we are quantifying over that individual . But when we
' '
think of it as sand, we are not. Quine s proposal is that the child s conceptual/
linguistic system has only the capacity to represent the world in terms of
'
concepts like furniture , sand, bottlehood. Of course the child s perceptual
system must pick out individuals in order to represent shape, to determine
what to grasp, and so on. This is part of what Quine meant when he claimed
' '
that the child is inherently body minded (Quine 1974) .
Piaget, like Quine, believed that that baby must construct the concept of
enduring objects, although he differed from Quine as to the' mechanismshe
envisioned underlying this construction. Quine saw the child s mastery of the
linguistic devices of noun quantification , the machinery by which natural
languagessuch as English manage divided reference, as the' processthrough
'
which the child s ontology comes to match his or her elders . Piaget held that
the baby constructs the concept object during the course of sensori -motor
development by the age of IS-months or so, and that this construction is the
'
basis for the child s mastery of natural language. Since Piaget did not frame
his discussion in terms of an analysis of the logic of sortals, it is not clear
when he would attribute full sortals to the child. 1
'
The Quine/Piaget conjecture about the baby s representational resourcesis
a serious empirical claim, and as I will show, it is difficult to bring data to
'
bear on it . In what follows , I first consider Quine s views, contrasting his
hypothesis that children come to represent sortals only upon 'learning the'
linguistic devicesof noun quantification with what I will call the Sortal First
hypothesis. The Sortal First hypothesis is that babies represent sortal
concepts, that toddler lexicons include words that expresssortals, and that
these representations underly the capacity for learning quantifiers rather
than resulting from learning them. I then turn to early infancy, and explore
the contrast betweenthe Quine/Piaget hypothesis and the Sortal First hypothesis
as regards the earliest phases of word learning. A preview of my
conclusions: whereas the Sortal First hypothesis is ultimately favored, evi-
1 For example, Piaget thought that the logical prerequisitesfor representingthe adult concepts
all and someare not acquired until after age S.
S. CareyI Reconceptualizing
theworld 147
's mastery
Thetoddler of syntax
'
Quine s hypothesis is that the child masters the logic of sortals through a
processof adjusting the meanings of nouns and of natural language quanti-
fiers to each other (scrambling up an intellectual chimney, the walls of which
are the child ' s currently worked out representationsof the quantifiers he/ she
knows). To address Quine' s conjecture experimentally, we must first know
when in the child ' s life the putative scrambling is going on. Even by age 3 the
child is not producing all the quantifiers that constitute the sides of Quine' s
chimney. The very beginnings of the English count/ mass distinction are
masteredin the months leading up to age 21/. 2. Many children age 2:0 produce
nouns with no determiners or plurals, but some have begun to produce
plurals and a few determiners and quantifiers (usually possessivessuch as
' ' ' ' ' '
my , plus a and the ) . Many 2-year-olds beginning to use determiners do
not distribute them differently according to the noun' s count/ mass status in
the adult lexicon. They still omit many determiners, and use others like ' the'
and ' my' that do not differentiate count nouns and mass nouns. By 21.J2 '
virtually all children distinguish in some ways the syntactic contexts in which
words like ' table' and ' dog' appear from those in which words like ' water'
and ' playdoh' appear (Gordon 1985, Sola et al. 1991) . Gordon ( 1982) showed
that between 21/. 2 and 3 years of age the distinction becomes marked in
'
syntax, as the child s speechabruptly comes to reflect the arbitrary rule that
detenniners are obligatory for singular count nouns, but not for mass nouns
' ' .' '
(that is, one can say I like juice , but not 1 like dog ) .
The developmentalfacts summarizedabove determine the relevant agesfor
an empirical test of Quine' s speculations. Data bearing against Quine' s claims
could be of several types: e.g., data showing children age 2 or under take
proper nouns to refer to individuals of a kind or that they take count nouns
to refer to kinds of individuals. But, as already mentioned, the trick is
' ' '
figuring out how we can know whether toddlers Mama refers to entities
' '
they conceptualize as individuals or whether their bottle divides reference,
referring to each individual of a certain kind , as opposed to bott/ehood.
Another type of evidencecould be relevant. If it can be shown that upon
first learning ' a' or the plural ' -s' , toddlers interpret them correctly, as
signalling an individuated entity of a kind or a plurality of individuals of a
148 theworld
S. CareyI Reconceptualizing
kind , respectively, this would tell against Quine. This is becausetheseare the
first relevant quantifiers the child learns. If he or she interprets them correctly
from the beginning, the interpretation could not have been acquired through
an adjustment process involving the entire set of quantificational devices of
noun syntax. This last point is important . In the beginnings of language
'
learning, on Quine s view, children will not interpret those few quantifiers in
their lexicons as adults do. The scramble will have just begun. Data showing
'
that children use ' a' and plurals will not be itself relevant to Quine s
hypothesis; it must be shown that such quantificational devicesare doing the
samework as they do in the adult language.
'
A large proportion of the baby s first words are words for middle-sized
' ' ' ' ' ' ' ' ' '
physical objects, such as bottle , book , dog , cup , and banana . But that
babies have words in their lexicons that refer to object kinds in the adult
lexicon tells us nothing of what these words mean to the babies. Many have
argued that the earliest words are often complexive (e.g., Bowerman 1978,
Dromi 1987, Vygotsky 1962). That is, children appear to extend words to new
referentson the basisof any of the salient perceptualproperties of the original
experiencesin which the word was heard. These complexive uses often cut
' '
across what are for adults distinct ontological categories, as when paper
apparently refers to the act of cutting, the act of drawing, to pens and pencils
and to paper (Dromi 1987). If such complexive uses reflect unconstrained
(from the point of view of adult lexical categories) projection of word
'
meanings, Quine s views receivesupport . But it is important to seethat such
'
complexive usesare not necessaryfor Quine s conjecture to be correct.
Indeed, others deny that toddlers construct complexive meanings; Hutten-
locher and Smiley, 1987, for example, presentevidencethat from the beginning
' '
babiesuseeach word for middle-sizedobjects appropriately: bottle to refer to
bottles, ' book' to books, and so on. But even if Huttenlocher and Smiley are
'
right , this fact .does not disconfirm Quine s conjecture. In fact, Quine presupposes
that the baby usesthe words in contexts adults would. His point is that,
'
evenso, the baby might not be individuating the words referentsas we do. The
baby could refer only to what we conceptualize as bottles when she uses
' bottle' but could be
, referring to bottlehoods. She could be using the word to
refer to a scattered portion of what goes on, determined by perceptual
' '
similarity to the portions of her experiencewhen adults use bottle .
S. Carey I Reconceptua/izing the world
Children as young as 17 months (at least girls that young) are sensitive to
the syntactic context in which a new noun is heard in their projection of noun
meaning (Katz et al. 1974, Macnamara 1982) . Specifically, if ostensively
taught a new word in a count noun context, referring to an unfamiliar doll
.
( See this. This is a dax. Can you touch the dax? Can you put the dax on
'
your head ... ), they assumethat other dolls of the sametype are also daxes.
But if taught in a proper noun context (.Seethis. This is Dax. Can you touch
Dax. Can you put Dax on your head ...' ), they assumethat other dolls of the
sametype are not Dax , reserving ' Dax ' for the original doll only .
Do thesedata establish that young children distinguish kinds from individuals
, and use count nouns to achieve divided reference? Certainly not. They
do establish that toddlers are sensitive to the syntactic distinction between
nouns following determiners and those not following determiners, but this
distinction could be signalling a different semantic distinction than that
between individuals and kinds. For a sample Quinian interpretation : babies
' ' ' '
could take nouns without determiners such as ' Dax , ' Rover , and Joan , to
refer to portions of experiencedefined by a stricter similarity metric than that
referred to by nouns with determiners. Supposea Quinian baby, Alice , has a
' ' ' '
brother whom she hears called both Rupert and a boy . Supposealso that
she relies on shape to determine Rupertness and boyness. She could have
' '
learned from others' usage of the words that to be called Rupert , a given
portion of experiencemust be very similar in shapeto the original portions of
' '
experienceto which the term was heard to refer, whereasto be called a boy ,
.the '
boy , something need look only somewhat like the original referent. A
' ' ' '
generalization of this pattern of distinction , across Alice and a baby ,
.Rover' and ' a ' and so on could
, dog , , underly the patterns of projection
found by Katz et al., 1974, and subsequentreplications.
This interpretation of the Katz et al. data attributes to the baby a different
' ' ' ' ' '
meaning for a from the adult as well as different meaningsfor bottle , boy ,
. ' '
Rupert . This is, of course, Quine s position . On his view, it is only in the
course of learning other quantifiers, plural markers, and so on, and adjusting
to all the contrasts in usage they mark (the process of scrambling up the
' '
intellectual chimney cited above) that the baby works out the meaning of a ,
' . ' '
.the another some more all ' ' ' ' ' ' ' ' '
, , , , , many , same,2 etc.
2 Bloom.
personal communication, provides the following argument against the Quinian
'
interpretation of the Katz et al. data. Among children s very first words are some pronouns (e.g.,
In several studies, my colleaguesand I have attempted to addressQuine' s
'
proposal by comparing children s representations of solid physical objects,
such as cups, with their representationsof non-solid substances , such as sand
.
or gels or creams Our idea that is since adults conceptualize the former as
kinds of individuals (i .e., in terms of sortals that divide reference), but do not
conceptualize the latter in this way, we might be able to find evidence that
infants and toddlers respect the quantificational distinction between the two
as well.
In the first studies, Sola et al. ( 1991) compared 2-year-olds' projection of
newly learned :words ostensively defined by reference either to novel solid
physical objects (e.g., a brass plumbing T) or novel non-solid substances(e.g.,
a hair-settin$ gel with grapenuts embeddedin it ) . The objects were made of
unfamiliar materials and the non-solid substanceswere presentedformed.into
distinctive novel shapes. The child was introduced to the novel entity and
' ' ' '
provided a word for it (e.g. blicket for a novel object; stad for a novel nonsolid
substance). The child was then presented two new sets of stimuli and
asked to give the experimenter the blicket or the stad. For each object trial ,
the choicesconsistedof another object of the same shapemade of a different
material (e.g., a plastic plumbing T) or three small pieces of the original
material (brass) . For each substance trial , the choices consisted of a new
substanceformed into the original shape, or three small piecesof the original
substance. Figure I shows the design for one trial of each type. There were
four object trials and four non-solid substancetrials. Of course, which words
were assigned to which entities varied across subjects, but for expository
' ' ' '
clarity I will use blicket as my sample object name and stad as my sample
non-solid substancename.
RAJ8D
Sft MULU8:
T S
~ . my Dbt . . " . . ..., . ~ .
TEST
LI :
8' 11MU
~
..dad4
JUh @8
~ . o. Mlabtf . ~ . o. . . . ?
'
Sola et al. carried out two analysesto assesswhether children s representations
of the referents of the words were influenced by the status of their
knowledge of count/ mass syntax. First , they collected production data and
assignedeach child a value corresponding to the degreeto which count nouns
and mass nouns appeared in selective syntactic frames (e.g., ' a NO U N ' ,
' NO UNs ! ' ' too much NO U N ' . Scores ran from 0 to near 1.0. Second
) , they
introduced the new words in two different ways. In the neutral syntax
condition , no syntactic information as to the count/ mass status of the word
was provided ; the words were introduced as ' my blicket , my stad' and
' '
subsequentlyappearedin the context the blicket , the stad . In the informative
' '
syntax condition , the words were introduced as a blicket , some stad , and
further differentiated syntactically, e.g. another blicket , some more stad' .
'
As figure 2 shows, children at age 2:0 and 2:6 used different basesfor their
projection or words for the two different types of entities. They projected
' biicket' to the other whole
object the sameshapeas the original referent and
' stad' to the scattered
they projected portion of substancethe same texture
and color as the original referent. For object trials , children were sensitive to
matchesin shapeand number; for non-solid substancetrials , children ignored
matchesin shapeand number. Performancewas more adult-like on the object
trials, but performance on both types of trials was better than chanceat both
ages. Also apparent on figure 2, the syntactic context made no difference. The
children were no more likely to interpret ' blicket' as the word for a kind of
individual when it was heard in a count noun context. Similarly , hearing
S. Carey / Reconceptualizingthe world
% choicesmatchingreferentin shapeandnumber
Ir.a rigid
I [] nonsolid
.
"
& .
60
chance
! .
0 :
- =
e .
40
.
! .
"a" "some
" "a" " "
"some "my" " . "
my" my" my"
2:0 2:6 2:0 2:6
SYNTAX
INFORMATIVE NEUTRAL SYNTAX
condition
Fig. 2. % trials in which test stimulus chosen matched original referent in shape and number
(Sola et al. 1991).
thus be a salient feature of the blicket , but not of the stad, for non-solid
substancesdo not maintain their shapeswhen manipulated. For non-solid
substances , properties such as texture and color might be salient, for these
stay constant over experienceswith substances . In other words, the two -year-
old could be using ' blicket' to refer to blicketness, and recognize blicketness
by shape. The differential patterns of projection do not establish that the
toddler is using ' blicket ' to refer to any individual whole object of a certain
kind , that the toddler divides the referenceof ' blicket ' .
One detail of the data from figure 2 favors the Sortal First over the
Quinian interpretation, and that is that toddlers performed more like adults
on the object trials than on the substancetrials. Quine' s interpretation of this
would have to be ad hoc, perhaps that the baby has had more object
experiencethan substanceexperience. But the Sortal First hypothesispredicts
this asymmetry. To seethis, supposethe Sortal First hypothesis is true, and
' '
suppose that upon first hearing the word blicket the child assumesthat it
refers to each individual object of a certain kind . The choicesfor testing how
the child projects ' blicket ' included another si.ngle object, and 3 small objects.
Even if the child isn' t exactly sure of which features of the blicket establish its
kind , the child can rule out that the 3 small objects are a blicket , for under no
interpretation can they be an individual object of the same kind as the
original referent. Children should then be at ceiling on the object trials , which
' '
they are. .The substancetrials are another story. If upon first hearing stad ,
the child takes it to refer to the kind of substanceof the original referent,
then scatteredportions have no different status from unitary portions . There
is no clue from number of piles which of the choices on the test trials is the
stad. If children are not certain what properties of the original sample of stad
determine the kind stad, they might do worse on the stad trials. And indeed,
they do.
The key issue here is the role of number in determining how to project
' blicket' . If the
Quinian interpretation of the data is correct, the baby should
' blicket' on the basis of
project shapesimilarity , no matter whether the choice
that does not match in shapeconsists of one object or three objects. That is,
the baby should succeedon an object trial as on figure 3 as well as on an
object trial as in figure 1. The Sortal First interpretation predicts that
performance on the object trials will fall to the level of performance on the
substancetrials if the cue from number is removed (figure 3) . In an object
trial such as that on figure 3, ' blicket' is ostensivelydefined as before, but the
choicesfor projection are changed: another blicket of a different material (as
before) and another whole object of . a different kind made of the same
S. Carey / Reconceptualizingthe world
NAMBO
B' nMULV8:
1DT
.i~
rt ~
rnMULI :
~ b. . tM MlDbl ?
).
trialin Sola(1987
Fig. 3. Object
material as the original referent (instead of the three small objects) . Now the
child has no clues from number of objects as to which is the correct choice.
Performanceshould fall to the level of the substancetrials , and indeed, this is
what happens(Sola 1987) .
Apparently , the child uses the information provided by number on the
object trials , but not on the substancetrials. We take this as evidencethat the
child conceptualizessomeentities as individuals (such as kinds of objects) and
conceptualizesother entities as non-individuated (such as kinds of substances
) . These distinct ways of conceptualizing objects and substancespredates
' '
mastery of count/ masssyntax. Toddlers do not merely project blicketness on
the basis of shapeof individual piecesof blicketness, as we determine whether
somepasta is spaghetti on the basis of the shapeof individual pieces. Instead,
' '
the pattern of projection suggeststoddlers divide referenceof blicket , and
take it to refer to any individual of a certain kind .
I take the data reviewed in the previous section to show that by age 2:0
children take ' blicket ' to refer to individual objects of a certain kind and
' stad' to refer to non-solid substancesof a kind and that the toddlers'
,
representationsof blickets and stads have the same quantificational structure
' '
as would adults . ' Blicket is a sortal term. These data disconfirm Quine only
on the assumption that the baby did not acquire these representationsfrom
S. Carey / Reconceplualizinglhe world
m
rigid
1 1
10 nonsolidi
.
.
160
! .
chant
40
.
II
Do
" "
some
some
. ,
1
2 : 6
2 : 0
YGungerinfants
'
Altogether the data support the Sortal First hypothesis over Quine s
conjecture, but they do not establish when the child first begins to
represent sortal concepts. As noted earlier , it is not clear when Piaget
would attribute sortal concepts to children , but it is certain that he would
deny them to young infants . The argument I have developed so far does
not bear on Piaget' s claims about the representational capacities of
infants , as it concerns children age 24 months and older . Of course, a
demonstration that young infants represent sortal concepts would defeat
' ' '
Quine s conjecture as well as Piaget s characterization of the infants
conceptual resources.
Studies by Cohen and his colleagues(e.g., Cohen and Younger 1983) show
that quite young babies will habituate when shown, for example, a seriesof
distinct stuffed dogs, and that they generalize habituation to a new stuffed
dog and will dishabituate when shown a stuffed elephant. Similarly , when
shown a seriesof distinct stuffed animals, babies of 8 or 9 months habituate,
generalize habituation to a new stuffed animal, but dishabituate to a toy
truck. Do thesedata not show that babies of that age representconceptssuch
as ' dog' and ' animal?'
Certainly not. Babies may be sensitive to dog shapes or animal shapes;
babies may be habituating to doghood or animalhood. To credit the baby
with sortals such as ' dog' , or ' animal' , we must show that such concepts
provide the baby with criteria for individuation and identity .
My discussion of this question has two steps. First , I argue that babies
represent at least one sortal , object. Second, I present some recent data
from my lab that suggest that as late as 10 months of age, the baby may
have no more specific sortal concepts - not cup, bottle, truck , dog, animal
.... Thus, a Quinian interpretation of the above habituation data may well
be correct.
S. Carey I Reconceptualizingthe world
' '
Piaget s characterization of infants cognitive capacitieswas basedon tasks
in which the baby must solve some problem, often involving means- end
'
analysis, and often involving planning some action. For example, Piaget s
conclusions that babies do not represent objects as continuing to exist when
out of view were basedon the robust finding that babies under 8 or 9 months
'
cannot remove a cover to get a hidden object. The babies failure might be
due to their failure to realize the object still exists, as Piaget thought , or
equally might be due to their inability to carry out one action (remove a
cover) to achievesome other goal (obtain the object). What is neededis some
reflection of the baby' s conceptualization of the world that relies on behaviors
well within the repertoires even of neonates. Over the past 15 years or
so, such a method has been developed and is now very widely used. It relies
on babies' ability to control what they attend to.
The basic idea is simple. Under most circumstancesbabies will look longer
at what is unfamiliar or unexpectedcompared to what is familiar or expected.
Researchersuse this fact to diagnosehow the baby representssome situation ,
especiallywhat the baby considers surprising given his or her current state of
physical knowledge. The selectivelooking paradigm has been usedextensively
to probe babies' representations of objects, and the data from a subset of
thesestudies can be recruited to bear on the question at hand. They establish
that by four months of age the baby representsat least one sortal concept -
the concept of a physical object. The baby has criteria for individuation and
for numerical identity of objects.
Spelke and her colleagueshave shown that babies establish representations
of objects on the basis of criteria which individuate them - an object is a
coherent, bounded, entity that maintains its coherenceand boundaries as it
moves through space(seeSpelke, 1990, for a review) . The baby predicts the
motion of objects according to principles such as that one object cannot pass
through the spaceoccupied by another (Spelke et al. 1992, BaillargeoD 1990) .
Most relevant to the present discussionare studies showing that babies count
objects.
These are of two types. In the first , babies are simply presentedwith arrays
containing a fixed number of objects, say 2 of them, one after another. For
example, two cups, followed by two shoes, two bottles, two hats, two pens,
and so on. The pairs of objects are never repeated, so the arrays have nothing
'
in common but twoness. The baby s looking is monitored , and after a while,
' s attention to each new
the baby array decreases , relative to his or her
S. Carey I Reconceptualizingthe world
original looking time. The baby is getting bored. After looking time has
decreased to Y2 its original level, the baby is presented with an array
containing one object, or three objects. In both cases, looking time recovers
to its original level. The baby notices the difference between two objects, on
the one hand, and a single object or three objects, on the other. This result, or
one very like it , has been obtained with neonates(Antell and Keating 1983) .
In fact, the baby' s capacity to detect similarity in number across distinct
arrays serves a methodological wedge into the problem of how babies
individuate objects. The baby can be habituated as described above, to two
objects, and then presented with an array as in figure 5, consisting of two
distinct objects sharing a common boundary. Babies dishabituate to this
array, showing that they perceiveit as one object, rather than two. Thesedata
support the conclusion, derived from other types of data as well, that babies
are not sensitive to shape or texture regularity in individuating objects; they
needpositive evidenceof distinct boundaries, such as one object moving with
.111
,.:~..:..~
;1
'
~
respectto the other, or the objects being separatedin space.
: 111111
.
~
Fig. 5. Test~timulu ~ of two adjacentblocksof different size, texture and color .
A second source of evidence that babies count objects derives from data
showing that babies can add and subtract. Wynn ( 1992) showed four -month -
olds events in which a secondobject was added to an array already containing
one object. An object was placed on an empty stage while the baby
watched and then a screen was raised that covered the object. A hand
carrying a second object was shown going behind the screen and returning
empty. The screenwas then lowered, revealing either one object (unexpected
outcome, even though that was what the baby had last seen) or two objects
(expectedoutcome, if the baby knows I + I = 2) . Babies looked longer at the
unexpectedoutcome.
A further experiment showed that babies expected exactly two objects,
rather than simply more than one object. In this study, the expectedoutcome
was two objects, as before, but the unexpected outcome was three objects.
Again , babies were bored at seeing two there, and looked longer at the
S. Carey / Reconceptualizingthe world
That babies individuate and count objects does not show that they trace
identity of objectsthrough time, that they have the representationalcapacity to
distinguishone object seenon different occasionsfrom two numerically distinct
but physically similar objects. However, there are now two demonstrationsof
'
this capacity in infants age 4 months or younger. Spelke( 1988) showedbabies
objectsmoving behind and r~ erging from two separatedscreens , screenA to
the left of screenB (figure 6). An object emergedto the left of screenA and
returned behind it, and then an object emergedto the right of screenBand
returned behind it. At any given time, at most one object was visible, and no
object ever appeared in the space between screensA and B. Under these
conditions, 4-month- olds inferred there must be two objects, as shown by the
fact .that when the screenswere removed, revealing two objects (expected
outcome), they looked less than when the screenswere removed revealingone
object (unexpectedoutcome). Baillargeon ( 1990) showedinfants two objects at
once, one on either side of a screen. The babiesthen used the existenceof two
numericallydistinct objectsto make senseof what would be an impossibleevent
if only one object were involved. Togetherthesestudiesshow that babiesusetwo
spatiotemporalprinciples to individuate and trace identity of objects: one object
cannot be in two placesat the same time, and one object cannot go from one
p}a(' e to another without tracing a spatiotemporallycontinuous path.
In sum, infants have a concept physical object that functions as a sortal ;
they have at least one concept that divides reference, that provides criteria for
individuation and numerical identity . These criteria are spatiotemporal.
differencebetweenyounginfantsandadults
10. A majorconceptual
DiscontinuousCondition
I I Screens introduced
. . - - +
~
Object 1
brought out
. . +- -
Object 1
returned
8 +- -
I I Object 2
brought out
- - +
.
Steps 2 - 5 repeated
. Object2
returned
Screens removed
~ ~ revealing
Expected outcome
~ or
Unexpectedoutcome
Different Condition
UI ~~
III - - +
~
Object1
broughtout
~~~IIII +- -
Objecl1
relurnep
~ 4- -
1~
llii~~i Object 2
brought out
-...
~llilllll Object 2
returned
do not notice the difference between the two objects. In one version of the
study, babies were allowed to handle each object (one at a time of course, for
we didn ' t want to provide spatial information that there were two ) before
beginning the events. This made no difference to the results. In another, we
compared looking time to the familiarization events when the objects are of
different kinds (e.g., a cup and an elephant) to looking times during familiari -
zation in a condition where the objects emerging from each side of the screen
are of the same kind (e.g., two elephants) . Babies habituated much faster in
the latter condition . That is, they noticed that the elephant and the cup are
different from each other. After habituation, we removed the screen, revealing
either one object or two objects. Babies in both conditions (cup/elephant;
elephant/ elephant) looked longer at the outcomes of two objects (unexpected
in the elephant/ elephant condition ; expected in the elephant/ cup condition ) .
The preference for two objects was equal in the two conditions. Thus,
although babies notice the difference between the elephant and the cup, they
S. Carey I Reconceptualizingthe world
simply do not use this information to drive the inference that there must be
two numerically distinct objects behind the screen.
In appears, then, that in one senseQuine was right . Very young infants
have not yet constructed concepts that serve as adult -like meanings of
works like ' bottle ' , ' ball ' , and ' dog' . How are the babies representing these
events? We can think of two possibilities. First , the babies may actually
establish a representation of a single individual object (O BJECTi ) moving
back and forth behind the screen, attributing to this object the properties of
being yellow and duck-shaped at some times and white and spherical at
other times. The basis for such a representation could be spatiotemporal :
the infants may take the oscillating motion as a single, continuous , path .
A second possibility is that the baby is making no commitment at all
concerning whether the objects emerging to the left and right of the screen
are the same or different. That is, the baby is representing the event as
OBJECT emerging from the left of the screen, followed by OBJECT
emerging from the right of the screen, and represents these neither as a
single object (O BJECTi ) nor as distinct objects (O BJECTi , O B JECTj ) .
Supposeyou seea leaf on the sidewalk as you walk to class, and you seea
leaf on roughly the same place on the sidewalk as you return from class.
That may be the same leaf or it may not ; your conceptual system is capable
of drawing that distinction , but you leave the question open. If the infant is
leaving the issue open in this case, then why does he/ she appear surprised
when the screens are removed and two objects are revealed? On this
hypothesis, the longer looking time at two objects is a familiarity effect; the
infant has been familiarized with instances of single objects, and thus seeing
' '
two objects is different . After all , babies can be habituated to oneness by
being shown a series of objects, one at a time. Even if you were not sure
whether that leaf was the same as the one you had seen earlier , if you
returned to the classroom later in the day and encountered two leaves on
the sidewalk, you would see this state of affairs as different from ones in
which you encountered casesof single leaves on the sidewalk.
We do not know which possibility is correct. The baby actually may be
representing the events as if a duck-shaped object is turning into a ball -
shaped object (possibility one) or simply may be failing to establish representation
of two distinct objects (possibility two ) . The take-home message
is the samewhichever possibility is correct ; IO-month -old infants do not use
the property/ kind differences between a red metal truck and a gray rubber
elephant to infer that there must be two numerically distinct objects
involved in the event.
S. Carey/ Reconceptualizing
the world
II . A few eRne
' odin! remarks
Where does this leave us vis-a-vis the continuity assumption? The major
discontinuity posited by Quine and Piaget does not receive support; there is
no reason to think that babies lack the logical resourcesto representsortals,
and indeed, object functions as a sortal at least from 4 months on. But if the
interpretation of the Xu and Carey data suggestedabove is correct, then an
important Quinian discontinuity is supported. Babies may be setting up a
representation of an object which sometimesis round , white, and styrofoam
and at other times red, metal, and truck -shaped. This is a representational
systemvery different from yours and mine.
My story is not complete. I do not know if Xu and I are interpreting our
data correctly. Also , we have as yet no account of the mechanism by which
babies might begin to acquire specific sortals at around 11 months. But as I
am convinced important conceptual changes occur later in life (cf. Carey
1991, Carey and Spelke, in press), I would not be shocked to find interesting
discontinuities in the conceptual histories of infants, even in arenasso closely
implicated in languageas the conceptual underpinnings of count nouns.
References
Antell , S. and DiP. Keating, 1983. Perception of numerical invariance in neonates. Child
Development 54, 695- 701.
'
Baillargeon, R., 1990. Young infants physical knowledge. Paper presented at the American
PsychologicalAssociation Convention, Boston.
Bloom, P., 1990. Syntactic distinctions. Child Language 17, 343- 355.
Bowerman, M ., 1978. The acquisition of word meaning: An investigation into some current
conflicts. In: N. Waterson . C. Snow (eds.). Developmentof communication . New York:
Wiley.
Carey. S.. 1991 . Knowledgeacquisition: Enrichmentor conceptualchange ? In: S. Carey.
R. Gelman(eds.). The epigenesis of mind: Essaysin biology and cognition. Hillsdale. NJ:
Erlbaum.
Carey. S. and E. Spelke. in press
. Domain specificknowledgeand conceptualchange . In: L.
Hirschfeld. S. Gelman(eds.). Cultural knowledgeand domain specificity . Cambridge . UK :
CambridgeUniversityPress .
Cohen. L.B. and B.A. Younger. 1983. Perceptual categorization in theinfant. In: E.K. Scholnick
(ed.). New trendsin conceptualrepresentation . 197-200. Hillsdale. NJ: Erlbaum.
ua/izint. the world
S. CareyI Reconcept
Explanation , association ,
and the acquisition of word meaning *
FrankC. Keil
, 223 Uris Hall, Ithaca, NY /4853-760/ , USA
, CornellUniversity
Departmentof Psychology
A newly emerging view of concept structure, the concepts-in -theories view, suggeststhat adult
conceptsare intrinsic mixes of two different sorts of relations: (a) those involving domain-general
tabulations of frequenciesand correlations and (b) those involving domain-specific patterns of
explanation. Empirical results from early cognitive development suggestthat , by the time first
words are acquired, most concepts have this intrinsic mix even though changesin the nature of
the mix can produce marked developmental changesin apparent concepts, word meanings, and
their use.
. The concepts-in -theories view suggests that the sorts of constraints needed to model the
representation and acquisition of concepts cannot be based solely on either perceptual or
grammatical bases; they must also arise from blases given by specific patterns of explanation,
patterns that may depart from standard notions of intuitive theories. These in turn suggest
different views of possible constraints on the acquisition of word meaning.
I . Introduction
3. Hybrid vigor
perhaps some general hunches about birds. Those hunches insure that even
the associatively organized features will not include whether a bird was first
seenon a tree with an odd vs. even number of branches, or whether it was
facing left or right , or whether it was old or young. But , within those general
constraints, many other regularities must indeed be stored purely in terms of
correlations and frequencies. In this way, associative structures in concepts
can persevere. This perseveranceis inevitable, simply becausewe can never
have explanations for all regularities that we observe.
It is tempting to try to exclude frequency and correlation information from
the concept proper and say that they are only part of heuristics and
identification procedures having to do with concept use; but such a move
fails to explain why our beliefs are often directed specifically towards
explaining those correlations; the correlations form an integral part of the
system. Only in special caseswhere we have conventionally defined concepts
with accidentally correlated features, such as the concept odd number, do the
two aspectsbecomemore fully separable. There must be not only explanatory
structures that narrow down an indefinitely large number of features and
feature relations to a manageablenumber but also mechanismsfor storing
information that is outside the ken of current explanation. We could never
acquire new explanations if there were not some way of storing information
external to current explanations.
"
These issueshave often led to proposals of a core- periphery distinction in
concept structure, wherein typicality information is relegatedto the periphery
(see discussions in Arm strong et al. 1983 and Rey 1983) . Although this
distinction may imply that the periphery is less important , or even optional ,
the relation between the two facets of concepts may be more symbiotic.
Explanations do not amount to much if they do not have anything to explain,
and raw tabulations quickly overwhelm any informati ~n gathering system if
it does not partially order that information in terms of explanatory useful-
ness. A parallel casecan be made for scientific theories. Scientific theories are
sometimes presented as a set of tightly connected laws linked through
deductive chains (e.g. Hempel and Oppenheim 1948), but in practice, they
start with sets of regularities that they seek to explain, regularities that are
noticed and remembered often before the fledgling theory has any way of
incorporating them.
The atheoretical component of concepts should therefore not be relegated
to a ' periphery' that is not part of the concept proper . There is no general
way to justify such a move despite those cases of a few formal and
conventional terms where a clean distinction is possible. However, even as the
F. C. Keill Word meaning
The idea that most adult conceptshave a hybrid structure has not led to a
consensuson how this relation comes about in development. Two sharply
contrasting views predominate:
( 1) The earliest concepts are independent of theories and are therefore pure
casesof the probabilistic models. The earliest concepts are constructed
solely by tabulations of feature frequenciesand correlations presumably
with the help of low -level, domain-general perceptual and cognitive
blases. Early word meaningsat this time would be similarly basedon such
tabulations.
(2) Even the earliest conceptsmay be subject to organizational principles that
go beyond tabulations of features. Primitive systems of explanation
embed concepts even in infancy and certainly by the time of the first
words ; or at least young children' s concept-related behaviors are only
understandable by imputing to them mental structures like our own
explanatory systems.
This debate has proved difficult to resolve. Recent studies would seem to
favor the explanations-from -the-start alternative as younger and younger
children' s judgements are shown to apparently be governed by such explanations
. Yet , advocates for the opposite view need merely claim that the
acquisition of such explanatory systems occurs very early on in infancy
(McClelland , forthcoming), a reply that becomesincreasingly difficult to test
as ever younger infants are needed. But the tension between these views has
resulted in an important empirical discovery. There has been a rapid downward
march of the ages at which concepts are endowed with explanation,
from five-year-olds postulated as purely associative beasts ( Vygotsky 1962,
F.C. Kei/ l Wordmeaning
Moralterms
-
",~-~-"+ ~
+d
- - -c/ +d
Mealterms
~ -
/,-~-~-"-+
_ C
-
c/
-
/
+.
I
u
d
Tool terms
,K c/
~
+d
Kinshipterms
. ~
-..-".'~ ~-
,:;.-<-.
+
c
-
c/
~
+
/d
Cookingterms
K '
"
( '
+
c/
~
-to- definingshift in the acquisitionof word
Fig. I . Illustration of an apparentcharacteristic
meaning . The y-axesrepresentthe extent to which eachdescribedentity wasjudged to be a
legitimatememberof thecategory . Thex-axesrepresent threegradeschoolclasslevelsandadults.
The graphsshowthat thereis a developmental shift in whetherthe + characteristic / - defining
descriptionsor the - characteristicl + defining ones are taken to indicate membersof the
category . In addition, the shifts occur at different times on a domain by domain basis.
(Adaptedfrom Keil 1989 .)
with age such that explanations play ever more extensive roles in constraining
what features and correlations are noticed . Thus , as explanation - based knowledge
becomes more and more elaborated , default tabulations of information in
associative terms are less common . ( 2) There are shifts in understanding which
explanatory system is most relevant to a class of phenomena . A child might
realize that ' uncle ' is better understood in terms of set of biological relations
that comprise kinship and not in terms of social relations that govern
F. C. Keill Wordmeaning
also present that shrinks with age in each domain in the face of more
elaborated explanation. Although the characteristic:-to -defining change in
development is not strictly correct, the changesthat do occur support both
aspectsof the hybrid .
eating. Yet the same child will also judge that eating applies to all and only
animals if they are primed to think about terms like eating in biological/
functional terms ( Vera and Keil 1988) . Those studies are relevant because
they illustrate how a property can alternately be embedded in two very
different systems of explanation. Comparable switches in embedding for
kinship terms could cause the sorts of referentially overlapping ambiguities
proposed here.
Apparent developmentalchangesin word meaning may therefore reflect an
awarenessthat a different set of explanatory relations can be superimposed
on roughly the same set of instances and properties so as to afford new
insights. The different explanatory system may have been present for some
time, but is now decided relevant to a class of lexical items in a certain
domain.
Shifts betweenmodes of explanation occur in adults as well. For example,
most of the time we understand the personality traits and behaviors of others
in terms of belief/ desire patterns of causation. John is afraid of snakes
becausehe had some bad experienceswith them that led him to .believe that
snakes are dangerous to him , even if those beliefs are false about most
snakes. But we can understand the same traits in biological/ functional terms
as well. One would then explain John' s fear of snakes not in terms of his
beliefs and desires, but in terms of the ecological niches occupied by humans
and their evolutionary predecessorsand of a need to develop an adaptation
that causesa built -in fear of snakes. Although explanations of behaviors in
biological/ functional terms as opposed to belief/ desire ones may not be the
first option taken, once adopted, they lead to different inferences and
understandings.
If shifting modes of explanation over roughly the samereferents provides a
senseof ambiguity, there should be other signs of that ambiguity as well. One
example might be a tendency not to switch betweenmeaningswhen a word is
shared in a clause. In the sentence' The pilot banked at the intersection of
Pine and Locust and the lawyer at Oak and Walnut ' , it is bizarre to have
' bank'
being used first as an airplane maneuver and second as fiscal action.
' '
Similarly, in the sentence The rock hit the house and John did too , it is
'
awkward to have ' hit be in the mechanical/ billiard -ball sensefor the rock
and in the intentional sensefor John. One is more inclined to seeJohn hitting
the house in an inanimate senseas a hurtling body not intending the impact.
"
As a second example, consider the sentence The daffodil neededwater and
' ' '
John did too , where need has the senseof a physiological need in both
cases, not as physiological need and explicit desire or wanting in the other. If
F.C. Keill Word meaning 185
this analysis is correct, a child hearing an unfamiliar tenD can safely assume
that when a word is shared in usage, the mode of explanation is likely to
remain constant.
The awkwardness of mixing different sensesseemsless marked for other
casesthat are not true ambiguities. For example, with metaphor, one can say
' Phil embraced her ' ' '
body, and Bill her ideas , where embrace clearly has
different senses , but little awkwardness results. Similarly , with syncategore-
' '
matic terms like good , different sensescan be mixed in the sameclause, as in
'The meal was '
good and the fire was too . If thesesorts of intuitions prove to
be more general, they suggestthat shifts in explanatory frameworks are not
to be confused with a host of other possible context effects that cause more
subtle shifts in meaning (Barsalou 1987, Lakoff 1987) . There are dozens of
' '
different nuancesof the word instrument depending on context, as there are
'
of the verb ' cut ; but these do not seemto reflect deep shifts in explanatory
frameworks.
all categoriesand to check that in fact they would be lessinfonnative for such
kinds.
The importance of property types varies strongly as a function of the kind
of thing querled. Overall shape and size changeshad a profound impact on
functional roles of artifacts as well as on their lexical labels. By contrast
changes in color and surface patternings were irrelevant for comparable
ratings of artifacts. For non-living natural kinds, such as minerals, the effects
were roughly reversed, with color and patterning being seenas essentialto the
judgements and shape and size as largely irrelevant. For both plants and
animals the results were consistently similar , suggestinga set of expectations
about the category of living things in general. Shapeand color were generally
seenas most relevant across conditions and size as least relevant, a pattern
that was clearly distinct from both artifacts and non-living natural kinds. An
example of these different patterns of judgement for naming is shown in
figure 2.
Above and beyond the differences just described for the data set as a
whole, judgements also varied as a function of the kind of question asked.
Questionsabout what the altered entity should be called had different profiles
across the three category types than did questions about its function or
unusualness . This finding illustrates that judgements about preservation of
function and typicality are not the sameasjudgements of what a thing should
be called.- Such differencesmight seemobvious, but they are often conftated
in experimental researchwhere results collected with one type of judgement
are compared to those collected with another. Moreover, as seenshortly , the
precise nature of their interactions can be informative about the roles of
specific properties in particular categories.
One final measurement taken in these studies provided an important
finding . After all initial ratings were completed, subjects were asked to
consider ordinary exemplars of each of the judged categories and rate how
much those items normally vary with respect to overall shape, size color , of
patterning. Thus, a subject might be asked to judge how much ordinary
chairs, or lumps of gold, vary in color. These questions were designed to
addressan important issuein the literature : how much the natural variability
of features in a category could cause the discounting or emphasis of the
importance of featureswhen querled in one of the three ways. A 3-inch disk is
more likely to be considered a pizza than a quarter, even though it is more
similar to the quarter in physical size, because the minimal variability of
currency disallows even modest size changeswhereasthe larger variability of
size for pizzas allows for an albeit implausible 3-inch pizza (Rips 1989).
F.C. Keill Word meaning
. Shape
8
D Size
6 ~ Surface
Pattern
5.5
.E 5
c. 4.5
e
Ic 4
Ai
3.5
8
3
2.5
2
1.5
1
Artifact Non - Plant Animal
Living
NK
Perhaps more generally, tabulations of variability are the basis for judgements
of the effectsof changing properties: the lower the variability , the more
the judged impact of the change.
In thesestudies, variability alone did not predict the significant effectsseen
for any of the three judgement types. For example, if a subject said that color
typically varied more for chairs than for whales, those judgements did not
predict other judgements about category membership when the color was
changedfor novel instances. However, when the data analysis was performed
F.C. Kei/ I Word meaning
concepts into the lexicon. (For other arguments about the special influences
of lexical items seeMarkman 1989and Woodward 1992.)
It is one thing to demonstrate that adults seemto be strongly influenced by
abstract explanatory belief systemsin how they interpret the importance of
properties for different categoriesof things. It is quite another to demonstrate
that such systems might be implicated early in life when the lexicon is
growing most rapidly . We have begun a seriesof studies addressingthis issue
as well.
These studies use triads of pictures in which an exemplar with a novel
name is shown followed by an object that varies in the shapesof the most
salient parts but which preservescolor and surface patterning and by another
object that preservesall shapesbut varies in colors and surface patterning. In
each case, novel items and names are clearly depicted as being from one of
four categories: artifact , plant , animal, or non-living natural kind . The
original object is referred to in a manner that is indeterminate betweencount
' '
and mass(e.g. ' this is an animal that is in the hyrax group and this is a rock
' ' '
that is of the malachite group or this is my hyrax ... it is a kind of animal
' '
and This is my malachite ... it is a kind of rock ) .
In one preliminary study 16 three- and four -year-olds ignored shape and
emphasizedcolor when deciding which of two novel non-living natural kinds
shared the same label as an originally labeled one. For the living kinds and
the artifacts, shape differencesas depicted by different parts are vastly more
important . Color and texture also playa role for living things but not at all
for artifacts where, with few exceptions, they may apparently be understood
as purely conventional. This insight may get stronger as the children get older
and enter elementary school, a pattern that is now under active study. One
aspectof the results is shown in figure 3.
A secondstudy now under way suggeststhat shapechangesnot implicating
changes in salient parts are seen as less important for living kinds than
artifacts because of their generally smaller roles in changing functional
properties for living things. Different parts compel a senseof category change
for li .ying things. We have constructed stimuli sets where a class of novel
.
things is distorted two ways: ( I ) a computer graphics transformation displaces
the image pixels such that the object becomestwisted around its central
point and thus the overall shape is dramatically changed whereaslocal parts
stay pretty much the sameand in roughly the samerelations to each other ; or
(2) the overall shape is not changed, but small properties are substituted for
other ones, such as one kind of beak for another on a bird , or one kind of a
dial for another on a machine. When preschoolers are shown these images
F. C. Keitt Word meaning
' V
C
o
!
0 . 9
!
. !
0 . 8
. .
3
: S
0 . 7
- !
. ~
0 . 6
:
' V
c
0 . 5
i
8 .
. ~
0 . 4
! . 2
u
0 . 3
' 0
0 . 2
0 ~
0 . 1
B .
8 .
0
eo !
plants
Fig . 3. Results of preliminary preschool study showing that shape is de- emphasizedrelative to
color and surface pattern for non-living natural kinds whereas the opposite holds for living
kinds and artifacts.
and told that a target image has a certain .novel label and are then asked
which of the other two images (the overall shape distortion vs. local parts
substitution) should also take the same label, they give different patterns of
responsesfor living kinds vs. artifacts. For plants and animals, local part
differences from the target are much more important than overall shape
differenceswhereasthe opposite is true for the artifacts, as long as the local
part changesare not seenas disrupting function . These sorts of expectations
could certainly constrain how they learn the meanings of new words.
Follow -up studies have used ambiguous displays where the triads are
labeled in one caseas a kind of animal, and in other casesas a kind of nonliving
natural kind or a machine. In these cases, with precisely the same
display, generalizationsare different depending on how the depicted entity is
construed. Preliminary studies with preschoolers suggest that when, for
example, an entity is labeled a frog , the shapechangeis critical ; but when it is
labeled a rock , color and texture becomescritical ; hence expectations of the
kind of things observed can guide how properties and similarities are interpreted
for exactly the samedisplays.
The examples offered here raise the possibility that many constraints on
early concept learning and on word learning may flow from our beliefs about
the sorts of properties and relations most explanatorily central to fundamental
kinds of things. Shape, like any other property , may betheory -
dependent for its perceived causal powers and central organizing role in a
category. Very young children seemto have setsof strong expectationsabout
192 F.C. Keill Wordmeaning
7. Conclusions
argued that there are good reasons to suspectat least three and speculated
that as many as half a dozen more seemplausible.
These patterns concerning the acquisition and representation of concepts
matter directly for models of lexical representationand acquisition. Both the
strategiesused to acquire word meaningsand their underlying representations
themselvesare deeply affected by such views about concepts. The hybrid
structure suggeststhat word meanings are linked both to frequency-based
information and to causal/ explanatory beliefs and that different sorts of
lexical tasks may highlight one or the other of these two facets. Empirical
studies of the lexicon must therefore keep in mind these two facets and their
interactions with specific tasks.
Becausethe hybrid structure appears to be already present in the prelinguistic
child , both facets influence the child ' s first attempts to learn the
meanings of words, hence the claim that there are explanation-based constraints
on the acquisition of word meaning. If early representations were
only based on tabulations of feature frequencies, then strategiesthat key on
some of these features, such as shape, might be the sole means of narrowing
down the hypothesis space of what novel words refer to. Instead, even the
first guessesabout word meanings may be biased not only by bottom -up
counts of feature frequenciesand correlations, but also by top- down abstract
expectationsabout what sorts of featureswill be causally central to a domain.
Apparent developmental shifts in the internal representations of word
meaningsrarely, if ever, are signs of a change in basic underlying representational
format . Instead they may representincreasing elaboration of explanatory
beliefs that are able to interpret a larger and larger percent of the
tabulated information and shifts in which explanatory systemis deemedmost
relevant to understanding membersof a category. Younger children may not
be showing an inability to represent word meanings in certain formats as
much as they are exercisingdifferent default options and having lessdetail in
some of their explanatory systems.
The hybrid structure and its early appearancehighlights a different kind of
ambiguity in word meaning, wherein largely overlapping sets of instances
may have the same label applied to them but have different meanings, with
different inductions, different categorizations of critical test cases, and a clear
senseof ambiguity as opposed to vaguenessor synonymy. It appearsthat the
shifts of meaning here are not caused by realizing that a label points to a
'
wholly different part of one s conceptual system with different instancesand
feature clusters, but rather by realizing that the label may point to a class of
things that can be interpreted in terms of more than one explanatory schema.
F.C. Keill Wordmeaning
References
Baron-Cohen, S., A .M . Leslie and U . Frith , 1985. Does the autistic child have a ' theory of
'
mind ? Cognition 21, 37- 46.
Barsalou, L . W., 1987. The instability of graded structure: Implications for the nature of
concepts. In : U . Neisser (ed.) Concepts and conceptual development: Ecological and intellectual
factors in categorization, 101- 140. Cambridge: Cambridge University Press.
Bloom, P., 1994. Possible names: The role of syntax-semanticsmappings in the acquisition of
nominals . Lingua92, 297- 329(this volume).
Carey, ,S. 1985 . Conceptualchangein childhood. Cambridge , MA : MIT Press .
Chapman , L. J. and J.P. Chapman , 1969 . Illusory correlation as an obstacle to the useof valid
psychodiagnostic signs. Journalof AbnonnalPsychology 74, 272- 280.
Cheng,P.W. and Y. Lien, in press . The roleof coherence in differentiatinggenuinefrom spurious
causes . In: D. Sperber , A. Premack(eds.), Causalunderstandings acrosscultureand development
. Cambridge : CambridgeUniversityPress .
Chierchia , G. and S. McConnell -Ginet, 1990 . Meaningand grammar: An introduction to
semantics . Cambridge , MA : MIT Press .
Gelman, S. and G. Gottfried, 1993 . The child' s theoryof living things. Presentation at the 1993
Meeting of the Society for Research in Child Development , NewOrleans .
Hatano, G., R.S. Siegier , K. Inagaki, R. Stavyand N. Wax, 1993 . Thedevelopment of biological
knowledge : A multinational study. CognitiveDevelopment 8, 47- 62.
Hempel , C.G., 1965 . Aspectsof scientificexplanationand other essaysin the philosophyof
science . New York: FreePress .
Hempel , C.G. andP. Oppenheim , 1948 . Studiesin thelogicof explanation . Philosophyof Science
15, 135- 175.
Inagaki, K. andG. Hatano, 1993 . Youngchildren's understanding of the mind- bodydistinction.
Child Development (in press ).
Keil, F.C., 1986 . The acquisitionof natural kind and artifact tenDs. In: W. Demopoulos ,
A. Marras(eds.), Languagelearningand conceptacquisition , 133- 153. Norwood, NJ: Ablex.
Keil, F.C., 1989 . Concepts , kindsandcognitivedevelopment . Cambridge , MA : BradfordBooks.
Keil, F.C., 1992 . The originsof an autonomousbiology. In: M. A. Gunnar, M. Maratsos(eds.),
The MinnesotaSymposium on Child Psychology , Vol. 25: 103- 137. Hillsdale, NJ: Erlbaum.
Keil, F.C. and N. Battennan, 1984 . A characteristic -to-definingshift in the development of word
meaning . Journal of Verbal Learning and Verbal Behavior , 23, 221- 236.
Lakoff, G., 1972 . Hedges : A studyin meaningcriteriaandthe logicof fuzzyconcepts . Journalof
PhilosophicalLogic 2, 458- 508.
Lakoff, G., 1987 . Women, fire, and dangerousthings: What categoriesrevealabout the mind.
Chicago , IL : Universityof ChicagoPress .
Landau, B., 1994 . Where's what and what's where: The languageof objectsin space . Lingua92,
259- 296(this volume).
Leech , G., 1974 . Semantics . Hannondsworth , England: Penguin .
Leslie, A., in press . TOBY and TOM. In: D. Sperber , A. Premack(eds.), Causalunderstandings
acrosscultureand development . Cambridge : CambridgeUniversityPress .
Markman, E., 1989 . Categorization and namingin children: Problemsof induction. Cambridge ,
MA : BradfordBooks/MIT Press .
Mclelland, J., forthcoming . Paralleldistributedprocessing : Implicationsfor cognitivedevelopment .
Medin, D. L. and E.J. Shoben , 1988 . Contextandstructurein conceptual combination . Cognitive
Psychology 20, 158- 190.
F. C. Keil / Word meaning
Murphy, G.L. and D. Medin, 1985 . The role of theoriesin conceptualcoherence . Psychological
Review92, 289- 316.
Putnam, H., 1975 . The meaningof meaning . In: H. Putnam(ed.), Mind, languageand reality,
215- 271. London: CambridgeUniversityPress .
Quine, W.V.O., 1977 . Natural kinds. In: S.P. Schwartz(ed.), Naming, necessity , and natural
kinds, 155- 175. Ithaca, NY : CornellUniversityPress .
Rey, G., 1983 . Conceptsand stereotypes . Cognition15, 237- 262.
Rips, L.J., 1989 . Similarity, typicality, and categorization . In: S. Vosnaidu , A. Ortony (eds.),
Similarityand analogicalreasoning , 21- 59. New York: CambridgeUniversityPress .
Rosch, E. and C.B. Mervis, 1975 . Family resemblances : Studiesin the internal structureof
categories . CognitivePsychology 7, 573- 605.
Russell , B., 1924 . Logicalatomism. In : J.H. Muirhead(ed.), ContemporaryBritish philosophy ,
1stseries , 357- 383. London: Allen and Unwin.
Salmon , W.C., 1989 . Four decadesof scientificexplanation . Minneapolis , MN : Universityof
MinnesotaPress .
Slater, A., 1993 . Commentson symposiumon infant cognition. Presentation at the 1993Meeting
of the Society . for Research in Child Development , New Orleans .
Smith, E.E. and Medin, D.L., 1981 . Categorizationand concepts . Cambridge , MA : Harvard
UniversityPress .
Spelke , E., A. Woodwardand A. Phillips, 1993 . Kind of objectsand causalrelationsin infancy
(tentativetitle). In: D. Sperber , A. Premack(eds.), Causalunderstandings acrosscultureand
development . Cambridge : CambridgeUniversityPress .
Springer , K. and F.C. Keil, 1991 . Early differentiationof causalmechanisms appropriateto
biologicaland nonbiologicalkinds. Child Development 62, 767- 781.
Vera, A. and F.C. Keil, 1988 . The development of inductionsabout biologicalkinds. Presentation
. at the Annual Meetingof the Psychonomic Society , Chicago , IL.
Vygotsky, L.S., 1962 . Thoughtandlanguage(E. Hartmann, G. Vakar, TransCambridge, MA :
MIT press .
Werner, H., 1948 . Comparativepsychologyof mentaldevelopment (2nd edition). New York:
InternationalUniversitiesPress .
Woodfield, A., 1976 . Teleology . Cambridge : CambridgeUniversityPress .
Woodward , A., 1992 . The role of the whole objectassumptionin early word learning. PhiD.
dissertation , StanfordUniversity.
Wright, L., 1976 . Teleologicalexplanations : An etiologicalanalysisof goals and functions.
Berkeley , CA: Universityof CaliforniaPress .
Section 4
Ellen M . Markman
, Stanford, CA 94305
, StanfordUniversity
of Psychology
Department , USA
Very young children success fully acquire the vocabulary of their native languagedespite their
'
limited information processing abilities. One partial explanation for children s successat the
inductive problem word learning presents is that children are constrained in the kinds of
hypotheses they consider as potential meanings of novel words. Three such constraints are
discussed: ( I ) the whole-object assumptionwhich leadschildren to infer that terms refer to objects
as a whole rather than to their parts, substance, color , or other properties; (2) the taxonomic
assumptionwhich leads children to extend words to objects or entities of like kind ; and (3) the
mutual exclusivity assumptionwhich leadschildren to avoid two labels for the sameobject. Recent
evidenceis reviewedsuggestingthat all three constraints are available to babiesby the time of the
naming explosion. Given the importance of word learning, children might be expectedto recruit
'
whatever sources of information they can to narrow down a word s meaning, including
information provided by grammatical form class and the pragmatics of the situation. Word -
learning cOnstraintsinteract with theseother sourcesof information but are also argued to be an
especiallyuseful source of infonnation for children who have not yet masteredgrammatical form
class in that constraints should function as an entering wedge into languageacquisition.
1. Introduction
. This work was supported in part by NSF Grant # BNS-9109236. I am very grateful to Andrea
Backscheiderfor allowing me to publish the results of our study here. I thank Barbara Landau,
Lila Gleitman, and two anonymous reviewersfor their helpful comments.
possibilities about what a novel word might mean, the speed with which
young children acquire word meanings requires explanation.
Given the importance of word learning for language acquisition, children
might be expected to recruit whatever sources of information they can to
narrow down a word ' s meaning. One powerful one, for children old enough
to benefit from it , is grammatical form class. As one example, Fisher et ale
(this volume) provides dramatic demonstrations of the way in which syntactic
information helps constrain the meaning of a novel verb (seealso Fisher et ale
1991, Naigles et ale 1992) . Being able to infer aspectsof the communicative
intent of a speaker should provide another source of information about the
referent of a novel term. A conscientiousparent or other tutor might further
help by arranging the environment in ways to exaggeratethe salienceof the
aspect of the situation being labeled in the hope that their child will find it
salient too. ,Another potentially powerful source of information young
children can use to figure out the meaning of a new word comes from word-
learning constraints. Constraints on word meaning may be particularly
critical for babies who have not yet learned enough syntax to rely on
'
grammatical form class to limit their hypothesesas to a word s meaning. I
will argue that by the time they are ready to acquire vocabulary, children
place constraints on possible word meanings, thereby greatly reducing the
hypothesis space that needs to be considered. Children would not need to
formulate a long list of potential meanings and painstakingly assessthe
evidence in support of each. Rather, they could quickly zoom in on some
hypothesesthat they are predisposedto prefer.
In this paper, I will consider evidence in support of three word -learning
constraints: the whole-object, taxonomic, and mutual exclusivity assumptions
. I will argue that the most recent evidence supports the claim that
children' s early word learning is guided by such constraints. For some
different formulations of possible constraints seeBloom (this volume) ; Clark
1991; Golinkoff et al. 1992a).
To claim that children' s early word learning is a constrained form of
learning is nqt to claim that no other source of information matters. On the
contrary, the recent evidence reveals some of the complex and subtle ways
that word learning blasesinteract with other sour~ s of information . There is
evidence, for example, that word -learning constraints can affect each other.
When two or more constraints converge on the same hypothesis the learning
will be efficient compared to cases where the constraints conflict and one
must override another. Similarly word -learning constraints interact with
grammatical form class. In some casesboth thesesourcesof information lead
E.M. MarkmanI Early language
acquisition
' '
early words from genuine words that are referential; and it has been termed
' formative'
per (Snyder et al. 1981) to suggest that the early words are
responses used to perform some instrumental act rather than functioning as
words that make reference. Another characteristic of this first phase of word
learning is that words are added to the productive lexicon very slowly. A slow
accumulation of new words might proceed by some relatively unconstrained
associativemechanism. Dozens or even hundreds of trials may be neededfor
a baby to learn, for example, to say ' bye-bye' on command. This is in marked
contrast to the very fast acquisition seenaround 18 months.
This developmental shift at the time of the naming explosion suggestsan
alternative formulation of the hypothesis that word -learning constraints are
necessaryfor word learning. The more preciseclaim is that some constrained
form of learning is necessaryto account for the rapid acquisition of words
seen at the time of the naming explosion. Babies could not be acquiring
words at the rate of 30 or so a week if they were open-mindedly considering
all possible hypotheseseach time they encountered a novel word. Thus the
main hypothesis to be evaluated here is that word -learning constraints are
available to babies by the time they enter the naming explosion at roughly 18
months of age.
is that children are able to make progress in word learning by use of such
default assumptions (see Merriman and Bowman 1989 and Woodward and
Markman 1991) . These constraints on learning provide good first guesses
about the likely meanings of terms. Words that conform to the constraints
should therefore be easier to learn than ones that violate them. Some of the
controversy surrounding such claims stems from confusion over whether
constraints must be absolute or whether they function as default assumptions
(Gathercole 1989, Nelson 1988, Tomasello 1992) .
could refer to objects of the same kind , but it could in principle refer to the
object and its spatial location , or the object and its owner, or the object and a
salient part , etc. This is not , moreover, just a theoretical possibility . Many
studies of classification have found that not only do children attend to such
' thematic' relations between
objects, they often find them more salient or
interesting than taxonomic relations per se (seeMarkman 1989). When shown
a car, for example, and told to find another one, children might pick a man
becausehe drives a car rather than a truck , another vehicle. Given such
'
findings from children s classification, the question arises as to how children
avoid such thematic interpretations of a novel word. Hutchinson and I
proposed that children expect novel terms to refer to objects of like kind
(Markman and Hutchinson 1984) . Hearing a novel label, then, should cause
children to seek out taxonomic relations even in cases where thematic
relations would otherwise be more salient. There is now a substantial body of
evidencethat children from about 21J2or 3 expect labels to refer to things of
like kind (Baldwin 1989, in press; Hutchinson 1984, Markman and Hutchin -
son 1984, Waxman and Gelman 1986, Waxman and Kosowski 1990) . I turn
now to consider whether the taxonomic assumption is available to babies by
the time of the naming explosion.
Bauer and Mandler ( 1989) first addressedthis issue in their study of the
categorization abilities in very young children. As part of their study
'
, they
asked whether labeling would increase 16- 31-month -old children s tendency
to sort taxonomically . Unexpectedly, however, even the youngest children in
their study were sorting taxonomically from the start. That is, even with no
labels children were sorting taxonomically about 75% of the time. Labeling
did not increasethis already high level of performance. Bauer and Mandler
( 1989) have thus convincingly demonstrated that quite young children are
capable of sorting taxonomically . They also argue that there may not be any
general thematic preference. Because of the already high rate of sorting
taxonomically, however, they were unable to test whether children of this age
adhere to the taxonomic assumption. That is, it is still important to know
whether there are situations in which very young children show thematic
preferencesand, if so, whether hearing a label will cause them to shift to
taxonomic sorting. Backscheiderand I addressedthis by changing aspectsof
'
Bauer and Mandler s procedure that resulted in children responding taxo-
nomically from the start.
E.M. Markman
I Earlylanguage
acquisition 205
One reason why Bauer and Mandler ( 1989) achieved such a high rate of
taxonomic responding in their young children is that they used areinforcement
procedure whereby they briefly pretrained children to selecttaxonomic-
ally and maintained this selectivereinforcementof taxonomic choicesthroughout
the testing procedure. The selective reinforcement clearly mattered
becausein a control study Bauer and Mandler achievedan equally high rate
of thematic responding by selectively reinforcing thematic rather than taxo-
nomic responses . Since they demonstrated that selective reinforcement is a
powerful way to influence children' s responses
, Backscheiderand I avoided
selectivereinforcement (as did all earlier studies with older children). Other
differencesbetween our procedure and Bauer and Mandler ' s ( 1989) are that
we used pictures instead of objects, we used items whose thematic relations
we thought would be better known to 18-month -olds, and we used an up/
down placemen,t of thematic and taxonomic choices instead of left/right .
Finally, Bauer and Mandler counterbalanced the position of thematic and
taxonomic choices by alternating the left /right placement from trial to trial .
This alternation is unfortunate especially when coupled with a reinforcement
procedure. Children may have simply learned the correct answer changes
sidesevery trial . We counterbalancedside but not by alternating from trial to
trial . The major difference between the studies, however, is that we did not
differentially reinforce taxonomic responding.
Thirty -three children participated in our study. They ranged in age from 18
to 25 months with a mean age of 21.5 months. The experimental materials
consisted of ten triads, each containing one target picture, one picture
thematically related to the target, and one picture taxonomically related to
the target picture. We selectedthematic relations that we thought would be
highly familiar to even very young children. The taxonomic match belonged
to the same basic level category as the target but , where possible, came from
distinctive subordinate categories. The ten triads are listed in table 1.
The experimental questions were preceded by a set of four warm-up
questions designed to clarify the instructions and procedure but not to
differentially reinforce either taxonomic or thematic responding. The warmup
questions and the experimental questions were oddity tasks, where children
viewed a target picture and then selectedone of two choice pictures. To
begin, children were introduced to a frog hand-puppet who manipulated the
pictures. They were asked to help the frog find the pictures it wanted. The
children were told that the frog had just gotten a new house, and wanted
things to put in its house. In order to ensure that children gave unambiguous
responses , they were shown how to place the pictures they selectedin the frog
206 E.M . Markman I Early languageacquisition
Table I
's
Experimentalitemsfrom Markmanand Backscheider study
Choices
Target
Taxonomic Thematic
'
puppet s mouth . There were two experimental conditions , the No Label
Conditionand the Novel Labelcondition.
Table2
'
Warm-up itemsfrom Markmanand Backscheiders study
Target Choices .
. In a single trial the child saw either the taxonomic match or the thematic match , not both . The
distractor was always present .
choice pictures were placed on a magnetic board, one above the other. (Pilot
work suggestedthat a vertical alignment of the pictures produced fewer
responseblasesbased on position than did a horizontal alignment.) The top/
bottom position of the random distractor and correct picture was counterbalanced
. The target picture was held next to the top picture while the
' '
experimenter asked Is this another one? and then held next to the bottom
'
picture while the experimenter asked or is this another one?' When children
selectedthe wrong picture they were told that that was not the one that the
frog wanted and were encouragedto make another selection. For the warmup
questions, the frog would not accept the wrong picture - it would keep its
mouth closed and shake its head, etc. When children selected the correct
choice the frog opened its mouth to take the picture and enthusiastically
thanked them, saying ' Yeah! Yeah! That is just the one I wanted. Thank
'
you , kissed the child , etc. The experimenter then explained why that was the
picture the frog wanted, describing either the taxonomic relation betweenthe
target and the picture or the thematic relation as appropriate. For example
for the thematic pair hat/ head, the frog said ' That ' s great! A hat goes on a
head. Do you ever put a hat on your head? A hat goes on a head, they go
' ' '
together . For the taxonomic pair hat/ hat, the frog said That s great! Now I
'
have two hats. Look , these are the same, two hats . Thus, the warm-up
questions encouraged, explained, and reinforced taxonomic and thematic
responsesequally.
The procedure for the two experimental conditions was similar to that of
the warm-up questions with two exceptions. First , in the procedure proper
there was no unrelated distractor. One of the choice pictures was thematically
related to the target and the other was taxonomically related to it , as shown
in table I . Second, the frog acceptedany choice the child made, that is, there
was no selectivereinforcement during the experiment proper.
this another one?' The frog then held the target between the two pictures
while the child selectedone of them and placed it in the frog puppet' s mouth.
The experimenter took the target out of the frog ' s mouth as the child moved
his or her choice over to the frog . Children were enthusiastically thanked on
every trial , regardlessof which picture they chose.
well over what would be expected by chance, 1( 16) = 6.75, p < O.OOOI . Thus,
these young children do adhere to the taxonomic assumption. They extend
object labels to other objects of like kind rather than to objects that are
thematically related.
Wasow and I have conducted subsequent studies to detennine if these
results would replicate with babies 16 to 18 months old. Several changesin
procedure were made to accommodate to the younger children. We used
objects instead of pictures and reduced the number of questions that children
were asked. The experimental triads that were used are presentedin table 3.
The procedure was otherwise quite similar to that of Backscheider and
Markman , including use of wanD-up questions. Thirty -two 16-month -olds
and 32 18-month-olds participated in this study, half of each age group in
eachcondition . As before 18-month -olds honored the taxonomic assumption.
First , when in the No Label condition , 18-month -olds showed a clear
thematic bias selecting the taxonomic choice only 28% of the time, which is
below chance, 1( 15) = 10.97, p < O.OOOI. When the target object was given a
novel label, the 18-month -olds significantly increased their taxonomic selections
, now picking a member of the same category 54% of the time,
1(30) = 3.34, p < 0.005.
Table3
Itemsfrom Markmanand Wasow's studywith familiar objects
Target Choices
Taxonomic Thematic
Felt doll hat Plasticdoll hat Doll head
Whitedoll-sizesock Yellow toddler-sizesock Doll-sizefoot
" Smallerdoll (approx. 3" )
Babydoll (approx. 5 ) Doll-sizebottle
Red& whiteadult-sizetoothbrushPink child-sizetoothbrush Setof plasticteeth
Pink plasticdollhousetable White woodendollhousetable Pink plasticdollhousechair
Yellowplasticpail Redplasticpail Yellow& white~plasticshovel
We did not find this labeling effect with 16-month -olds, however. The 16-
month-olds did find the thematic relations salient, picking taxonomically only
34% of the time in the No Label condition , which was less than chance,
1( 15) = 3.76, p < O.OO2
. But labeling had no effect on their selections, with
babies selecting taxonomically only 39% of the time in the Novel Label
condition .
One possible reason for our failure to replicate the labeling effect with 16-
month-olds is that in this procedure we provide a novel label for a familiar
210 E.M. MarkmanI Early language
acquisition
object and then look to seehow the child interpreted the label. As I will argue
later, children find it more difficult to learn second labels for objects because
this violates mutual exclusivity, another word -learning constraint. Other
studies mimimized the conflict betweenmutual exclusivity and the taxonomic
assumptions by telling children that the novel word was a word in puppet
language or a foreign language (Markman and Hutchinson 1984, Waxman
and Gelman 1986) . Although older children have been shown to treat mutual
exclusivity as applying within a single language, not acrosslanguages(Au and
Glusman 1990), 16-month -olds would not have understood a discussion of
puppet language. To test the possibility that the conflict between mutual
exclusivity and the taxonomic assumption was confusing the 16-month -olds,
Wasow and I ran another version of the study, this time using novel objects.
Thirty -two 16-month-old babies were introduced to novel triads of toys
where two of the toys were similar in appearanceand two were related by
some thematic relation we demonstrated. For example, two honey dippers of
different colors were novel taxonomic pairs and one honey dipper was
scraped against a plastic grid to demonstrate the thematic relation. Babies
were then shown a target, e.g., a honey dipper, heard it labeled or not , and
were then depending on the condition , asked to find another one or another
X (where X was the novel label) . The full set of novel items used is described
in table 4.
Table 4
' Tern~ frnm Markmal1 and Wasow' s study with novel objects
Target Choices
Taxonomic Thematic
Red ' person' peg Yellow ' person' peg Wooden car with round hole
Largered propellor Smallyellowpropellor Red helicopter
(with post to hold propellor)
' '
object during the familiarization phrase saying, e.g., Look at the car . For
some babies the experimenterdrew attention to the object without labeling it
' '
saying, e.g., Look at this . As predicted, labeling the objects improved
babies' ability to categorizeas measuredboth by rate of habituation to within
category exemplarsand by dishabituation to a novel category (Waxman and
Helm 1991, Markow and Waxman 1992). Moreover, Markow and Waxman
'
discoveredthat at 12 months labeling objects improved babies categorization
even when adjectivesrather than nouns were used to refer to the objects. This
finding is of interest becauseit supports the idea that babies first treat words
as referring to objects of like kind rather than treating only nouns that way.
One-year-olds, presumably with minimal knowledge of grammatical form
class, were better able to categorize objects if they were labeled regardlessof
whether a noun or adjective was used as the label.
'
At this point it is of interest to consider briefly Bloom s (this volume)
alternative framework for constraints on word meaning. Bloom views the
postulation of particular constraints such as the taxonomic and whole-object
assumptions as unmotivated and corrects this in his model with an elegant
systemthat generatesconstraints to conform to universal grammar-cognition
'
mappings. Bloom s systematicanalysis provides a clear statement of what the
endpoint of development should be. Where we disagreeis in what to attribute
to the very early phasesof language learning - the time before children have
mastered grammatical form class. On the view I have proposed very young
children will break into the systemby assumingthat a word refers to a whole
object (the whole object assumption) and is extended to things of like kind
'
(the taxonomic assumption). Bloom s account, which is bolstered by counterexample
to my formulation , is that children map count nouns on to kinds of
individuals rather than kinds of objects. Yet , this may describe adevelop -
mental achievement rather than the initial state. Babies may treat terms as
referring to objects but with time note the regularity with which count nouns
refer to objects. Once this correlation is establishedchildren may then be able
' '
to metaphorically extend object-like or individual status to non-objects that
' ' ' '
are referred to with count nouns such as a forest or a lecture . Second, on
'
my account as well as Waxman s, young children rely on word -learning
constraints before they have mastered grammatical form class. Very young
children should err by treating adjectives, for example, as referring to kinds of
objects. The work just described (Markow and Waxman 1992) is one source
of evidencethat before children have masteredthe noun/adjective contrast in
'
English, adjectivesas well as nouns enhanceone-year-old s attention to kinds
of objects.
E.M. Markman
I Earlylanguage
acquisition 213
'
Labeling has been found to enhance even younger babies attention to
objects. Baldwin and I found that babies from 10- 14 months old attended
more to toys that had been labeled compared to ones that had not ( Baldwin
and Markman 1989) . In a second study, we compared labeling to another
'
powerful means of directing babies attention , namely pointing . Although
labeling did not increasebabies attention over pointing at the time pointing
occurred, during a subsequentplay period babies attended more to toys that
had been labeled than to those that had not . Thus, labeling may sustain
babies' interest in objects beyond the time the labeling occurs. By helping
babies sustain their attention to relevant objects, this labeling effect may
help babies notice and remember the word -object correspondences. It is not
yet known whether this labeling effect is caused by labels per se or whether
other nonlinguistic sounds would have the same effect (see Baldwin and
Markman 1989 for other possible explanations as well) . So far , the existing
data are inconsistent. Waxman and Balaban ( 1992) have found that words
but not tones increased attention to object categories in 9-month -olds. On
the other hand, Roberts and Jacob ( 1991) found that music as well as words
facilitated object categorization in infants . Whether the effect is specific to
words or not , words do heighten babies' attention to objects and object
categories.
Taken together, then, the experimental findings document that babies from
12- 18 m"Onths(and maybe even younger) use the taxonomic assumption to
help them determine the appropriate referents of a word. This conclusion
from the experimental data is supported by the results of Huttenlocher and
'
Smiley s ( 1987) study of naturalistic data. They generateda set of criteria to
distinguish thematic (complexive) extensionsof words from taxonomic extensions
, arguing that not every use of a word by a young child should be taken
as a simple label of an object. They followed severalchildren from the time of
their first word (around 13 months for most of the children) and periodically
recorded both the words children produced and details of the situation and
context in which utterances occurred. For example, a child who reaches
towards a cookie jar saying ' cookie, cookie' with an insistent request intonation
, should not be interpreted as labeling the cookie jar as a cookie. Instead,
the child is most likely requesting a cookie. Using coding categories that
'
capture reasonable interpretations of children s utterances, they found that
from the start children use words to refer to objects of like kind . Thus,
evidence from experimental and naturalistic studies alike indicate that the
taxonomic assumption is available to babies who have not yet undergone the
naming explosion.
214 E.M. Markman/ Early language
acquisition
Although the taxonornic assurnption goesa long way in reducing the kinds
of hypothesesbabies need to consider in figuring out the rneaning of a new
word , it by no rneans solves the problern. Babies rnay be led to treat novel
words (or nouns) as referring to kinds of objects but that still leavesopen the
question of which kind . This problern has been extensively addressed by
Waxrnan and her colleaguesand is reviewed in Waxrnan (this volurne) . To
'
briefly surnrnarize, labels facilitate young children s attention to objects at
sorne hierarchical levels but rnight actually interfere with their ability to
categorizeat other levels. Young children hearing a noun expect it to refer to
categories either at a basic or superordinate level of categorization, but do
not expect it to refer to the distinctions rnade at subordinate levels of
categorization. Rather, they expect this level of detail to be indicated by
'
adjectives. Waxrnan s work reveals the interaction between use of word-
learning constraints, grarnrnatical form class, and conceptual structure.
Not only does the taxonornic assurnption leave open the question of which
kind of object is being referred to , it even leavesopen which object is being
referred to. As Baldwin ( 1991) pointed out , in normal environrnents that
babies find thernselvesin , there are often rnany candidate objects around that
could serveas potential referents for a new word. Even if babies are prepared
to treat a novel word as referring to , say, a basic level category, they rnust
figure out which one. One possibility is that babies rnay treat a novel word as
refelTing to whatever novel object they are attending to. Whenever babiesand
adults are focussing on the sarne object, this would lead babies to correctly
identify the referent of a novel term. On those occasions, however, when the
adult was in fact labeling an object other than the one the baby was focused
on, the baby would wrongly interpret the new word and would rnake a
rnapping error. Such rnapping errors appear to be very rare, although there is
evidence that babies learn words rnore readily when parents tend to label
what their baby is attending to (Tornasello and Farrar 1986) . Baldwin ( 1991)
suggested that babies could avoid rnapping errors if they rnonitored the
'
speakers focus of attention. If babies recognize that the object they are
interested in is not the one the speakeris attending to , then they would know
not to treat the word as a label for their object. Baldwin tested whether
'
babies use information about a speakers focus of attention in inferring the
referent of a novel word or whether they rnapped a novel word to whatever
novel object they were interested in. To test this, she provided novel labels to
' '
16- 19-rnonth-old babies in one of two conditions. In the discrepant label
condition , babies were given a toy to play with . Once the experirnenter was
assured that the baby was exarnining the toy the experirnenter provided a
E.M . Markman I Early languageacquisition 215
novel label saying, e.g., ' It ' s a toma' . Instead of looking at the baby or the
' '
toy , however, the experimentergazedinto an opaque bucket as she said It s a
'
toma . Thus, the baby heard the novel label while looking at a novel toy but
while the speaker looked into a bucket that contained a second toy . In the
' follow -in '
labeling condition , the experimenter looked at the visible toy while
she provided the novel label.
In the follow -in labeling condition , babies of both ages treated the novel
word as referring to the visible toy . Of more interest is what happenedin the
discrepant conditions. The results were quite clear: at neither age did babies
treat the novel word as referring to the visible toy even though they were
looking at the toy at the time they heard the label.
By monitoring the eye-gaze, posture, direction of voice, or some other cues
to the speaker's focus of attention babies avoided making a mapping error.
The 16- 17-month -olds avoided errors by simply failing to learn the new
word. The 18- 19-month -olds not only avoided errors, but were able to infer
that the new word referred to the object that was hidden in the bucket at the
time of labeling. Baldwin' s results reveal how babies' use of word-learning
constraints is coordinated with their construal of the communicative intent of
the speaker.
The whole-object assumption states that babies should treat a novel label
as referring to an object rather than a part , substance, color , movement, etc.
There is somecontroversy as to whether young children treat a novel label as
referring to the shapeof an object rather than the object per se. Some studies
have found that young children will extend an object label to things of the
sameshaperather than those of the samecolor , or texture (Au 1989, Au and
Markman 1987, Baldwin 1989, Landau et al. 1988, Ward et al. 1989) . One
interpretation of these findings, however, is that children are treating the
terms as referring to whole objects of like kind , and that shape is a
'
particularly reliable clue to an object s taxonomic category. Sola et al. ( 1991)
reinforce the conclusion that it is the object, not the shape, that children are
responding to in that they do not treat words as referring to the shape of
non-solid substances even when different substances were arranged into
common shapes. Baldwin (in press) has found that , on the one hand, children
will extend a label on the basis of shapewhen there is no obvious taxonomic
category, but , on the other hand, children will extend a label on the basis of
E.M. MarkmanI Early language
acquisition
taxonomic category even when the objects in question do not have the same
shape. Moreover, Landau (this volume) along with Becker and Ward ( 1991)
have found that preschoolers treat novel terms as labels for objects rather
than shape in that they treat a term for a worm -like animal as referring to
another worm twisted into a different configuration . (Landau, however,
argues for a different interpretation of these results. In particular she argues
that children are still responding on the basis of shapeibut shape now is
defined as the possible shape transformations an object of a given shape can
undergo.) In addition , there are developmental differences in the way this
whole-object bias interacts with grammatical form class (Landau et al., in
press; Landau, this volume) . Older children and adults will treat a novel noun
as referring to the kind of object but treat an adjective as referring to a
property such as texture. But the youngest children treated even novel
adjectives as referring to kinds of objects. Here again we see that when
knowledge of grammatical form classis weak, word -learning constraints such
'
as the whole-object assumption dominate the child s interpretation of a novel
term.
Taken together, these findings suggestthat children attempt to honor the
whole object and taxonomic assumptions per se, rather than treating words
as shapeterms. Further support for the whole-object assumption comes from
studies that have documented that children interpret a novel term as a label
for an object and not its part (Markman and Wachtel 1988, Mervis and Long
1987) and for an object over its substance(Markman and Wachtel 1988, Sola
et al. 1991) .
other screenthe babies viewed a static novel object. When the screenswere
turned on and babies allowed to watch freely, they clearly preferred the
swirling substancesto the static objects. Thus, Woodward created a situation
where the object was lesssalient than the substancein motion . On some trials
babies heard a label that could be interpreted as referring either to an object
or substance. The prediction from the whole-object assumption is that
hearing a label should cause babies to shift attention more to the whole
object. This hypothesis was confirmed for 18-month-olds (though lessclearly
for the 24-month -olds) . Around the time of the naming explosion, then,
babies treat novel words as referring to objects per se, rather than to
.
whatever is most salient.
Testing a somewhat different hypothesis, Echois also has evidence suggesting
that well before the time of the naming explosion babies honor the
whole-object assumption, but that younger babies (8- to 10-month -olds) may
not. Echois ( 1990, 1991) asked whether young babies might map labels to
whatever in the environment is consistent rather than to objects per se. She
used an habituationjdishabituation procedure where babies heard labels
either in the presenceof consistent objects with varying motions or consistent
motions with varying objects. Trends in the pattern of habituation and
dishabituation led Echois to speculatethat there is a developmentalshift from
8- 10 months to 13- 15 months in how babies are affected by labeling. The
younger babies appeared to focus on what was consistent when they heard a
label while the older ones focused on objects per se. The older babies were
only 13 to 15 months old , however.
Woodward ' s visual preferencestudy and Echols' habituation studies both
suggest that the whole-object assumption is in place by the time of the
'
naming explosion. By focussing children s attention on objects as the most
likely referent of a novel word , the whole-object assumption greatly reduces
the number of hypotheseschildren need to consider for the meaning of a
novel term. The whole-object assumption thus promotes the rapid learning of
object labels. While the whole-object assumption can account for the speed
with which children acquire names for objects, it poses an obstacle for
learning terms for parts, substances , colors, texture, and other properties of
objects. With only the whole-object assumption to guide their interpretation
of novel words, babies would be limited to learning only object labels. One of
the functions of the mutual exclusivity assumption discussednext is that it
can override the whole-object assumption, freeing children to learn a greater
variety of terms.
E.M. MarkmanI Early language
acquisition
The mutual exclusivity assumption leads children to prefer only one label
for an object. This assumption is stronger than a related one of contrast
(Clark 1987, 1990, 1991), which is that any two words contrast in meaning.
' ' ' '
Although words such as car and vehicle contrast in meaning with one
being a superordinate of the other, they violate mutual exclusivity in that one
' ' ' '
object can be called both car and vehicle . In some cases, mutual exclusivity
comesinto conflict with the whole-object assumption. Supposea child hears a
' '
novel word , say finial in the presenceof a novel object, a pagoda. On the
' '
whole-object assumption, the child should (mistakenly) treat finial as referring
to the pagoda. Supposeinstead that another child who has learned that a
' ' ' '
pagoda is called pagoda also hears the novel word finial in the presenceof
the pagoda. Here the whole-object and mutual exclusivity assumptions are
brought into conflict. As before, the whole-object assumption would lead
' '
children to interpret finial as referring to the pagoda. In contrast, children
' ' ' '
who know a pagoda is called a pagoda are led to avoid treating finial as
' '
yet another word for the same object. By rejecting finial as a second label
children would then be motivated to find some other referent for the term.
Thus, mutual exclusivity can motivate children to find salient parts, substances ,
and properties of objects as referents for novel terms. As predicted by mutual
exclusivity, preschool children are in fact better able to .learn novel terms for
parts and substancesif taught on objects with known labels (Markman and
Wachtel 1988) . More generally children should be better able to learn a
variety of property terms when taught on objects with known labels. Mutual
-
exclusivity can then help override the whole object assumption allowing
children to interpret the novel word as something other than an object label.
Mutual exclusivity may also enable children to override the taxonomic
assumption to enable them to better learn proper names (Hall 1991) . While
the taxonomic assumption leads children to interpret new terms as referring
to objects of like kind , proper names refer to particular individuals , not
kinds. Hall ( 1991) predicted that children should be better able to learn
proper namesfor familiar objects with known labels than for objects without
known labels. To test this Hall ( 1991) relied on the logic first put forward by
Katz et ale( 1974) . Katz et ale taught children a new label for an object. They
then looked to see whether children restricted the new term to the labeled
object or whether they extendedit to a highly similar secondobject. Choosing
the original object above chance was taken as evidencechildren treated the
term as a proper name; indifference between the two objects was taken as
E.M . Markman / Early languageacquisition 219
evidence they treated the term as a common name. Katz et al. ( 1974)
manipulated two sourcesof information about whether something might be a
proper name or not. The first was grammatical form class, in particular ,
'
whether the novel term took an article or not , e.g., ' a dax' vs. ' Dax . The
second was the type of object that was labeled, in particular , animate-like
things which are appropriate referents of proper names, in this case dolls,
versus inanimate objects unlikely to be treated as unique individuals , in this
caseblocks. Their results for girls suggestedthat when both the grammatical
form class and the conceptual domain were appropriate, girls treated the
novel term as a proper name. That is, when a doll was called ' Dax ' , girls
interpreted Dax as a name for that doll and not the other one. When the doll
was called ' a dax' , girls treated the term as applying to both dolls. No matter
whether the block was called by common or proper name, girls treated the
new word as referring to both blocks. A partial replication of this stud)'
suggestedthat 18-month -old girls would also distinguish betweenproper and
common names for dolls. These results did not hold up for the boys,
however. Two -year-old boys were at chance for all object selections, even
when hearing a proper name for a doll .
Gelman and Taylor ( 1984) replicated the Katz et al. ( 1974) study with some
improvements and modifications. They addressedthe concern that evidence
for treating a term as a common noun was undifferentiated from chance
performance. By adding other distractors to the responseset, they avoided
this problem. Gelman and Taylor ( 1984) also used unfamiliar rather than the
familiar objects that Katz et al. had used. Testing somewhat older children
(2- 3-year-olds), Gelman and Taylor found that boys as well as girls treated
terms as common nouns except in the casewhere proper nameswere used to
refer to animate-like objects (stuffed animals) .
To return to Hall ( 1991), then, the question is whether, along with
grammatical form class and conceptual domain, mutual exclusivity is used to
help children learn proper names. By overcoming the tendency to treat a
novel label as a second term referring to things of like kind , mutual
exclusivity could promote the interpretation of a new term as a proper name.
Second labels for objects should be more likely than first labels to be
construed as proper names. To test this Hall ran a partial replication of
Gelman and Taylor ( 1984), running only the condition that resulted in proper
name interpretations in the prior work . That is, two - and three-year olds were
taught proper names for animate objects. They were taught in one of two
conditions: either the animate objects had known names or they did not.
Overall, Hall ( 1991) again replicated Gelman and Taylor ( 1984) and Katz et
E.M. MarkmanI Early language
acquisition
al. ( 1974) with children treating the novel term as referring to the individual
object labeled at above chance levels. Moreover, as predicted, children were
more likely to treat the novel term as a proper name when they already knew
another common name for the object. Combining the results of Hall ( 1991)
with those of Gelman and Taylor ( 1991), we can conclude that young
children recruit information from three sourcesto determine how to interpret
a novel term: When the conceptual domain is appropriate, when the syntactic
information is consistent with a proper name interpretation , and when
mutual exclusivity can help override the taxonomic assumption, young children
are most likely to treat the term as a proper name.
These studies with parts and substancesand proper and common names
lead to the general conclusion that some word -learning constraints can be
used to moderate or override others. Mutual exclusivity can override the
whole-object assumption leading children to learn terms for parts, substances,
and other attributes of objects. It can also override the taxonomic assumption
, thereby helping children interpret a proper name as a term for a unique
individual .
'
5.1. Mutual exclusivity as a guide to a word s referent
object with a known label which in turn prevented the children from
accepting another novel label for that object.
Two -year-olds' successat using mutual exclusivity to infer the referent of a
novel term belies the inferential requirements of this task. The logic of this
'
problem is of the form of a disjunctive syllogism: The novel word must refer
to either object A or to object B. It is not A (by mutual exclusivity),
therefore, it must be B' . Hutchinson ( 1986) has found , in fact, that this logic
of the problem may pose a problem for developmentally delayed children
even when they are matched in mental age to normal children. In her task,
normal and developmentally delayed children were given pairs of objects
where one object had a known name and the other did not. The normal
children were 2, 2Y2 and 3 years of age and, as expected, at all age groups
they chose the unfamiliar object at above chance levels as the referent of
a novel label. The performance for the developmentally delayed children
showed a marked developmental difference with children matched in mental
age to the 2Y2- and 3-year-olds selecting the novel object at above change
levels, but with children matched in mental age to the 20- 24- month-olds
failing to do so. Thesechildren did not select the novel object as the referent
of the novel word even though normally developing children of 20- 24 months
did. It was not until retarded children reacheda mental age of 28 months that
they seemed capable of using mutual exclusivity to infer the appropriate
referent of a novel term. One possible reason for this delay is that the logic of
the problem is particularly slow to develop in the delayed children.
There are two concerns that have been raised about whether children
around the age of the naming explosion can use mutual exclusivity to infer
the meaning of a novel term. The first is that this use of mutual exclusivity
has been documented only in children from two years of age and up .
Studies with younger children are needed. Second, an alternative explanation
for these results has been proposed by Merriman and Bowman
( 1989 .
) They argued that children might map a novel word to a novel
object in these studies becausethey are predisposed to fill lexical gaps - to
find first labels for objects. Thus , when children encounter an object for
which they do not know a label , they should seek out its name. Children
might be mapping the novel label to the novel object becausethey desire a
name for the novel object, not because they reject a second label for the
familiar object .
E.M. Markman/ Early language
acquisition
month-olds success fully learn a new first label after brief training , but they
learned a second label just as well. These young children clearly violated
mutual exclusivity to acquire this second label. We then ran a second study
with 24-month-olds to see what would happen if the task were made
somewhat more demanding. Now children were required to learn two new
labels rather than just one - either two new first labels or two new second
labels. In contrast to the earlier results, the two -year-olds now showed
evidenceof using mutual exclusivity. They succeededat learning two new first
labels but failed to learn the second labels for objects. This suggestsanother
way in which word -learning constraints function as default assumptions:
children may rely on the default assumptionsmore heavily when the demands
of the task increase.
6. Conclusions
References
Au, T.K. and M. Glusman , 1990 . Theprincipleof mutualexclusivityin word learning: To honor
or not to honor? Child Development 61, 1474 - 1490.
Au, T.K. and EM . Markman, 1987 . Acquiringword meaningsvia linguisticcontrast. Cognitive
Development 2, 217- 236.
Baldwin, D.A., 1989 . Prioritiesin children's expectations aboutobjectlabelreference : Form over
color. Child Development 60, 1291 - 1306 .
Baldwin, D.A., 1991 . Infants' contributionto the achievement of joint reference . Child Development
62, 875-890.
Baldwin, D.A., in press . Clarifyingthe role of shapein children's taxonomicassumption . Journal
of ExperimentalChild Psychology .
Baldwin, D.A. andE.M. Markman, 1989 . Mappingout word-objectrelations: A first step. Child
Development , 60 381 - 398 .
Banigan , R. L. andC. B. Mervis, 1988 . Roleof adult input in youngchildren's categoryevolution,
II : An experimental study. Child Language15, 493- 505.
Bauer, P.J. andJiM . Mandler, 1989 . Taxonomies and triads: Conceptualorganizationin one-to-
two-yearolds. CognitivePsychology 21, 156- 184.
Becker , A.H. and B.T. Ward, 1991 . Children's useof shapein extendingnovellabelsto animate
:
objects Identity versus posturalchange . CognitiveDevelopment 6, 3- 16.
Bloom, B., 1994 . Possiblenames: The role of syntax-semanticsmappingsin the acquisitionof
nominals . Lingua92, 297- 329(this volume).
Bloom, L., K. Lifter andJ. Broughton, 1985 . Theconvergence of earlycognitionand languagein
the secondyearof life: Problemsin conceptualization and measurement . In: M. Barrett(ed.),
Children's single -word speech , 149- 180. New York: Wiley.
Clark, E.V., 1987 . The principle of contrast: A constraint on languageacquisition . In :
B. MacWhinney(ed.), The 20th Annual CarnegieSymposiumon Cognition, 1- 33. Hillsdale,
NJ; Erlbaum.
Clark, E.V., 1990 . On the pragmaticsof contrast. Journalof Child Language17, 417- 431.
Clark, E.V., 1991.Acquisitionalprinciplesin lexicaldevelopment . In: S.A. Gelman, J.P. Byrnes
(eds.), Perspectives on languageand thought: Interrelationsin development , 31- 71. Cambridge
University Press .
Corrigan, R., 1983 . The development of representational skills. In: K.W. Fischer(ed.), Levels
and transitionsin children's development , 51- 64. SanFrancisco , CA: Jossey -Bass .
Dockrell, J. and R. Campbell , 1986 . Lexical acquisitionstrategies in the preschoolchild. In:
S. Kuczaj, M. Barrett(eds.), The development of word meaning , 121- 154. Berlin: Springer .
Dromi, E., 1987 . Early lexicaldevelopment . New York: CambridgeUniversityPress .
Echois, C.H., 1990 . An influenceof labelingon infant' s attentionto objectsand consistency :
Implicationsfor word-referentmappings . Paperpresented at the InternationalConference on
Infant Studies , Montreal, Quebec , April 1990 .
Echois, C.H., 1991 . Infant' s attentionto objectsand consistency in linguisticand non-linguistic
context. Paper presentedat the Biennial Meeting of the Societyfor Researchin Child
Development , Seattle , WA , April 1991 .
Fisher, C., L. R. Gleitmanand H. Gleitman, H. 1991 . On the semanticcontentof subcategorization
frames.CognitivePsychology23, 331- 392.
Fisher, C., D.G. Hall, S. Rakowitzand L. Gleitman, 1994 . Whenit is betterto receivethan to
give: Syntacticand conceptualconstraintson vocabularygrowth. Lingua 92, 333- 375 (this
volume).
E.M . Markman I Early languageacquisition 225
Gathercole, V.C., 1989 . Contrast: A semantic constraint ? Journalof Child Language16, 685- 702.
Gelman, S.A. and M. Taylor, 1984 . How two-year-old childreninterpretproperand common
namesfor unfamiliarobjects . Child Development 55, 1535 - 1540.
Golinkoft', R.M., C.B. Mervis and K. Hirsh-Pasek , 1992a . Early object labels: The casefor
lexicalprinciples . Unpublishedmanuscript .
Golinkoft', R.M., K. Hirsh-Pasek , L.M. Baileyand N.R. Wenger , 1992b . Young childrenand
adultsuselexicalprinciplesto learnnewnouns. Developmental Psychology28, 84- 89.
Gould, J.L. and P. Marler, 1984 . Ethologyand the natural history of learning. In: P. Marler,
H.S. Terrace(eds.), The biologyof learning,47- 74. Berlin: Springer .
Hall, D.G., 1991 . Acquiring proper nounsfor familiar and unfamiliaranimateobjects: Two-
'
year-olds word-learningblases . Child Development 62, 1142 - 1154.
Halliday, M.A.K., 1975 . Learninghow to mean. In : E.H. Lenneberg , E. Lenneberg(eds.),
Foundationsof languagedevelopment : A multidisciplinaryapproach , Vol. 1, 239- 265. New
York: AcademicPress .
Hutchinson , J.E., 1984 . Constraintson children's implicit hypotheses about word meanings .
Unpublisheddoctoraldissertation , StanfordUniversity, Stanford, CA.
Hutchinson , J.E., 1986 . Children's sensitivityto the contrastiveuseof objectcategoryterms.
'
Papersand Reportson Child LanguageDevelopment 25, 49- 56.
Huttenlocher , J. andP. Smiley, 1987 . Earlyword meanings : Thecasefor objectnames . Cognitive
Psychology19, 63- 89.
Katz. N., E. BakerandJ. Macnamara , 1974 . What' s a name? On thechild' s acquisitionof proper
andcommonnouns. Child Development 45, 469- 473.
Landau, B., 1994 . Where's what and what's where: The languageof objectsin space . Lingua92,
259- 296(this volume).
Landau, K.B., LiB. Smith and SiS. Jones , 1988 . The importanceof shapein early lexical
learning.CognitiveDevelopment 3, 299- 321.
Landau, B., L. B. Smithand S. Jones , in press . Syntacticcontextand the shapebiasin children's
andadultslexicallearning. Journalof Memoryand Language .
Liittschwager , J.C. and E.M. Markman, 1991 . Mutual exclusivityas a default assumptionin
secondlabellearning. Paperpresented at the biennialmeetingsof the Societyfor Research in
Child Development , Seattle , April 1991 .
Lock, A., 1980 . The guidedreinventionof language . London: AcademicPress .
Markman, EM., 1989 . Categorizationand namingin children: Problemsof induction. Cambridge
, MA : MIT Press , BradfordBooks.
Markman, E.M., 1992 . Constraintson word learning: Speculations about their nature, origins,
and domainspecificity . In : M.R. Gunnar, M.P. Maratsos(eds.), Modularity and constraints
in languageand cognition: The MinnesotaSymposiaon Child Psychology , 59- 101. Hillsdale,
NJ: Erlbaum.
Markman, E.M. and J.E. Hutchinson , 1984 . Children's sensitivityto constraintson word
meaning : Taxonomic vs. thematic relations . CognitivePsychology16, 1- 27.
Markman, E.M. and GiF. Wachtel, 1988 . Children's useof mutualexclusivityto constrainthe
meanings of words. CognitivePsychology 20, 121- 157.
Markman, E.M. and J. Wasow, in preparation . Very youngchilren's useof mutualexclusivity .
Markow, D.B. and SiR. Waxman , 1992 . The influenceof labels on 12-month-olds' object
categoryformation. Paperpresented at the InternationalConference on Infant Studies , Miami,
Florida, May 1992 .
226 E.M . Markman / Early /anRUQY.E~acquisition
Marler, P. and HiS. Terrace(eds.), 1984 . The biologyof learning. Berlin: Springer .
McShane , J., 1979 . The development of naming. Linguistics17, 879-905.
Merriman, W.E. and L.L. Bowman , 1989 . The mutual exclusivitybias in children's word
.
learning Monographs of the Society for Researchin Child Development54( 3/4), Serial
No. 220.
Mervis, C.B., and L.M. Long, 1987 . Words refer to wholeobjects: Young children's interpretation
of the referentof a novelword. Paperpresented at the biennialmeetingof the Society
for Research in Child Development , Baltimore, MD.
Mervis, C.B., RiM . Golinkoff and J. Bertrand, 1991 . A refutationof the principleof mutual
exclusivity . Paperpresented at the Societyfor Research in Child Development , WA.
, Seattle
Naigles , L.G., H. Gleitmanand L.R. Gleitman, 1992 . Childrenacquireword meaningcomponents
from syntacticevidence . In: E. Dromi (ed.), Languageand cognition: A developmental
perspective . Norwood, NJ: Ablex.
Nelson, K., 1973 . Structureand strategyin learningto talk. Monographsof the Societyfor
Research in Child Development 38( 1/ 2), SerialNo. 149.
Nelson, K., 1988 . Constraintson word learning? CognitiveDevelopment 3, 221- 246.
Nelson, K. and J. Lucariello, 1985 . The development of meaningin first words. In: M. Barrett
'
(ed.), Childrens single - word speech , 59- 86. Chichester : Wiley.
Quine, W. V.O. , 1960 . Word and object. Cambridge , MA : MIT Press .
Roberts , K. and M. Jacob, 1991 . Linguistic versusattentional influenceson nonlinguistic
categorization in 15-month-old infants. CognitiveDevelopment 6, 355- 375.
Rozin, P. and J. Schull, 1988 . The adaptive -evolutionarypoint of view in experimentalpsychology
, Vol. I. In: R.C. Atkinson, R.J. Herrnstein , G. Lindsey, R.D. Luce(eds.), Perception and
motivation(2nd ed.), 503- 546. New York: Wiley.
Shettleworth , S.J., 1984 . Naturalhistoryandevolutionof learningin nonhumanmammals . In: P.
Marler, 8 .S. Terrace(eds.), The biologyof learning, 41~ 33. New York: Springer .
Snyder , L.S., F. Batesand I. Bretherton , 1981 . Contentandcontextin earlylexicaldevelopment .
Journalof Child Language8, 565- 582.
Sola, N.N., S. Carey and E.S. Spelke , 1991 . Ontological categoriesguide young children's
inductionsof word meaning:Obj~ t termsand substance terms. Cognition38, 179-211.
Taylor, M. and S.A. Gelman, 1989 . Incorporatingnew words into the lexicon: Preliminary
evidencefor languagehierarchiesin two-year- old children. Child Development 59, 411- 419.
Tomasello , M., 1992 . The socialbasesof languageacquisition . SocialDevelopment1, 67- 87.
Tomasello , M. and M.J. Farrar, 1986 . Object permanence and relational words: A lexical
trainingstudy. Journalof Child Language13, 495- 505.
Tomasello , M., S. Mannieand L. Werdenschlag , 1988. Theeft'~ t of previouslylearnedwordson
the child' s acquisitionof wordsfor similarreferents . Journalof Child Language15, 505- 515.
Ward, T.BE . Vela, M.L. Peery,S. Lewis, N.K. Bauerand K. Klint , 1989.Whatmakesa vibble
a vibble: A developmental studyof categorygeneralization . Child Development 60, 214- 224.
Waxman , S.R., 1994 . The development of an appreciationof specificlinkagesbetweenlinguistic
andconceptualorganization . Lingua92, 229- 257(this volume).
Waxman , S.R. and M.T. Balaban , 1992 . The influenceof words vs. tones on 9-month-old
infants' object categorization . Paper presentedat the InternationalConferenceon Infant
Studies , Miami, Florida, May 1992 .
. Preschooler 's useof relationsin classifica -
Waxman , S.R. and R. Gelman, 1986 superordinate
tion. CognitiveDevelopmentI , 139- 156.
E.M . Markman I Early languageacquisition 227
Thedevelopment of an appreciation of
specificlinkagesbetween linguisticand
conceptual *
organization
Sandra R. Waxman
Department . Northwestern
of Psychology . 2029Sheridan
University Road. Evanston
. 1L 60208
, USA
.
'WhenI makea word do a lot of work like that I '
, alwayspay it extra
(LewisCarroll 1895 )
I . Introduction
Humans are uniquely endowed with the capacity to build complex , flexible ,
' '
and creative linguistic and conceptual systems. Infants and toddlers remarkable
achievements in each of these arenas have engaged researchers for
decades. Yet in recent years , it is the relation between linguistic and conceptual
development that has come to occupy center stage. Some of the most
exciting current work has been designed to explore the relation between early
'
linguistic and conceptual development in the young child s acquisition of the
lexicon .
This new , integrative approach has brought into sharp focus a fascinating
puzzle . We know that infants acquire their native language naturally at a
. This work wassupportedin part by grant # HD30410from NIH . Thanksto Marie Balaban
,
D. GeoffreyHall, ElizabethShipley, and an anonymousreviewerfor their commentson a
previousversion.
remarkable pace (Carey 1978, Dromi 1987, Gopnik and Meltzoff 1987,
Nelson 1983) . We also know that even before learning the words to express
them, infants appreciate many different kinds of conceptual relations among
objects, including category relations, thematic relations, causal relations, and
event-related or associative groupings (Bornstein 1984, Leslie and Keeble
1987, Mandler et al. 1987, Younger and Cohen 1986) . These early linguistic
and conceptual achievementsset the stagefor what has beendescribedas ' the
inductive problem of word learning' (Goodman 1983, Quine 1960, Carey
1990). The problem is that , in principle, the richnessand flexibility of infants'
conceptual abilities should complicate the task of mapping new words to their
meanings.
To understand why this is the case, consider a typical word learning
scenario, in which an adult introduces a child to a novel object (say, a
' '
flamingo) and offers a novel label ( a flamingo ) . Let us assumethat both the
child and adult are focusing attention on the same object or scene( Baldwin
and Markman 1989, Tomasello 1988) . If children have the conceptual ability
to appreciateso many different kinds of relations involving that object, and if
'
each of these is a potential candidate for the new word s meaning, then
how do infants select from among these many possible meanings when
determining what the new word is intended to convey? How do infants so
rapidly learn that a given word (e.g., flamingo) may apply to a particular
whole object and may be extended to other members of that object category
(e.g., other flamingos)~but not to salient properties of the object (e.g., its long
neck or unusual color), to salient actions in which it is engaged(e.g., feeding
its young), or to salient thematic relations (e.g., a flamingo and sand) ? If
children had to rule out theseand countlessother logically possiblecandidate
meanings, word learning would be a formidable task indeed.
Yet despite the logical difficulty of the task, young children rapidly and
successfully map novel words and meanings. This observation has led several
researchers , working from several different paradigms, to suggestthat children
come to the task of word learning equipped with certain implicit blases
or expectationswhich lead them to favor some types of conceptual relations
over others when ascribing meaning to a new word (Chomsky 1986, Landau
and Gleitman 1985, Pinker 1984, Markman 1989, Waxman 1990, 1991) . The
claim is that these expectations reduce the logical difficulty of word learning
S.R. Waxman I Linguistic and conceptualorganization
by narrowing the range of candidate meanings a child will consider for any
given new word.
Several such implicit blaseshave been proposed. For example, researchin
several different laboratories has converged on the finding that children
expect that the first word applied to a novel object will refer to the whole
object and other members of its basic-level kind , rather than to its parts or
other salient aspects(Markman and Wachtel 1988, Taylor and Gelman 1988,
Hall et al. 1993, Markman 1989) . Further evidencerevealsthat children, like
adults, expect that different words will contrast in meaning (Clark 1987,
Golinkoff et al. 1992, Markman 1984, 1989; Merriman and Bowman 1989,
Waxman and Senghas1992).
A third type of bias or predisposition will serve as the focal point of this
article. There is now considerableevidencethat children use the grammatical
form of a novel word (e.g., count noun, proper noun, adjective) as a guide to
determining its meaning (Brown 1957, Katz et al. 1974, Landau and Gleit -
man 1985, Naigles 1990, Gleitman et al. 1987, Hall and Waxman 1993) . For
example, by two to three years of age, English-speaking children expect
objects and object categories(e.g., flamingos, birds, animals) to be marked by
count nouns (Markman and Hutchinson 1984, Waxman and Gelman 1986,
Waxman and Kosowski 1990, Waxman and Hall 1993) ; they expect substances
(e.g., wood ; gel) to be marked by mass nouns (Dickinson 1988, Sola
et al. 1991); and they expect object properties (e.g., size, color ) to be marked
by modifiers (Gelman and Markman 1985, Hall et al. 1993, Taylor and
Gelman 1988, Waxman 1990) .
Notice that any linkage between grammatical form class and meaning
requires that the word learner has (a) the linguistic capacity to distinguish
among the relevant syntactic categories (e.g., count noun vs. mass noun vs.
adjective) in her language, and (b) the conceptual or perceptual ability to
appreciate the various kinds of relations among objects.
In sum, recent work has establishedthat young children appreciatelinkages
between particular types of words (e.g., count nouns vs.. adjectives) and
particular types of conceptual relations. These linkages help to explain how
children so rapidly map novel words appropriately to their meanings. For ,
although children appreciatemyriad kinds of conceptual relations, only some
of these relations become lexicalized. Children do not sample randomly
among thesemany possible relations when determining the meaning of a new
word. Instead, particular kinds of conceptual relations are favored in the
context of learning particular kinds of words.
232 S.R. WaxmanI Linguisticandconceptual
organization
In this article, I take as a starting point the general hypothesis that (a)
young children are sensitiveto preciserelations linking linguistic with conceptual
development, and that (b) these linkages promote rapid lexical acquisition
and foster the establishmentof powerful systemsof conceptual organization
. I begin by reviewing the evidence in support of this position with
preschool-aged children, focusing primarily on the role of linguistic information
'
(e.g., count nouns vs. adjectives) in the young child s ability to form
categories of object kinds at various hierarchical levels (e.g. flamingo, bird ,
animal) . Seefigure I .
Next , I turn to theoretical questions concerning the origin or developmental
status of these linkages between word learning and conceptual
organization, asking how and when they emerge in the developing child ,
This discussion highlights some limitations in the existing literature and
underscores the importance of two distinct , but complementary research
approaches.
First , becauseso much of the existing literature is devoted to primarily
preschool children who have already made significant linguistic advances,
we are left with a very limited understanding of how the child acquires these
important linkages early in development. Therefore, the goal of the first
approach is to chart the emergenceof these links in preverbal infants and in
toddlers by examining the influence of language on their categorization
abilities.
Second, becausethe existing researchhas been basedalmost exclusively on
English-speaking subjects, it is unclear whether these linkages are universal
features of human development or specific to English. Therefore, the goal of
the secondapproach is to seekevidencefor theselinkages in children learning
languagesother than English.
Although these developmental and cross-linguistic research programs are
still very much in progress, our initial results converge to suggest that the
linkage between count nouns and object categories is evident even at the
onset of language acquisition and may be a universal phenomenon. In
contrast, the data suggesta very different developent course for the linkage
between adjectives and properties of objects. This linkage appears to emerge
later in development and to vary across languages.
S.R. WaxmanI Linguisticandconceptual
organization 233
[
e
0 ~
~
=
A
.J
[
e
O
8J~
'~ s '
i
=[
~
1s
~
SQq1
-
ob
~
R
=0
";.0 0 e
-e doWE u
~ ~
Q
.9 .ce
0B
CE ~
~
:=
8
~
~ . ..
~ ~
~
~ ~
..
~ .. ~
~
~ ~
~ ~
~
~ ~ ~
234 S.R. Waxman / Linguisticand conceptualorganization
conjunction with the photographs from each class (e.g., These are the akas;
these are the dobus) . Children in the Novel Adjective condition also heard
novel words, but the words were presented within an adjectival syntactic
context (e.g., ' These are the ok-ish ones, theseare the ones that are doh-ish' ) .
The children in this experiment were very sensitiveto the linguistic context
in which the novel words were introduced. Novel nouns facilitated object
1
categorization at the superordinate, but not the subordinate level. In the
Novel Adjective condition , this pattern was completely reversed. Unlike
nouns, novel adjectives supported the formation of subordinate level object
categories, but exerted no demonstrableeffect at either the basic or superordinate
levels. Thus, each of these different linguistic forms facilitated object
categorization at particular hierarchical levels.
An interestingparallel to this phenomenonin children has beendocumented
acrossa wide variety of adult languages , both spokenand signed. According to
ethnobiological data, count nouns typically mark objectsand object categoriesat
the basicand superordinatelevelswhile adjectivestend to mark subordinatelevel
distinctions (Berlin et aI. 1973, Newport and Bellugi 1978). Although these
correlationsbetweenlinguistic form and object categoriesat particular hierarchi-
cal levelsare not perfect, they do suggestthat a relation betweennaming and
object categorizationmay exist throughout the lifespan. (SeeWaxman, 1991, for
a more thorough discussionof this literature and its relevanceto acquisition.)
The developmental finding that novel nouns and adjectiveseach produced
systematic, but distinct , patterns of results at distinct hierarchical levels
revealsthat three-year-olds are not only sensitive to the distinctions between
these two linguistic forms, but also consider linguistic form as relevant to
establishing meaning. This important finding constitutes strong support for
the hypothesis that by three years of age, children appreciate powerful and
preciselinkages betweenword learning and conceptual organization.
Notice , however, that the data from preschool-aged children cannot
address crucial questions concerning the development of these linkages in
infants and toddlers. (See Nelson, 1988, for an extended discussion of this
point .) Neither does the existing evidence address questions concerning the
universality of such linkages across languages. These questions become
1 In fact,
although novel nouns facilitated classification at the superordinate level, they made
classification at the subordinate level more difficult . Children in the Novel Noun condition
classified less successfully at the subordinate level than did their agemates in the No Word
condition . This very interesting result has spumed a whole independent line of research (see
'
Waxman et al. 1991) which suggeststhat children s interpretations of novel words are mediated
by their existing lexical and conceptual infonnation .
S.R. Waxman I Linguistic and conceptualorganization 237
of Unkages
3. Three alternative accounts of the development of aD appreciation
between Hnauisti (: form and conceptual organization
The first alternative account posits that these linkages are learned entirely
on the basis of infants ' experience with human language. On this account,
infants embark upon the process of lexical development with no a priori
expectations concerning linkages between word meaning and conceptual
organization . Instead, they learn their first words in an unconstrained
fashion, slowly establishing the mappings betweenwords and their meanings.
Later, once they have made a sufficient number of word -to -meaning map-
pings, infants may come to notice a correlation between particular linguistic
forms (e.g., nouns, adjectives) and particular types of meaning (e.g., object
categories, object properties) . They may then exploit this correlation in future
word learning (seeNelson, 1988, for a fuller discussionof this account) .
This first account is plausible becauseprior to the naming explosion, lexical
acquisition is indeed comparatively slow; later, perhaps once infants come to
notice the linkages betweenform class and meaning, their rate of acquisition
increasesexponentially. It is possible that it is only at the onset of the naming
explosion that infants have accumulated enough word -to -meaning mappings
238 S. R. Waxman I Linguistic and conceptualorganization
categories. This pattern would suggestthat infants embark upon the process
of lexical acquisition with a rudimentary linkage between words and object
categories that will become increasingly specific as a function of their
experiencewith the particular syntactic distinctions drawn in their language.
To adjudicate among these broad alternative accounts, I have initiated a
detailed examination of the influence of words of various linguistic forms on
infants' and toddler ' s object categorization. I have also begun to examine
young children learning languagesother than English.
To foreshadow, the results of thesetwo complementary setsof experiments
converge to provide initial support for the third alternative account. Infants
at 12 months begin the processof lexical acquisition with a general expectation
that words (independent of their linguistic form class) will refer to
objects and object categories. This initial , rudimentary linkage becomes
increasingly specific in the second year of life , perhaps as a function of their
own languageexperience.
completed this procedure four times, with four different sets of objects: 2
basic level sets(cows vs. horses; cars vs. planes) and 2 superordinate level sets
(animals vs. vehicles; tools vs. animals) .
Infants were assignedto one of three conditions, which differed only in the
'
experimenters comments during the Familiarization phase. See figure 2. In
the Novel Noun condition , the experimenter labeled objects during the fami -
'
liarization phase (e.g., ' See the auto ) . In the Novel Adjective condition , she
' '
introduced the novel word in an adjectival context (e.g., Seethe aut-ish one ) .
In the No Word condition , she drew attention to each object but offered no
'
label (e.g., ' See this ) . The test phase was identical for infants in all three
conditions. The experimenter introduced the test pair (e.g., cow vs. horse),
' '
saying, Seewhat I have . No object labels were introduced in the test phase.
Becauseinfants at this age do not yet distinguish among linguistic form
classes such as noun and adjective in their own language production or
comprehension(Bloom 1990, Gordon 1985, McPherson 1991, Prasada 1993,
Valian 1986), it is unlikely that they would consider linguistic form as
relevant to establishing meaning. We therefore hypothesized that for infants
at this developmental moment, object categories would be highlighted in
word learning in general, not by nouns in particular . We predicted that
infants in both the Novel Noun and Novel Adjective conditions would
categorize more readily than would infants in the No Word condition . More
specifically, we predicted that infants hearing novel words (be they nouns or
adjectives) would show ( 1) a greater decreasein attention to the objects over
the familiarization phase, and (2) a stronger preferencefor the novel object in
the test phase than should infants in the No Word condition .
The results of the experiment were consistent with these predictions.
Consider first the data from the familiarization phase, depicted in figure 3.
We calculated individual contrast scores to test the prediction that infants
would show a linear decreasein attention across the four familiarization
trials. At the basic level (figure 3a), infants in all three conditions showed this
linear trend. This is consistent with arguments concerning the primacy of the
basic level. However, on the superordinate level trials (figure 3b), only infants
hearing novel words (in the Novel Noun and Novel Adjective conditions)
showed a decreasein attention.
During the test trials , the effects of novel nouns and adjectives were also
quite comparable. Figure 4 displays the proportion of attention the infants
devoted to the novel test object. At the basic level, infants in both the Novel
Noun and Novel Adjective conditions showed a reliable novelty preference;
those in the No Word condition showed no such preference. At the super-
S. R. Waxman / Linguistic and conceptualorganization
FamlUarizatioa Phase
Pm
Dili8iz8ticm Tri8l1 P8DIi1i1ri8ti- Trial7.
(ADiI D8lVI. Fruit) (ADimalVI. Fruit)
"S:1118
NOUN
:I1VB ,
-
ee?"..mkh"S:V
:
NOUN 1118
,
-
ee,".u-ni8h
ADJFL
"
NO -
S 1118
:"S-ee
WORD r
ee -
'" NOrAD
JB
Cn
WORD-
S
B '" ,-
:"S-eer
1118
ee
Pamililriz8timl
Trial3 F8I Dili8rizali
- Trial4
(AnimllVI. Fruit) (ADim8 I VI. Pruit)
Test Phase
"Seewb8tI haver
T. Trial
-WO
NOU
:NO
S
ADJS r"-i7
:dI8
'_
-dI8
7 ee
-8h
ee
(ADim8
I VI. Pnit)
-SeewM I baver
Fig. 2.
S.R. Waxman I Linguistic and conceptualorganization
- BASICLEVEL
FAMlLI ARI ZATl O~ PHASE FA~ ILIARIZATIO:\: PHASE- SUPERORDINATE
LEVEL
30
oo
'U
1 :!:::.25
1
~
(
II
~
~
~
-~
;IIz:20
- Q - NOUN
~
< -
-
0
0
-
-
NOUN
ADJ~
- 0 - - ADJK
" nVE . NOWORD
NOWORD
IS@
- . -
1 4 4
FAMILIARIZATIO~ TRIAL ~ TRIAL
FAMILIARIZATIO
Fig. 3.
ordinate level, only infants in the Novel Noun condition showed this
preference.
These are very striking results, for they reveal a nascent appreciation of a
linkage between words and object catagories in infants who have yet to
commencethe naming explosion. This finding weakensconsiderably the first
alternative account - that prior to the naming explosion, infants fail to
appreciate any linkages betweenword learning and conceptual organization.
'
Clearly, novel words do focus infants attention on object categories. These
results also weaken the second alternative account - that infants embark
upon the processof word learning with a fully developedappreciation of the
specificlinkages betweentypes of words (e.g., nouns and adjectives) and types
of meaning. Instead, these data support the third view - that 12- and 13-
month -old infants begin the processof word learning with a general expectation
that words, be they nouns or adjectives, will refer to object categories.
How do the piecesof evidencefrom the infants fit together with the data
from the 2-year-olds? Together, the data suggestthat from the earliest stages
'
of lexical acquisition, count nouns focus infants attention on object categories
. Indeed, we have also obtained converging evidenceon this point with
16- and 2O-month -old subjects, using an entirely different method (Waxman
S. R. Waxman I Linguistic and conceptualorganization 245
Tf:ST TRIALS
0 70 .
.
0 65 .
. .
. ~
. .
~
ADJB: ' nvE
0 60 .
0
NOWORD
0 . 5
* *
<
O .
OOS
p
O OS.
p <
0 . 5
BASIC SUPERORDINATE
LEVEL
Fig.4.
and Hall 1993). However, initially , this focus is not specific to count nouns.
At 12 and 13 months, both nouns and adjectives focus infants' attention on
'
object categories. The infants ability to distinguish between linguistic form
classesand to use these distinctions as a guide to establishing word meaning
must undergo important developmental change during the second year. By
the time they are approximately two years of age, infants begin to teaseapart
the syntactic form classes; by two and a half years, we begin to get evidence
that they treat nouns and adjectivesdifferently with respect to object categorization
(Waxman and Kosowski 1990, Taylor and Gelman 1988) .
Put differently , the data suggestthat .the affinity betweencount nouns and
object categoriesis evident even in preverbal infants, but the specificity of this
246 S. R. Waxman / Linguistic and conceptualorganization
affinity increasesover development. This pattern fits nicely with some anecdotal
evidence concerning early word learning. One interesting observation
has been made by severalresearchers: Prior to the naming explosion, infants
seemto interpret most words, independent of syntactic form , as referring to
objects and categoriesof objects. This is illustrated by the oft -cited anecdote
'
regarding infants initial interpretation of adjectives like hot. In the earliest
' ' ' '
stagesof lexical acquisition, when children hear, Don t touch that. It s hot ,
they often interpret hot as referring to an object (e.g. a stove), rather than to
a salient property of that object. Indeed, all of the data documenting that
children use syntactic form to affix meaning to a new word comes from
children who have at least embarked upon the naming explosion (Hall 1992,
Hall et ale 1993, Katz et ale 1974, Markman and Wachtel 1988, Sola et ale
1991, Waxman 1990, Waxman and Kosowski 1990) .
Basedon the data reviewed thus far , I have suggestedthat the appreciation
of a linkage between count nouns and object categories undergoes no
developmental change: it appears to emerge early, requiring little , if any,
experiencewith the language. In contrast, an appreciation of specific linkage
between other grammatical categories (e.g., adjectives, mass nouns, verbs)
and meaning emergeslater in development and may depend upon language
experience.
Notice , however, that this suggestion is based almost exclusively on
English-speaking subjects. This is a serious limitation , for it is important to
determine whether the patterns observedin our English-speaking samplesare
universal to human development. (SeeSlobin, 1985, for excellentdiscussionsof
the necessityof cross-linguistic work in establishingtheories of acquisition.)
S. Cross studies:FrenchandSpanish
- linguisticdevelopmental
infants expect that words (independent of their grammatical form ) will refer
to objects and categories of objects. Later , this general linkage gives way to
more specific pairings between particular grammatical forms and particular
types of meaning. The affinity between count nouns and object categories
emergesearly and is evident in all three languageswe have examined to date.
In contrast, the more specific linkages for adjectives emerge later and may
vary, depending upon the language being acquired.
This account of the child ' s emerging appreciation of linkages between
linguistic and conceptual organization is consistent with other major milestones
in lexical acquisition. It gains further plausibility by virtue of the fact
that it is also consistentwith cross-linguistic evidenceconcerning the linguistic
categoriesnoun and predicate (including, e.g., adjectives, verbs) .
The cross-linguistic consistency of the gra~ matical category noun and the
variability of the category adjective, which has been documented by linguists
(c.f., Dixon 1982), is quite relevant to issues of acquisition. The syntactic
category noun has a stable and uniform function across human languages.
Count nouns refer primarily to objects and classesof objects. Furthermore,
unlike words from other form classes, nouns supply principles of individuation
and identity for their referents. (See Macnamara, 1986, for thorough
discussionsof these principles; see Hall 1992, Hall and Waxman, 1993, for
evidencethat preschool children, like adults, expect that count nouns supply
these principles.) Additionally , early lexical acquisition consists predominantly
of nouns (Gentner 1982, Nelson 1973) .
In contrast to the cross-linguistic stability of the classnoun, membersof the
predicate system (e.g., adjectives, verbs) have a more fluid status. There is
considerablecross-linguistic variation as to what information is conveyed as
part of one predicate and what is conveyedas part of another (Gentner 1982,
Talmy 1985). Furthermore, there is cross-linguistic variability in the evolution
of particular predicates. Dixon ' s ( 1982: 1- 3) discussion of the distinction
betweennouns and adjectivesis quite revealing.
' It is an
empirical fact that there is always a major class that is aptly termed Noun ; there is
never any doubt as to the applicability of this traditional label, and never any question as to
which class should be called Noun ... However, not all languageshave the major word class
Adjective. Either they have no Adjective class at all , or else there is a small non-productive
minor class that can be called Adjective. In either of thesecasesit is interesting to ask how the
250 S.R. Waxman I Linguistic and conceptualorganization
tone (Waxman and Balaban 1992). Thus, by 9 months of age, labels facilitate
categorization of objects. Moreover, this labeling effect appears to be tied to
language, rather than to auditory stimulation , in general. (But see Roberts
and Jacob 1991, for a different view.)
However, at this point in development, the data do not support the claim
that infants make systematic distinctions among words from various form
classes. The data from our laboratory reveal that at 12 months, infants tend
to interpret most words, independent of their syntactic status, as referring to
objects or categoriesof objects. Therefore, prior to the onset of the naming
explosion, there appears to be a general (and possibly universal) linkage
betweenwords (not specifically count nouns) and object categories.
As the naming explosion draws to a close, and as infants begin to
distinguish among the linguistic form classes(e.g., count nouns, mass nouns,
adjectives, verbs) in their own language production and comprehension
( Bloom 1990, Gordon 1985, McPherson 1991, Prasada 1993, Valian 1986),
infants probably begin to consider syntactic form class as relevant to determining
a novel word ' s meaning. By two to three years of age, children begin
to reveal an appreciation of specific linkages between particular linguistic
forms and particular types of meaning.
For example, English-speakingchildren expect that object categorieswill be
marked linguistically by count nouns (Brown 1957, Markman and Wachtel
1988, Wa~man 1990, Waxman and Kosowski 1990, Taylor and Gelman 1989,
Waxman and Senghas1991), that substanceswill be marked by mass nouns
(Dickins.on 1988, Sola et ale 1991), that individuals will be marked by proper
nouns (Katz et ale 1974, Gelman and Taylor 1984, Hall 1992), and that
various properties (e.g., size, color , temperament) will be marked by modifiers
(Hall et ale 1993, Markman and Wachtel 1988, Waxman 1990, Taylor and
Gelman 1988).
I t is interesting to note that even at this point , when children are clearly
capable of using syntactic information as a cue to meaning, they do not do so
invariably . Instead, the tendency to use syntactic information is modified
considerably by the child ' s existing lexical and conceptual knowledge (Au
1990, Banigan and Mervis 1988, Callanan 1985, Chi 1983, Mervis 1984,
Mervis and Mervis 1988, Waxman et. ale 1991; Hall et ale 1993) . Children' s
interpretation of a novel word depends, at least in part, upon whether or not
they already have an existing label for the referent object. If the object is
familiar (that is, if children have already acquired a count noun label for the
object), then they use syntactic information as a guide in interpreting the
meaning of subsequentwords applied to that object. For example, if a child is
S.R. Waxman / linguistic and conceptualorganization
taught a new noun for a familiar object (e.g., a dog), the child exhibits a
strong tendencyto interpret the word as referring to an object category that is
subordinate to (e.g., collie), superordinate to (e.g., mammal), or overlapping
with (e.g., household pet) the familiar basic level category (Taylor and
Gelman 1989, Waxman and Senghas 1992) . For a new adjective, children
tend to interpret the word as referring to a salient property , substanceor part
of the object (Hall et al. 1993, Markman and Wachtel 1988) .
However, if the object is unfamiliar (that is, if children have not yet
acquired a count noun for the object), they tend to rely upon an earlier
pattern of behavior; they tend to interpret any word applied to that object
(be it a count noun, proper noun, or adjective), as referring to an object
category, typically at the basic level (Hall 1992; Hall et al. 1993, Markman
and Wachtel 1988) . Thus, children are attentive to syntactic form in ascribing
meaning only after a count noun has beenassignedto that object and to other
membersor its kind .
8. Summaryandconclusion
ment. Of course, linkages like the ones described here cannot tell the entire
developmental story, for children do not learn meaning on the basis of
syntactic context alone. Additional researchwith preverbal infants and with
children learning diverse languages will further clarify when these various
linkages betweenlinguistic and conceptual developmentemerge, how they are
modified by linguistic input , and how they are modulated within the context
of the child ' s existing fund of knowledge.
References
Anglin, J.M., 1977. Word, object, and conceptualdevelopment . New York : Norton.
Au, T.K., 1990.Children's useof informationin word learning. Journalof Child Language17,
393- 416.
Baldwin, D.A. and E.M. Markman, 1989. Establishingword-object relations: A first step.
Child Development60, 381- 398.
'
Banigan , R.L. and C.B. Mervis, 1988. Role of adult input in young childrens category
evolution. II : An experimentalstudy. Journalof Child LanguageIS, 493 504. -
Bauer, P.J. and J. M. Mandler, 1989.Taxonomiesand triads: Conceptualorganizationin one-
and two-year-olds. CognitivePsychology4, 100- 110.
Benedict , H., 1979. Early lexical development : Comprehensionand production. Journal of
Child Language6, 183- 200.
Berlin, B.. .D. Breedloveand P. Raven, 1973. Generalprinciplesof classificationand nomenclature
in folk biology. AmericanAnthropologist75, 214- 242.
Bloom, P., 1990. Syntacticdistinctionsin child language . Journalof Child Language17, 343-
355.
Bornstein . M.H.. 1984 . A descriptivetaxonomyof psychological categoriesusedby infants. In:
C. Sophian(ed.), Originsof cognitiveskills, 303- 338. Hillsdale, NJ: Erlbaum.
Brown. R., 1957 . Linguisticdeterminismand the part of speech . Journalof Abnormaland Social
Psychology 55, 1- 5.
Brown. R., 1958 . Wordsand things. Glencoe , IL : The FreePress .
Bruner, J.S., J.J. Goodnowand G.A. Austin, 1956 . A studyof thinking, New York: Wiley.
Callanan , M.A., 1985 . How parentslabelobjectsfor youngchildren: The role of input in the
acquisitionof categoryhierarchies . Child Development 56, 508- 523.
Carey , S., 1978 . The child as word learner . In: M. Halle , J. Bresnan, G.A. Miller (eds.),
Linguistictheoryand psychological reality. Cambridge , MA : MIT Press .
Carey, S., 1990 . On somerelationsbetweenthedescriptionand the explanationof developmental
change . In: G. Butterworth, P. Bryant (eds.), Causesof development : Interdisciplinary
perspectives , 135- 157. New York: HarvesterWheatsheaf .
Carey, S. and E. Bartlett, 1978 . Acquiring a singlenew word. Papersand Reportson Child
LanguageDevelopment (Department of Linguistics , StanfordUniversity) IS, 17- 29.
Chi, M.T.H.. 1983 . Knowledge -derivedcategorizationin youngchildren. In: D.R. Rogers . J.A.
Sloboda(eds.), The acquisitionof symbolicskill, 327- 332. New York: PlenumPress .
Chomsky , N., 1986 . Knowledgeof language : Its nature, origin, and use. Westport, CT: Praeger .
254 S. R. Waxman I Linguistic and conceptualorK,anizatioll
Barbara Landau
of CognitiveScience
Department , Universityof California, Irvine, CA 92717
, USA
what they perceive and know. Of chief importance is their perception and
knowledge of objects, the motions they undergo, and the locations they
occupy.
But how does perceiving or knowing about objects help in learning
language? As Quine ( 1960) argued, the novice analyst hearing an utterance
might be free to conjecture any of an infinite number of hypothesesabout its
meaning, each compatible with what he observes. Such rampant indetenni-
nacy would , of course, prevent learners from converging on the correct
meaning of a word. However, human infants do not seemto find themselves
mired in such an impossible learning situation ; quite the contrary, they
manage to map many words onto coherent meanings in a relatively short
period of time. Accounting for this phenomenon in tenDS of constraints on
learning - both linguistic and non-linguistic - is currently a major challenge
for both linguists and psychologists.
One sourCeof constraints is the human spatial- cognitive system. Recent
non-linguistic investigations have shown that the spatial systemcontains two
'
separate components: one dedicated to representing objects (the what sys-
' '
tem' ) and the other to representinglocations (the where system ; Ungerleider
and Mishkin 1982) . The two components exhibit quite different properties,
perhaps not surprising if one evolved to represent objects, the other, places.
What is intriguing about each component, however, is that broad properties
of each appear to converge with the differences in the ways that languages
expressobjects and locations (Landau and Jackendoff 1993) .
Theseconvergencescould impose significant constraints on learning. For if
the design of the ' what' and ' where' system engagesonly certain geometric
representations, then only thesewill be available for mapping onto language.
Learners possessingsuch a representational system would not be subject to
'
Quine s radical indeterminacy; rather, they would approach language learning
equipped with strong blases to consider only certain properties of the
world as ' relevant' to naming objects vs. places.
In arguing for such constraints, severallines of evidenceare required. First ,
there must be differences in the way that languagesuniversally encode the
two onto logical categories of object and place. Second, these linguistic
universals should show properties that are similar to those found in nonlinguistic
' ' ' '
representations of what and where . Finally , there must be evidence
that children learning names for objects and places generalizeon the
basis of different geometric properties in the two cases- that , essentially, the
problem of learning names for objects and places is bifurcated, with each
domain drawing on quite different kinds of representations.
B. Landau 's whatandwhat's where 261
I Where
object name) together with the sight of an object (Gillette and Gleitman,
forthcoming) . Given only a small group of nouns, the remaining categoriesin
the syntactic configuration may be deduciblefrom internal linguistic principles
(Grimshaw, this volume).
In either case, it seemsreasonable to assumethat children also approach
language learning with certain expectations about the canonical mappings
betweenthe major onto logical categoriesand the syntactic categories(Grim -
shaw 1981) . For example, OBJECT is mapped canonically onto N , and
PLACE - more properly , spatial-locative-functions - onto Preposition, PP, or
other argument-taking categories.!
Once the learner has come to know how objects and placesare syntactically
expressed in his or her language (in English, by count nouns and PPs
respectively), he or shecan infer that a novel lexical item falls into the domain
of either object or place. Now , from the different structural properties of the
' what' and ' where'
system, the relevant geometric generalizations should
automatically fall out.
Given this view, the syntactic properties of a novel word can provide the
learner with quite coarse semantic distinctions (e.g. the major onto logical
' '
categoriesand broad semantic classes ) . The remainder of the learning of a
word must be discoveredfrom sourcesoutside of syntax2 (seeGrimshaw, this
volume; Fisher et al., this volume; Pinker, this volume) . Our interest is in
how the spatial-cognitive system might further pare down the hypothesis
spacefollowing the broad division into object vs. place.
Given that words for objects and places are among the earliest acquired,
one might wonder whether a general, all -purpose representationunderlies the
earliest learning in both domains. Perhaps human learners are designed to
detectjust the samesetsof properties whether attending to an object (and its
name) or the object' s location (and its name) . This kind of representation
would keep track of a compound - an object' s identity together with its
location. Such a relatively unbiased device could definitely learn to represent
objects and places separately (see Rueckl et al. 1988 for a demonstration
using connectionist modelling) . The question, however, is whether this is the
way that young learners do represent objects and their locations under
different form-classinstructions, i.e., when they are being told the name of a
novel object vs. a novel pla~ .
In a seriesof experiments, we asked whether young children could draw on
qualitatively different geometric representations acorss the two domains of
object and place (Landau and Stecker 1990) . In order to induce these two
kinds of representation, we showed 3-year-olds and adults a novel object
' '
being placed on top of a box (the standard location), and simultaneously
presentedthem with a novel word occurring in one of two different syntactic
contexts. A count noun context was meant to induce a representation of the
object itself, while a preposition context was meant to induce a representation
of the place where the object was located.
In the count noun context, subjectsheard ' Seethis? This is a corp' . In the
' '
preposition context, subjects heard See this? This is acorp my box . The
entire display then was set aside, and subjects saw each of three different
shapedobjects - one identical to the standard plus two shapechanges- being
placed in each of five different locations on and around a second box (see
264 B. Landau I Where's what and what's where
figure I for objects and standard location) . Each time subjects observed an
' '
object being placed on the secondbox, they were asked either Is this a corp?
' '
or Is this acorp your box? (matching their initial instructions) . We asked
whether subjectswould attend to different properties of the array in the two
contexts.
uu .r
Thisis a corp. No No
~ I I
' '
Fig . 1. Objects and array used in spatial preposition experiment. In this experiment, a standard
' '
object was placed in a standard position (as shown) on the upper surfaceof a box. Subjectswere
"
told either " This is a corp" (count noun condition ) or "This is acorp my box ( preposition
condition .) Then each of the objects was placed in each of several positions on and around the
" "
box, and subjects were asked either " Is this a corp?" or Is this acorp your box? (with the test
question matching the initial instruction ) . The responses showed that in the count noun
condition , subjects acceptedonly the standard shape and rejected shapesthat differed from the
standard. In contrast, in the preposition condition , subjects accepted all shapes, apparently
considering object shapeirrelevant to decisionsabout location.
'
The results showed that both children and adults attended to the object s
shapein the count noun condition , accepting the original shape but rejecting
all shape changes. In the preposition condition , however, subjects ignored
shape, accepting all three shapesequally. In complementary fashion, subjects
'
ignored the object s position in the count noun context, but attended to it
closely in the preposition condition . Children accepted all positions on the
upper surface of the box, while adults were more conservative, primarily
accepting the standard position .
'
B. Landau I Wheres what and what's where
-
.
Thisis a corp.
Fig . 2. Objects and array used in ~ nd spatial preposition experiment. This experiment was
' '
identical to the first , except for the shapesof the standard and test objects, and the positions
that were tested (see text.) As in the first experiment, subjects in the count noun condition
' '
attended to shape, aa: epting only the standard (the straight rod.) In the preposition condition ,
' linear' - -
subjecg a<x:epted the two objects objects whose principal axis could span the box and
rejected the block. This suggests that subjects in the preposition condition schematizedthe test
objects, focussingonly on the existenceof a principal axis of a pa~ cular extent. The results of the
two experimentsshow that detailed object shapematters for the extension of object count nouns,
but that schematicobject shapematters for the extension of spatial prepositions.
In contrast, when asked to generalizea new place term, they either ignored
' ' '
the object s shape altogether - representing it essentially as a blob - or did
something in between, representing just one component of shape as if the
' '
target object were a linear thing .
Why did young children generalizeso differently in the context of a count
noun vs. a preposition? Clearly, we cannot appeal to an 1;inbiased Quinean
' '
similarity spacewhich dictateswhat will be salient in any scene, for thesewere
identical in the two situations. The only differencebetweenthe conditions was
the syntactic context in which the novel word was introduced. Children must
have assumedthat the novel count noun referred to the object being placed
on the box - in this case, an object of a specific shape (to be explored in
section 3) . And they must have assumedthat the novel preposition referred to
the place where the object was located. Syntactic context thus directed
' '
B. Landau I Wheres what and what s where 267
Count nouns such as dog , apple , chair name categories of objects - sets of
objects that cohere under some criteria that can be used to generalize the
word to new instances. The nature of these criteria is as yet unclear , but often
has been assumed to include some set of perceptual or conceptual features
united by an underlying theory (Carey 1982, Arm strong et al . 1983, Murphy
and Medin 1985) . Such theories would specify not only which object properties
are critical for different named categories , but why the same property
can assume different weighting for different categories - for example , why
feathers are more critical to the description of birds than to the description of
hats.3
3 Therecentemphasis on the importanceof suchunderlyingtheoriesin conceptacquisitionand
word learningoftenhasbeenviewedasan alternativeto the moretraditional' perception -driven'
approach . For example , it has been argued that young children use their knowledgeof
ontological categoriesto constraindecisionsabout word meaning , therebyallowing them to
' '
overrideperceptualinformation(Kei11989 , Solaet al. 1991). This either- or approachto word
learningseemsmisguided , however , for althoughsuchknowledgecan be important in certain
kindsof decisionsaboutcategorymembership , perceptualinfonnationalsois likely to play an
importantrole, especiallyamongyoungchildren(Landauet aI. 1988 , Landauet aI. 1992 , Smith
1989 .)
For example , youngchildrenmay not possess the samekinds of rich theoreticalsystemsas
adults(Carey 1985 , Keil 1989 .) Careyhas arguedthat children's theoriesof ' living thing' are
shallowerand qualitativelydifferentfrom thosecontainedin maturebiologicaltheories . Evenif
childrendo havetheoriesof objectkind that enterinto their classification (henceobjectnaming),
perceptualappearance of objectsis boundto be a critical factor in initial objectsorts, as it is for
adults in many contexts(Bosterand Johnson1989 , Landau 1982 , Murphy and Wright 1984 ,
268 B. Landau I Where's what and what's where
L .B.Smith 1989, Smith and Medin 1982.) Most important , the existence of theories does not
logically preclude the use of perceptual information in word learning. Presumably, our theories
have evolved in part to make senseof what we observe.
4 Jackendoff 1983, 1992 has '
( ) suggestedthat an object s name can be extendedto many possible
referentsX , including ' depiction of X ' . This would account for the naturalnessof referring to the
' ' '
sculpture as clothespin even though we all know it isn t really one. Jackendoff suggeststhat the
distinction betweenreal referentsand depictions may be critical for binding theory. I thank Paul
Bloom for pointing out the relevanceof Jackendoff' s treatment, and Robert Remezfor providing
an extensivelist of Oldenburg' s work that exemplifies this problem.
' '
B. Landau I Wheres what and what s where 269
When a child hears an object labelled with a novel count noun, he facesat
least a circumscribed version of Quine' s problem : He must decide which of
the properties he perceivesare relevant to generalizing the new word. Which
' '
properties of the whole object (Markman , this volume) will the learner select
as the basis for generalization?
In a first seriesof experiments, Landau et al. ( 1988) investigated 2- and 3-
' '
year-olds and adults representation of concrete 3-dimensional objects. We
askedwhether children and adults would tend to generalizea new count noun
'
on the basis of an object s shape, its size, or its texture, and whether any such
blaseswere due to generalizedperceptual preferencesrather than the task of
learning a new word.
We showed subjects a single object - for example, a 2" blue wooden Ushaped
' '
thing - and labelled it by saying, Seethis? This is a dax . This object -
the standard - was placed aside but still in view and subjects were shown a
series of objects, one at a time, which were either size, texture, or shape
changesfrom the standard. These changes ranged from relatively minor or
moderate changestested in a first experiment (e.g., ~2" larger for the minor
sizechange or a slightly bent leg for the shapechange) to either moderate or
"
quite extremechangestested in a secondexperiment (e.g., 24 for the extreme
sizechangeand a completely rotated leg for the shapechange; seefigure 3 for
! ' '
examples ) . For each object, subjectswere asked Is this a dax?
The results are shown schematically in figure 3. Subjects of all ages
acceptedthe size and texture changesat ceiling levels, but rejected the shape
changes. For the minor and moderate magnitudes of change in the first
experiment, adults acceptedall size and texture changesbut rejectedall shape
changes. Two -and three-year-olds showed the same pattern but probabilisti -
cally rather than categorically.
The
(Landa
~ Bias
Shape
,and
Smith ,1988
)
Jones
YES ~ ! !
[1:0
St8Dd8Id
(2" woodeD)
~.x.:.
CaIh Us,.. U
ClIck
- Wile
~
nn
N 0
[ b ?
L . ~ . : ~ : J
Fig . 3. The shapebias in early lexical learning. When children or adults bear a novel object (the
' standard' named with a novel count
) noun, they tend to generalizethat noun on the basis of the
's and not on its size or texture.
object shape ,
the ' shape bias' - appears by age 2 in a relatively fragile form , is stable by
age 3, and reachesadult strength by age 5 ( Landau et ale 1992a) .
Does the shapebias reflect a general, indiscriminateperceptualpreferencefor
grouping same-shapedobjectstogether? One might supposethat children generalize
a count noun on the basisof shapejust becausethe objectswith the same
shapeas the standardlook more like the standardthan objectswith the samesize
or the sametexture. Evidencefor this interpretation would be a shapebias across
a variety of object matching tasks, whether they involve an object' s nameor not.
To assessthis possibility, Landau et ale ( 1988) presented2- and 3-year-olds
with the same standard as before, but did not name the object. Children' s
attention was drawn to the standard by pointing and saying ' Seethis?' Then
the object was placed aside as before, and the test objects were brought out in
' '
pairs. Children were now asked Which one goes with this? (requesting a
' '
match to the standard) . The expression goes with is, of course, ambiguous;
that is, one object could ' go with ' another on many different bases. Previous
experiments have shown that children who are asked this question might
group on a number of different bases, including various dimensions of
perceptual similarity , magnitude of similarity , or thematic relationship with
another object (Smith 1989, Markman and Hutchinson 1984, Inhelder and
Piaget 1964). The question was whether children would still group on the
basis of shape given that there are many possible bases for similarity
grouping.
Although children had shown a shape bias in the count noun condition ,
this time they showed either a weak shape bias or none at all. Two -year-olds
were as likely to pick the most extreme size change (which had the same
shapeand texture as the standard) as the most extreme shapechange (which
was identical to the standard in size and texture) . Three-year-olds showed a
weak shapebias, nowhere near as strong as in the count noun condition .
Thus, by age 3, the shapebias is elicited reliably and strongly in the context
of object naming. During the early years of languagelearning, the perceptual
' '
space has already become biased - stretched towards emphasizing different
dimensions in different contexts. The context of object naming draws on a
perceptual spacewhere similarity of object shapeis critical .
Further researchhas confirmed that the shape bias is extremely robust in
the context of object naming, that is, under count noun instructions. It
appears in 3-year-olds and adults when the objects are three-dimensional
(Landau et ale 1988, Au and Markman 1987), or two-dimensional ( Landau et
ale 1992b, 1993; Baldwin 1993) ; when the object' s texture or coloration is so
salient that children are drawn to those dimensions if asked for non-linguistic
's whatandwhat's where
272 I Where
B. Landau
similarity judgments (Smith et al. 1992); and when cues to animacy (eyes)
'
also pull children s attention to the texture or substanceof objects (Jones et
al. 1991) . A shapebias has been replicated in other labs and with school age
children (Becker and Ward 1991).
In contrast, the shape bias is weakenedor completely suppressedin other
syntactic contexts, even though the domain in question still concerns objects
(Landau et al. 1992a, Smith et al. 1992) . For example, we found that when
'
objects were introduced as membersof a more inclusive category (i.e., This is
'
a kind of dax ), young children and adults showed only a weak preferencefor
same shape, and no strong preferencefor either size or texture. This makes
sense, as membersof more inclusive object categoriesare often lesssimilar in
shape (and other dimensions) than members of their subsets. (For example,
members of the animal category are less similar to each other than members
of the dog category; and dogs are lesssimilar to each other than Dalmatians) .
We also found that when objects were introduced to learners using an
' '
adjectival context ( This is a daxy one), they showed no preferenceat all for
same shape, but rather, a strong preferencefor samesurface texture - which
is encoded by adjectivesin English and other languages(Schachter 1985; see
also Waxman, this volume).
The results of both syntactic manipulations confirm the notion that the
shape bias is confined to the count noun context. This provides further '
evidencefor the importance of syntactic context in directing young children s
interpretations of novel words to different parts of the object representation
system.
The existence of a shape bias should certainly aid the child in learning
new object names and generalizing them to new instances. However , if the
bias were too rigid , children would fail to generalize in cases where they
should. Although many artifacts are rigid , others regularly undergo transformation
of shape - changes in their configuration - because they are
jointed (e.g. tools , vehicles) . Still other objects, especially natural kinds ,
6
undergo changes of configuration as they move, grow , are squashed, etc.
6 I omit from this discussionthe importanceof perspective transformations , to which young
language learners are surely .
sensitive Evidence shows that infants are sensitive to shape
constancy (see Banks and Salapatek 1983). I that
assume 2-year- olds will generalizea novelname
' '
to different perspectivetransformationsof an object (although not necessarilyaccidental
perspectives, i.e. thosethat obtain from unusualviewingperspectives ).
B. Landau / Where's what and what's where 273
Does a bias for same shape mean that shape is preserved in a static
. '
snapshot -like fashion , corresponding to the configuration shown by an
object in just one pose? Or is the bias for same shape responsive to the
different kinds of shape transformations a given object might normally
undergo?
The objects used by Landau et al. ( 1988) all were made of wood and
most had hard straight edges and sharp angles, suggesting the rigidity
characteristic of artifacts . Thus the strong shape bias among children
might have been due to their understanding that objects possessingthese
properties could not undergo shape transformations . Such an understanding
could stem from perceptually -based or knowledge-based causes.
Perceptual processes might specify the kinds of transformations normally
expected from objects possessing properties such as straight edges and
sharp angles. Knowledge about different object kinds might also affect
judgments ; for example, knowing that artifacts normally do not undergo
massive shape transformations might lead the child to reject shape
changes. And these perception -driven and knowledge-based causes might
well interact .
To determine whether perceptual causescould modulate children' s judgments
about named objects, Landau et al. ( 1992b, 1993) first sought to vary
suggestedmalleability by purely perceptual means. Using the same overall
method .as in previous studies, we first created outline drawings of the
original shapes, including the standard plus the three shapechangesand three
new size changes(seefigure 4A for the shape changes). Three-year-olds and
adults who were tested using thesestimuli showed a strong shapebias, just as
before.
In a second series, we tried to suggestmalleability by drawing the same
shapes but with curved contours (figure 48 ) . Contour curvature generally
correlates with malleability in the physical world , and the question was
whether this change would suggesta greater potential for transformation to
the children. In a third series, we added another cue to malleability by
imposing wrinkles on the curved contours (figure 4C) .
In the fourth and final series, we added a single perceptual property that
powerfully suggestsa particular onto logical kind . Following Jones et al.
( 1991), we drew eyeson the wrinkled curved drawings (figure 4D ) . Although
eyes are a perceptual property , they clearly are powerful in suggesting
animate beings. In fact, Jones et al. found that when eyeswere placed on the
original wooden objects, 3-year-olds generalized on the basis of same shape
' '
274 B. Landau I Wheres what and what s where
Staadard !
Shape Sbape2 S
Shape
C1: O~ M
QD ~ ~
Q:D~ ~
aD~ ~
Fig. 4A-D. Drawingsusedin studiesof rigidity, malleability, objectkind, and naming. The first
the ' standard'
objectin eachrow represents objectthat wasnamedwith a countnoun. Subjectsin
eachof four experiments saweitherA , B, C, or D. Testitemsincludedthe corresponding shape
changesin eachrow. Objectsin the first row possess straightedgesand sharpcomers. Thosein
the second , third, and fourth rows vary the basicshapeso as to suggestmalleabilityby curves ,
'
wrinkles, andeyes . Subjects
, respectively judgmentsof whichshapechanges belongin the named
categoryvariedconsider ablyin ~ rd with suggested malleability.
Subjectsshowed a strong shape bias for the drawn versions of the original
stimuli (4A ) - the ones with straight edges and sharp comers - but this
restriction to the original shape was progressively weakened with the other
manipulations. With the curved contour st subjects accepted some shape
changesand with the wrinkled and curved contour st they accepted slightly
more. The most powerful change occurred with the combination of wrinkled
and curved contours topped by eyes; in this casetsubjectsacceptedall of the
shape changes. There were no age differencesin the overall pattern : 3-year-
olds showed the samedistribution of choices as adults.
Thesefindings make two points. Firstt they show that the shapebias is not a
bias for accepting only objects having precisely the same static configuration.
Children as young as 3 yearsold appreciatethat an objecttsshapeis definedwith
respectto the range of shapetransformationsit can undergo- over transforma-
'
tiont the same shapetwill undergo changesin its configuration. Although the
rangeof acceptableconfigurationsbroadenswith suggestedmalleabilityt it still is
constrainedwithin a clearly delimited range of possibilities. We think it unlikely
that children would have accepteda radical topological transformation (e.g. an
irregulart roundish shape) as an instanceof a dax.
The secondpoint is that perceptualinformation (suchas curvinessor wrinkles
or eyes) can playa rich role in engagingchildrents and adultst knowledge of
different object kinds and their potential for shapetransformation. This information
may concern the objecttsshapeitself (i.e.t its contours) or it may concern
other properties that signal certain categories(e.g.t eyes). Although 3-year-olds
do not have mature biological theoriesof conceptssuch as ' animaltt ' mammaltt
or ' primate' (Johnsonet al. 1992)t perceptualinformation can engageknowledge
of different object kindst allowing young children to make predictions about
what an object will look like over spaceand time. Thesepredictionsserveas an
important foundation for the extensionof object names.
8 To be '
specific, Spelke ( 1990) proposes that infants object perception is guided by four
principles: cohesion, boundedness, rigidity , and no action at a distance. Together, theseprinciples
specify which surfacesbelong to the same object and which belong to separateobjects. Spelke
further proposes that theseprinciples of object perception are deeply related to basic constraints
on the motions of physical bodies: we perceiveobjects according to certain perceptual principles
such as rigidity becausebodies move as they do , i.e. they do not deform as they move.
9 We obviously can construe what we seein terms of shape, but then we would describewhat we
' '
see as a ' splotch of paint or a ' pile of sand , drawing attention to the individuated entity .
' ' or ' ' seeP. Bloom this volume for discussion of related issues.
splotch pile ( , , )
B. Landau I Where's what and what's where 277
child to link two pairs of relationships: object name and object kind on the
one hand, object name and object shape on the other hand. Such a linking
could occur on just one or two exposures(in effect, a kind of ' triggering' in
Fodor ' s, 1981, sense), although the evidencereviewedabove does suggestthat
the shape bias becomes sharper over time. In any case, once this link is
established, children hearing an object' s name would generalizethat name on
the basis of object shape.
The reliance on object shape in extending object names does not entail
that shape is definitional for object kind , nor that it exhausts the possible
cues to kind . Obviously , objects sharing the same shape are sometimes
members of different kinds (e.g. whales and fish), which means that
sometimes a shape bias will lead to incorrect extension of an object name.
However , it is significant that in these cases, children ' s errors may be longlived
and require specific tutoring before they are corrected. Shape may
"
remain at the top of a preference hierarchy even for adults , although
additional computational resources and brute - force knowledge of the
world will allow adults to use other sources of information more easily
than young children , thus leading to more complex modulations of any
shape bias.
The strong reliance on object shape in object naming points to an intricate
' '
system of representation which is specializedfor the whats : object recognition
, object naming. This specialization does not appear to be engagedin the
systemunderlying the languageof places.
Like names for objects, names for places emerge quite early in language
learning: Among the earliest words acquired by children learning English are
up and down, in and out. By the age of two and a half , children have a
respectablestock of spatial prepositions and by age four , they have learned
even some of the most complex English prepositions, such as in front of and
behind (Johnston and Slobin 1978, Tanz 1980). The early acquisition of
spatial terms equivalent to English prepositions has been documented among
children learning languages other than English (Gentner 1982, Bowerman
1991, Johnston and Slobin 1978, Choi and Bowerman 1991), as well as
among children whose learning experienceis clearly different from normal -
those born deaf (Supalla 1988, Feldman et al. 1978) or blind ( Landau and
Gleitman 1985, Mulford 1985) .
' '
278 B. Landau I Wheres what and what s where
Given that spatial locational terms are acquired early and with little
difficulty , the question arises how learners come to know just what spatial
properties of the world are relevant to these terms. Having just seen that
object shape is a privileged property for learning object names, one might
supposethat learnersshow a shapebias even when learning namesfor places.
For example, lerners might assume that something can be in only those
objects that are shapedlike a box or bottle. Yet cross-linguistic study of place
expressionsrevealsthat locationallanguage does not , in general, preservethe
details of object shape. What , then, is the structure of spatiallocational terms
that renders them so easy to learn? And how precisely do young learners
acquire this structure?
can easily be suitable arguments for in. This fact has led many to emphasize
that spatial language draws on ' schematic' representations - that what
matters is one' s mental representation of an object, and not necessarilyits
physical properties (Talmy 1983, Herskovits 1986).
Given the geometric restrictions on prepositions, violations (usually, grossly
unsuitable arguments) result in anomaly. For example, since in requires an
enclosingareaor volume, a sentencesuchas ( la ) is permissiblewhile ( 1b) is not.
As a more complicated case, the English words across, along, and around
seem to require that the figure object be construable as having a principal
linear axis . For example ,
is fine , because the snake has a clear principal axis which can either intersect
( for across) or lie parallel to (for along) the road . However ,
is anomalous, except on the reading that the ball lies on some path leading
from the observer ' s viewpoint to some point on the other side of the road .
at most preserving quite restricted aspectsof object shape such as the three
orthogonal sets of axescorresponding to up/ down, front / back, and right/ left
(Talmy 1983, Landau and Jackendoff 1993) . Objects are not representedby
precise metric descriptions of shape, even though shape is critical to the
representation of objects as category members (but see section 4.4 for some
possible exceptions) .
For example, as we noted above, the preposition in must be used with a
' '
referenceobject that can be potentially construed as a container , e.g. in a
bowl , in a carton, or even in a tree, where the tree is an enclosing volume.
Although what is selectedby the preposition is a geometric volumecommon
to all three acceptable objects, there clearly are enormous shape differences
among bowls, cartons, and trees which are critical in assigning these objects
to different named categories. Similarly, on selectsfor a referenceobject that
can be thought of as a surface, e.g. on my nose, on the field, on the earth, but
not . on the hole. Again , each of thesecan function as a surface, though their
individual shapeparametersdiffer greatly.
Other than volume and surface, the main elements of shape that are
'
preservedconcern the object s three setsof axes. Some terms require that the
reference object be construed as having at least one directed axis, either
vertical or horizontal (up, down, above, below) . Others call for the reference
object to be construed as having two orthogonal axes in the horizontal plane
(in front of, behind, beside). Both of these sets of terms leave the geometry of
the figure object completely unspecified, while the referenceobject can vary
indefinitely in shape as long as its axes are preserved. Terms like acrossand
along are among the most geometrically complex prepositions in English, as
they require a linear axis for both figure and referenceobjects.
Drawing on thesegeometries, only a relatively small set of region types are
expressed. At the most general level, all prepositions appear to describe
regions that preservenon-metric properties; for example, there is no spatial
preposition that describesregions extendingjust 48 degreesto the right of an
"
object or 2 around its border. Rather, prepositions representproperties such
as containment (in, with the referenceobject a volume), support or attachment
(on, with the reference object a surface), intersection (across, with
reference object an axis), co-linearity (along, the reference object again an
axis), and relative proximity (near, in front of, the reference object either a
point or a set of axes) .
Landau and Jackendoff ( 1993) suggest that these properties may factor
into several levels of direction and distance. The direction component relies
on analysis of the reference object into the three sets of orthogonal axes
B. Landau I Where's what and what's where
describedabove (up/ down, front / behind, right/ left ) . The distance component
chunks continuous space into a small set of discrete categories which vary
somewhat from language to language. In English there are four such levels:
(a) internal to the reference object (e.g. in, inside), (b) in contact with the
object (e.g., on), (c) proximal to it (e.g. near, in front oj) , (d) distal from it
(e.g. far and beyond) .
Thus, the geometric system for representingfigure and referenceobjects is
rather tight and rather ' sparse'. What do these restrictive geometric descriptions
portend for the languagelearner? On the one hand, strict limitations in
the range of object geometriesmight be a positive force in learning: If the
representations are limited , there is less to learn. On the other hand, the
object geometriesthat are relevant to prepositional meanings might create a
serious problem for the learner, for these geometries are qualitatively different
from those used when naming objects. Although detailed object shapeis
relevant when learning an object' s name, most or all of an object' s shape is
irrelevant when learning the meaning of a spatial preposition.
Given the salience of object shape in object naming, one might suppose
that children would require considerable time to learn to ignore object shape
in the context of prepositions. For example, children might find it easiestto
learn terms drawing on richer shape representations (like across or along)
than sparser ones. Or , children' s earliest use of terms with sparse shape
descriptions might occur only with referenceobjects of a certain shape: The
word in might be used only with objects that are canonical ' containers' - fully
closing, opaque-sided objects such as boxes and drawers.
To my knowledge, there are no reports describing such patterns among
young children learning spatial prepositions. What evidence does exist on
early usage suggeststhe opposite. For example, experimental studies on the
cross-linguistic order of acquisition of spatial prepositions suggestthat terms
requiring objects with very little geometric restriction (e.g. near, on) are
acquired before those requiring objects with more complex restrictions (e.g. in
front of, across, seeJohnston and Slobin 1978) .
Similarly, evidence from spontaneous speechsuggeststhat children represent
objects schematically- that is, preserving little shapeinformation - when
using them as the arguments of prepositions. Landau et al. ( 1990) analyzed
'
young children s spontaneoususes of prepositional phrases headed by in or
on, asking what kinds of objects served as the reference objects for these
' '
282 B. Landau / Wheres what and what s where
If children ignore object shape when learning place tenDs, when and how
do they come to representan object in tenDs of its axes, as would be critical
for learning tenDs such as up, down, top, bottom, in front of, behind, and
across? Much of the literature on acquisition order for spatial prepositions
(and comparable tenDs across languages) suggeststhat the axial system is
acquired rather late. For example, Johnston and globin ( 1985) reported that
terms such as in front of and behind are mastered substantially later than
terms such as in or on. However, the conditions for application of in front of
and behindare rather complex, involving more than an axial representationof
the referenceobject. For example, one must understand the particular pattern
of deictic usageof one' s language- for instance, which portion of the object' s
axis is mapped onto each tenD (see, e.g. H . Clark 1973, Herskovits 1986) .
If we confine ourselves to the question of when children are able to
representobjects in tenDS of their axes - regardlessof whether the particular
mapping for individual spatial tenDS is correct - we find that children from
age 2 on can represent the axial system. One piece of evidence comes from
Tanz ( 1980), who asked children from 2Y2 to 5 years of age to place objects
' '
284 B. Landau I Wheres what and what s where
' in front of ' ' in back of ' and ' at the side of ' reference
, objects. Tanz found a
great many errors through age 4. However , despite the errors, 96% of the
' '
placementswere made at the cardinal directions: in line with the two axes
that divide the referenceobject, front from back and side from side.
Additional evidencecomes from some current work in our laboratory on
'
the nature of young children s regions mapped to in front of In three studies,
we have shown 3-year-olds, 5-year-olds, and adults an array with a single
'
reference object, and have asked them to judge when a small object is in
front of ' it . In one study, the reference object is V -shaped - essentially, a
' '
largish version of the dax we have used in our object naming studies. This
object has a clear vertical axis, and when it is placed horizontally on a table,
it has a clear principal axis that runs from front to back, at least for adults. In
another study, the referenceobject is round , lacking clear axes. In the third
study, the referenceobject is still round , but now is adorned with plastic eyes
'
and a small tail , rendering clear the location of this object s principal axis.
' '
Adults , not surprisingly, tend to judge one object in front of the reference
object when it falls within a quadrant surrounding the half-axis closest to the
viewer. Five-year-olds do the same. Three-year-olds - who might be least
'
likely to understand an object s axial system - perform randomly when
judging what is in front of the round unmarked object. However, they follow
the samepattern as older children and adults when judging what is in front of
either the V -shapedobject or the round object with eyesand tail . When they
'
perform differently , they follow the reference object s projected axis more
closely than either five-year-olds or adults. For example, some of them judge
that only objects falling exactly on the relevant half axis are in front of the
referenceobject.
A last piece of evidencefor early representation of object axescomes from
an experiment by Landau ( 1991) on the understanding of spatial part terms
(top, bottom, front , back, and side) in 3- to 4- year-olds, one of whom was
congenitally blind . Children were shown novel objects, and were specifically
told that one region (indicated by the experimenter) was either the top ,
bottom , front , back, or side of the object. Children now had to identify the
regions corresponding to the remaining four terms. Correct responding would
have required the children to identify the three major axes, and anchor them
with the single term provided by the experimenter.
In order to determine whether the terms corresponded to only the canoni-
cal position of an object, children were shown regions in each of two
conditions : In the Canonical condition , the region was designated as if the
object was upright - for example, top was the uppermost region, bottom the
B. Landau I Where's what and what's where
'
depend on analysis of the object s shape: The butt of the waterpot is where it
'
is in virtue of the object s axial system, not its rotund shape.
The second system uses locational predicates that appear to describe the
location of objects with particular shapes. For example, according to Brown,
' '
pachal describesa wide-mouthed container canonically sitting , waxal describes
' '
a tall oblong-shapedobject canonically standing , lechel describesa wide flat
object lying flat , pakal describesa blob-type object with a clear flat surface
' '
lying face-down . Again , English does share some of these restrictions in its
verb system: Stand usually is restricted to objects in their vertical orientation ,
lie to objects in their horizontal orientation , and numerous denominal verbs
(to bottle, to house, to jar ) incorporate specific object categories.
Is the Tzeltal system an exception to the sparserepresentation of shape in
place naming? It is difficult to tell , since the items in question are verbs,
lexical items that can incorporate all kinds of information . But even Tzeltal
locational verbs do not appear to incorporate object-shapedistinctions as fine
as the vocabulary of object nouns in that language or any other. In fact, the
Tzeltal shape distinctions seem to fit with the kinds of distinctions made by
classifier systems in other languages (Allan 1973.) Shape classifiers group
' ' ' ' ' - - '
objects in terms of such parametersas lump , flat , and long thin thing . But
classifiersdo not group objects in terms of specific basic-level (shape-based)
object categories- this is what object namesare for . It remains an interesting
empirical question whether the children learning Tzeltal readily incorporate
the so-called richer elementsof shape in their place expressions.
10 We find it
interesting that children can be resistant to labelling the more unusual looking of
thesewith the word dog. The daughter of a friend insisted for a long time that a Pekinesewas a
cal, presumably in virtue of its flat face.
' '
B. Landau / Wheres what and what s where
This is unlikely to be the whole story. The fundamental question is not why
spatial prepositions filter out detail at all , for all linguistic categoriesfilter as
they must, given the fact that their memberships are finite. Rather, the
fundamental question is why spatial prepositionsfilter as they do, preserving
only certain shape-based qualities. If the requirement were simply to filter -
without further specification - spatial prepositions could representobjects by
gross variation in size, brightness, texture, or any of the other infinite possible
properties.
A telling comparison involves the noun classifiersfound in many languages
such as Japanese, Chinese, and American Sign Language. Noun classifiersare
' '
morphemesthat classify or describethe noun they accompany, and in many
languagesthey are obligatory in certain contexts (such as counting) . They are
closed class, are relatively few in number, and expressjust a few distinctions
among objects. Universally, these distinctions include properties such as
' '
shapeand size of objects (Allan 1973), for example, long and straight thing ,
' small round ' ' flat ' . These
thing , thing properties appear quite similar to the
kinds of geometric properties described in conjunction with figure and
referenceobject and might seemto suggestthat it is the closed class nature of
the prepositions - and not their pertinence to location - that forces them to
draw on the properties they do.
However, there are a number of differencesbetweenthe properties engaged
by noun classifiers vs. spatial prepositions. First , classifiers do not describe
only object geometries. They also often mark animacy, texture or substance,
physical integrity (e.g., broken), arrangement (e.g. pleats, loops), and other
properties that do not seem to appear in expressionsof pure location (see
Allan 1973) . Second, they can be much more specific than preposition
geometries, for example, marking certain function -related categories (e.g. in
Mandarin Chinese, ' handled things' ; in ASL , ' tools' ), basic level or subordinate
' ' ' '
categories(e.g. in Mandarin , book , rifle ), and even unique"honorific object
' '
categoriessuch as horse (Erbaugh 1986). This suggeststhat there is actually
a fair amount of latitude in how closed classelementscan abstract away from
object representations. The particular geometric object properties preservedin
the spatial preposition meaningsdo not follow necessarily(or solely) from the
'
prepositions status as closed class elements.
So, although languagesmust filter out detail from spatial and perceptual
representations, such filtering cannot by itself account for why objects being
named are representedin terms of their detailed shapeswhile objects playing
the role of figure or referenceobject are representedin terms of geometries
such as point , blob , volume, or axes.
B. Landau I Where's what and what's where
shows that damage to the inferior temporal cortex often impairs an animal' s
ability to identify an object (often by its shape) in order to obtain food. In
contrast, damage to the posterior parietal cortex appears to impair the
animal' s ability to identify the location an object occupies, again, in order to
obtain food. In the former case, object recognition is impaired while object
location is spared; in the latter case, the ability to locate an object is impaired
while object recognition is spared.
This general dissociation of ' what ' and ' where' has also received support
from human psychophysical studies and from studies of brain damage in
humans. The psychophysical studies indicate that two separate streams of
visual processing exist from lower visual levels up to higher levels. These
streams appear to segregatethose properties relevant to object identification
(primarily shapeand color ) from those relevant to locating objects (primarily
motion , depth, and location ; see Living stone and Hubel 1989) . The human
clinical evidencealso showsdissociationof object and place through individual
casesin which brain damagecan selectivelyimpair either object recognitionwhile
sparing object localization or the converse(Farah et al. 1988) .
What do these findings imply for our studies of spatial language? We
conjecture that the two kinds of object representation we have uncovered in
studies of language learning may be rooted in the larger design of the spatial
representational system. The reason that detailed shape matters for object
naming is that the count nouns naming objects draw on representationsin the
' what'
system. And the reason that sparse shape - only point , line, volume,
-
etc. matter for object location is that spatial prepositions draw on representations
in the ' where' system, a systemthat does not representobject shapein
detail. We speculatethat the relatively simple shapespecificationsobservedin
the preposition system reveal the extent of detail possible in object descriptions
within the ' where' system. That is, the degreeof filtering seenfor objects
when they play the role of figure or ground is rooted in the fact that the
spatiallocational systemsimply does not , by itself, representobject shapesin
detailed fashion.
This hypothesis is compatible with the view put forth by Talmy ( 1983) in
his explanation of why prepositions - and other closed class elements- have
the sparse structure that they do. Talmy proposes that the structure of the
closed class reflects much of the fundamental structure of cognition ; for
example, that the cognitive system itself represents location in relatively
qualitative (non-metric) terms. The present proposal further points to a
reason why certain members of the closed class - spatial prepositions -
abstract away from shape as they do. We propose that , in part , they do
B. Landau / Where's what and what's where
This lengthy foray into the representation and acquisition of object and
place terms seemsto have taken us far from the question set out in the
beginning: Given the potentially massiveproblems of indeterminacy derived
from learning words ' from observation' , how do children manage to map
many words onto meanings in a relatively short period of time? The work
summarized in this paper suggests that the spatial-cognitive system may
impose significant limitations on the kinds of hypotheseschildren will entertain
about the meaning of a new word for an object vs. a place.
The limitations that I have emphasizedare domain -specific constraints on
the kinds of geometric representations that can be engaged. When learning
a novel word representing an object, learners will view shape as important ,
but not location . When learning a novel word representing an object' s place,
learners will view location as important , and critically , they will selectively
view only certain geometric object properties as r~levant. Some of these
limitations will be set in place during the course of early language learning -
for children learning Korean must focus on somewhat different properties
than children learning English. However, on a more general level, the
fundamental limitations may precede language learning and serve as a
critical boost into the system. Such early predispositions are probably
universal, perhaps reflecting the fact that languagesare designed to be set in
correspondencewith our spatial representations. Presumably, rich specifications
of object shape will be critical in object naming for children learning
all languages. Similarly , schematic specifications of object shape will be
critical in learning locational expressions, regardless of the language to be
learned.
' '
292 B. Landau / Wheres what and what s where
References
Clark. E.V.. 1973 . What's in a word? On thechild' s acquisitionof semantics in his first language.
In: T. E. Moore (ed.), Cognitivedevelopmentand the acquisitionof language , 65- 110. New
York: AcademicPress .
Clark. H., 1973 . Space , time, semantics , and the child. In: T.E. Moore(ed.), Cognitivedevelopment
and the acquisitionof language , 27- 64. New York: AcademicPress .
Choi, S. and M. Bowerman , 1991 . Learningto express motioneventsin Englishand Korean: The
influenceof language -specificlexicalizationpatterns.Cognition41, 83- 122.
Erbaugh , M., 1986 . Takingstock: Thedevelopment of Chinesenounclassifiers historicallyandin
youngchildren. In: C. Craig (ed.), Noun classesand categorization , 399- 436. Amsterdam :
Benjamins .
Farah. M., 1988 . Is visualmemoryreally visual? Overlookedevidencefrom neuropsychology .
Psychological Review95(3), 307- 317.
Farah. M.. K. Hammond , D. Levineand R. Calvanio, 1988 . Visualand spatialmentalimagery:
Dissociable systemsof representation . CognitivePsychology 20, 439- 462.
Feldman , H.. S. Goldin-Meadowand L.R. Gleitman, 1978 . BeyondHerodotus: The creationof
languageby linguisticallydepriveddeaf children. In: A. Locke (ed.), Action, symbol, and
gesture : The emergence of language . New York: AcademicPress .
Fisher, C., H. Gleitmanand L.R. Gleitman, 1991 . On the semanticcontentofsubcategorization
frames . CognitivePsychology 23(3), 331- 392.
Fisher. C.. D.G. Hall, S. Rakowitzand L. Gleitman, 1994 . Whenit is betterto receivethan to
give: Syntacticand conceptualconstraintson vocabularygrowth. Lingua 92, 333- 375 (this
volume).
Fodor. J., 1981 . The presentstatusof the innateness controversy . Representations . Cambridge ,
MA : MIT Press .
Gentner. D.. 1982 . Why nouns are learnedbefore verbs: Linguistic relativity vs. natural
partitioRing. In: S. Kuczaj (ed.), Languagedevelopment : Language , culture, and cognition
301- 334. Hillsdale, NJ: Erlbaum.
Gibson. E and A. Walker, 1984 . Developmentof knowledgeof visual-tactual affordancesof
substance . Child Development 55, 453- 460.
Gilette. J. and L.R. Gleitman, forthcoming . Observationand nounverb learning.
Gleitman, L.R., 1990 . The structuralsourcesof verbmeanings . LanguageAcquisitionI , 3- 55.
Gleitman, L.R., H. Gleitmanand E. Wanner, 1988 . Wherelearningbegins: Initial representations
for languagelearning. In: F. Newmeyer(ed.), Linguistics:The CambridgeSurvey. New
York: CambridgeUniversityPress .
Grimshaw, J.. 1981. Form, function, and the languageacquisition device. In : C. Baker,
J. McCarthy(eds.), The logicalproblemof languageacquisition , 183-210. Cambridge , MA :
MIT Press .
Grimshaw . J., 1994 . Lexicalreconciliation . Lingua92, 411- 429(this volume).
Herskovits , A.. 1986 . Languageand spatialcognition: An interdisciplinarystudyof the prepositions
in English. Cambridge : CambridgeUniversityPress .
Inhelder,B. andJ. Piaget, 1964 . Theearlygrowthof logicin thechild. NewYork: Norton, 1969 .
Jackendoff . R., 1983 . Semantics and Cognition. Cambridge , MA : MIT Press .
Jackendoff , R., 1987 . On beyondzebra: The relation of linguistic and visual information.
Cognition26, 89- 114.
Jackendoff . R., 1992 . Mme. Tussaudmeetsbinding theory. Natural Languageand Linguistic
Theory 10, 1- 31.
' '
294 B. Landau I Wheres what and what s where
McKenzie , B.E., R.H. Day and E. Ihsen, 1984 . Localizationof eventsin space : Young infants
are not alwaysegocentric . BritishJournalof Developmental Psychology 2. 1- 9.
Miller, G. and P. Johnson -Laird. 1976 . Languageand perception . Cambridge . MA : Harvard
University Press .
Mulford, R.. 1985 . First words of the blind child. In: M.D. Smith. J.L. Locke (eds.). The
'
emergentlexicon: The child s developmentof linguisticvocabulary . New York: Academic
Press .
Murphy, G. and D. Medin. 1985 . The role of theoriesin conceptualcoherence . Psychological
Review92, 289- 316.
Murphy. G.L. and J.C. Wright, 1984 . Changesin conceptualstructurewith expertise : Differences
betweenreal world expertsand novices . Journalof ExperimentalPsychology : Learning.
Memory. and Cognition10. 144- 155.
Pinker, S., 1989 . Learnabilityand cognition: The acquisitionof argumentstructure.Cambridge .
MA : MIT Press .
Pinker, S., 1994 . How coulda child useverbsyntaxto learnverbsemantics ? Lingua92. 377- 410
(this volume).
Quine, W., 1960 . Word and object. Cambridge . MA : MIT Press .
Rieser, J.J.. 1979 . Spatialorientationof six-month-old infants. Child Development50. 1078 -
1087 .
Rosch , E., C. B. Mervis. W.O. Gray, D.M. Johnsonand P. Boyes -Braem. 1976 . Basicobjectsin
naturalcategories . CognitivePsychology8. 382- 439.
Rueckl, J., K. Caveand S. Kosselyn . 1988 . Why are 'what' and ' where' processed by separate
corticalvisualsystems ? A computationalinvestigation . Journalof CognitiveNeuroscience I,
171- 186.
Schachter . P.. 1985 . Parts of speech . In: T. Shopen(ed.). Languagetypology and syntactic
description . Vol. I : Clausestructure. 3- 61. Cambridge . MA : CambridgeUniversityPress .
Smith, L.B., 1989 . A model of perceptualclassificationin childrenand adults. Psychological
Review96. 125- 144.
Smith, E. and D. Medin. 1981 . Categoriesand concepts . Cambridge . MA : Harvard University
Press .
Smith. L. B., S. Jonesand B. Landau. 1992 . Countnouns. adjectives . andperceptualpropertiesin
novelword interpretations . Developmental Psychology 28, 273- 286.
'
Sola, N., S. Careyand E. Spelke , 1991 . Ontologicalcategories guideyoungchildrens inductions
of word meaning:Objecttermsand substance terms. Cognition38(2). 179-211.
Spelke, E., 1990 . Principlesof objectperception . CognitiveScience14. 29- 56.
Supulla, E., 1988 . Structureand acquisitionof verbsof motion in AmericanSign Language .
Cambridge . MA : BradfordBooks.
Talmy, L.. 1983 . How languagestructuresspace . In: H. Pick. L. Acredolo (eds.). Spatial
orientation: Theory, research , and application. New York: PlenumPress .
Talmy, L., 1985 . Lexicalizationpatterns: Semanticstructurein lexicalforms. In: T. Shopen(ed.).
Languagetypologyand syntacticdescription . Vol. 3: Grammaticalcategories and the lexicon.
57- 149. Cambridge : CambridgeUniversityPress .
Tanz, C., 1980 . Studiesin the acquisitionof deictic terms. Cambridge : CambridgeUniversity
Press .
Ungerleider , L.G. andM. Mishkin. 1982 . Two corti~ l visualsystems . In: D.J. Ingle. M.A. Goodale .
R.J.W. Mansfield(eds.). Analysisof visualbehavior . Cambridge . MA: MIT Press .
's whatandwhat's where
B. LandauI Where
296
Possible names:
The role of syntax - semantics mappings
in the acquisition of nominals *
Paul Bloom
Departmentof Psychology, University of Arizona, Tucson, AZ 85721, USA
Many scholars have posited constraints on how children construe the meaningsof new words.
These include the restriction that new words refer to kinds of whole objects ( Markman and
Hutchinson 1984), that words describing solid objects refer to individuated objects while words
describing non-solid substancesrefer to portions of substance(Sola et al. 1991), and that count
nouns that name objects are generalizedon the basis of shape (Landau et aI. 1988). There are
theoretical and empirical problems with theseproposals, however. Most importantly , they fail to
explain the fact that children rapidly acquire words that violate these constraints, such as
pronouns and proper names, names. for substances , and names for non-material entities. The
theory defended here is that children and adults possessmappings from grammatical categories
' ' ' ' ' '
( count noun , mass noun , and noun phrase ) to abstract semantic categories; thesemappings
serveto CQnstraininferencesabout word meaning. Evidencefrom developmental psychology and
linguistic theory is presentedthat suggeststhat even very young children possesssuch mappings
and use them in the course of lexical development. Further issues- such as the possibility of
developmental change, the precise nature of these semantic categories, and how children learn
words prior to the acquisition of syntax - are also discussed. It is concluded that although these
'
syntax-semanticsmappings are not by themselvessufficient to explain children s successat word
learning, they playa crucial role in lexical .
development As such a
, only theory that posits a deep
relationship betweensyntax and semanticscan explain the acquisition and representationof word
meanings.
'
Language ... must have its perfectly exclusivepigeon-holes
and will tolerate no flying vagrants. Any concept that asks
for expressionmust submit to the classificatory rules of the
game ... It is almost as though at some period in the past
Introduction
Taxonomic constraint:
' ... labels refer to
objects of the same kind rather than to objects that
are thematically related' ( Markman 1990: 59) . Thematically related
entities include those that fall into ' spatial , causal, temporal , or other
relations ' such as a dog and its bone, a dog and the tree that it is under ,
a dog and the person who is petting it , and so on. Although children are
sensitive to these sorts of relations in non -linguistic tasks (for instance,
they will put a dog and a bone together when asked to sort objects into
different piles), this constraint forces them to attend to taxonomies
' '
(such as the kind dog ) when faced with the task of inferring the
meaning of a new word .
Another proposal, advanced by Sola et ale ( 1991: 182- 183), is that the
following two proceduresapply in the processof word learning :
Procedure1:
Step I : Test to seeif the speakercould be talking about a solid object; if yes,
Step 2: Conclude that the word refers to individual whole objects of the
sametype as the referent.
300 P. Bloom / Possiblenames
Procedure 2 ..
-
Step I : Test to see if the speaker could be talking about a non solid
substance ; if yes,
Step 2: Conclude that the word refers to ,portions of substance of the same
type as the referent.
Shapebias;
' ... the biasto of a novelcountnoun
group objectsby shapein thepresence
...' (Landauet al. 1992: 87)
As Markman ( 1990) notes, the whole object constraint is false at least for
the lexicons of older children and adults. There exist words that refer to
' ' ' ' ' '
properties ( happy ), to actions ( hit ), to spatial relations ( under ), to substances
' water' and
( ) so on. In general, adjectives, verbs, prepositions, and
massnouns do not refer to whole objects.
One apparent solution would be to redescribe the constraint as only
applying to count nouns, and thus exclude examplesfrom other grammatical
categories. But the generalization still does not hold. Some count nouns
describewhole objects (' dog' ), but most do not. Within the domain of count
nouns that describe material entities, some refer to collections of objects
' ' ' ' ' ' ' ' ' '
( forest , bikini , flock ), while others refer to parts of objects ( finger , foot ,
1 Otherconstraintson word
learningthat havebeenproposedincludemutualexclusivity , which
is that wordscannothaveoverlappingextensions (MarkmanandWachtel1988 ), andtheprinciple
of contrast(Clark 1987 ), whichstatesthat thereare no synonymous fonDSin language . As with
theconstraintsdiscussed above,mutualexclusivityis clearlyfalseof adult language(asit excludes
' ' ' '
pairs of words suchas dog and animal) and thereis considerable debateover how well it
.
appliesto the languageof youngchildren(seeAu andGlusman1990 , Gathercole1987 , Merviset
al. 1991 , Nelson1988 ). The principleof contrastmay be correctfor adult languagebut it is not
by itself sufficientto constrain . the child' s inferencesand must work in aa: ord with other
principles . Neitherof theseconstraintscan becapturedin tenDS of the sortsof syntax-semantics
mappingsdiscussed here, but it is possible , as suggested
by both Clark ( 1990) and Gathercole
( 1987), that theseconstraintsmightalsoreduceto deeperpropertiesof language andcognition. In
particular, a prohibition on certainfonDS of lexicaloverlapmay be derivablefrom children's
knowledgeof pragmaticprinciples , e.g., of Griceanmaximsof communication .
302 P. Bloom I Possiblenames
Constraint hypothesis:
In the lexicons of young children, count nouns refer to kinds of whole
objects
The rapid acquisition of such words is clearly problematic for the whole
object constraint and the taxonomic constraint. It also poses a difficulty for
the shape bias. As noted by Sola et al. ( 1991, 1992), it would be surprising
' ' ' '
indeed if children extended words like uncle and clock on the basis of
shape; if they did , they would be unable to use thesewords in any way similar
to adults.
Finally consider the Sola et al. ( 1991) procedures, which state that words
describing objects refer to kinds of individual whole objects and words
describing substancesrefer to portions of substance. Revising their proposal
so that it applies only to nouns, this predicts that children should be unable
'
to acquire solid substancenames(' wood ) and namesfor bounded substances
' ' . But in fact even 2-
( puddle ) year-olds can learn solid substancesnames
(Prasada 1993), and there is evidence from Sola ( 1992) suggestingthat they
can also acquire namesfor bounded entities (seesection 4.2 for discussion) . It
'
is worth noting, however, that these are not children s first guessesas to a
'
word s meaning. For instance, when a word is used to describe a novel
'
bounded entity , children s first interpretation of its meaning is that it refers to
the kind of object, not the stuff that the object is made out of , and this holds
regardlessof the syntax in which the word is presented (Au and Markman
1987, Dickinson 1988, Markman and Wachtel 1988, Sola 1987) . Nevertheless,
'
the fact that words like ' wood are acquired at all militates against the claim
that children possessthe proceduresposited by Sola et al. ( 1991).
None of the counterexamplesdiscussedabove necessarilyrefutes the hypothesis
that these constraints exist. One possibility, discussed in detail by
Markman ( 1989), is that although they are present at the start of lexical
' '
development, the constraints can be abandoned or overridden in the course
of language development (possibly as the result of the application of other
constraints; see footnote 1) . More generally, they can be viewed as default
conditions which only apply in the absenceof certain countervailing circumstances
, and such circumstances may be present at any stage of lexical
development. .
Under this interpretation of what constraints are, however, it is unclear
whether any degree of understanding on the part of young children could
refute theseproposals. For instance, pronouns and proper namesare acquired
very early, often before children have learned any other noun that describes
' ' ' ' ' '
people (such as person , man , and woman ) . As such, constraints such as'
'
Mutual Exclusivity cannot block the child from interpreting words like Fred
' '
and she incorrectly, as namesfor kinds of whole objects. In fact, thesesorts
of errors do not occur (see Macnamara 1982) - but a reasonablereply by a
P. Bloom / Possiblenames 305
constraint -theorist is that the special status of what pronouns and proper
names refer to (i .e., people) causes children to override the taxonomic
assumption in such cases. In general, the fact that these constraints are
posited as default conditions makes it important to provide some theory of
what other rules and constraints can override them. Without such a
theory , the constraint proposal runs the risk of begging all of the difficult
questions.
In sum, the theories of Markman , Landau et al., and Sola et al. cannot
account for most of the words that children acquire. But the very same
induction problem that exists for the acquisition of a word such as ' dog' also
exists for words such as ' Fred' , ' water' , and ' forest' , and thus the very same
arguments for the necessityof constraints also apply. The goal of a theory of
lexical development is to account for the acquisition of all words, not just
names for kinds of objects, and this motivates an effort to explain the
acquisition of object names in the context of a more general theory of word
learning.
fact partition the world according to the shapes of the objects in it ...
young children very quickly realize this , abstracting a rule from their early
word learning experience that says shape is the critical factor in decisions
about the extensions of these nouns. Then they use this rule when
encountering new nouns and new classes, thus immensely simplifying and
'
speeding up the mapping of the one onto the other . While possible, this
assumes that children are able to learn the correct meanings of count
nouns (and can thus infer that they tend to refer to shape) prior to the
onset of the shape bias. Thus although this bias might facilitate word
learning later on , we are still left with the problem of what constrains
children ' s inferences in the first place.
If the constraints are unlearned, what is their precise nature? Do they
constitute a subpart of a distinct language acquisition mechanism that exists
solely to facilitate the learning of words? This is certainly conceivable, but it
would be preferable to motivate these constraints on word learning in terms
'
of more general properties of children s linguistic and cognitive competence.
One specific proposal, advanced below, is that theseconstraints emergefrom
' '
other properties of children s knowledge; in particular , from children s grasp
of syntax-semanticsmappings.
in order to account for words like ' Fred' and ' she' which refer to individuals
and are not ' generic' in the sensediscussedby Di Sciullo and Williams .
An alternative is that the relevant distinction is grammatical, and related to
how languages use syntactic categories to express meaningful notions and
relations ( Bloom 1990a, Jackendoff 1983) . In particular , it is not words that
have generic reference- it is categoriessuch as nouns and verbs. Nouns like
' ' ' '
dog and water are generic in the sensethat they can be extended to an
indefinite number of novel instances(an infinity of different dogs, an infinity
of different portions of water). Put differently, they refer to kinds, not to
particular individuals or entities.
In contrast , noun phrases (NPs) like ' the big dog' can be conceptualized
as referring to individuals , and not to kinds. The standard examples of this
are when nouns combine with quantifiers to become NPs. Thus ' a dog' can
' '
pick out a particular individual that happens to be a dog, those big dogs
picks out those dogs that have the property of being big , and so on. (See
Parsons, 1990, for a discussion of how a similar analysis can apply to verbs
and V Ps.)
The distinction betweenwords and nouns is crucial here, since some words
are NPs, not nouns. This allows us to explain the peculiar status of pronouns
and proper names. From the standpoint of grammar, they are lexical NPs
(seeBloom 1990b). With regard to their role in syntactic structure, words like
"Fred' and "he' behave like ' '
phrasal NPs such as the dog and thus cannot
appear with adjectivesand determiners. Under the hypothesis that pronouns
and proper namesare NPs, we can posit the following mappings:
It is likely , however that Mapping I is too strong; there are NPs that
appear not to refer at all. In a language like English, these include expletive
' ' ' ' ' ' '
pronouns, such as it as in it is raining or there as in there is trouble
' , One
brewing theory of why such semantically empty NPs exist is becauseof
a requirement in English that all tensed sentencesmust have overt subjects
(Chomsky 1981), even if thesesubjectsplay no meaningful role. In languages
like Italian , where overt subjectsare not necessary , one can say the equivalent
of ' is raining' , while in English, it is necessaryto add the meaninglessNP ' it '
in order to satisfy the grammatical requirement (but see Bolinger, 1973, for
P. Bloom / Possiblenames
evidence that even expletives have some semantic content) . This class of
counterexamples(and many others; see Bloom, under review) suggeststhat
the mapping from NPs to individuals does not hold in all cases. These
exceptions must somehow be learnable by children through something other
than the mapping posited above.
One possibility, first advanced by Nishigauchi and Roeper ( 1987), is that
children must initially acquirea given NP by using the syntax-semanticsmapping
' '
(e.g., referential it ), and only after having done so, can they understand the
sameword or string of words in a semanticallyempty context (e.g., expletive' it ' ).
This makesthe prediction that acrossdifferent languages , all words and phrases
that are NP expletivesmust also be NPs of the semanticallywell-behavedtype
( NPsthat refer), becauseotherwisechildren would not be able to acquire them.
There is someevidencethat this is the case( Nishigauchiand Roeper 1987), and
one could make the further prediction that children can only categorizea string
of words within an idiom as an NP (e.g., ' the bucket' as in ' kick the bucket') if
they are already capable of construing that string of words as having some
referentialmeaningwhen it is outside of the idiom. A theory along theselines is
discussedin Bloom (under review); for the purposeshere, it should be noted that
a completeaccountof lexical acquisitionmust explain how children acquirethese
sorts of non-referentialNPs.2
If we restrict their domain to count nouns, we can collapse the whole object
constraint, the taxonomic constraint , and the shapebias as follows :
2 A different sort of
puzzle concernsNPs that apparently refer to kinds, as with the subject NPs
' ' ' ' ' ' ' '
dogs and water in the sentencesDogs are friendly animals and Water is good to drink . These
NPs can be construed as referring not to actual dogs or actual portions of water, but to abstract
kinds - to the speciesDOG and the substanceWATER . Under one analysis, they serveas proper
namesfor these kinds, and thus refer to kinds in a very different way than do nouns (Carlson
1977). In any case, the acquisition and representationof theseNPs constitute a further domain of
study from the standpoint of syntax-semanticsmappings.
P. Bloom / Possiblenames 309
Bach 1986, Bloom I 990a, I 994a; Bloomfield 1933, Gordon 1985, 1988;
Jackendoff 1991, Langacker 1987, Levy 1988, McCawley 1975, McPherson
1991, Mufwene 1984, Quine 1960, Ware 1979, Weinrich 1966, Whorf 1956,
Wierzbicka 1985, seeGathercole 1986for a review) . In general, it is clear that
' ' ' '
objects tend to be described by count nouns ( a dog , five tables ) and nonsolid
substancestend to be described by mass nouns ( much water' , ' less
'
'
sand ). This holds for languagesother than English; acrossdifferent language
families, names for entities like dogs and tables are always count nouns, and
names for entities like water and sand are always mass nouns (Markman
1985, M ufwene 1984, Wierzbicka 1985). This pattern is unlikely to be a
coincidence, and it might lead one to the hypothesis that entities describedby
count nouns have perceptually salient boundaries, and thus are countable,
while mass nouns describeeverything else.
Nevertheless, the sameobjections discussedearlier against the whole object
constraint have also beenmade against this theory of count/ masssyntax (e.g.,
by Ware 1979) . One objection is that many count nouns do not describe
' '
whole objects; there exist abstract words like ' nap and 'joke , as well as
collective nouns like ' forest' and ' anny ' . If one is to retain the notion that the
grammatical count/ mass contrast maps systematically onto some cognitive
division , the cognition side of the mapping must be considerably more
' '
abstract than ' whole object and ' non-solid substance.
As a result of theseconsiderations, many scholars have proposed that the
grammatical count/ massdistinction maps onto the semanticcontrast between
nouns that refer to kinds of individuals vs. those that refer to kinds of nonindividuated
'
entities, which we can view as ' portions (seeBach 1986, Bloom
1990a, 1994a; Bloom and Keleman, under review; Gordon 1982, 1985, 1988;
Jackendoff 1983, 1990, 1991; Langacker 1987, 1990; Macnamara 1986,
Macnamara and Reyes 1990). The cognitive notion of inviduals is related to
properties such as countability , indivisibility , and boundedness , and is roughly
to ' discretebounded ' . Within the domain of material entities
equivalent entity ,
this usually corresponds to whole objects, though it can also correspond to
entities such as a forest, a puddle, and a pile. Outside of the domain of material
' ' ' '
entities, an event that takes a bounded interval of time ( a race , a conference)
' ' '
can be construed as an individual , as can a mental state ( a headache, a
' ' ' ' '
nightmare ) or a period of time ( a day , an hour ). (Some speculationsabout
' '
the precisenotion of individual will be discussedin section 6.2.)
What support is there for the claim that count/ mass syntax actually
corresponds to these aspectsof abstract cognition ? One source of support is
from linguistic analyses (e.g., Barwise and Cooper 1981, Jackendoff 1991,
310 P. Bloom I Possiblenames
semanticbasis of this distinction until a relatively late age. For instance, Levy
( 1988: 186) reviews the work of Gordon and Gathercole and concludes as
follows :
' ... Gordon 1985 and Gathercole 1985 have shown that the count- massdistinction is
( ) ( )
acquiredthroughformal cluesrather than via the semanticobject-substance distinction. The
reasonseems correlationbetweenthesetwo
to be that, in English,thereis not a veryconsistent
.'
distinctions
'
These findings have been taken as strong evidence for a distributional
' of
theory language development. Levy ( 1988) argues, following Karmiloff -
Smith ( 1979), that children view the acquisition of grammar as a formal
' '
puzzle, a problem space per se , and semantics is irr .elevant. This is also
'
Gathercole s ( 1985) conclusion, but Gordon ( 1985, 1988) proposes a quite
different view, maintaining that count/ masssyntax is basedon quantificational
semantics from the very start. What young children lack, according to
Gordon , is knowledge of how this semantic contrast maps onto perception.
That is, they understand that the contrast between count nouns and mass
nouns corresponds to the distinction between words that refer to kinds of
individuals vs. words that refer to kinds of portions - but they lack an
understanding that physical objects are canonical individuals and non-solid
substancesare canonical portions . If this were correct, then these mappings
would be uselessas a source of constraint in word learning.
The specific studies of Gathercole and Gordon are critically reviewed in
considerabledetail in Bloom ( 1990a) ; it will suffice here to raise a conceptual
' s understanding
point . All of the experiments purportedly showing that children '
of count/ mass is not semantic involve studying children s sensitivity
to linguistic cues. Thus one finding is that if 3- and 4- year-olds hear (e.g.,)
' This is a blicket ' ' '
they tend to grammatically categorize blicket as a count
' '
noun, while if they hear This is some blicket , they tend to grammatically
' '
categorize blicket as a mass noun. Further , children will give these '
syntactic
" '
cues priority over referential cues. If they hear This is a blicket they are
312 P. Bloom I Possiblenames
likely to interpret the word as a count noun regardlessof whether they are
being shown an object or a substance(Gordon 1985) .
One interpretatio~ of this result is that children' s understanding of count/
mass is not semantic. Instead children possesssome generalization of the
form : ' Everything following the word " a" is a count noun' , and this is
distinct from any semantic understanding, which has to be learned at some
later point . This assumesa dichotomy between' linguistic cues' and ' semantic
cues', where the latter is restricted to information that children receive
through perception of the external world . An alternative, however, is to reject
this dichotomy altogether. Semantic information can be conveyed through
' ' ' '
language; when children hear a blicket and categorize blicket as a count
noun, they may be drawing a semanticinference. Specifically, children might
encodethe determiner ' a' as having the semanticpotential of interacting with
a noun that 'refers to a kind of individual to pick out a single individual -
becausethis is what it means- and it follows from this that any noun that
follows ' a' must refer to a kind of individual and thus must be a count noun.
In fact, linguistic cues are a more reliable cue to the semantic' status of a
novel word than perceptual cues are. This is becausea given percept can be
construed in different ways; if a solid object is describedas ' blicket' , it is quite
' '
possiblethat blicket is actually a massnoun, becauseit could refer to the stuff
that the object is made out of. But linguistic cuesare flawless; every noun that
can co-occur with a quantifier that has the semanticrole of individuation has
to be a count noun. Given this, the child' s early sensitivity to linguistic
information actually supports a semantictheory; it does not refute it .
recorder, presented one after the other at a very fast rate, which could be
construed either as a set of discrete bells or as an undifferentiated noise and
therefore could also be describedwith either a count noun or a mass noun.
' ' ' '
All children were presentedwith both the food and the bell stimuli. One
' - '
group was told : These are feps there really are a lot of feps here (count
'
noun condition ); the other group was told : this is fep - there really is a lot of
' were taught the word
fep here (massnoun condition ) . Then the children who '
'
as a count noun were told to give the puppet a fep for the food condition
'
and, in the sound condition , were given a stick and a bell and asked to make
'
a fep . The children who were taught the word as a mass noun were told to
' '
give the puppet fep in the food condition or , in the sound condition , to
' make '
fep with the stick and the bell. If children are sensitiveto the semantic
properties of count/ mass syntax, they should act differently in the count
' '
condition than in the mass condition . When asked for a fep , they should
' '
tend to give one object or make one sound, and when asked for fep , they
should tend to give a handful of objects or make a lot of sounds.
These were the results obtained: Both 3- and 4-year-olds performed
significantly above chance on both the food and sound conditions. This
finding provides further support for the hypothesis that there is a semantic
basisto count/ masssyntax evenfor non-material entities, and indicates that an
-
understanding of mappings betweensyntax and semanticsis present in 3 and
-
4- year olds.
Sensitivity to the semanticsof the noun/ NP contrast is evident at an even
earlier age than is an understanding of the semantic basis of count/ mass. The
mapping hypothesis is that young children should understand that the
grammatical contrast betweenwords that are nouns and words that are NPs
correspondsto the contrast betweenwords that refer to kinds and words that
refer to individuals. In a classic study by Katz et al. ( 1974), the experimenter
taught young children new words by pointing to an object and saying either
' This is a ' count noun context or ' This is '
wug ( ) wug ( NP context). Even some
- -
17 months olds were sensitive to this grammatical difference; when the word
was presentedas a noun they tended to construe the word as the name for a
kind , but when it was presented as an NP , they .tended to construe it as a
name for a particular individual (see Gelman and Taylor , 1984, for a
replication with slightly older children) . The findings that children younger
than two treat nouns and NPs differently with regard to how they interact
with determiners and adjectives, and that they categorize pronouns and
proper namesas NPs (Bloom 1990b) constitute further evidencethat children
possesssome grasp of syntax-semanticsmappings.
P. Bloom / Possiblenames 315
~
6. Openquestio
' '
6.2. What is the nature of kind of individual ?
Hypothesisabout possibleindividuals:
Something (e.g., an object or set of objects) can be encoded as an
' individual ' if we can construe it as
playing an independent causal role in
some conceptual domain.
objects, and asked to describe these using the new word . Their responses
indicate whether they think the word refers to a collection or to an individual
' '
object. For instance, if they interpret fendle as a collective noun, they should
describe eight objects as either one fendle' or ' two fendles' (depending on
'
how crucial numerosity is in their collective interpretation) and should
describeone object as ' part of a fendle' . If they interpret it as an object name,
' ' '
they should describe eight objects as eight fendles and one object as one
fendle .'
In a pilot study with a group of 36 adults, we tested the effects of syntax
' '
(singular count vs. neutral) and intentional integrity on their interpretation
of novel words. This second manipulation went as follows : For half of the
trials, the objects were placed in front of the subject slowly and carefully ; for
the other half , the objects were casually dumped in front of the subject. The
prediction was that this simple manipulation would have an effect on whether
the subjects construe the novel word as a collective noun. The mere act of
purposefully and intentionally setting out the stimuli in a given configuration
should be sufficient to emphasizeto the subject that the set itself is relevant as
a single individual .
This prediction was confirmed: When the novel word was presentedas a
singular count noun, there was a bias towards interpreting it as referring to
the entire collection. The bias increased(though not significantly so) when the
objects were placed in front of the subject, rather than dumped. When the
novel word was presentedwithout syntactic support, however, there was an
actual switch in the favored interpretation : In the dumping condition , almost
all subjectsconstrued the word as a name for a single object (like ' tree' ), but
for the placing condition , the collection interpretation (like ' forest' ) was
strongly favored.
Further research will explore whether young children can also acquire a
collective noun through this sort of intentional cue, and will also focus on the
' '
precisenature of adults and children s construal of the new word. The hope
is that by studying the acquisition and understanding of nominals referring to
sets of discrete objects - including collective nouns and names for artwork -
we will gain some insight into the nature and development of the notion
' individual ' and how it relates to
cognition and perception.
The proposal here is that the constraints children use when acquiring
words are the result of their understanding of the mappings from syntactic
322 P. Bloom I Possiblenames
overt grammatical cues. This may be one explanation for the well-known
finding that 3-year-olds show a greater sensitivity to the constraints than 2-
year-olds (e.g., Landau et al. 1988, Markman 1989, Nelson 1988) - children
can use the constraints to their fullest only once they have some facility with
the grammatical and morphological structure of their language.
The view that thesemappings provide constraints on children' s word meanings
evenprior to the acquisition of surfacesyntax may be counterintuitive, but
there is some support for it . In an extended analysis of the production and
comprehensionabilities of children in the one-word stage, Huttenlocher and
'
Smiley ( 1987: 84) state, Taken as a group, the object words in the single-word
period form a broad semanticclasswhich contrasts with other semanticclasses
emerging at the same time. That is, the pattern of usage of object words
contrasts with that of words for events ..., words for persons ..., words for
'
temporary states, greetings, and negation, and so on . They go on to suggest
that this early demarcation of words into these classesprovides a semantic
foundation for the later acquisition of syntactic categories. A more radical
interpretation is that thesechildren have already classifiedthesewords into the
relevant grammatical classes , and all they have left to do is learn how (or if )
their languageexpresses theseclassesin the grammar and morphology. (Does
their languagemark the contrast betweencount nouns and massnouns? Is the
morphology different for verbs than for adjectives?) Once they have acquired
these surface expressionsof the linguistic categories, children can use the
mappings to further facilitate the acquisition of word meanings.
7. Limitationsof syntax-semantics
mappings
One might be tempted to view the constraints that emerge from these
syntax-semanticsmappings as constituting a solution to the problem of word
' '
learning. For instance, once the child knows that dog is a count noun, she
can infer that the word refers to a kind of individual , and this excludes
innumerable other possible interpretations. If knowing the grammatical class
that a word belongs to is tantamount to knowing what it means, then these
mappings could effectively solve the induction problem. Unfortunately , there
are two main reasonswhy this optimism is misguided.
First , we have been assuming throughout that the child is able to infer
what the novel word is being used to describe, leaving only the question of
how sheencodesthe meaning and appropriately extendsit to novel instances.
As Gleitman ( 1990) has argued, however, there are many caseswhere children
324 P. Bloom I Possiblenames
are exposed to words in the absenceof the entities that they describe. An
adult , for example, might point to a bowl of cereal and say ' Do you want
milk with that ?' even when no milk is present. Consider also words for nonmaterial
entities, like ' nap' and 'joke ' , where the notion of inferring what an
adult is pointing to or looking at does not apply. One could also note the
successof blind children at learning words ( Landau and Gleitman 1985) in
order to appreciate the mystery here. A complete theory must explain how
children grasp the adult ' s intention to refer - how they somehow make the
correct guessas to what adults are talking about when they use novel names
(for researchalong these lines, seeBaldwin 1991) . More generally, no theory
of the acquisition of words can be complete without a prior theory of how
children can pick up the intended reference of language-users (Macnamara
1982) .
Second, even with the aid of grammar-cognition mappings, there is still an
infinity of possible meanings of the new word and children are stuck with
' '
sorting them out. The count noun dog could refer to dogs, but it could also
refer to dogs and pencils, to dogs until the year 2000 and then to cats, and so
on. Knowing that a given word refers to a kind of individual is only a small
part of the word learning puzzle; children must also determine which kind of
individual the word refers to , and it is here that the induction problem runs
' '
deep, particularly given how the notion of kind of individual interacts with
conceptual systems such as social cognition (see section 6.2) . Crucially , an
explanation of how children learn words involves a theory of psychologically
' ' ' ' '
possible kinds (one that includes dogs and tails , but excludes dogs and
'
pencils ) . In sum, while syntax-semantics mappings may be part of the
solution to how children learn new words, they are not sufficient. Not only
does a complete account of the acquisition of word meaning require an
explanation of how people understand the intended referenceof others, it also
requires a theory of conceptual representation.
8. Concludingcomments
If one were to follow the standard course in the study of word learning,
' ' ' '
and only consider the acquisition of words like dog and cup , it would be
hard to empirically distinguish the claim that children possessspecial word
learning constraints from the alternative that they apply syntax-semantics
-
mappings. Both theories avoid the same hard questions how do children
determine what a new word is meant to describe, and what constitutes a
P. Bloom I Possiblenames 325
psychologically possible kind - and both can capture the same simple
phenomenon; if a child hears a single object describedwith a word , he or she
will tend to take the word as referring to that kind of object. Within this
domain, the advantage of the syntax-semantics mapping theory is solely
theoretical; it only posits aspectsof children' s psychology (a mapping from
count nouns to kinds of individuals and a bias to view discrete physical
objects as individuals) that have independent empirical support and have
been previously proposed in adults for reasonsthat have nothing to do with
lexical acquisition. This is preferable to having to posit special unlearned
constraints that exist solely to help children acquire words and which have no
other motivation or support.
The empirical differences between the two theories become more obvious
when we consider words that do not describe whole objects. The constraints
advancedby Ma ,rkman ( 1990), Sola et al. ( 1991), and Landau et al. ( 1988) do
not apply to words like ' Fred' , ' she' , ' map' , ' foot ' , and ' forest' - and these
sorts of words are present in the speech of 1- and 2-year-old children. By
shifting the focus to mappings between grammar and abstract cognition, we
have a framework in which to deal with the acquisition of pronouns and
proper names, words for substances , words for material entities that are not
whole objects (like parts and collections), and words for abstract entities.
With the notable exception of research on the development of verb
"
meaning (e.g., Gleitman 1990, Pinker 1989) , most scholars have viewed word
learning as an independent issue from the nature and development of
grammatical knowledge. It is also often assumed that the theoretically
interesting casesof word learning are limited to the acquisition of words for
material entities, usually names for whole objects. ~ is article has presented
reasons for abandoning both of these assumptions, and for exploring how
' '
mappingsbetweensyntactic categorieslike count noUJ) and abstract semantic
' '
categorieslike kind of individual facilitate the acquisition of word meaning.
References
Au, T.K. and M. Glusman , 1990 . Theprincipleof mutualexclusivityin word learning: To honor
or not to honor? Child Development 61, 1474- 1490
.
Au, T.K. and E.M. Markman, 1987 . Acquiringword meaningsvia linguisticcontrast. Cognitive
Development 2, 217- 236.
Bach, E., 1986. The algebraof events . Linguisticsand Philosophy9, 5- 16.
Baldwin, D.A., 1989 . Prioritiesin children's expectationsaboutobjectlabelreference: Form over
color. Child Development 60, 1291 - 1306.
326 P. Bloom I Possiblenames
. .. C and Lila
Cynthia Fisher ' , D . Geoffrey Hallb , Susan Rakowitz
Gleitman d
. Department , University
of Psychology , 603E. DanielStreet
of Illinois , Champaign, /L 61820
, USA
b Medical
ResearchCouncil
, CognitiveDevelopment Unit
, 4 Taviton
Street
, LondonWC1 H OBT, UK
CDepartmentof Psychology, FairfieldUniversity ,
, Fairfield CT06430, USA
d Department
of Psychology , University of Pennsylvania , 3815WalnutStreet , PA
, Philadelphia
19104, USA
. Thanksare due to Stevenand Marcia Roth for funding that supportedthe experimental
work, througha grant to Lila Gleitman. Preparationof this paperwasalsosupportedby NSF
grant DBC 9113580to C. Fisher. We wishto thank ReneeBaillargeon , Judy DeLoache , Henry
Gleitman, JaneGrimshaw , AravindJoshi, MichaelKelly, BarbaraLandau, Anne Lederer , Ellen
Markman, Kevin Miller, Letty Naigies. ElissaNewport, Brian Ross, and SandyWaxmanfor
helpfulcommentson earlierdrafts. NoamChomskyis alsothankedfor continuingcommentary
on this line of work. Finally, we wishto thank the children, staff, and parentsof the SaintFaith
NurserySchoolin Philadelphiafor their cooperation .
. . Corresponding author.
1. Introduction
2 Of course it is easy to think of nouns that are similarly ' abstract ' , such as liberty , so
relative ease of learning via extralinguistic observation is not theoretically identifiable with
the noun / verb distinction . But it is as a practical matter : Abstract verbs are common in usage
to children ( 5 of the most frequent verbs in maternal use to children under two years refer to
mental states and acts, want, like , think , know, and see) but all the most frequent nouns in our
corpus refer to visible object classesor names, e.g., Mommy . The more important point is that
subjects cannot reconstruct even the maternal verbs that refer to observable actions (go, eat,
catch, etc.) by watching the scenein the presenceof evidence (the beep) of just when they were
uttered.
C. Fisher et a/. I Syntactic constraints
Locke proposed , they would fail to acquirelanguagefor quite differentreasons ; namely, that
almostno wordsreferto sensorypropertiesor combinationsof these(Fodor 1981 , Armstronget
at. 1983).
5 For all theseperspective - changingverbpairs, distinguishing environmental conditionsarenot
reallynonexistent , but arevery rare. For instance, it is reasonable to say Thepeoplefled thecity
but not so reasonableto say The city chasedthe people(examplefrom Pinker, personal
communication ). Soin principleonecanfleewithout beingchased . Thequestionis whetherthese
'
raredissociatingenvironments playa role in the child s differentiationof thesepairedverbs. We
know that closeto a third of verb usesto youngchildrenare in the absence of their referents-
not aboutthe here-and-now (Beckwithet al. 1989 as in " "
), Granny coming visit nextweek ,
is to
whichoccursin the absence of visiblecoming(or visibleGranny). This meansthat the learning
devicemustbe quite tolerantin evaluatingscene -to-world conjectures . /Come/ mustbe mapped
onto ' come' thoughit is oftensaidwhennothingis coming, and oftennot saidwhensomethingis
coming. No learningprocedurewilling to discountthe large percentageof scene /usagemis-
matchesfor comecould treat the vanishinglyrare mismatch es for give/receiveor chaselftee as
anythingbut noise.
C. Fisheret alI Syntacticconstraints 339
( I ) Look, biffing!
(2) The rabbit is biffing the ball to the elephant.
(3) The elephantis biffing the ball from the rabbit.
while watching a rabbit give and an elephant receivea ball. As we will show,
if the listener has no accessto the syntactic framework, as in ( I ), she will
probably interpret / biff / as related in meaning to English give. Hearing
sentence(2) bolsters this choice. But a learner who inspects sentence (3)
favors receive.
There are two clues to this choice in sentences(2) and (3) . First is the to/
from distinction , which indicated which entity is source and which is goal of
the moving ball. Second is the placement of rabbit and elephant within the
structure, for whatever entity showed up as the subject of the sentencehas
been selected, in the utterance, as the one that the sentenceis ' about' - the
entity of whom the act is predicated. The notional interpretation of / biff /
must be one that still fits the sceneobservedbut castsit in a different light : If
the subject of the predication was " rabbit " , then the act was giving; if it was
'. "
elephant , then the act was receiving. In essence , for a listener sensitive to
the full sentence, the interpretation of the observed scene will have been
affected by the linguistic observation that accompaniesit .6
The difficulty of the mapping problem is not restricted to the perspective-
changing verbs that we have just discussed. Consider a learner observing a
scenein which a rabbit pushesa duck, who falls ; and hearing one of these
three sentences:
6
Presaging later discussion, note that a discovery procedure that implicates semantic
deductions from surface structure must confront the fact that the relation between surface
syntax and argument structure , even within a single language, is complex at best and can be
misleading in some cases. Consider the case of gel, a near relative of the two verbs ( give and
receive) just discussed. Gel is subject to two interpretations . When we say " Emmanuel got a
book from the library " , the subject (Emmanuel) is certainly the causal agent in the book ' s
" "
moving out of the library . But when we say Emmanuel got the flu from Aaron , Aaron was
the intended causal agent, assuming that Emmanuel wanted no part of the flu . Thus surface
position of the nominals does not uniformly reflect distinctions in their thematic roles.
"
Moreover, gel can appear in two -argument sentencessuch as " Emmanuel got the flu in which
its transfer-of -possessionsenseis masked (if intended) and may not be intended in the first
place. We will return to these issues.
340 C. Fisheret al. / Syntacticconstraints
(3) the elephant is agent of receiving. But there may be a preference between
them all the same. In most give/ receive scenarios, the giver seems more
volitional and thus is the plausible candidate for the agent role - the cause,
first -mover, or instigator (Dowty 1991) .
Our experiment will examine the joint effects of preferences in event
representation (the agency bias) and syntactic deduction. Sometimes these
'
factors work together to reveal the verb s meaning. For example, causal
agent is linked to subject position in the sentencein all known languages
(Clark and Begun 1971, Grimshaw 1981, Bates and MacWhinney 1982,
Pinker 1984, Givon 1986, Schlesinger 1988, Dowty 1991) . Thus in sentence
'
(5) the listener s event bias will mesh with syntactic deduction. In contrast ,
sentence (6) pits the two evidentiary sources against each other , for the
' '
preferred causal interpretation ( push ) is in this case incompatible with the
intransitive syntax. 7
Experiment
4.1. Method
4.1.1. Subjects
The child subjects were twenty-four 3-year-old children (mean age 3;8,
range 3; 1- 4;0), and thirty 4- year-old children (mean age 4;8, range 4;3- 5;0) .
Nine children (five 3-year-olds and four 4- year-olds) were replaced in the
C. Fisheret at. / Syntacticconstraints 343
design for failure to respond (see the Procedure section below) . Eighteen
adults were included to provide a baselinemeasureof competent performance
in this task. A third of the subjectsin each age group were randomly assigned
to each of three introducing context conditions (seeProcedure, below) .
4.1.2. Stimuli
Six brief motion sceneswith puppet actors were video-taped. The scenes
were designedto be naturally describablewith two English verbs that differed
in their semantic and syntactic properties. One of the sentencecontexts that
could accompany each scenewas arbitrarily called the ' X ' context, and the
other was called the ' V ' context. Descriptions of the scenesand thesesentence
contexts are shown in table I .
Table I
Scenes
/sentence
pairs
Scene Sentences
I : A rabbit is feedingan elephant X: The elephantis -- ing. (eat)
with a spoon. Y: The bunnyis - -ing the elephant . (feed)
2: A rabbit comesup and pushes X: The bunnyis - -ing the monkey. (push)
a monkeyoff a box. Y: The monkeyis - -ing. (fall)
3: A rabbit runsacrossthe screen , X: The bunnyis ---ing the skunk. (flee)
followedby a skunk. Y: The skunkis ---ing the bunny. (chase )
4: A monk :ey is riding piggy-back X: The monkeyis - -ing the bunny. (ride)
on a rabbit. Y: The bunnyis ---ing the monkey. (carry)
An elephanthandsa ball to a rabbit. X: The elephant is -- ing the ball to the
bunny. (give)
Y : The bunny is ---ing the ball from the elephant
. (take)
6: A rabbit putsa blanketovera monkey. X : The bunny is ---ing the blanketonto the
monkey. (put)
Y : The bunny is -- ing the monkeywith the
blanket. (cover)
For the first two scenesin table 1 (feed/eat and push/ fal/), the syntax of the
two sentencesdiffers in the number of noun phrases, i.e., transitive feed
expresses the causal relationship w~ile intransitive .eat does not. For the next
two scenes(chaselfleeand .carry/ ride), the number of noun phrases is equal
but the order of the nouns encodes two perspectives on the event and,
8
consequently, who is the agent.
8 Notice tha~ride/carry and choselfleedon' t differ syntactically
, i.c., in their subcatcgorlzation
frames; both appearin simpletransitivesentences . Moreover, both membersof thesepairshave
344 C. Fisher et alI Syntactic constraints
In the last two scenes(give/ receiveand put/ cover), the sentencesfor the two
standard choicesdiffer in the order of NP argumentsin the sentenceas well as
in the preposition used to mark the indirect object (to vs. from and onto vs.
with) . These last two casescan be subdivided into one pair relevant to the
choice of agent (give/ receive), thus subject (elephant vs. rabbit ), and one pair
relevant to the choice of goal and located object (put/ cover) , thus direct object
(blanket vs. monkey) . Thus the stimuli overall can provide some indication of
the kinds of syntactic-semantic linkages that young children can recruit for
verb mapping.
Each sentence structure was randomly paired with one of six nonsense
"
syllables for each subject (zike, blick , pilk , dack, moke, node) . All sentences
were presentedwith the verb in the progressiveform (blicking) to maximiz@
intelligibility and pragmatic felicity as descriptions of ongoing actions.
4.1.3. Procedure
Two experimenters tested each child individually ; one showed the videotapes
and uttered the stimulus sentences , while the other recorded the
' were also audio -taped, to allow later
subjects responses. The sessions
checking of the accuracy of the recording experimenter.
"
A puppet (Mac) was introduced to the child who was then told Mac
'
doesnt speak English very well, so sometimeshe usespuppet words. Can you
"
help us figure out what the puppet words mean? . Assent received, the child
was then given a practice trial in which Mac said " Look ! The elephant is
"
zorping ! as an experimenter made a hand-held elephant puppet laugh. The
"
child was then asked, " What does zorping mean? and prompted by asking
" What is the "
elephant doing ? This latter prompt was used only in the
practice trial , to help the child understand the task. For the adult subjects,
Mac the puppet was omitted for obvious reasons. They were simply informed
that their task would be to guessthe meanings of nonsensewords.
On each test trial , the subject first heard the stimulus sentenceand then
saw the video-taped scene. The scenewas repeated for up to one minute as
the subject watched. The experimenter repeatedthe stimulus sentenceat least
the same thematic-role assignmentto syntactic position : the subject is agent, the object is theme
' '
(or patient) . What we mean by attention to the syntax in solving the mapping problem for the
' ' '
membersof thesepairs is that the observers conjecture about whether the verb means chase or
' flee' is -
consequenton noticing which observedentity receivesthe subject agent slot. As we have
emphasizedearlier, this is one reason why a priorly acquired nominal vocabulary is prerequisite
to acquiring verb meanings.
C. Fisher et al. / Syntactic constraints 34S
once while the videotape played. The subject was asked " What does gorping
rnean?" . Additional prompts were " What ' s happening?" or " What ' s going
on?" . Subjects were encouraged to guess. Subjects who said nothing during
the practice trial and during the first two experimental trials were dropped
frorn the study. Sceneswere presented in two orders, chosen to allow all
subjects to begin with less difficult items (as revealed during pilot testing) .
The first order was feed/ eat, give/ receive, chaselfiee, cover/put , carry/ ride,
push/fall . The secondorder was its reverse.
The nonsenseverbs were presentedto the subjectsin one of three linguistic
contexts:
' '
Neutral syntax ( No sentence) trials. A third of the subjects in each age
group heard the nonsensewords in the syntactically uninformative context
" Look ! "
Ziking ! . These subjects were cued (by the -inK suffix, Brown 1957)
that the novel word was a verb but receivedno infonnation about its specific
syntactic behavior. Even if young children use syntactic evidencein real-life
verb learning, this experimental condition withheld such evidence, allowing us
to assessany blases subjects might show .in interpreting the scenes and
providing a baselineagainst which to compare their perfonnance when given
syntactic information .
Sentencetrials. The remaining two -thirds of the subjects saw the scenes
accompaniedby one of two introducing sentential contexts ( X or Y in table
1, for each scene). Each subject heard only one of the two sentencesfor a
' '
single scene. One group of subjectsheard the sentencesdesignated X in table
' '
1, and the other group heard those designated V . We reiterate that the
assignment of sentencesto the categories X and Y was entirely arbitrary .
Thus, e.g., transitive feed was assignedto the Y category while transitive push
was assignedto the X category. This arbitrary assignmentassuredthat each
subject group heard some sentencesof both kinds.
4.2. Resu
/ts
Table
2
each
Proportion foreach
response scene
, byageandintroducing
context
Adults( n= J8) :
eat/ feed 0.00 0.17 0.83 1.00 0.00 0.00 0.00 1.. 0.00
push/fall 0.50 0.00 0.50 1.00 0.00 0.00 0.00 1.. 0.00
flee/chase 0.00 0.83 0.17 1.00 0.00 0.00 0.00 1.. 0.00
ride/carry 0.17 0.33 0.50 0.83 0.17 0.00 0.00 1.. 0.00
give/take 0.50 0.00 0.50 1.00 0.00 0.00 0.00 1.. 0.00
put/cover 0.00 0.83 0.17 0.50 0.17 0.33 0.00 1.. 0.00
Mean 0. 19 0.36 0.44 0.89 0.06 0.06 0.. 1.. 0.00
9 Preliminary analyses indicated that there was no effect of order (which of the stimuli the
subjects saw first ) on the probability of sentence-congruent responses , so the two order groups
were combined in this and all further analyses. The effect of syntax on verb paraphraseswas
shown in a series of planned I -tailed I-testS. In each case the dependent variable was the
proportion (arcsine transformed) of each response( X or V ), examined across sentencecontexts.
Separateanalyses for the 3- and 4- year-olds yielded the same results in each case; to simplify
presentation of the results, the two groups of children are pooled in the analysespresentedhere.
C. Fisheret al. / Syntactic
constraints 349
ducing context who produced A (e.g., feed, carry) and NA (e.g., eat, ride)
responses, as defined above. These values are shown only for the five relevant
scenarios(that is, excluding the put/ cover scenewhich cast the same participant
as subject in both context sentences ) . Across all presentation conditions
and all age groups, A 187
responses( ) outnumber NA responses(91) two to
one. There is an agency bias.
Four-year-olds(n = 30) :
. 0.80 0.10 0.20 0.60
feed eat 0.10 0.10
push fall 0.10 0.00 1 0.00 0.50 0.40
chase flee 0.10 0.00 0.90 0.00 0.10 0.40
carry ride 0.80 0.00 0.70 0.20 0.20 0.40
give take 0.40 0.10 0.50 0.00 0.30 0.60
Adults(n = 18) :
feed eat 0.17 0.00 1.00 0.00 0.00 1.00
push fall 0.50 0.00 1.00 0.00 0.00 1.00
chase flee 0.83 0.00 1.00 0.00 0.00 1.00
carry ride 0.33 0.17 1.00 0.00 0.17 0.83
give take 0.50 0.00 1.00 0.00 0.00 1.00
. Since ' Other ' within each scene/ context cell may
responses are left out of this table , proportions
not sum to I .
Table4
Proportion. agentive(A) and non-agentive( NA) responses
, by ageand introducingcontext
A NA A NA A
. Since ' Other ' within each scene/ context cell may
responses are left out of this table , proportions
not sum to 1.
meaningsto new words. They do not expect there to be exact synonyms. (For
this effect with nouns, seee.g., Clark 1987, and Markman , this volume ; and
for the analogous effects with verbs, Golinkoff et al., in prep. ; Kako , in
prep.)
We now ask whether the samepattern of syntactic effectson verb interpretation
occurs for the phrasal responsestaken alone. If so, it is unlikely that
'
the children were merely ' filling in the blank in the stimulus sentenceswith a
known verb that fit , but were engaging in something more like the syntax-
guided mapping procedure proposed here.
Table 5 shows the obtained responsepatterns for the phrasal responses ,
. Three 3- - olds and one 4- - old contributed
omitting all single-verb glosses year year
no data to this analysis. Given the reduction in the data entailed by
'
recoding the responses in this way, the 3- and 4-year-olds data were
combined. Inspection of table 5 shows that the same pattern of structural
effects on verb interpretation remains. This pattern was reliable in independent
I -tests on the (arcsine transformed) proportion of X and Y responses .
ResponseX was more likely in the context of sentence X than sentenceY
' '
(1(30) = 2.94, p < 0.01) and the No sentence context (1(32) = 4.11, p
< 0.001) . Similarly , responseY was more frequent given sentenceY than
either of the other two introducing contexts (sentenceX : 1(30) = 6.59, p
< 0.001; no sentence: 1(32) = 3.61, p < 0.001) . In sum, even when the
children evidently did not interpret the nonsensewords as puppet synonyms
for verbs in their known lexicon, there are strong and reliable effects of
sentencestructure on construal.
5. DiseI RiOD
Table5
Proportion of X and Y responsesby introducing context ; phrasal responsesonly (3- and 4- year-
aids together, n = 50)
By default , the effect should be that youngest children can acquire only
object tenDs (nouns, in the adult language) . This is because, as we
described in introductory remarks, only the nouns occur in maternal
speech in a tight time -lock with the situational contexts, and in ostensive
contexts. And indeed one of the most striking findings in the language-
learning literature is that first words are nouns despite the fact that from
the beginning the learner is exposed to words from every lexical category .
We assume that these first words are assigned to the formal category noun
on a semantic basis, as conjectured by Grimshaw ( 1981) and Pinker
( 1984) .
'
labelled by the novel verb (" Point to the one who s blicking the other one
"
around" or " Point to the one who' s blicking around ) . The intent here was to
put the children into the position of much younger learners who have access
only to the PSR: They knew how many arguments were supplied to the new
verb, but not which was which. Those who had heard the transitive frame
confidently chose the causal agent as the blicker , while those shown the
intransitive frame were willing to select the patient as blicker. Thus without
being told which event participant has been cast as sentencesubject, preschooler
interpreted a one-argument structure as incompatible with a causal
interpretation .
The PSR taken together with the scene observed will allow learners to
acquire a crucial aspect of the phrase structure itself. Supposea child hears
" kick " for the first time in the frame " The "
bunny kicked the monkey . The
agency bias,- as constrained by the minimal structure given by the PSR
-
(namely, a 2-argument structure), will lead the learner to seek an agent
patient interpretation of the scene. Provided that she knows the nouns bunny
and monkey, she can annotate the phrase structure as shown in figure 1,
marking bunny, the first noun in the structure, as the agent (Joshi and
Rambow, in prep.) . This representation now matches two of the quasiuniversa
-
properties of the category subject of transitive sentence there is
one noun which is both the agent and the leftmost noun in a transitive
14
structure. There is a strong tendency for languagesto place subjectsbefore
objects (Keenan 1976, Kayne 1992) . Further phrase-structure options can
then be set based on this initial assignment.
///""
""""
KICK
BUNNY
(agent)
MONKE
Fig. 1
14 Notice that this claim differs from the subject -agentlink which has beeninvokedin the
literatureto supportan initially asyntacticverb-learningprocedure(e.g., Grimshaw1981 , Draine
andHardy 1982 , Pinker1984 . It is
, andothers). In fact, subjectsareoftenpatientsor experiencers
the transitivesentence whoseagentjust about universallysurfacesas subject . To recognizethis
distinction requires, at minimum, PSR knowledgethat will revealthe numberof argument
positions.
C. Fisher et aI. / Syntactic constraints 359
-structure
Fig. 2. (a) anSVOphrase ; (b) a VOSphrase
template -structure .
template
Severalother kinds of cue to the phrase-structure of the exposurelanguage
;
have been suggested , and have plausibility . For instance, Mazuka ( 1993)
suggests that branching direction in Japanesecan be deduced by observing
intonational markings of moved relative clauses. Further , subject and object
=zz>~~ ~0ZZ
order and the way this maps onto thematic roles. If they heard a voice saying
" "
Big Bird tickles Cookie Monster they looked primarily at a video screen
" "
depicting this event, but if the voice said Cookie Monster tickles Big Bird ,
they looked at another video screen which depicted this latter event. These
toddlers evidently were sensitiveto (something like) the idea that the doer of
the action is the subject of the transitive sentence; and that in English the
serially first noun is that subject.
language. Once the full phrase structure is acquired, the learner can approach
the perspective-changing verbs that we have studied. Disentanglement of the
membersof thesepairs requires more than counting NP positions (which are
the same for both interpretations) and fitting these to the logic of the
observedsituation (which suits either choice) .
Once the phrase structure has been bootstrapped from PSR, the learner
can make this decision by inspecting the geometry of the tree to determine
which noun is sentencesubject. If the plausible agent appears as subject with
the give/ receive scene, then the situational and syntactic cues converge on
give. But if the plausible agent appears in nonsubject position then it is not
the agent, despite appearances. In the present experiment, we showed child
responsivenessto these implications of structure for verb interpretation even
in caseswhere they had to overcome a semantic bias in event interpretation
to use it .
cookie to the child ), subjects now come up with yet another physical term
(e.g., take) . They are mulishly resistant to conjecturing any mental term.
While successiveobservations force them to change their minds about which
physical-action term is the right one, they never seemto get the idea that this
bias should be overridden altogether. This effect was shown by Lederer et al.
( 1991) with adults, and by Gillette ( 1992) with children.
Examination of the further syntactic privileges of eat and want can
resolve this problem . Eat occurs intransitively and in the progressive form
while want does not . Want also occurs with tenseless sentence complements
(Do you want to eat the apple?) . These distinctions are sufficient to
disentangle the two verb construals , for only mental activity verbs license
these constructions . (Note that force verbs, which also accept sentence
complements, require an additional nominal position , e.g., Make him eat
the apple! , but not . Make eat the apple.' ). As we will discuss presently,
subjects seize upon this disambiguating structural information to find the
right construal .
A second example of residual problems unresolved by single frames, even
'
though theseare paired with differing events, concernsa blind child s learning
of the distinction between touch and see. Blind learners receiveobservational
evidenceabout the meanings of both terms though it is perforce haptic and
not visual. The blind child ' s first uses of see at age two were in the sense
' touch' e. . she commanded " Don ' t see that !"
, g, , while pushing her brother
away from her record- player. The confusion arose, doubtless, becauseevery
scenein which the blind child can see(' ascertain by perceptual inspection' ) is
a scenein which she can touch. And both verbs occur most often in maternal
" ' '
speechas simple transitives. Further syntactic experience( Let s seeif there s
"
cheesein the refrigerator ) can account for how the blind child could, as she
did , come to distinguish between the two construals by age three (Landau
and Gleitman 1985).
As a more general example of the convergencethat frame ranges make
available for verb mapping, consider the four verbs give, explain, go, and
think. These verbs are cross-classified both conceptually and syntactically.
Give and explain, different as they are in many regards, both describe the
transfer of entities between two parties. Accordingly, they can appear in
structures with three noun-phrase positions :
In (7), a physical object (the horse) is transferred from Ed' s to Sally' s hand
and in (8) abstract objects (the facts) are transferred from Ed' s to Sally' s
mind. A noun phrase is required for each of the entities involved: the giver,
the receiver, and that which is transferred betweenthem. It is this similarity in
their meanings that accounts for the similarity in the structures that they
accept. Verbs that describe no such transfer are odd in theseconstructions:
.
(9) Philip went the horse to Libby .
.
( 10) Philip thought the facts to Libby .
But there is another semantic dimension for these four verbs in which the
facts line up differently . Explain and think concern mental events while give
and go concern physical events. There is a typical surface reflex of this
distinction also, namely, mental verbs accept sentencecomplements(expressa
relation betweenan actor and a proposition) :
(x CAUSE (y OPEN
I I
AGENT THEME
' '
decomposing the meaning only to the level required to state the argument
structure and thus to predict the surface structures. The semantic distinction
between opening and closing is then derived from examining situational
factors, though only God and little children know just how. There is no
opening-vs.-closing syntactic reflex to aid them.
This schemecertainly does not imply that open is interpreted as meaning
' an event which is
causal and noncausal at the same time' . If that were true,
then the larger a verb' s syntactic range the less it would mean. Rather, the
lexical description predicts that open is causal when transitive, noncausal
when intransitive. The interpretive choice among those made available by the
lexical entry is (on any single use of the verb) derived computationally from
the truth value of the sentencestructure.
A difficulty with interpreting these results onto the child learning situation
is that thesesubjects(when correct) were identifying old verbs that they knew,
by definition : Perhaps they just looked up the frame ranges for these known
verbs in their mental lexicons rather than using the frames to make semantic
deductions. Becauseof this possibility , the pertinence of the findings is much
more easily interpreted by inspecting the 48% of cases where the subjects
failed to identify the maternal verb, guessingsomething else.16 The finding is
that false guessesgiven in responseto frame-range information are semantically
close to the actual verb the mother said (as assessedby the Fisher et al.
semantic-similarity procedure) while false guessesin responseto sceneswere
semantically unrelated to the verb the mother actually uttered. As syntactic
'
bootstrapping predicts, the frame range put the subjects into the semantic
' even when
neighborhood they did not allow convergenceto a unique verb
construal.
Note that 52% percent correct identification , while a significant improvement
over 7% or 28% , is not good enough if we want to model the fact that
verb learning by three-year-olds is a snap. They do not make 48% errors so
far as we know , even errors close to the semantic mark. But as we have
repeatedly stressed, syntactic bootstrapping is not a procedure in which the
child is assumedto forget about the scene, or the co-occurring nominals, and
attend to syntax alone (as Lederer et al. forced their subjects to do in this
manipulation by withholding all other evidence). It is a sentence -to-world
pairing procedure. Indeed, adding the real nouns to the frames without video
in this experiment led to over 80% correct verb identification ; adding back
the sceneyielded almost perfect performance. So if the child has available (as
she does, in real life ) multiple paired scenesand sentences , we can at last
understand why verb learning is easy.
Collateral evidence from children for the use of multiple frames is at
present thin , largely becauseit is difficult to get young children to cooperate
while a lengthy set of structures/ scenesis introduced. (For this reason, the
experiment presented in this article settled for studying the effect of single
structures on the interpretation of single scenes , though we interpret the
findings as a snapshot of an iterative process.) Supportive evidence comes
from Naigles et al. ( 1993), who found that young children will alter their
interpretation of known verbs in responseto hearing them in novel syntactic
, while
environments older
children
and
adults willnot(forareplication
usually
and extension
, see ). Evidently
etale1992
Naigles ,expansionoftheframe
istaken
range asevidence oftheconstrual
foralteration early
' inthelearning
for
process that word, but
after
extensive the
experience word s meaningisset
thesyntax
and losesitspotency
tochange .
theconstrual
5.3. Whatsemanticcluesresidein thesyntax?
one of the meanings and others the consequenceof the other. Putting them
together as a single frame-range should lead to chaos. The degree to which
polysemy reducesthe plausibility of the use of multiple frames is unknown in
detail (seeGrimshaw, this volume, for a pessimisticview) . However, there is
some suggestiveevidence that the frame-ranges of verbs are well-correlated
with their meaningsin the general case, despite this problem.
The manipulations of interest in this regard were carried out in English by
Fisher et al. ( 1991) and in Hebrew by Geyer et al. (forthcoming) . One group
of subjects provided the frame ranges for a set of common verbs (they gave
judgments of grammaticality of all the verbs in various syntactic environments
) . A second group of subjectsprovided semantic-relatednessjudgments
for these verbs presented in isolation (with no syntactic context) . The
question was whether the overlap in frame-ranges predicted the semantic
relatednessamong the verbs. The answer is yes, massively- and in materially
'
the same way for English and for Hebrew. The more any two verbs
overlapped in their syntactic privileges, the closer they were judged to be in
their meanings. Evidently , overlap in frame range provides a guide to
semanticrelatednessthat (though probabilistic) is stable enough to contribute
to the verb-learning feat.
The three problems just described - variability of the mapping relations,
alternate parsesfor input sentences , and polysemy - limit or at least complicate
the. potential effectivenessof the procedure we have called syntactic
bootstrapping. Thus any linguist or psychologist worth his or her salt can
find counterexamples to the claim that frame-information , or frame-range
information , always and perfectly predicts the relevant (argument-taking)
properties of the verbs - just as counterexamples to the usefulness of
situational information are easy to find.
We must suppose, in consequence , that the learner draws on convergent
cuesfrom prosody, syntax, and situation , as available, jiggling them all across
instances to achieve the best fit to a lexical entry . That is, the internal
structure of the child ' s learning procedure is likely to be quite mixed in the
information recruited and probabilistic in how such information is exploited,
sad as this seems. In the work presented, we could show only that syntactic
evidence is on theoretical grounds crucial for working out certain mapping
problems (those that involve perspective-taking verbs) and indeed is used by
youngsters solving for these under some exquisitely constrained laboratory
conditions. We take the outcomesto lend plausibility to the overall approach.
C. Fisher et al. / Syntactic constraints
6. Concl _ io-
References
ArmstrongS ., L.R. Gleitman and H. Gleitman, 1983 . What someconceptsmight not be.
Cognition ,13 263 - 208.
Bates, E. and B. MacWhinney . Functionalistapproach
, 1982 esto grammar.In: E. Wanner, L.R.
Gleitman(eds.), Languageacquisition: The stateof the art, 173- 218. New York: Cambridge
UniversityPress .
Beckwith , R., E. Tinker and L. Bloom, 1989 . The acquisitionof non-basicsentences. Paper
presented at the BostonUniversityConference on LanguageDevelopment , Boston.
Bowerman , M., 1976 . Semanticfactorsin the acquisitionof rules for word useand sentence
construction . In: D.M. Morehead , A.E. Morehead(eds.), Normal and deficientchild language
. Baltimore, MD : UniversityPark Press .
Bowerman , M., 1982 . Reorganizationalprocesses in lexical and syntacticdevelopment. In :
E. Wanner, L.R. Gleitman(eds.), Languageacquisition: The stateof the art, 319- 346. New
York: CambridgeUniversityPress .
Braine, M. and J. Hardy, 1982.On what casecategoriesthereare, why they are, and how they
develop: An amalgamof a priori considerations , speculation , and evidencefrom children. In :
E. Wanner, L.R. Gleitman(eds.), Languageacquisition: The stateof the art, 219- 239. New
York: CambridgeUniversityPress .
Brown, R., 1957 . Linguisticdeterminism and the part of speech . Journalof Abnormaland Social
Psychology 55, 1- 5.
Bruner, J.S., 1975 . From communication to language : A psychological perspective . Cognition3,
255- 287.
Bruner, J.S., 1983 . Child' s talk. New York: Norton.
Chomsky N.
, , 1957 . Syntacticstructures . New York: Mouton Publishers .
Chomsky , N., 1981 . Lectureson governmentand binding. Dordrecht: Foris.
Clark, E.V., 1987 . The principle of contrast: A constraint on languageacquisition . In:
B. MacWhinney(ed.), Mechanisms of languageacquisition , 264- 293. Hillsdale, NJ: Erlbaum.
Clark, H.H. andJ.S. Begun, 1971 . Thesemantics of sentence subjects . Languageand Speech , 14,
34- 46.
Cooper, W. and J. Paccia -Cooper, 1980 . Syntaxand speech . Cambridge , MA : HarvardUniversity
Press .
Dowty, D., 1991 . Thematicproto-rolesand argumentselection . Language67(3), 547- 619.
Dromi, E., 1987 . Early lexicaldevelopment . New York: CambridgeUniversityPress .
Fernald, A. andT. Simon, 1984 . Expandedintonationcontoursin mothers '
speechto newborns .
Developmental Psychology 20( 1), 104- 113.
Fillmore, C.J., 1968 . Lexicalentriesfor verbs. Foundationsof language4, 373- 393.
Fillmore, C.J., 1970 . The grammarof hitting and breaking. In: R. Jacobs , P. Rosenbaum (eds.),
Readingsin Englishtransformationalgrammar, 120- 133. Waltham, MA : Ginn.
Fisher, C., in press. Structureand meaningin the verb lexicon: Input for a syntax-aidedverb
learningprocedure . Languageand CognitiveProcess es.
Fisher, C., 1993 . Preschoolers ' useof structuralcuesto verb . Paperpresented at the 60th
meaning
Annual meetingof the Societyfor Research in Child Development , NewOrleans , LA.
Fisher, C. and H. Tokura, in press . Prosodyin speechto infants: Direct and indirectcuesto
syntacticstructure.In: J. Morgan, C. Demuth(eds.), Signalto syntax. Hillsdale, NJ: Erlbaum.
Fisher, C., H. Gleitmanand L. R. Gleitman, 1991 . On the semanticcontentof subcategorization
frames . CognitivePsychology23, 331- 392.
Fodor, J.A., 1981 . The presentstatus of the innatenesscontroversy . In: J.A. Fodor (Ed.),
Representations . Cambridge , MA : MIT Press .
Fritz, J.J. and G. Suci, 1981 . Facilitationof semanticcomprehension at the one-word stageof
languagedevelopment . Journalof Child Language9, 31 39. -
Gentner, D., 1978 . On relationalmeaning : The acquisitionof verbmeaning . Child Development
49, 988-998.
uentner, D., 1982 . Why nounsare learnedbeforeverbs: Linguistic relativity versusnatural
partitioning. In: S.A. Kuczaj(ed.), Languagedevelopment , Volume2: Language , thoughtand
culture, 301- 334. Hillsdale, NJ: Erlbaum.
Gerken, L., P.W. JusczykandD.R. Mandel, 1993 . Whenprosodyfailsto cuesyntacticstructure:
Nine-month-olds' sensitivityto phonologicalvs. syntacticphrases . Ms., State Universityof
New York at Buffalo.
Geyer, J., L.R. Gleitmanand H. Gleitman, forthcoming.Subcategorization asa predictorof verb
meaning : Evidence from modem Hebrew . Ms., of
University Pennsylvania .
C. Fisheret al. / Syntacticconstraints 373
Gillette, J.A., 1992 . Discoveringthat /think/ means' think' : Theacquisitionof mentalverbs. Ms.,
Universityof Pennsylvania .
Gillette, J.A. and L. R. Gleitman, forthcoming . Effectsof situationalcueson the identificationof
nounsand verbs.
Givon, T., 1986 . The pragmaticsof word order: Predictability , importance , and attention.
Amsterdam : Benjamins .
Gleitman, L.R., 1990 . The structuralsourcesof verb meanings . LanguageAcquisitionI , I- 55.
Gleitman, L.R., H. Gleitman, B. Landauand E. Wanner, 1988 . Wherelearningbegins: Initial
representations for languagelearning. In: E. Newmayer(ed.), The CambridgeLinguistic
Survey , Vol. III : 150- 193. New York: CambridgeUniversityPress .
Goldin-MeadowS., M. Seligmanand R. Gelman, 1976 . Languagein the two-year-old. Cognition
4, 189-202.
Golinkoff, R.MR . C. Jacquetand K. Hirsh-Pasek , in progress . Lexicalprinciplesunderlythe
learningof verbs. Ms., Universityof Delaware .
Gordon, P., 1985 . Evaluatingthe semanticcategorieshypothesis : The caseof the count/mass
distinction. Cognition20, 209- 242.
Grimshaw , J., 1981 . Form, function and the languageacquisitiondevice. In : Ct . Baker, J.J.
McCarthy(eds.), Thelogicalproblemof languageacquisition , 165- 182. Cambridge , MA : MIT
Press .
Grimshaw , J., 1990 . Argumentstructure. LinguisticInquiry Monograph18. Cambridge , MA :
MIT Press .
Gropen, J., S. Pinker, M. Hollanderand R. Goldberg, 1991. Affectedness and direct objects:
The role of lexical semanticsin the acquisitionof verb argumentstructure. Cognition 41,
153- 195.
Grosjean , F. and J.P. Gee, 1987 . Prosodicstructureand spokenword recognition . Cognition25,
135- 155..
Gruber, J.S., 1965 . Studiesin lexicalrelations. PhiD. dissertation , MIT .
Hall, D.G., 1993 . Basic-levelindividuals. Cognition48, 199-221.
Hall, D.G. and S. Waxman , 1993 . Assumptionsabout word meaning: Individuationand basic
levelkinds. Child Development 64, 1550 - 1570 .
Hirsh-Pasek , K. and RiM . Golinkoff, 1991 . Languagecomprehension : A newlook at someold
themes . In: N. Krasnegor , D. Rumbaugh , M. Studderty -Kennedy , R. Schiefelbusch (eds.),
Biological and behavioral of
aspects languageacquisition , 301 - 320 . Hillsdale
, NJ: Erlbaum .
Hirsh-Pasek , K., H. Gleitman, L. Gleitman, R. Golinkoff and L. Naigies, 1988 . Syntactic
bootstrap ping: Evidencefrom comprehension . Paper presentedat the Boston University
Conference on LanguageDevelopment , Boston.
Hirsh-Pasek , K., D. Kemler-Nelson, P. Jusczyk , K. Cassidy , B. Drussand L. Kennedy , 1987 .
Clausesare perceptualunits for younginfants. Cognition26, 269- 286.
Jackendoff , R., 1972 . Semanticinterpretationin generativegrammar. Cambridge , MA : MIT
Press .
Jackendoff , R., 1978 . Grammarasevidence for conceptualstructure.In: M. Halle, J. Bresnan , G.
Miller (eds.), Linguistictheory and psychologicalreality, 201- 228. Cambridge , MA : MIT
Press .
Jackendoff , R., 1990 . Semanticstructures . Cambridge , MA : MIT Press .
JoshiA . and O. Rambow , in progress . Dependency parsingfor phrasestructuregrammars . Ms.,
Universityof Pennsylvania .
374 C. Fisher et alI Syntactic constraints
Naigles , L., A. Fowler and A. Helm, 1992 . Developmental shifts in the constructionof verb
.
meaningsCognitiveDevelopment 7 , 403 - 427 .
Naigles , L., H. Gleitmanand L. R. Gleitman, 1993 . Childrenacquireword meaningcomponents
from syntacticevidence . In: E. Dromi (ed.) Languageand development , 104- 140. Norwood,
NJ: Ablex.
Nelson, K., 1973 . Structureand strategyin learningto talk. Monographsof the Societyfor
Research in Child Development 38.
Nelson, K., 1974 . Concept , word, and sentence : Interrelationsin acquisitionand development .
Psychological Review81, 267- 285.
Newport, E.L., 1977 . Motherese : The speechof mothersto youngchildren. In: N. Castellan ,
D. Pisoni, G. Potts(cds.), CognitiveTheory, Volume2. Hillsdale, NJ: Erlbaum.
Ninio, A., 1980 . Ostensivedefinitionin vocabularyacquisition . Journalof Child Language , 7,
565- 573.
Pinker, S., 1984 . Languagelearnabilityand languagedevelopment . Cambridge , MA : Harvard
UniversityPress .
Pinker, S., 1989 . Learnabilityand cognition. Cambridge , MA : MIT Press .
Quine, W. V. , 1960 . Word and .
object Cambridge , MA : MIT Press .
Rappaport , M., B. Levin and M. Laughren , 1987 . Levelsof lexical representation . Lexicon
ProjectWorkingPapers# 20. Cambridge , MA : MIT Centerfor CognitiveScience .
Resnick , P., 1993 . Selectionalpreferenceand implicit objects . Paperpresentedat the CUNY
Sentence Processing Conference , Amherst, MA.
Schlesinger , I., 1988 . The origin of relationalcategories . In: Y. Levy, I. Schlesinger , M. Braine
eds.
( ), Categories process and es in language acquisition , 121- 178 . Hillsdale , NJ: Erlbaum .
Slobin, D.I., 1975 . On the natureof talk to children. In: E.H. Lenneberg , E. Lenneberg(cds.),
Foundationsof languagedevelopment . New York: Holt, Rinehartand Winston.
Talmy , L., 1985 . Lexicalization patterns Semanticstructurein lexicalforms. In: T. Shopen(ed.),
:
Languagetypology and syntacticdescription , Volume 3: Grammaticalcategoriesand the
lexicon. New York: CambridgeUniversityPress .
Tomasello , M. and A.C. Kruger, 1992 . Joint attentionon actions: Acquiringverbsin ostensive
and non-ostensive contexts. Journalof Child Language19, 311- 333.
Vendier,Z., 1972Rescogitans:An essayin rationalpsychology
. . Ithaca, NY : CornellUniversity
Press .
Zwicky, A., 1971 . In a mannerof speaking . LinguisticInquiry II (2), 223-233.
) 377- 410. North-Holland
Lingua92 ( 1994
Steven Pinker
Departmentof Brain and CognitiveSciences
, Massachusetts
Institute of Technology
, 10-016,
" MA02139
Cambridge , USA
I examine Gleitman' s ( 1990) arguments that children rely on a verb' s syntactic subcategorization
frames to learn its meaning (e.g., they learn that seemeans ' perceivevisually' becauseit can
appear with a direct object, a clausal complement, or a directional phrase). First , Gleitman argues
that the verbs cannot be learned by observing the situations in which they are used, becausemany
verbs refer to overlapping situations, and becauseparents do not invariably use a verb when its
perceptual correlates are present. I suggestthat these arguments speak only against a narrow
associationist view in which the child is sensitive to the temporal contiguity of sensory features
and spoken verb. If the child can hypothesizestructured semantic representationscorresponding
to what parents are likely to be referring to , and can refine such representationsacross multiple
situations, the objections are blunted ; indeed, Gleitman ' s theory requires such a learning process
despite her objections to it . Second, Gleitman suggeststhat there is enough information in a
'
verb s subcategorization frames to predict its meaning ' quite closely' . Evaluating this argument
'
requires distinguishing a verb s root plus its semantic content (what She boiled the water shares
with The water boiled and does not share with She broke the glass), and a verb frame plus its
semanticperspective(what Sheboiled the water shareswith Shebroke the glassand does not share
with The water boiled) . I show that hearing a verb in a single frame only gives a learner coarse
information about its semantic perspective in that frame (e.g., number of arguments, type of
'
arguments); it tells the learner nothing about the verb root s content across frames (e.g., hot
bubbling liquid ) . Moreover , hearing a verb across all its frames also revealslittle about the verb
root ' s content. Finally , I show that Gleitman ' s empirical argumentsall involve experimentswhere
children are exposed to a single verb frame, and therefore all involve learning the frame' s
'
perspectivemeaning, not the root s content meaning, which in all the experiments was acquired
'
by observing the accompanying scene. I conclude that attention to a verb s syntactic frame can
'
help narrow down the child s interpretation of the perspectivemeaning of the verb in that frame,
but disagreewith the claim that there is some in-principle limitation in learning a verb' s content
. Preparationof this paperwassupportedby NIH Grant HD 18381and NSF Grant BNS 91-
09766
. The ideasand organizationof this paper wereworked out in collaborationwith Jane
Grimshaw , and werepresented jointly at the 1990BostonUniversityConference
on Language
Development . I thank Paul Bloom , JessGropen, Gary Marcus, an anonymousreviewer
, and
especiallyLila Gleitmanfor helpfuldiscussions and comments
.
'
from its ~ituatinn ~ of use that could only be resolved by using the verb s set of subcategorization
frames.
'
1. Introduction: The problem of learning words meanings
When children learn what a word means, clearly they must take note of the
circumstancesin which other speakersuse the word . That is, children must
learn rabbit becausetheir parents use rabbit in circumstancesin which the
child can infer that they are referring to rabbits. Equally obviously, learning
word meaningsfrom circumstancesis not a simple problem. As Quine ( 1960),
among others, has noted, there are an infinite set of meanings compatible
with any situation , so the child has an infinite number of perceptually
indistinguishable hypothesesabout meaning to choose among. For example,
all situations in which a rabbit is present are also situations in which an
animal is present, an object is present, a furry thing is present, a set of
undetachedrabbit parts are 'present, a something-that -is-either-a-rabbit -ora -
Buick is present, and so on. So how does the child figure out that rabbit
'
means ' rabbit ' , not ' undetachedrabbit part ?
Word learning is a good example of an induction problem, where a finite
set of data is consistent with an infinite number of hypotheses, only one of
them correct, and a learner or perceiver must guess which it is. The usual
explanation for how people do so well at the induction problems they face is
that their hypotheses are inherently constrained: not all logically possible
hypothesesare psychologically possible. For example, Chomsky ( 1965) noted
that children must solve an induction problem in learning a language: there
are an infinite number of grammars compatible with any finite set of parental
sentences . They succeed, he suggested , because their language acquisition
circuitry constrains them to hypothesize only certain kinds of grammatical
rules and structures, those actually found in human languages, and because
the kinds of sentenceschildren hear are sufficient to discriminate among this
small set of possibilities.
In the case of learning word meanings, too , .not all logically possible
construals of a situation can be psychologically possible candidates for the
'
meaning of a word . Instead, the hypotheses that a child s word learning
mechanismsmake available are constrained in two ways. The first constraint
comes from the representational machinery available to build the semantic
'
structures that constitute mental representations of a word s meaning: a
'
Universal Lexical Semantics, analogous to Chomsky s Universal Grammar
S. Pinker / Verb syntax and verb semantics 379
(see, e.g., Moravscik 1981, Markman 1989, 1990; Jackendoff 1990) . For
' '
example, this representational system would allow object with shapeX and
' ' '
object with function X as possibleword meanings, but not all the undetached
' '
parts of an object with shapeX , object with shapeX or a Buick' , and ' object
'
and the surfacesit contacts . The second constraint comes from the way in
which a child ' s entire lexicon may be built up ; on how one word ' s meaning
'
may be related to another word s meaning (see Miller 1991, Miller and
Fellbaum 1992). For example, the lexicons of the world ' s languages freely
allow meronyms (words whose meanings stand in a part- whole relationship,
like body- arm) and hyponyms (words that stand in a subset- superset relationship
, like animal- mamma/), but do not easily admit true synonyms
Bo
( linger 1977, Clark 1987, Miller and Fellbaum 1991) . A child would
therefore not posit a particular meaning for a new word if it was identical to
some existing word '
' s meaning. Finally , the child would have to be equipped
with a procedure for testing the possible hypotheses about word meaning
against the situations in which adults use the words. For example, if a child
' '
thought that pet meant dog , he or she will be disabusedof the error the first
time the word is used to refer to a fish.
Although the problem of learning word meaningsis usually discussedwith
regard to learning nouns, identical problems arise with verbs (Landau and
Gleitman 1985, Pinker 1988, 1989; Gleitman 1990) . When a parent comments
on a dog chasing a cat by using the word chase, how is the child to know that
it means ' chase' as opposed to ' flee' , ' move' , ' go' , ' run ' , ' be a dog chasing' ,
' chaseon a warm '
day , and so on?
As in the case of learning noun meanings (indeed, learning in general),
there must be constraints on the child ' s possible hypotheses. For example,
manner-of -motion should be considered a possible component of a verb' s
mental dictionary entry, but temperature-during -motion should not be. (See
Talmy 1985, 1988; Pinker 1989, Jackendoff 1990, and Dowty 1991, for
inventories of the semantic elements and their configurations that may
constitute a verb' s semantic representation.) Moreover, there appear to be
constraints on lexical organization (Miller 1991, Miller and Fellbaum 1991) .
For example, verb lexicons often admit of co-troponyms (words that describe
different manners of perfonning a similar act or motion , such as walk-skip-
jog) but , like noun lexicons, rarely admit of exact synonyms ( Bolinger 1977,
Clark 1987, Pinker 1989, Miller and Fellbaum 1991) . Finally , the child must
be equipped with a learning mechanism that constructs, tests, and modifies
semantic representationsby comparing infonnation about the uses of verbs
by other speakersacrossspeechevents (Pinker 1989) .
380 S. Pinker/ Verbsyntaxandverbsemantics
In recent years Lila Gleitman and her collaborators have presenteda series
of thorough and insightful discussionsof the inherent problems of learning
verbs' meanings (Landau and Gleitman 1985, Hirsh -Pasek et ale 1988,
Gleitman 1990, Naigles 1990, Lederer et ale 1989, Fisher et ale 1991and this
volume) . Interestingly, Gleitman and her collaborators depart from the usual
'
solution to induction problems, namely, seeking constraints on the learner s
'
hypotheses and their relation to the learner s input data as the primary
explanation. Rather, they argue that the learner succeedsat learning verb
semanticsby using a channel of information that is not directly semantic at
'
all. Specifically, they suggestthat the child infers a verb s meaning by using
the kinds of syntactic arguments (direct object, clause, prepositional phrase)
that appear with the verb when it is used in a sentence. Such syntactic
properties (e.g., whether a verb is transitive or intransitive ) are referred to in
' ' ' ' '
various literatures as the verb s argument structure , argument frame ,
' ' ' '
syntactic format , and subcategorization frame . 'Indeed, Gleitman and her
collaborators argue that information about a verb s semantics, gleaned from
observing the circumstances in which other speakers use the verb (e.g.,
' '
learning that open means opening becauseparents use the verb to refer to
opening things) is in principle inadequate to support the acquisition of the
verb' s semantics; cues from the syntactic properties of the verb phrase are
essential.
This position has its roots in Brown ( 1957) and Katz et ale ( 1974), who
showed empirically how children use grammatical information to help learn
certain aspectsof word meanings. But it was given a stronger form in Landau
'
and Gleitman s ( 1985) book Language and Experience: Evidencefrom the
Blind Child. Landau and Gleitman point out that a blind child they studied
acquired verbs, even perceptual verbs like look and see, rapidly and with few
'
errors, despite the child s severeimpairment in being able to witnessdetails of
the scenesin which the verbs are used. Moreover, they noted that a sighted
'
child ' s task in learning verbs is different from the blind child s task only in
degree, not in kind . Since the learning of verbs like see and know cannot
critically rely on information from vision, Landau and Gleitman presented
the following hypothesis:
'In essence formatsfor a verbprovides
ourpositionwill bethatthesetof syntactic crucialcues
to theverbmeanings theseformatsareabstract
just because surface esof themeanings
reflex .
... thereis verylittle informationin ' formatthatis for
attested some verb
,
anysinglesyntactic
for that formatserves manydistinctuses. However... thesetof subcategorizationframes
S. Pinker / Verb syntax and verb semantics
1 Note that some of the other linguistic contexts that Landau and Gleitman call ' subcategorization
frames' are not subcategorizationframes either, but frozen expressionsand collocations that
are probably idiosyncratic to English and henceno basis for learning. Theseinclude Look !, See?,
Look! The doggie is running!, See? The doggie is running!, Come see lhe doggie, and look like in
'
the senseof ' resemble. Since look and see are the only two verbs that Landau, Gleitman , and
their collaborators discussin detail, if their learning scenariosfor thesetwo verbs adventitiously
exploit particular properties of English, one has to be suspicious about the' feasibility of the
scenario in the general case. More generally, Fisher, Gleitman, and Gleitman s claim that there
are something like 100 distinct syntactic subcategorization frames, hence, in principle, 2100
syntactically distinguishable verbs, appears to be a severe overestimate. I think most linguists
would estimate the number of syntactically distinct frames as an order of magnitude lower, which
would make the estimated number of syntactically distinguishable verbs a tiny fraction of what
Fisher et aI. estimate.
S. Pinker I Verb syntax and verb semantics
tion frames. She intended the term to suggestan opposition to my ' semantic
'
bootstrapping (Pinker 1982, 1984, 1987, 1989), and one of the sectionsin her
1990paper is even entitled ' Deciding betweenthe bootstrapping hypotheses'.
' '
Though the opposition semantic versus syntactic bootstrapping is catchy, I
suggestit be dropped. The opposition is a false one, becausethe theories are
theories about different things. Moreover, there is no relationship between
what Gleitman calls ' syntactic bootstrapping' and the metaphor of bootstraps
, so the term makes little sense.
Gleitman usesthe term ' semantic bootstrapping' to refer to the hypothesis
that children learn verbs' meanings by observing the situations in which the
verbs are used. But this is not accurate. ' Semantic bootstrapping' is not even
a theory about how the child learns word meanings. It is a theory about how
the child begins learning syntax. ' The bootstrapping problem' in grammar
acquisition (see Pinker 1987) arises becausea grammar is a formal system
consisting of a set of abstract elements, each of which is defined with respect
to other elements. For example, the ' subject' of a sentenceis defined by a set
of formal properties, such as its geometric position in the tree with respectto
the Sand VP nodes, its ability to force agreement with the verb, its
intersubstitutability with pronouns of nominative case, and so on. It cannot
be identified with any semantic role, sound pattern, or serial position . The
bootstrapping problem is: How do children break into the systemat the very
outset, when they know nothing about the particular language? If you know
that verbs agree with their subjects, you can learn where the subjects go by
seeingwhat agreeswith the verb - but how could you have learned that verbs
'
agree with their subjects to begin with , if you don t yet know where the
' '
subjectsgo? How can children lift themselvesup by their bootstraps at the
very outset of languageacquisition, and make the first basic discoveriesabout
the grammar of their languagethat are prerequisite to any further learning?
Pinker ( 1982), following earlier suggestionsof Grimshaw ( 1981), suggested
that certain contingenciesbetweenperceptualcategoriesand syntacticcategories ,
mediated by semantic categories, could help the child get syntax acquisition
started. For example, if the child was built with the universal linking rule that
agentsof actions were subjectsof active sentences , and they could infer from
a sentence's perceptual context and the meaningsof some of its content words
that a particular word referred to the agent of an action, the child could infer
that that word was in subject position . Once the position of the subject is
established as a rule or parameter of the child ' s nascent grammar, further
kinds of learning can proceed. For example, the child could now infer that
any new word in this newly-identified position must be a subject, regardless
S. Pinker I Verb syntax and verb semantics
of whether it is an agent; he or she could also infer that verbs must agree in
person and n':1mberwith the element in that position . SeePinker ( 1984) and
( 1987) for a more precisepresentation of the hypothesis.
The semantic bootstrapping hypothesis does require, as a background
assumption, the idea that the semantics of at least some verbs have been
acquired without relying on syntax. That is becausethe theory is about how
' '
syntax gets bootstrapped at the very beginning of learning; if all word
meaningswere acquired via knowledge of syntax, and if syntax were acquired
'
via knowledge of words meanings, we would be faced with a vicious circle.
The semantic bootstrapping hypothesis is agnostic about how children have
attained knowledge of these word meanings. Logically speaking, they could
have used telepathy, surgery, phonetic symbolism, or innate knowledge of the
English lexicon, but the most plausible suggestion is that the children had
attended to the contexts in which the words are used. Gleitman takes this
'
latter assumption (that the child s first word meanings are acquired by
attending to their situational contexts), generalizesit to a claim that all verb
meanings are acquired by attending to their situational contexts (i.e., even
verbs acquired after syntax acquisition is underway), and refers to the
' '
generalizedclaim as semantic bootstrapping . But this is a large departure
from its intended meaning.
' '
And what Gleitman calls, in contrast , syntactic bootstrapping , is not a
different theory of how the child begins to learn syntax . Thus it is not an
alternative to the semantic bootstrap ping hypothesis. (The only reason
they could be construed as competitors is that semantic bootstrap ping
assumesthat at least some verb meanings can be acquired before syntax,
'
so a very extreme form of Gleitman s negative argument , that no verb
meaning can be learned without syntax, is incompatible with it .) Moreover
' '
, since syntactic bootstrapping is a theory of how the child learns the
meanings of specific verbs, and since it can only apply at the point at
which the child has already acquired the syntax of verb phrases, it is not
' '
clear what it has to do with the bootstrap ping problem or the metaphor
'
of lifting oneself up by one s bootstraps . For these reasons, I suggest that
the term be avoided.
Here is a somewhat cumbersome, but trans~arent and accurate set of
'
replacements . ' Semanticcueing of syntax refers to the semanticbootstrapping
' '
hypothesis. Semantic cueing of word meaning refers to the commonplace
assumption that meaningsare learned via their semantic contexts (perceptual
' '
or linguistic) . Syntactic cueing of word meaning is the hypothesis defended
by Gleitman and her collaborators.
S. Pinker I Verb syntax and verb semantics
'
3.1. Arguments directed against 'observationlearning only refute learning by
associative pairing
The first problem is that Gleitman' s arguments are not aimed at ' observa-
tion ' in general. They are aimed at a particular straw theory of observation.
This foil , a version of one-trial associative pairing , has the child identify a
verb~s meaning with the sensory features activated by the situation at the
moment when a verb is uttered. But no one believesthis particular theory, so
refuting it is ineffective in establishing in -principle limitations on observation;
a few uncontroversial assumptionsmake Gleitman' s objections moot. Let me
examine the arguments in order.
someone who holds his ground), and move will be used for instances of
moving without pushing (e.g., sliding or walking) . To take another one of
Gleitman' s examples ( 1990: 14), even though a single event may be describable
as pushing, as rolling, and as speeding, most events are not. The child
need merely wait for an instance of rolling without pushing or speeding,
speedingwithout pushing or rolling , and pushing without rolling or speeding.
SeeGropen et al. ( 1991a) for experimental demonstrations that children use
this kind of information .
Gleitman ( 1990: 16; seealso Fisher et al. 1991: 380) suggeststhat there are
pairs of verbs that overlap 100% in the situations they refer to. For example,
there can be no giving without receiving, no winning without beating, no
buying without selling, and no chasingwithout fleeing.
In fact, I doubt that pairs of verbs that refer to exactly the same set of
situations exist (or if they do , they must be extremely rare.) Such pairs would
be exact synonyms, and there is good reason to believe that there are few if
any exact synonyms (Clark 1987, Bolinger 1977, Miller and Fellbaum 1991).
To take just theseexamples, I can receivea packageeven if no one gave it to
me; perhaps I wasn' t home. John, running unopposed, can win the election,
'
though he didn t beat anyone, and the second-place Celtics beat the last-place
Nets in the standings last year, though neither won anything. Several of my
gullible college friends sold encyclopediasdoor to door for an entire summer,
but in many cases, no one bought any; I just bought a Coke from the machine
across the hall, but no one sold it to me. If John fled the city , no one had to
be chasinghim ; Bill can chaseFred even if Fred isn' t fleeing but hiding in the
garbagecan.
I would certainly not claim that the learning of all thesedistinctions awaits
the child ' s experienceof the crucially disambiguating situation. But a lot of it
could, and more important , the in-principle arguments for an alternative that
are based on putative total overlap among verb meanings are not valid if
meaningsrarely overlap totally .
3.1.3. Thesubsetproblem
child who mistakenly thought that Verb B had the same meaning as Verb A
could never reject that hypothesis by observing how Verb B is used; all
instances would fit the A meaning, too. The only disconfirming experience
would be overt correction by parents, and there is good reason to believe that
children cannot rely on such corrections. This argument is parallel to one
commonly made in the acquisition of syntax (see, e.g., Pinker 1984, 1989;
Wexier and Culicover 1980; Berwick 1985, Marcus 1993) . For example, move,
walk, and saunter are in a superset relation ; any child that thought that
saunter meant walk would do so forever, becauseall examplesof sauntering
are also examplesof walking .
But this is only a problem if the child is allowed to maintain synonyms in
his or her vocabulary. If children do not like to keep synonyms around (see
Carey 1982, Clark 1987, Markman 1989, for evidencethat they do not ), then
if they have a verb A (e.g., walk), and also a verb B (saunter) that seemsto
mean the same thing, they know something is wrong . They can look for
additional meaning elementsfrom a circumscribed set to make the meaning
of B more specific (like the manner of motion ) . Pinker ( 1989: ch. 6) outlines a
mechanismfor how this procedure could work .
example, with the conclusionsdrawn by Lederer et ale 1989.) All that matters
is whether a child can recognize situations in which that correct concept
applies.
Gleitman ( 1990) then turns to a stronger argument. Even when one
examinesgenuineinstancesof the concept corresponding to a verb' s meaning,
one finds a poor correlation with instances of the parent uttering the verb.
For example, in one study put was found to be used 10% of the time when
there was no putting going on. Similarly , open was used when there was no
opening 37% of the time. As Gleitman notes, this is not a surprise when one
realistically considers how parents interact with their children. When a
mother, arriving home from work , opens the door , she is likely to say, What
'
did you do today?, not I m openingthe door. Similarly , she is likely to say Eat
your peas when her child is, say, looking at the dog, and certainly not when
the child is already eating peas. Indeed, Gleitman ( 1990: 15) claims that
'
positive imperatives pose one of the most devastating challenges to any
schemethat works by constructing word -to-world pairings' .
The problem with this argument is that it , too , only refutes the nonviable
theory of learning by associatepairing , in which verb meanings are acquired
via temporal contiguity of sensory features and utterances of the verb. It
doesn't refute any reasonableaccount, in which the child keeps an updated
mental model of the current situation (created by multisensory object- and
event-pe~ception faculties), including the likely communicative intentions of
other humans. The child could use this knowledge, plus the lexical content of
the sentence, to infer what the parent probably meant. That is, chldren need
not assumethat the meaning of a verb consists of those sensoryfeatures that
are activated simultaneously with a parental utterance of the verb ; they can
assume that the meaning of a verb consists of what the parent probably
meant when he or she uttered the word. Thus imperatives, where the child is
'
not performing the act that the parent is naming, are not ' devastating .
Certainly when a parent directs an imperative at a child and takes steps to
enforce it , the child cannot be in much doubt that the content of the
' '
imperative pertains to the parents wishes, not the child s current activities.
'
Gleitman considers this the ' most serious challenge to the idea that
children learn verb meanings by attending to their nonsyntactic contexts.
Mental verbs like think, know, guess, wonder, know, hope, suppose , and
understandinvolve private events and states that have no external perceptual
S. Pinker / Verb syntax and verb semantics
correlates . Therefore children could not possibly infer their meanings obser -
vationally .
One problem I see with this argument is that although children may not be
able to observe other people thinking and the contents of others ' beliefs , they
can observe themselves thinking and the contents of their own beliefs .
Similarly , children may not know what their mothers are feeling , but they
certainly know what they are feeling . And crucially , in many circumstances so
do their mothers . When a parent comments on what a child is thinking or
feeling , that constitutes information about the meanings of the mental state
verbs they use.
'
Moreover , there surely are ways to infer a person s mental state from his or
her behavior . Indeed , the standard way that humans explain each other ' s
behavior is to assume that it is caused by beliefs and intentions , which can
only be inferred . This must be how adults , during ordinary speech production
, know when to use mental verbs based on their own mental state or
'
guessesabout others , even though there is no obvious referent event . There is
no principled reason that children could not infer meanings of new mental
verbs using exactly the same information that adults employ to use existing
mental verbs accurately .
3.1.6. Does a richer system of mental representation hurt or help the child ?
imperatives), and that open is not used when opening is taking place (e.g.,
when someoneenters the house), if it is relevant at all , pertains in full force to
the ' delicate and specific' aspectsof the meaning of open (i.e., those aspects
that differentiate it from syntactically identical close). Similarly, parents
surely cannot be counted on to use break or tear when and only when
breaking or tearing are taking place, respectively. Nonetheless, Gleitman
concedesthat the meanings specific to open, break, and tear are somehow
learned by observation. Thus it is not true, as she suggests( 1990: 48), that
'
semantically relevant information in the syntactic structures can rescue
'
observational learning from the sundry experiential pitfalls that threaten it .
There are pitfalls , to be sure, but for most of the ones Gleitman originally
discussed , syntax offers no rescue. What we need is abetter , non-associationist
theory of observational learning.
'
3.3. Conclusionsabout Gleitmans argumentsagainst observationallearning
The first question we must ask is, what do we mean by ' a verb' ? The term
is ambiguous in a critical way, becausein most languagesa verb can appear
in a family of forms, each with a distinct meaning component, plus a
common meaning component that runs throughout the family . For example,
many verbs can appear in transitive, intransitive , passive, double-object,
prepositional object, and other phrases. In some cases the verb actually
changesits morphological form acrossthesecontexts, though in English only
the passiveis marked in this way. Following standard usagein morphology ,
we can say that all the forms of a given verb share the same verb root. We
can then call the syntactically distinct forms of a given root its frames. For
example, consider the matrix of verb forms in figure 1, where the existenceof
a given root in a given frame is marked with an ' x' .
The meanings of the x ' s differ along two dimensions. Let me use the term
root meaningto refer to the aspectsof meaning that are preservedin a given
root across all the frames it appears in ; that is, whatever aspectsof meaning
The water boiled and I boiled the water have in common, and fail to share
with The door openedand I opened the door. Let me use the term frame
meaningto refer to the orthogonal dimension: the aspectsof meaning that are
shared across all the roots that appear in a given frame; that is, whatever
aspectsof meaning differentiate The water boiled from I boiled the water, and
that The water boiled and The door openedhave in common.
Note that root meanings are much closer to what people talk about when
worrying about acquisition of word meaning. That is, the main problem in
learning boil is learning that it is about hot liquid releasing bubbles of gas.
This is the aspect of boil that is found in both its transitive and intransitive
uses, that is, its root meaning. The root meaning corresponds to what we
396 S. Pinker I Verb syntax and ve.pbsemantics
Frames
Roots
eat x x
move x x
boil x x
open x x
kill x
die x
think x
tell x
know x x x
see x x x
look x x
Fig. I
think of as the content of a verb. The frame meaning - the fact that there
must be an agent causing the physical change when the verb is used in the
transitive frame, and that the main event being referred to is the causation,
not the physical change- is just as important in understanding the sentence,
but it is not inherently linked to the verb root boil. It is linked to the
transitive syntactic construction, and would apply equally well to melt, freeze,
open, and the thousands of other verb roots that could appear in that frame.
This is a crucial distinction .
The first question that follows is, What can be learned from hearing a verb
in one frame? Something, clearly, for frame semanticsand frame syntax are
highly related. For example, it is a good bet that in A glips B to C, glip is a
verb of transfer. The regularities that licensethis inference are what linguists
call linking rules (Carter 1988, Jackendoff 1987, 1990; Pinker 1989, Gropen et
al. 199Ia) . For example, if A is a causal agent, A is the subject of a transitive
verb. Linking rules are an important inferential mechanism in semantic
bootstrapping (semantic cueing of syntax at the outset of language acquisition
), in predicting how one can use a verb once one knows what it means,
and in governing how verbs alternate between frames (see Gropen et al.
1991afor discussion) .
S. PinkerI Verbsyntaxandverbsemantics 397
One might now think : If syntax correlates with semantics, why not go both
'
ways? If one can infer a verb s syntax from its semantics(e.g., in semantic
'
bootstrapping), couldn t one just as easily infer its semanticsfrom its syntax?
As Gleitman puts it ( 1990: 30) :
' The
syntacticbootstrapping proposalin essence turns semanticbootstrapping on its head.
Accordingto this hypothesis, thechild who understandsthe mappingrulesfor semantics on to
syntaxcan usethe observedsyntacticstructuresas evidencefor deducingthe meanings . The
learnerobservesthe real-world situation but also observesthe structuresin which various
wordsappearin the speechof the caretakers . Suchan approachcan succeed because, if the
syntacticstructuresare truly correlatedwith the meanings , the rangeof structureswill be
informativefor deducingwhichword goeswith whichconcept .'
that the book is interesting; between shoot the man and shoot at the man;
perhaps even between think, eat, and go. Third , the syntax of a sentencecan
help identify which argument can be construed as the agent (viz., the subject)
in caseswhere the inherent properties of the arguments (such as animacy)
leave it ambiguous, for example, in kill versus is killed by, and chaseversus
flee . Similarly , syntactic information can distinguish the experiencerfrom the
'
stimulus in ' psych-verbs with ambiguous roles, such as Bill feared Mary and
Mary frightened Bill . Fourth , syntactic information can help identify which
' '
argument is construed as affected (viz., the syntactic object) in events where
several entities are being affected in different ways. For example, in load the
hay and load the wagon, on cognitive grounds either the hay or the wagon
' '
could be interpreted as affected : the hay, becauseit changeslocation , or the
wagon, becauseit changesstate from not full to full (similar considerations
apply to the .pair of verbsfill and pour. The listener has to notice which of the
two arguments (content or container) appearsas the direct object of the verb
' '
to know which one to construe as the affected argument for the purpose of
understanding the verb in that frame. Gleitman and her colleaguesgive many '
'
examplesof these forms of learning, which I have called reverselinking (see
Pinker 1989 and Gropen et al. 1991a, b for relevant discussion and experimental
data) .
'
Unfortunately , while one can learn somethingabout a verb form s meaning
from the syntax of the frame it appears with , especially when there are a
small number of alternatives to selectamong, one cannot learn much, relative
to the full set of English verbs, becauseof the many-to -one mapping between
the meanings of specific verbs and the frames they appear in. For example,
one cannot learn the differences among slide, roll , bounce, skip, slip, skid,
tumble, spin, wiggle, shake, and so on, or the differencesamong hope, think,
pray , decide, say, and claim; among build, make, knit , bake, sew, and crochet;
among shout, whisper, mumble, murmur, yell , whimper, whine, and bluster;
among fill , cover, tile, block, stop up, chain, interleave, adorn, decorate and
face, and so on. Indeed, Gleitman herself ( 1990: 35) concedesthis point in the
quote reproduced above.
In sum, learning from one frame could help a learner distinguish frame
meanings, that is, what the water boiled has in common with the ball bounced
and does not have in common with I boiled the water. But it does not
distinguish root meanings, that is, the difference the water boiled and' the ball'
bounced. And the root meaningsare the ones that correspond to the content
' ' '
of a verb, what we think of as the verb s meaning , especially when a given
verb root appears in multiple frames.
S. Pinker I Verb syntax and verb semantics 399
The frame meanings (partly derivable from the frame) are closer to the
' '
perspective that one adopts relative to an event: whether to focus on one
actor or another, one affected entity or another, the cause or the effect.
Indeed in some restricted cases, differencesin perspectiveare most of what
distinguishes pairs of verb roots, such as kill and die, pour and fill , or
Gleitman' s example of chaseand flee . Gleitman ( 1990) and Fisher et al. (this
volume) adopt a metaphor in which the syntax of a verb frame servesas a
' zoom lens' for the
aspectsof the event referred to by the verb. This metaphor
is useful, becauseit highlights both what verb syntax can do and cannot do.
The operation of lens when aimed at a given scenegives the photographer
three degreesof freedom, pan, tilt , and zoom, which have clear effects on the
perspectivein the resulting picture. But no amount of lens fiddling can fix the
vastly greater number of degreesof freedom defined by the potential contents
of the picture - whether the lens is aimed at a still life , a nude, a ' 57 Chevy, or
one' s family standing in front of the Grand Canyon.
So I have no disagreement with Gleitman' s arguments that a syntactic
frame can serve as a zoom lens, helping a learner decide which of several
perspectiveson a given type of event (discernedby other means) a verb forces
on a speaker. But becausethis mechanismcontributes no information about
a verb' s content, it cannot offer significant help in explaining how children
learn a verb' s content despite blindness, nor in explaining how children learn
a verb' s content despite the complexity of the relationship between referent
event and parental usage.
The claim that inspection of multiple frames can predict a verb' s meaning
' '
quite closely appearsto contradict the earlier quote in which Gleitman notes
that syntactic information in general is not ' delicate and specific enough to
400 S. Pinker I Verb syntax and verb semantics
" '
distinguish among ... semantically close items . To seeexactly how close the
'
syntax can get the learner to a correct meaning, we must ask, What can be
'
learned from hearing a verb in multiple frames? In particular , can a root
'
meaning - the verb s content - be inferred from its set of frames, and if so,
how?
Unfortunately , though Gleitman and her collaborators give examples of
how children might converge on a meaning from several frames, almost
always using the problematic example of see(seefn. 1), they never outline the
inferential procedure by which children do so in the general case. In Fisher et
al. (this volume) they suggest that the procedure is simply the zoom lens
' '
(single-frame) procedure applied iteratively . They give the procedure as
'
follows : In assigning a gloss to the verb, satisfy all semantic properties
'
implied by the truth conditions of all its observed syntactic frames . But this
cannot be right , for reasons they mention in the next paragraph. The truth
conditions (what I have been calling ' frame meaning' ) that belong to a verb
form in one frame do not belong to it in its other frames. So satisfying all of
them will not give the root meaning or verb' s content. If we interpret
' '
satisfying all semantic properties as referring to the conjunctionof the frame
meanings, we get the meaning of its most restrictive frame, which will be
incompatible with its lessrestrictive frames. For example, the truth conditions
for transitive boil include the presenceof a causal agent. But presenceof a
causal agent cannot be among the semantic properties of boil across the
board, for its intransitive version ( The water boiled) is perfectly compatible
with spontaneous boiling in the absenceof any agent. But if we interpret
' '
satisfying all semanticproperties to be the disjunctionof frame meanings, the
aggregation leads to virtually no inference at all. Consider again the frame
involved in The water boiled. This intransitive frame tells the learner that the
meaning of boil in the frame consists of a one-place predicate. Now consider
a second frame, the one involved in I boiled the water. This transitive frame
tells you (at most) that the meaning of boil in the frame consistsof causation
'
of some one-place predicate. What do they have in common? One-place
' . Which is not
predicate very useful. It says nothing whatsoever about the
root meaning of boil, that is, that it pertains to liquid , bubbles, heat, and so
on.
This is a problem even for verbs that appear in many frames, for which the
syntax would seem to provide a great deal of converging information (see
Levin 1985, Pinker 1989) . For example sew implies an activity . Sew the shirt
implies some activity performed on an object. Sew me a shirt implies an
activity creating an object to be transferred to a beneficiary. Sew a shirt out
S. Pinker I Verb syntax and verb semantics
of the rags implies an activity transforming material into some object. What
'
do theseframe meaningshave in common? Only ' activity ' . Not ' sewing .
' '
The conclusion is clear: you can t derive a verb s root meaning or content
by iterating the zoom lens procedure over multiple frames and taking the
resulting union or intersection of perspectives.
4 ' '
Actually , the question and answer should be stated in tenDs of a family of notions , not
' notion ' becauseverbs like seethat can take either
, objects or clausal complementsdo not exhibit
' ' ' '
a single content meaning across these frames: seeNP does not mean the same thing as see S .
The latter is not even a perception verb: [ seethat the meal is ready does not entail vision. (Clearly
'
not , becauseyou can t visually perceive a proposition .) Similarly , [ feel that the fabric is too
'
smooth does not entail palpation ; it s not even compatible with it . And Listen! [ hear that the
orchestrais playing is quite odd. ( Theseobservations are due to Jane Grimshaw.) Clearly there is
' '
a commonality running through each of thesesets, but it is a metaphorical one; knowing can be
' '
construed metaphorically as a kind of perceiving .
402 S. Pinker I Verb syntax and verb semantics
or because' rnuch of the [sernantic] information can be read off frorn the
subcategorization frarnes thernselvesby a general scheme for interpreting
thesesemantically' (Landau and Gleitman 1985: 142) . Moreover, the premises
that would drive this riddle-solving are far rnore irnpoverished than the
'
prernisesderived frorn inferring a speakers rneaning frorn the context. The
latter can include any concept the child is capable of entertaining (sewing,
boiling , and so on); the former are restricted to a srnaller set of abstract
concepts like causability and taking a propositional argument.
There is a second way that sets of syntactic frarnes could assist sernantic
learning. That is via narrow argument-structure alternations. Often the verbs
that can appear with a particular set of syntactic frarnes have surprisingly
specific rneanings (see Levin 1985, in press; Pinker 1989, for reviews) . For
exarnple, in English, the verbs that can appear in the double-object form but
not the to-object form are verbs of irnplied deprivation like envy, bet, and
.
begrudge(e.g., I envied him his good looks/ envied his good looks of him) .
Sirnilarly, verbs of rnanner of rnotion can alternate betweencausative-transitive
and inchoative-intransitive forms (e.g., I slid the puck/ The puck slid) , but
verbs of direction of rnotion cannot (e.g., I brought the book/ . The book
brought) . An astute learner, in principle, could infer , frorn hearing I glipped
him those things and frorn failing to hear I glipped those things of him, that
glip involves sorne intention or wish to deprive sorneoneof sornething. But
note ' that these regularities are highly specific to languagesand to dialects
within a language. (For exarnple, . 1 suggestedher somethingis grarnrnatical in
Dutch , and . I pushedhim the box is grammatical in sornedialects of English;
seePinker 1989.) Exploiting thern requires first having acquired these subtle
subclass esand their syntacticbehavior in the dialect, presumablyby abstracting
the subclass es frorn the sernanticsand syntax of individual verbs, acquired by
other rneans. This kind of inference dependson a good deal of prior learning
of verbs' rneaningsin a particular language, and thus is rnost definitely not a
'
caseof ' bootstrapping performed by a child to acquire the rneaningsof the
verbs to begin with .
'
In general, learning a verb s content or root rneaning frorn its set of
' '
syntactic frames ( syntactic bootstrapping ) is fundamentally different frorn '
'
learning its perspectiveor frame rneaning frorn a single frarne ( zoorn lens ) .
'
Thus I disagreewith Gleitman s ( 1990) suggestionthat they are versions of a
single procedure, or Fisher et alis suggestion(this volume) that one is sirnply
the iteration of the other. There is a clear' reason why they are different.
While there rnay be a universal rnapping betweenthe rneaning of a frarne and
the syntax of that frame (allowing the lens to zoorn), there.is no universal
S. Pinker / Verb syntax and verb semantics
mapping betweenthe meaning of a root and the set of frames it occurs in (see
Talmy 1985 and Pinker 1989 for reviews) . For examplet universal linking
rules impl Yt roughlYt that an inchoative verb can appear in an intransitive
framet and a causativeverb can appear in a transitive frame. And itts clearly
possible for some roots to be able to have both causative and inchoative
meanings (and hence to appear in both frames) . But itts an accident of
English that slide appears in both framest but come and bring appear in one
each. Thus the kinds of learning that are licensed by universalt reliablet
grammatical linking regularities are restricted to differencesin perspective. A
verbts content is not cued by anyone of its syntactic framest and at best
might be related to its entire set of frames in a tenuoust language-specific
way.
One set of studies (Hirsh -Pasek et ale 1988) presented27-month -olds with
two video screens. One showed a pair of puppets performing a simple action
like flexing their own arms; the other showed one puppet causing the second
to perform the action, by manipulating its arms. A recorded voice used a verb
unknown to the child in one of two syntactic frames, either the intransitive
' ' '
Big Bird is flexing with Cookie Monster or the transitive Big Bird is flexing
'
Cookie Monster . Children looked at the screenshowing the noncausal scene
more often when the sentence was intransitive, and looked at the screen
showing the causal scenemore often when the sentencewas transitive.
Note that this study provided children with only a single frame, not
multiple frames. And note as well that what the children were sensitiveto was
a frame-specific perspective (causation by an agent), not a root -specific
404 S. Pinker I Verb syntax and verb semantics
content (flexing) . The content (flexing) was acquired through observation, not
syntax; it was depicted on the video screen, and the child was watching it .
Thus at best the children were demonstrating use of the zoom lens procedure;
'
there was no opportunity for multiple frames to cue the verb s content. (At
worst, the children were not acquiring any information about the verb at all ,
but were ignoring the verb and merely responding to the transitive and
intransitive sentenceframes themselveswhen directing their attention.)
the meaning of the verb in its particular frame. The sentenceplayed to the
first group of children told them that gorp is a two-place predicate, presumably
a causative. It means either 'cause to pop up and down' or ' cause to make
arm circles' . The sentenceplayed to the secondgroup told them that gorp is a
one-place predicate, presumably an activity . It means either ' make arm
circles' or ' pop up and down' . Once again, the syntactic frame cued only the
coarse information of how many arguments were immediately related by the
verb; the rest came from observation.
One might object at this point that Naigles' s experiment has demonstrated
what I have been arguing is impossible: children appear to have learned
about a verb' s content (in the case, up-and-down versus make-circles) from
the sentencein which it is used, and could not have learned that content from
observation alone. But this is misleading. Successdependedcompletely on the
fact that Naigles engineeredan imaginary world in which perspective and
content were confounded, so that when children were using syntax to choose
the right perspective, they got the right content, too , by happy accident. Note
there is no grammatical constraint forcing or preventing either ' popping up
and down' or ' making arm circles' from being exclusively transitive, exclusively
intransitive, or alternating. Nor is there any real-world constraint that
could cause creatures to make arm circles and to pop up and down in
tandem. But Naigles' s teaching example exemplified both such constraints:
only popping up-and-down was causable, and such causation took place in
the presenceof arm circles. It was only these artificial contingencies that
made the forms learnable by syntax rather than observation. Consider what
would have happenedif the children had been shown a scenedepicting circles
without pushing up and down or vice-versa. In that case, observation would
have been sufficient to distinguish the two actions, with no syntax required.
Now consider what would happen if the children had been shown an arm-
circling rabbit causing the duck simultaneously to pop up and down and to
make arm circles. This is no more or less bizarre than the conjunction that
Naigles did show children, where causation of popping up and down was
simultaneous with uncaused arm-circling . In that case, neither the sentence
The duck is gorping the rabbit nor the sentence The duck and rabbit are
gorping would have distinguished the two kinds of motion . This shows that
syntax is neither necessarynor sufficient- to distinguish the alternative content
meanings of gorp, across all the different scenesin which it can be used;
observation, in contrast, is sufficient. In sum, Naigles simply selected a
contrived set of exposure conditions that penalized observation while letting
syntax lead to the right answer by coincidence.
406 S. Pinker I Verb syntax and verb semantics
there is no reason to think that the retrieval cue that the experimenter
provides now, for existing knowledge, was ever used as a learning cue, in
order to acquire that knowledge originally . Substitute ' transitive syntax' and
' intransitive ' ' ' ' '
syntax for p .... and f .... and one has the Fisher et al.
experiment (this volume) - a test of whether children can use transitivity
correctly as a retrieval cue for previously learned words when the content of
the words is available observationally.
6. Conclusions
References
Chomsky , N., 1965 . Aspectsof the theoryof syntax. Cambridge , MA : MIT Press .
Clark, E.V., 1987 . The principleof contrast: A constrainton languageacquisition . In : B. Mac-
Whinney(ed.), Mechanisms of languageacquisition . Hillsdale, NJ: Erlbaum.
Dowty, D., 1991 . Thematicproto-rolesand argumentselection . Language67, 547- 619.
Fisher, C., H. Gleitmanand L.R. Gleitman, 1991 . On the semanticcontentof subcategorization
frames . CognitivePsychology 23, 331- 392.
Fisher, C., G. Hall, S. Rakowitzand L.R. Gleitman, 1994 . Whenit is betterto receivethan to
give: Syntacticand conceptualconstraintson vocabularygrowth. Lingua 92, 333- 375 (this
volume).
Gleitman, L.R., 1990 . The structuralsourcesof verb meaning . LanguageAcquisitionI , 3- 55.
Grimshaw , J., 1979 . Complementselectionand the lexicon. Linguisticinquiry 10, 279- 326.
Grimshaw , J., 1981 . Fonn, function, and the languageacquisitiondevice . In: C. L. Baker,
J.J. McCarthy(eds.), The logicalproblemof languageacquisition165- 182. Cambridge , MA :
M IT Press .
Grimshaw , J., 1990 . Argumentstructure.Cambridge , MA : MIT Press .
Gropen, J., S. Pinker, M. Hollanderand R. Goldberg, 1991a . Affectedness and direct objects:
The role of lexicalsemantics in the acquisitionof verbargumentstructure.Cognition41, 153-
195.
Gropen, J., S. Pinker, M. Hollander and R. Goldberg, 1991b . Syntax and semanticsin the
acquisitionof locativeverbs. Journalof Child Language18, 115- 151.
Hirsh-Pasek , K., H. Gleitman, L. R. Gleitman, R. Golinkoff and L. Naigles, 1988 . Syntactic
bootstrap ping : Evidence from .
comprehensionPaperpresented at the 13th Annual Boston
UniversityConference on LanguageDevelopment .
jackendoff, R.S., 1987 . Thestatusof thematicrelationsin linguistictheory. LinguisticInquiry 18,
369- 411.
JackendoffiR.S., 1990 . Semanticstructures . Cambridge , MA : MIT Press .
Katz, B., G. Bakerand J. Macnamara , 1974 . What's in a name? On the child' s acquisitionof
properand commonnouns. Child Development 45, 269- 273.
Landau, B. and L. R. Gleitman, 1985 . Languageand experience . Cambridge , MA : Harvard
UniversityPress .
Lederer , A., H. Gleitman and L.R. Gleitman, 1989 . Input to a deductiveverb acquisition
procedure . Paperpresentedto the 14thAnnual BostonUniversityConferenceon Language
Development .
Levin, B., 1985 . Lexical semanticsin review: An introduction. In: B. Levin (ed.), Lexical
semantics in review. LexiconProjectWorkingPapers# I. Cambridge , MA : MIT Centerfor
CognitiveScience .
Levin, B., in press . Englishverbclassesand alternations : A preliminaryinvestigation . Chicago,
IL : Universityof ChicagoPress .
Marcus, G.F., 1993 . Negativeevidencein languageacquisition . Cognition46, 53- 85.
Markman, E.M., 1989 . Categorization in children: Problemsof induction. Cambridge , MA : MIT
Press / BradfordBooks.
Markman, E.M., 1990 . Constraintschildrenplaceon word meanings . CognitiveScience14, 57-
77.
Miller, G.A., 1991 . The scienceof words. NewYork: ScientificAmericanLibrary.
Miller, G.A. and C. Fellbaum, 1991 . Semanticnetworksof English. Cognition41, 197-229.
Moravcsik , J.M.E., 1981 . How do wordsgettheirmeanings ? TheJournalof Philosophy 78, 5-24.
410 S. Pinker / Verb syntax and verb semantics
Naigles. L.. 1990 . Childrenusesyntaxto learn verb meanings . Journalof Child Language17.
357- 374.
. A theoryof theacquisitionof lexicalinterpretivegrammars
Pinker. S.. 1982 . In: J. Bresnan(ed.).
The mentalrepresentation of grammaticalrelations.Cambridge . MA : MIT Press .
Pinker. S.. 1984. Languagelearnabilityand languagedevelopment . Cambridge . MA : Harvard
UniversityPress .
Pinker. S.. 1987. The bootstrap ping problemin languageacquisition . In: B. MacWhinney(ed.).
Mechanisms of languageacquisition . NJ: Erlbaum.
. Hinsdale
Pinker. S.. 1988 . Resolvinga learnabilityparadoxin the acquisitionof the verb lexicon. In :
M. Rice. R. Schiefelbusch (eds.). The teachabilityof language . Baltimore. MD : Brookes .
. Learnabilityand cognition. The acquisitionof argumentstructure.Cambridge
Pinker. S.. 1989 .
MA : MIT Press / BradfordBooks.
Quine. W.V.O.. 1960 . Word and object. Cambridge . MA : MIT Press .
Talmy. L.. 1985. Lexicalizationpatterns: Semanticstructurein lexicalforms. In: T. Shopen(ed.).
Languagetypology and syntacticdescription . Vol. III : Grammaticalcategoriesand the
lexicon. NewYork: CambridgeUniversityPress .
Talmy. L.. 1988 . Forcedynamicsin languageand cognition. CognitiveScience12. 49- 100.
Wexier. K. and P. Culicover. 1980 . Formal principlesof languageacquisition . Cambridge. MA :
MIT Press .
) 411- 430. North-Holland
Lingua92 ( 1994
*
Lexicalreconciliation
Jane Grimshaw
Departmentqf Linguisticsand Centerfor CognitiveScience . 18 Seminary
. RutgersUniversity
Place. NeM' BrunsM'ick. NJO8903
. USA
In the context of current research in lexical representation there are two fundamental ideas
about lexical learning. One holds that the semantics of a word is critically involved in the
acquisition of its syntax, another holds that the syntax of the word is critically involved in the
acquisition of its semantics. This paper examines the positive results and limitations of the two
views, and proposes a reconciliation of the two in which a hypothesized meaning based on
observation is the input to the linguistic mapping principles. Thesederive a predicted s-structure,
checkedagainst observedsyntax. Learning occurs when the two match.
1. Introduction
. This paper grew out of joint work with Steve Pinker, which was presented at the Boston
University Language Development Conference in 1990, and owes much to his input and
influence. Many discussionswith Lila Gleitman helped to clarify the arguments immensely, as did
conversations with Barbara Landau. Thanks are also due to Alan Prince, the Rutgers-Penn
lexicon group. participants at the University of Pennsylvania Lexical Learning Conference in
1992. and the Workshop on Thematic Roles in Linguistic Theory and LanguageAcquisition held
at the University of Kansas in 1993, and an anonymous reviewer.
. Science8 .v . All
- 3841/94/$07.00 @ 1994- Elsevier
0024 rights reserved
-
SSDI 0024 3841( 93) EO045 9-
412 J. GrimshawI Lexical reconciliation
lexical items, and the grammatical processes which involve them (Grimshaw
1990, Jackendoff 1990, Levin and Rappaport 1992, Pinker 1989, Williams , in
press, Zu bizarreta 1987). With this has come the recognition that lexical
systems,just like other linguistic domains, are subject to universal constraints
and a high degree of language-internal consistencey. ( For a particularly
interesting perspectiveon the relationship between the theory of words and
the notion of lexical listing , seeWilliams 1992.) A change in perspectivehas
resulted. If the endpoint of the acquisition processis highly principled, then
the acquisition processitself must be interesting.
The acquisition issuesthat have been addressedin recent researchseemto
fall into two subgroups. First , especiallysince Baker ( 1979) there has been an
important seriesof studies which attempt to uncover the principles behind the
acquisition of patterns of alternation shown by verbs: including Bowerman
( 1974, 1990), Choi and Bowennan ( 1991), Grimshaw ( 1989), Gropen et al.
( 1991), Pinker ( 1989), Randall ( 1990). These questions have been the concern
in particular of theories which attempt to make use of mapping between
semanticsand syntax. Second, some recent work has investigatedthe question
of how the relatively gross meaning of a word can be acquired (Landau and
Gleitman 1985, Gleitman 1990, Naigles 1990, Fisher et al. 1991), looking into
theories which make use of mapping of some kind between syntax and
meaning. Research of the first kind asks how learners determine that give
' '
participates in the dative alternation ( We gave a book to someone , We gave
someonea book) whereas donate does not ( We donated a book to someone ,
* We donatedsomonea book) . Researchof the second kind asks how learners
' ' ' '
detennine that a particular morpheme means give , rather than hold .
Whether these are ultimately different questions, with different answers,
remains to be seen.
The claim that observation of the world is inadequate for the acquisition of
lexical syntax is no surprise - the argument here is just the same as the
argument for any other kind of syntax. The theory of lexical syntax involves
well-formedness principles for argument structure and its interface with
lexical semantics and d-structure: the principles are universal, and not
plausibly learned. With respect to lexical meaning, though, this position is
perhaps more surprising, although arguments from lexical semantic universals
, patterns of errors in learning and so forth , show the effects of va in
J. Grimsha~' I Lexical reconciliation 413
consider
Idealizing ably, wecanpose
thelearning asin (2):
problem
(2) SITUATION\ OBSERVER > WORD
REPRESENTATION
SENTENCE /
The input to the learning systemis a pair of an s-structure, and a representation
of a world situation. This representation will be an interpretation of a
perceived event, for example, based on observation, previous observations,
surrounding discourse, etc. How then does a learner arrive at the lexical
representation for the verb in the sentence, given this kind of learning
situation? Two ideas form the focus of a large amount of current researchon
the question.
The first idea is that analysis of the situation can make it possible to
determine the meaning of a word , and that the meaning of a word in turn
makes it possible to determine its lexicals Yl:ltax. This type of proposal has
been very fully developed by Pinker ( 1989), in an important extension of
earlier work , which concentrated on the question of how meaning might play
a role in allowing the child to perform an initial syntactic analysis upon which
the syntactic system of the languagecould be erected, and on ways in which
the meaning of a verb could make aspects of its syntax predictable to a
learner; Grimshaw ( 1981), Pinker ( 1984, 1989) . The second idea which has
emerged on how linguistic relations play their part in acuisition, is that
analysis of the sentencemakes it possible ta determine (parts of) its semantics
(seeLandau and Gleitman 1985, Fisher et al. 1991, Fisher et al., this volume,
Gleitman 1990) . These two ideas about mapping are sometimes contrasted
'
under the rubrics ' semantic bootstrapping' and ' syntactic bootstrapping (but
seePinker, this volume, for discussion) .
' Semantic '
bootstrapping has one extremely important property : it makes
direct use of the principles mapping from lexical meaning to lexical syntax
discussedearlier. I will not illustrate this in detail - the works referred to
above contain many examples. Nonetheless, it would probably be incorrect to
maintain that an analysis of the situation a/one is the input to the learner.
Becauseof the indirect relationship between events and semantic representations
of verbs, discussedin the previous section, it is not easy for an observer
'
to determine a verb s meaning from an event. Gleitman and colleagues(see
Gleitman 1990and Fisher et al., this volume, for examples) have looked into
this point , showing that eventstypically have multiple construals, hencemany
416 J. GrimshawI Lexical reconciliation
The problem for word learning from world situations is that circumstances
that can be described by one member of the pair of verbs, can often be
describedequally well by the other: seethe discussion of give and receivein
Fisher et al. ,(this volume) for example. The causative entails the change of
state, hencewheneverthe causativedescription is true of some state of affairs
in the world , the change-of -state description is true also (although not vice
versa) . So if a learner guessesthe causative (kill ) when in fact the verb has a
change-of -state meaning (die), and if word learning is based on world
observation alone, recovery is only possible if some situation tells the learner
that even when there is no possibility of construing the event as involving an
' '
agent, die is still used. If a learner choosesthe change-of -state meaning ( die )
' '
for a: morpheme which is in fact the causative ( kill ), there is no way to
correct the mistake, becausethere is no world situation where the inchoative
is inappropriate and the causative appropriate. The correct meanings must
therefore be assignedto the membersof the pairs by some other means, since
learning by world observation seemsat worst to be impossible, and at best to
require that juvenile speakershave accessto a disambiguating situation for
every pair of meanings like this.
But if observation of the world is not enough, other means must be
available for lexical learning. This is where key information contained in the
sentencein (2) offers the most promising avenue for successfulword learning,
very roughly along the lines sketched by Gleitman ( 1990). The fundamental
idea is that the linguistics of words itself makes learning words possible. It is
the language, and not the world , that supports the processof word learning.
Returning to the kill / die problem, although the situations in which the two
morphemesare used are not good sourcesof information about the meanings
of the verbs, the two are linguistically different in a crucial respect, and this
linguistic difference makes word learning possible. What I will for now
term, loosely, the ' transitivity ' of kill and causative melt contrasts with the
J. GrimshawI Lexical reconciliation 417
" '
intransitivity of die and inchoative mell, and this property is correlated with
the meaningsin a way that can be exploited by a languagelearner.
What aspects of verbal meaning can in principle be deduced from the
.
syntactic context a verb appears in ? The proposal advanced by Landau and
Gleitman ( 1985) is that learners use information about the surface syntax of a
clause to determine (aspects of) the meaning of the verb in the clause.
Specifically, they suggestthat the subcategorization frame of a verb contains
the critical information . It is easyto seethat this idea will be useful in solving
the kill / die problem : since kill is subcategorizedfor an NP complement and
die is not , a learner who knew the subcategorizationsin advance could use
them to choose the right morpheme for the right meaning. (Presumably
variability resulting from parametric variation can be factored out of the
situation, and a sufficiently abstract view of the subcategorization sets will
make it possible to treat the subcategorization of a verb in one language as
being the same as the subcategorization of a verb with the same meaning in
another language, despite superficial differencesin the syntactic systemsin the
two cases.) When the issueis consideredin a more precisefashion, however, a
number of considerations suggestthat subcategorization frames are not the
optimal source of information . (See Pinker, this volume, and Fisher et al.,
this volume, for additional remarks on the limitations of frames for learning
semantics.)
First , -observable context alone cannot determine what the subcategorization
frames of a verb are. Arguments figure in subcategorization frames,
adjuncts do not. However both occur on the same side of the verb in a
languagelike English, hencethere is no positional evidenceto distinguish one
from the other. This problem arises wherever adjuncts and arguments have
the same form , e.g. with PPs. Consider the examplesin (4) : while write takes
an optional PP adjunct, put has an obligatory PP argument. Similarly , last
has a temporal argument, while wriggle occurs with a temporal adjunct.
So the fact that a verb takes an NP complement is not very informative as far
as its meaning is concerned. (Of course it is not completely uninformative ,
since it does make it possible to eliminate a number of candidates: ' put ' and
' die' for
, example, are not possible meanings for any of the verbs in (5H7 ) .)
The reason that the syntax here is comparatively uninformative is that many
different meaningsare mapped onto a syntactic expressionlike NP.
Syntactic mapping based on a single subcategorization frame will be
maximally effective when the mapping is one-to-one or just few-to -one. This
is probably the casewith , for example, sentential complements. A verb with a
sentential complement must draw its meaning from a relatively small set of
possibilities, which include verbs of communication (say, announce, state),
verbs of logical relation (entail, presuppose ), and verbs of propositional
attitude (hope, believe) .
Being aware of the limitations and problems of mapping from subcategorization
frames to meaning in cases such as the ones just discussed, the
researchersworking on syntactic mapping have investigated the idea that sets
of subcategorizationframes, and not just single frames, playa role in the
acquisition of verbal meaning (see especially Landau and Gleitman 1985,
Fisher et al. 1991, and the discussion of ' frame range' in Fisher et al., this
volume) . The learner will examine the set of subcategorizationsthat a verb
appearsin , and discover properties of its meaning in this way.
Note that one strong disadvantageof this position is that it is not possible
to learn a meaning for a morpheme on exposure, even repeatedexposure, to a
single sentencetype. Analysis acrosssentencetypes will be required, since the
J. Grimshaw/ Lexical reconciliation 419
entire set of frames that the verb appearsin , or some approximation thereof,
must be observedbefore its meaning can be detennined.
There is an important precondition for the successof the ' sets of frames'
idea. In order for learners to be able to use a verb' s subcategorizationset to
determine its meaning, the learner must know in advancewhich set goeswith
which meaning. This is possible only to the extent that the subcategorization
set/ meaning mapping is cross-linguistically stable, or shows only parametric
variation. So the question is whether UG detenninesthe subcategorizationset
associatedwith a verb, or not. This issueturns out to be highly problematic -
the reason is that the total subcategorizationset of a verb is a function of the
set of subcategorizationsin which each senseof the verb participates. And the
way sensesare distributed acrossmorphemesis not unifonn acrosslanguages.
(This point is also made in Pinker, this volume.)
Let us consider why this should be so. The relevant principles of UG
govern the mapping between the meaning of a verb and its syntax in that
meaning. In addition , UG regulates the set of possible semantic alternations
that a morpheme may participate in , such as the causative/change-of -state
relation. If each morpheme had exactly one sense, or if the alternative senses
of a morpheme were always regulated by UG , this would exhaust the
situation. But in fact, a morpheme can have severalsenses , each with its own
UG detennined syntax. and each participating in alternations in the ways
prescribedby UG . UG says little or nothing about the complete set of senses
the verb has, and therefore little or nothing about the total set of subcategori-
lations of the morpheme. UG only detennines the properties of the individual
sensesand those that are related grammatically.
As an example, consider the verb shoot. It has at least two senses: one
exemplified in She shot the burg/ar , and one in The burglar shot out of the
room. In the first sense, shoot takes an NP complement, in the second a PP.
In both casesthe subcategorizations are highly predictable. shoot- l is like,
say, stab, and takes an individual / NP as its complement, while shoot-2 is like ,
say raced, and takes a directional PP as its complement. The subcategorization
of each senseis completely in accordancewith the theory of grammar,
but nothing about the theory of grammar determines that shoot will have
these two senses . Hence the theory of grammar cannot possibly predict that
shoot will have these two subcategorization frames, and a learner could not
know this in advance.
In caseslike these, the sensesof the verb do not seemto be related by UG
at all , even though they are all realized by a single morpheme. Presumably
they are related by association, which depends on semantic field and other
420 J. GrimshawI Lexicalreconciliation
learning device tracks the syntactic context of the morpheme and attempts to
determine its meaning based on t.his information . It will collect up all
observed subcategorizationsfor a morpheme, disregarding the fact, which it
'
cannot by definition be sensitive to , that verb s meaning differs across these
subcategorizations. It will then assign to the verb the meaning that would be
assigned to a verb which, under a single meaning, occurred in this set of
frames. But this will be incorrect, except in the caseof a verb with exactly one
sense. For verbs with multiple senses , if the procedure succeedsin assigninga
meaning at all , it will give one meaning to the verb instead of several, and the
meaning it gives might well be completely unrelated to any actual meaning of
the verb being learned. Without knowing about meaning it is not possible to
know which subcategorizationsshould be grouped ,ogether, and which shouldbe
kept separate.
What of lexical entries in which the relationship betweenthe alternatives is
regulated by UG , such as the causative/change of state examples discussed
above (e.g. melt/ melt) ? Supposethat there are two subcategorization frames
associatedwith each of these verbs, one transitive and one intransitive. Can
this set of frames be exploited to learn the meaning of the verb? (Note that
the existenceof the alternation cannot be crucial for learning the meaningsof
the members, becausemany verbs, such as kill and die do not alternate, but
are still learned.) Here again the answer is negative, becausethe two occur-
rencesof the morpheme have different meanings, under standard assumptions
about lexical representation. 1 So there cannot be a single sensespecified by
UG as associatedwith this subcategorization set. Note that it also cannot be
the casethat the shared meaning, i.e. what the two casesof the verb have in
common, is learned from the subcategorization set either, becauseall causative
/change of state pairs have the same subcategorization set, even though
each pair has its own meaning.
Clearly what we must aim for is a learning procedure that uses the
alternation as a clue to the semantic analysis of the verbs, that might tell the
learner that these verbs have changeand causedchangemeanings. However,
What will guarantee successis to take into account the properties of the
arguments, and not subcategorization alone; it is the fact that the subject of
one caseof the verb correspondsto the object of the other casesthat reliably
distinguishes change/ causedchange pairs from eat and leave.
One final point concerning the causative/change-of -state pairs. In fact both
are ' transitive' in d-structure, according to the unaccusativehypothesis (Perl-
mutter 1978, Burzio 1986), which means that technically they both have
transitive subcategorizations. Again , this suggests that subcategorization
frames are not exactly the right place for the child to look for help in figuring
out verb meaning.
We can summarizethe conclusionsso far as follows. Mapping from meaning
onto syntax can success fully exploit a set of principles of Universal Grammar
which relate lexical meaning ultimately to surface syntax. However, determining
the meaningjust from observation of the world seemsto require multiple
exposuresacrosssituations in many cases, and may be impossibleunder certain
circumstances , when the world is particularly uninformative. On the other hand,
mapping from syntax onto meaning promises to success fully exploit linguistically
encodedinformation about a verb' s meaning. However as formulated so
far, it seemsto require multiple exposuresacross sentences , and also may be
in
impossible many cases, when syntax is uninformative.
4. Reconciliation
syntax, and to use surface syntax to regulate the semantic analysis. One way
in which it is possible to combine the essentialgood effects of both types of
mapping gives them different roles in the learning process: the semantics-to-
syntax mapping principles provide a predictive mechanism, and the observed
s-structure provides a checking mechanism. This is the basisof Reconciliation.
(Wilkins 1993 explores a rather similar model for lexical learning, with
particular emphasison the acquisition of morphology .)
Reconciliation
(1) The learner interprets a scene or situation , hears a sentence and detects the
verb.
(2) The learner finds a relationship R among participants in the situation
( entities , propositions etc.) that is sensible given the interpretation of the
observed situation .
The learner checks that R involves participants consistent with the content
of the ( candidate argument ) expressions in the sentence, and rejects an R
that does not meet this requirement .
The learner constructs a lexical conceptual structure which is consistent
with R , and assigns candidate argument expressions in the sentence to
argument positions in the lexical conceptual structure .
This lexical conceptual structure is fed through the semantics- to -syntax
mapping principles of UG in their language particular instantiation .
(6) The s-structure predicted by step 5 is compared to the observed s-structure .
( 7) If they do not match then no learning takes place .
(8) If they do match then the morpheme is entered into the lexicon with the
hypothesized lexical conceptual structure .
not through the syntactic fonn of the arguments of the predicate but through
gross consistency between the (candidate) arguments and the relationship
expressedby a candidate R. This step is prelinguistic , in the sensethat it does
not rely on the linguistic representation of the arguments, just on their gross
meaning. It is quite different, then, from the effects of the linguistic mapping
and checking involved in later stages. An interesting question is whether some
' '
of the effectsdescribedin the literature on syntactic bootstrapping are really
due to this (non-linguistic) process, see Pinker (this volume) . At Step 4,
however, the procedure is now linguistic in character. If R is a causal
relationship between two entities, a causative lcs will be constructed. The
system is now working with a linguistic representation, rather than just with
construals of events, and conceptual properties of relationships and entities.
With respect to the final steps, note that an incorrect representation will be
entered into the lexicon, if the match discoveredin R6 is accidental. We must
assumethat general principles of decay will eventually result in eradication of
mistaken entries (cf. Draine 1971), since they will result in matches less often
'
than correct entries. Moreover, the notion of a ' match requires further
explication. The definition could require identity of fonn between the observed
sentence and the predicted sentence, or it could allow for limited
inconsistencies , in particular with respect to various ways of reducing the
number of arguments that are actually expressedin a clause. Imperatives lack
an overt subject, for example, there are elliptical fonns , and there are verbs
like wash and eat, which can be syntactically intransitive while apparently
' '
maintaining a two -place semantic structure. Cognate objects (e.g. to die a
' '
peaceful death) and fake reflexives (e.g. to behaveoneself) pose the opposite
problem, and a more detailed treatment is required than I will give here.
The reconciliation model incorporates aspectsof both semanticand syntactic
' '
bootstrapping . It crucially involves mapping from a posited meaning to a
syntactic fonn . It also exploits the surface syntax to constrain solutions.
A simple result of the model is that the number, the position , and the fonn
of the syntactic arguments of a predicate will constrain the semantic representation
it is given. This is becauseof the grammatical principles regulating the
lcs- syntax relation.
Suppose for example, that a learner hears a sentencecontaining the verb
give: Mary is giving the package to the boy, and observesan event in which
one individual hands a package to another individual . Suppose the learner
interprets the event as involving a three-place logical relationship of transfer
of possession . This ' R ' will be consistent with the content of the candidate
argument expressionsin the sentence: Mary , the package, and the boy, since
J. GrimshawI Lexical reconciliation 425
not a causative, that kill can be causativebut not change-of -state, and that melt
can be both. This can be determinedjust on the basis of a single observation
each for kill and die, and two observationsof melt, one in each complement
structure. A verb which means to causesomething to change state must have
two arguments, a verb which means to change state must have one. Hence,
since kill has two syntactic arguments, it must have two semantic arguments
and cannot have a change-of-state meaning. Since die has one syntactic
argument it must have one semantic argument and cannot be causative. The
syntactic form provides the information that there are two semanticarguments,
which provides the necessaryinformation about possibleverb meanings.
As we saw in section 3, in order to properly identify instances of the
alternation betweencausativeand change-of -state meanings, it is necessaryto
take into account the fact that the subject of one caseof the verb corresponds
to the object of the other : We melted the ice, The ice melted. The properties of
the arguments are essentialto distinguishing theseverbs from verbs like leave
and eat. If intransitive eat is mistakenly analyzed as having a change of state
meaning, its predicted s-structure will have what is eaten in subject position
The ice ate. The observed s-structure will have the eater in subject position
We ate. Hence the wrong analysis will be rejected. Similarly , if a learner fails
to assigna change-of -state analysis to melt, treating it instead as having an lcs
like intransitive leave or eat, the predicted s-structure will have the agent in
- the
subject position ( We melted), while the observed s structure will have
entity undergoing the change of state in subject position ( The ice melted ).
Once again, the error will be avoided.
The syntactic form of an argument will similarly constrain meaning. The
the
syntactic form dependson the semantic properties of the argument, hence
can information about its semantics , and
syntax of an argument provide
hence about the semantics of a predicate. Sentential complements, for instance
, will occur with verbs of propositional attitude (e.g. believe), verbs of
logical relation (e.g. entail) and verbs of saying (e.g. announce ) . Reconciliation
has the desired effect that this fact will prevent certain types of errors by
learners. Supposethe event is one in which a child is playing roughly with a
dog and an adult says either I think that you are being mean to the dog or
' ' and the learner thinks it
That dog will kill you. If the verb means think
' ' - while
means kill , the predicted s structure will contain an NP complement
the observed s-structure will contain a clausal complement . Similarly if the
' kill ' and the learner thinks it means ' think ' , the predicted s-
verb means
-
structure will contain a sentential complement while the observed s structure
will contain an NP.
J. Grimshaw/ Lexical reconciliation 427
S. Conclusion
words which have absolutely identical linguistics. Such setsinclude the setsof
causative verbs (kill , melt, burn) and change-of -state verbs (die, melt, burn) .
Semantics-to -syntax mapping guaranteesthat each member of the set will be
syntactically indistinguishable (just as the names of animals, cat versus dog
for example, are not syntactically distinct ) . So examination of the surface
'
syntax will not inform a learner as to whether a verb means to become
' ' '
liquid or to becomesolid . The general situation is that it is possible to use
the surface syntax to constrain analysesof the semanticstructure of a verb,
but not its semanticcontent: the fact that a verb is a change-of -state verb, but
not the fact that it expresses a particular change of state. This is a matter of
semanticcontent only and is not reflected in the syntax of a verb at all.
Presumably, then, the semantic differences among members of these sets
must be learned from observation about the world in some sense. However,
this does not necessarilymean that the differencesare perceptual in character.
A vast quantity of information about the world can be encodedlinguistically
but is not linguistic itself. Thus a child can observe that melt is used of , for
example, ice, while burn is used of , for example, paper. This is sufficient for
the child to conclude that the meaningsare as they are rather than the other
way around. In this way, it is possiblefor a child to know what a word means
without ever having observed an event which would count as an occurrence
of what the verb describes. Indeed if this were not the case it would be
impo'Ssibleto understand how meaning differencesamong unobservableverbs
are acquired: think, hope, imagine.
For this reason, language is a source of essential information for lexical
learning in two respects. As just discussed, languagecan convey information
about word meaning which is orders of magnitude more informative than
observation of the world can be. Second, by virtue of the grammatical
principles that govern it , language constrains the possible representationsof
words in ways that learnerscan exploit in word learning. Reconciliationis one
way in which this might happen.
References
Baker, C.L., 1979 . Syntactictheoryand the projectionproblem, LinguisticInquiry 10, 533- 581.
Baker, M., 1988 . Incorporation: A theory of grammaticalfunction changing . Chicago, IL :
Universityof ChicagoPress .
Bowerman , M., 1974 . Learningthe structureof causativeverbs: A study in the relationshipof
cognitive, semantic and syntacticdevelopment . Papersand Reports on Child Language
Development 8. StanfordUniversityDepartmentof Linguistics .
J. Grimshaw/ Lexicalreconciliation 429
Procedures
for verb learning
) 433- 470. North-Holland
Lingua92 ( 1994 433
M.R. Brent
Departmentoj Cognitive Science. Johns Hopkins University. Baltimore. MD 21218. USA
1. Introduction
Individual words differ in the syntactic types of the phrases that can
representtheir semantic arguments. For example, watch takes a noun-phrase
argument while look does not.
( I a) JanewatchedBob
( I b) *Janelooked Bob
. This work benefited enormously from conversations with Bob Berwick, Lou Ann Gerken,
Lila Gleitman, Jane Grimshaw, and Elissa Newport . Any remaining faults ... they probably tried
to talk me out of.
. ScienceB.V. All
0024 > 1994- Elsevier
- 3841/94/ $07.00 ce rights reserved
SSDJ0024- 3841(93) EO046- A
434 M . R. Brent I Surface cuesand robust inference
1.1. Theproblem
'
Both semanticand syntactic bootstrapping depend on two-year-olds ability
to learn the subcategorization frames of some words without relying on their
-
meanings. This is obvious in the caseof syntactic bootstrapping if children
are to rely on the subcategorizations of some particular word to infer its
meaning they cannot also rely on its meaning to infer its subcategorizations.
In the case of semantic bootstrapping, the problem is that languages vary
1.2. Hypotheses
This paper investigatesthe possibility that children first learn the syntactic
functions of a few words that are extremely common and highly informative
about syntactic structure, then exploit these words as probabilistic cues to
key syntactic structures in the input utterances. The function morphemes
- prepositions, determiners, inflection , pronouns ,
auxiliary verbs, and
complementizers - are typically the shortest, most common , and most
syntactically informative words in a .language - ideal starting points for
learning syntax (Morgan et al. 1987, Valian and Coulson 1988) . The fact
that children do not use function morphemes consistently until quite late
436 M . R. Brent ! Surface cuesand robust inference
has suggestedto some that young children do not know them. Logically ,
however, children must know some function morphemes by the time they
are learning subcategorization frames - they cannot observe that look
subcategorizes for a prepositional phrase without the ability to recognize
even one preposition . Further , the next section reviews a substantial body of
empirical evidence that English-speaking children know the syntactic privileges
of a number of function morphemes before they are two -and-a-half
years old .
Although function morphemes provide a great deal of infonnation about
syntactic structure, they do not provide enough for complete, unambiguous
syntactic parsing. This paper investigatesthe hypothesis that young children
can learn lexical syntax on the basis of partial and uncertain syntactic
analyses of input utterances, and that they can cope with the resulting
misconstruals using statistical inference.
1.4. Organization
One of the major hypotheses advanced here is that children rely on the
distribution of function morphemesin the early acquisition of subcategorization
frames. This hypothesis is only plausible if young children recognize the
syntactic privileges of function morphemes. In addition , the model presented
in section 3 assumesthat young children know some proper names by the
time they need to learn syntactic frames. Finally , the model assumesthat
children can detect the ends of utterances using prosodic cues such as pause
M. R. Brent! Surface
cuesandrobustinference 439
length, pitch , vowel duration , and volume. This section reviews the empirical
basesfor theseassumptions.
The fact that young children do not use function words reliably has
sometimesbeenexplainedby claiming their grammarslack function morphemes
'
( Radford 1988, 1991). It has been further proposed that children s putative
ignorance of function morphemesresults from a perceptual failure - they do
not hear or cannot distinguish function morphemes in most languages
becausethey are unstressed(Gleitman et al. 1988) . However, a substantial
body of convergentevidencehas accumulatedagainst eachof thesehypotheses .
Briefly, it has been shown that :
The direct evidence that children are sensitive to the presenceand correct
usageof function morphemesis summarized below:
( I ) On imitation tasks, young children (age 1; 11- 2;6, MLU 1.30- 2. 00, M =
1.73) omit English function morphemes more often than unstressed
nonsense syllables (Gerken et al. 1990) . This effect persists when the
sentencesare synthesizedautomatically to control for possible prosodic
differencesbetweengrammatical and ungrammatical sentencesspoken by
humans (age 2;0- 2;6, MLU 1.57- 2. 60, M = 2. 07), and when the nonsense
words are composed of phonemic segmentssimilar to those of English
function morphemes (age 2;0- 2;6, M MLU = 2.20) . This indicates that
children younger than two-and-a-half distinguish between English function
morphemes and similar nonsense syllables - i.e. they know the
440 M. R. BrentI Surfacecuesandrobustinference
2.2. Prosody
3. An implementation
3.1. Overview
This subsection explains how the model works in general terms ; the following
subsections fill in the details .
First , note that subcategorization frames , as described by Chomsky ( 1965),
do not distinguish between tensed clauses and infinitives - both are described
as sentences. Sentential complements are realized as tensed clauses or infinitives
as a result of lexical features and transformations . The cues described below ,
however , recover only the syntactic forms of arguments as they occur in
utterances . (Boguraev and Briscoe , 1987, provide evidence that it is usually
'
possible to recover the lexical features required in Chomsky s analysis from
surface argument types mechanically , but that question is not addressed
here.) In addition , this implementation recovers some information that is
-
usually thought of as function word selection , rather than subcategorization .
To avoid confusion , the representations recovered in this implementation are
referred to as syntactic frames , or simply frames . The syntactic frames
inves.tigated here , excluding selection for prepositions and complementizers ,
are shown in table I .
Table I
The syntacticframesstudiesin the simulation
( I ) Which words have occurred both with and without -inK, not counting
occurrencesprecededby a preposition or a detenniner. These words are
.
treated as having verbal senses
(2) How many times each word with a verbal sense has occurred, not
c' Ountingoccurrencesprecededby a preposition or a detenniner.
( ) How many of those occurrenceswere followed by cues for each of the
3
syntactic frames.
Table2
, for the first 7Sverbsin alphabeticalorder
, beforestatisticalprocessing
Rawobservations
V NP NPNPNPclNPinNPinfNPonNPtoNPwhcl in inf on to wh
add
21
-
ask
-
be 118
blow 17
bring 52
brush 54
button 25
buy 12
call 88 19
carry 6 3
catch 13 5
cheat 2 1
chew 15 1
clean 10
close 97 43
color 99 1
comb 25 6
come 298 6 2 1 1
cook 19 1
cry 42 5
cut 16 5
dance 21 1
do 142418410 162 1 2 7
draw 23 I
drink 33 4 1
drive 19 4 3
eat 254 64
even 13
fall 46
feed 25 4
feel 55 15
find 103 34 2
fish 17
fix 32 15
fly 6
forget 6 I
get 398 84 3 3 9 3 I
give 91 4 8 I
go 982 5 24 113 3 26
grow 3
have 506 59 3 S6 3
446 M.R. BrentI Surfacecuesandrobustinference
Table2 (Cont.)
pretend
pull S7 9 I I
push 44 S I I
put 99S46 I 2S 5
rain 7
for an infinitive . The observations table is the input to the statistical inference
procedure.
The statistical inference component of this implementation has two parts :
one estimatesthe error rate for each frame, and the other usesthis estimate to
weigh the evidenceprovided about that frame by multiple occurrencesof each
verb. In particular , the latter component considers the number of times the
M . R. Brent I Surface cuesand robust inference 447
verb occurred with cues for the frame and the number of times it occurred
without , much as one would consider the number of headsand the number of
tails turned up by a coin to determine whether or not it is fair . The method of
estimating the error rate was designed specifically for this implementation,
while the method for weighing the evidence in light of the error rate is a
standard one.
The error estimation procedure is based on some simplifying approximations
. First , for each frame , a single source of error dominates all
others in terms of frequency. Second, errors from this source are distributed
evenly across verbs. Third , verbs that in fact have a given frame
occur with cues for that frame with significantly higher probability than
verbs that do not have it . Put another way, false alarms are less frequent
than true alarms. To the extent that these approximations hold , the verbs
whose co-occurrence
' with cues for a given frame consist entirely of false
alarms should have a relatively low rate of co-occurrence with those cpes,
and their rates of co-occurrence should be distributed binomially . These
facts can be used to identify a sample of verbs whose co-occurrence with
cues for a frame consist entirely of false alarms, or miscues. This sample
can then be used to estimate the error rate.
The remainder of this section provides a more detailed discussion of the
implementation and the rationale behind its design. Readers interested
primarily in the experimental results can proceed to section 4 without loss
of continuity .
(3a) is grammatical because the verb pretend takes a tensed clause (CP)
argument; (3b) is ungrammatical because the verb play does not ; (3c) is
grammatical becauseplay is functioning as a noun and the following clauseis
448 M . R. Brent / Surface cuesand robust inference
a relative clause rather than an argument of play . Any common noun can
have a relative clause, so noticing a tensedclauseis not informative about the
syntactic frames of a preceding noun, but it is informative about those of a
preceding verb.
A similar difficulty arises from the fact that , in English, verbs can take NP
complementsbut nouns cannot:
'
(4a) We plannedto [" purchase ] [NPthe world s largestbank]
'
(4b) . Our [N purchase
] [NPthe world s largestbank] wasprofitable
' '
(5) Our [N purchase] of [NP the world s largest bank] was profitable
Thus if observations of purchase the noun and purchase the verb were not
distinguished, they would provide contradictory evidence about whether or
not purchasetakes an NP complement. This has consequencesfor nouns that
are not ambiguous as well - cues indicating that a word has an NP
complement should be disregarded in the presenceof evidencethat the word
is a noun.
The proposed statistical inference method adds to the importance of
distinguishing between nouns and verbs. It is based on the simplifying
assumption that miscuesfor a given frame are distributed fairly evenly across
words. This assumption is violated if data from nouns and verbs are pooled.
To see this, suppose that no discrimination were made between nouns and
verbs. Further , suppose that a clause following a word W were taken as
evidencethat W takes a tensedclause argument. Example (3) shows that the
likelihood of that evidencebeing incorrect is much higher if W is a noun than
if W is a verb. Thus, distinguishing betweennouns and verbs is important for
obtaining a relatively uniform distribution of errors. This would be true even
if there were no lexical ambiguity .
Cuesfor finding verbs. In English, there are several function -morpheme
cues that provide evidenceabout major category, but none of them is ideal.
In general, the simplest cues tend to be either less reliable or less common
than more complex ones. For example, a simple, reliable cue is that words
following a form of be and ending in - ing (progressives) are very likely to be
verbs. Unfortunately , a number of words occur in the progressiveconstruc-
.MR . Brent / Surface cuesand robust inference 449
tion very rarely, if ever. These include some of the most common and most
syntactically interesting words in child -directed speech, such as know, want,
like and see.
The following two-part procedure for identifying verbs is a reasonable
compromise among simplicity , frequency, and reliability : first , identify
words that occur as verbs ; second, when such a word occurs, assume it is
functioning as a verb , except in the presence of evidence to the contrary .
All words that can occur as verbs can occur both with and without the
final syllableing , and all but a few words with that property can occur as
verbs. Even mental state verbs like know, which rarely occur in the
progressive, have nominalizations ending in -ing (Knowing how to read a
map is useful) . A learner who has identified potential verbs can now
proceed by trying to weed out their non -verbal occurrences. Occurrences
following determiners
' , prepositions , or non -auxiliary verbs, for example,
are unlikely to be verbal .
The simulations are based on the -ing cue for potential verbs and the
determiner and preposition cues for non -yerbal occurrences.2 However ,
these cues are not without difficulties . Nominalizations of mental states
like knowing, linking , and wanting appear to be rare in child -directed
speech, perhaps becausethey are semantically abstract. A second problem
is that some ambiguous words occur as nouns more often than they occur
as verbs, even when occurrences following determiners and prepositions
are excluded.
2 In ' '
speechthere is only one allomorph of the morpheme ling / . In written English the ing
suffix can induce spelling changes at the end of the stem, and the simulation includes some
techniques for negotiating those spelling changes. Description of these techniques has been
omitted since the orthographic problem they solve does not occur in auralleaming .
450 M.R. Brent/ Surfacecues and robust inference
Table3
The cues used by the simulation for identifying complements
. LNP (Lexical NP) includes
pronounsand personalnames. NPLE (NP left edge) includesLNP and the determiners
(6) John put [NP the toy Mary wanted] on the floor
3.3. Robustinference
4 Given a verb V , the outcomes of the coins for different S' s are treated as approximately
independent, even though they cannot be perfectly independent. Their dependencecould be
modeled using a multinomial rather than a binomial model.
M . R. Brent I Surface cuesand robust inference 453
n '.
P (m,nip) = pm(lp ) n- m
m! (n - m) !
n
P(m+ ,nip) = LP (i ,nip)
ism
Analogously, P(m + ,nix _s) gives the probability that m or more occurrences
of a - S verb V will be followed by a cue for S out of n occurrencestotal .
If m out of n occurrences of V are followed by cues for S, and if
P(m+ ,nix _s) is quite small, then it is unlikely that V is - So That is, the
observeddata would be quite unlikely if V were - S and hencehad probability
X- s of being followed by a cue for S. Traditionally , a threshold less than
or equal to 0.05 is set, such that a hypothesis is rejected if , assuming the
hypothesis were true, the probability of outcomes as extreme as the observed
outcome would be below the threshold. The confidence attached to this
conclusion increasesas the threshold decreases .
4. Experiment
once for drawing lexical conclusions. On the first pass, a verb is ignored after some preset sample
size has been reached. Multiple passesare not possible for a child leamer, so a new variant of the
procedure was developed for the current problem.
M.R. Brent/ Surfacecuesandrobustinference 457
4.1. Methods
The input corpus for the experiment consists of 31,782 utterancesby adult
caretakers to children between 1;0 and 2;6. The utterances were taken from
the CHILDES database(MacWhinney 1991). The particular trtanscripts were
as follows : all of Bates's ' free play' and ' snack' transcripts, but not the
reading aloud transcripts (Bates et al. 1988); all of the Bernstein- Ratner
transcripts ( Bernstein- Ratner 1987) ; all of the Higginson transcripts for
children betweenages 1;0 and 2;6 (Higginson 1985) ; and the Warren-Leubec-
ker (Warren- Leubecker and Bohannon 1984) transcripts for children 2;6 and
younger. In all cases, exactly one adult and one child were present throughout
. All lines except the adult ' s speechwere removed. The pause markers
( # ) were left in and taken as utterance boundaries, along with the ordinary
punctuation marks. All diacritics between square brackets were removed.
Angle brackets (used as scope markers for diacritics) were removed but the
words between them were not. A few changeswere made to enforce consistent
'
use of the compound marker ' + ' , including the placement of a ' + '
rather than a space between two-part names like Santa Claus and Mickey
Mouse. Otherwise, transcription errors were not corrected.
Collectingobservations
. The observations were collectedexactly as described
in section3.
Statistical inference. The algorithm given in the Appendix was used with I
= 40 and minsample = 40. Given the estimated error rate for each
syntactic frame, verbs are reported as + S if their rate of co-occurrencewith
cuesfor S would have probability below 0.02 under the null ( - S) hypothesis.
Vocabu/ary . I n order to make use of the cues, the learner needs to know
some function morphemes. In this experiment the program made use of an
initial vocabulary chosen from the most common words in the corpus
'' ' ' ' ' ''
(table 4) . Besidesthe four utterance-boundary markers, . , 1 , # , and ! ,
these include the pronouns you, that, it , this, I , we, he. and they; the
determiners the, a, and your ; the complementizerswhat, that, and where; and
458 M . R. Brent / Surface cuesand robust inference
Table 4
The SOmostcommonorthographicforms in the corpus
the prepositions to, in and on. Auxiliaries and some modals are also common,
and these can be exploited by cues. However, one of the goals of this
experiment was to determine how little lexical knowledge the child could
make do with , so these function morphemeswere not exploited. The demonstrative
' '
pro -PPs here and there were not exploited either, though they would
probably be very useful for learning about locative PPs. Interjections, negations
, and conjunctions are of little use. The fifty most frequent words also
include the open classverbs put , look, want, and like, but the simulation does
not know any open class words at the outset.
In addition , the simulation started with a lexicon of all 158 proper names
that occurred at least twice in the corpus.
4.2. Results
~I0
Table5
The 76 verbs that were assignedat least one syntactic frame
QQ
~~~~-~~ l
NP Z NPNP NPin NPon NPto NPwh cl in inf on to wh
NPwh
~
Z
ZZZZ
:(~
z~0
ZQZQ
ZZZZ
ZZZZZZZZZZZZZZ
Q Q
NPNP
z&0 z!0
.c
NPNP
ZZZ
Q
z ~0c
ZZQ
zzzz
~
"
460 M . R. Brent I Surface cuesand robust inference
Table5 (Coot
.)
NP NPNP NPin NPon NPto NPwh cl in inf on to wh
pull NP
push NP
read NP NPNP
record wh
ride NP in
rock NP
roll NP NPto
say NP
set NP
sew NP
sing NP NPNP
sit in on
sleep on
smell NP
spill NP
NP in
splash
start inf
take NP
talk on to
tell NP NPwh
think . cl
throw NP NPto
touch NP
trot inf
NP inf
try
turn NP
understand NP wh
wait NP
walk in on
wash NP
watch NP wh
water on
wear NP
Table6
The 50 words that were identified as verbs but were not assignedany syntactic frames
add button clean color come cook dancefraw evenfall fish fly forget grow hide inventjump keep
kick knock laugh lay lie move mow nap pick pitch pour rain ring rub run scratch share shavesnow
stack stand star step stick swim swing umm wake wavework write zip
M . R. Brent / Surface cuesand robust inference
These results should be interpreted in light of the fact that the statistical
' or
inference technique does not answer 4yes 4no' as to whether a given verb
has a given frame. Rather, it answers4yes ' or 4noreliable indication of
yes so
'
far .
It should also be noted that a number of the verbs discovered have
auxiliary or modal functions, including be, do, go and have. These are
generally omitted from the following discussion, since they have special
properties that must in any case be learned by some process that does not
directly involve subcategorization frames.
Now consider the results for each syntactic frame.
4.3. Discussion
clause, and come with a cue for an infinitive . As a result of the inference
procedure, none of these miscues lead to errors in the lexicon shown in
table 5.
This investigation began with the question, How could children possibly
acquire their first subcategorization frames? Knowing verb meanings does
not help until the language-particular correspondencebetween meaning and
subcategorizationalternations can be learned. Direct observation of syntactic
structures might work , except that it is difficult to imagine how children
could parse an utterance reliably without already knowing the syntactic
properties Qf the words in it . However, function morphemes are among the
most frequent morphemes, and they would seem to provide a good deal of
-
probabilistic information about syntactic structure. Further , English speaking
children appear to have passiveknowledge of the syntactic privileges of some
individual function morphemes by age two -and-a-half. The question is how
useful the information carried by function morphemes is and how it can be
exploited to learn subcategorization frames reliably .
To answer that question, a collection of simple cues was devised for
English, using a vocabulary of twenty frequent function morphemes, under
the assumption that children recognize the common proper names in their
environment as NPs and can detect utterance boundaries prosodically .
Becausethe information provided by these cues is probabilistic , inferences
must be drawn from multiple occurrencesof each verb. Thus, a statistical
inference procedure was developed. The cues and the inference procedure
were simulated on transcripts of child -directed speech. This experiment
showed that probabilistic surface cuescombined with statistical inference can
determine syntactic frames with surprisingly high accuracy. Thus, young
children could learn subcategorizationframes from sentenceswhosemeanings
-
they do not fully understand. There is no doubt that two- year olds comprehend
many utterances, and this probably helps them parse in some cases. However,
the current experimentssuggestthat they can also learn from utterancesthey
do not understand and cannot parse completely.
However, some important questions remain open. Specifically, the hypothesis
that young children can use their knowledge of function morphemesas
cues for specific phrasal categories remains unproven. Even if they can, one
would still want an explanation of how English-speaking children come to
M.R. BrentI Surface
cuesandrobustinference 465
For Veverbs
Add in verbs that were + 8 under the previous hypothesis about the ::i:Scutoff,
but are - 8 under the new hypothesis
For Veverbs
f (ViS
verb - bin = roundoff ( bin - count *
N( V )
If ( verb - bin = cutoff - bin ) And ( N ( V ) ~ min - sample )
observed - dist [ verb - bin ] = observed - dist [ verb - bin ] + 1
total - cue - occurrences = total - cue - occurrences + f ( V, S )
total - verb - occurrences = total - verb - occurrences + N(V )
- -
raet = total - cue -occurrences
total verb occurrences
For Veverbs
distribution for verb V, scale it to bin - count bins, and add it to the
mixture.
References
Adams, L. and T. Macfarland, 1991 . Testingfor adjuncts . In: Proceedings of the 2nd Annual
Meeting of the Formal LinguisticsSocietyof Midamerica . Formal LinguisticsSocietyof
Midamerica .
Bates, E., I. Brethertonand L. Snyder , 1988 . From first wordsto grammar: Individualdifferences
and dissociable mechanisms . Cambridge , MA : CambridgeUniversityPress .
Bernstein -Ratner, N., 1987 . The phonologyof parentchild speech . In: K. Nelson, A. vanKleeck
'
(eds.), Childrens language , Vol. 6. Hillsdale, NJ: Erlbaum.
Bever, T.G. , 1991 . Thedemonsand the beast- Modularand nodularkindsof knowledge . In: N.
, N. Sharkey ( .
eds), Connectionist approach es to language processing, 212- 252 . Hillsdale,
Reilly
NJ: Erlbaum.
Boguraev , B. and T. Briscoe , 1987 . Largelexiconsfor naturallanguageprocessing : Utilising the
grammarcodingsystemof Idoce. ComputationalLinguistics13(3), 203- 218.
Brent, M.R., 1991 . Automaticacquisitionofsubcategorization framesfrom unrestrictedEnglish.
Ph.D. thesis , Massachusetts Institute of Technology .
Brent, M. R., 1993 . Fromgrammarto lexicon: Unsupervised learningof lexicalsyntax. Computa-
tional Linguistics19, 243 262. -
Chomsky , N., 1965 . Aspectsof the theoryof syntax. Cambridge , MA : MIT Press .
M . R. Brent ! Surface cuesand robust inference 469
Chomsky . N.. 1981 . Lectureson government and binding. the Pisalectures , Volume9 of Studies
in generative grammar. Dordrecht: Foris.
Chomsky . N., 1986 . Knowledgeof language . Convergence . Westport, cT : Praeger .
Demuth. K.. 1993 . Issuesin the the acquisitionof the sesothotonal system . Journalof Child
Language20. 275- 301.
Fernald. A. and P. Kuhl, 1987 . Accousticdeterminants of infant preference of motheresespeech .
Infant Behaviorand Development10, 279- 293.
Fisher. C.. H. Gleitmanand L.R. Gleitman, 1991 . On the semanticcontentofsubcategorization
frames . Journalof CognitivePsychology 23(3), 331- 392.
Fisher. C. and H. Tokura, 1993 . Acousticcuesto clauseboundariesin speechto infants: Cross-
linguisticevidence . Manuscript , Departmentof Psychology , Universityof Illinois.
Gelman. S.A. and M. Taylor, 1984 . How two-year-old childreninterpretproper and common
namesfor unfamiliarobjects . Child Development 55, 1535 - 1540 .
Gerken. L.. 1992 . Young children's representationof prosodic phonology: Evidencefrom
-speaker ' s weak
English syllableproductions . Departmentof Psychology , SUNY Buffalo.
Gerken. L.. B. Landauand R.E. Remez , 1990 . Functionmorphemes in youngchildren's speech
perceptionand production. Developmental Psychology 26( 2), 204- 216.
Gerken. L. and B.J. McIntosh, 1993 . The interplayof functionmorphemes and prosodyin early
language . Developmental Psychology 29, 448- 457.
Gleitman. L.. 1990. The structural sourcesof verb meanings . LanguageAcquisition 1( 1), 3-
56.
Gleitman, L.R., H. Gleitmanand E. Wanner, 1988 . Wherelearningbegins: Initial representations
for languagelearning, 3- 48. New York: CambridgeUniversityPress .
Gold. E.M.. 1967 . Languageidentificationin the limit. Informationand Control 10, 447- 474.
Grimshaw . J., 1981 . Form, function, and the languageacquisitiondevice, 165-210. Cambridge ,
MA : MIT Press .
Gropen. J.. S. Pinker. M. Hollanderand R. Goldberg, 1991 . Affectedness anddirectobjects: The
role of lexicalsemanticsin the acquisitionof verb argumentstructure . Cognition41, 153-
195.Higginson . RiP.. 1985 . Fixing-assimilationin languageacquisition . PhiD. thesis , Washington
StateUniversity.
Hirsh-Pasek . K., D.G. Kemler-Nelson, P.W. Jusczyk , K. Write and B. Druss, 1987 . Clausesare
perceptualunits for younginfants. Cognition26, 269- 86.
Jusczyk . P.W.. K. Hirsh-Pasek , DiG. Kemler-Nelson, L. Kennedy , A. Woodwardand J. Piwez,
1993 . Perceptionof accousticcorrelatesof major phrasalunits by young infants. Cognitive
Psychology (in press ).
Jusczyk . P.W. and E. Thompson , 1978 . Perceptionof a phoneticcontrast in multisyllabic
utterances by 2-month-old infants. Perceptionand Psychophysics 23, 105- 109.
Katz. N.. E. Bakerand J. MacNamara , 1974 . What' s in a name? A studyof how childrenlearn
commonand propernames . Child Development 45, 469- 73.
Kemler-NelsonD.G., K. Hirsh-Pasek , P.W. Jusczykand K.W. Cassidy , 1989 . How theprosodic
cuesin motheresemight assistlanguagelearning. Journalof Child Language16, 55- 68.
Kimball. J., 1993 . Sevenprinciplesof surfaceparsingin naturallanguage . Cognition2( 1), 15- 47.
Landau, B. and L. Gleitman, 1985 . Languageand experience . Cambridge , MA : Harvard
UniversityPress .
Lederer. A. and M.H. Kelly, 1992 . Prosodicinformation for syntacticstructurein parental
speech . Manuscript , Departmentof Psychology , Universityof Pennsylvania .
M . R. Brent I Surface cuesand robust inference
Mark Steedman
Departmentof Computerand Information Science , 200 South
, Universityof Pennsylvania
33rdStreet, Philadelphia
, PA -
191046389, USA
1. Introduction
The question of how children acquire lexical entries for verbs, and in
particula~ their subcategorisation frames is one of the central questions
'
concerning the child s acquisition of syntax. Its importance is enhanced by
the recent tendency in theories of grammar to gravitate to a lexicalist
position, and the role of verbs as the head of their clause. How do children
do it , given the non-determinacy and automata-theoretic complexity of the
syntax itself, and the unsystematic presentation and error-proneness of the
'
linguistic data that they apparently have to make do with ? Michael Brent s
paper in this volume shows how the statistical technique of binomial error
estimation can be used to minimise the effect of contamination in the data
available to the child languagelearner, arising either from errors in the input
itself or errors in the child ' s analysesof the input sentences
. The technique is
demonstratedby applying it to the sentencesof a corpus of actual adult- child
conversations, to derive subcategorisation frames for verbs from analyses
basedon imperfectly reliable local syntactic cues defined in terms of sequences
of inflectional morphemes, function words and lexical NPs. As Brent
points out , these two aspects of the work are quite independent : binomial
error estimation could be used to minimise the influence of errors arising
from imperfect analysis procedures of any kind at all , including those
based on semantic and prosodic information , as well as syntactic . The
present paper considers the part that all of these sources of information
may play .
2. Syntax
How can the child avoid erroneously subcategorising meet like put ? Brent
'
points out that it doesn t matter if they do, because it is (presumably
universally) the case that the relative frequency with which the sequence
V N P PPOll occurs will be significantly higher for verbs like put that
subcategorisefor NPs and on PPs than for those like meet which subcate-
gorise for NP and only allow PP as an adjunct. Binomial error estimation is
able to distinguish the two distributions , and reject the child ' s spurious
evidence from analyses suggesting that meet subcategorises for the PP.
Similar results seemto follow for spurious occurrencesof subcategorisations
arising from extraction, as in who did you put on the bus, which might appear
otherwise to suggestthat put might subcategorisefor PP alone.
It is therefore reasonableto ask whether the child languagelearner actually
makes use of such purely syntactic cues to learn the lexical categories of
verbs. Here Brent is extremely cautious, and goesout of his way to acknowledge
the possible involvement of prosodic and semanticcuesas well. He notes
in passing that there are a number of open questions that need answering
before we can be quite comfortable with the assumption that the child is
using the closed-class cues. The most important is that both the function
words themselvesand the cue sequencesbasedon them are language-specific.
The question arisesof how the child can possibly catch on to the fact that it ,
that and the are cue words, much lessthat the sequenceV it the suggeststhat
V is probably a ditransitive verb, while the sequenceV that the suggeststhat
V is probably a complement verb. It is hard to see that there is any
alternative to knowing , besidesthe set of possible subcategorisation frames,
(a) the precisesyntactic significanceof each closed class word as NP , Specof
CP, etc., and (b) some statistics about possible corpora, including facts such
as that complement-taking verbs are more common than ditransitives. 1
I shall remain equally cautious in the face of such open questions, and
certainly would not wish to claim that the child cannot be using such cues.
However, as long as these questions remain open, it also remains unclear
whether we have escapedwhat Brent identifies as the ' chicken-or -egg prob-
lem' of apparently needing to know some syntax to apply this procedure.
This suggeststhat there may be some point to asking ourselves what other
resources the child could call upon, and in particular whether the two
alternatives that Brent mentions, prosody and semantics, can help a child
1 It is not
enough to assumethat the child simply looks out for verbs followed by all possible
' ' ' '
sequences cue words, classifying verbs as ;t + the verbs , that + the verbs , etc. Such a
of
classification does not determine a subcategorisation.
474 M . SteedmanI Acquisition of verb categories
3. Prosody
Z Brent suggeststhat spurious analyses of verbs like ring as subcategorising for PP can be
eliminated by observing sets of subcategorisations, presumably meaning that we can reclassify
verbs that have been assigned subcategorisationsof both PP and NP + P. However, this is a
distinct (and language-specific) complication to the proposal, and appears likely to conflict with
the other usesthat have been proposed for such sets. The prosodic cues discussedbelow would
allow this particular complication to be eliminated from his account.
M . Steedman/ Acquisition o/ verb categories 475
4. Semantics
4 The use of the words ' little more' rather than ' nothing more' is important . It would not be
-
surprising to find that some part of syntax perhaps the observed constraints upon consistent
orders across headsand complements - had its origin elsewherethan in semantics.
5 I am not saying that logicians and my fellow computer scientists do not have interesting
fonnalisms for representingconceptual structures. In fact these systemsare the main source of
fonnal theoretical devices that linguists have to draw on. But as knowledge representation
systems, none of them as yet seemparticularly close to the human one.
M. Steedman
I Acquisitionof verbcategories 477
'
whether the child s conceptual representation really can be used reliably in
this fashion , which Pinker ( 1989) has called ' semantic bootstrapping ' , and , if
not . how the child can cope with its unreliability .
S. Syntacticandsemantic
Gleitman ( 1990) argues very persuasively that the child must often find
itself in a situation which is ambiguous with respect to the conceptual
representation. To take one of her examples, a child who is being read a story
from their picture-book about a fox and a rabbit may have insufficient
information from their understanding of the story and from a picture of the
fox running after the rabbit to tell whether an unknown verb in an adult
sentenceof the form ' The fox is V ER Sing the rabbit ' should be associated
with the concept of chasing, or the concept of fleeing. In a number of elegant
experiments, she and Landau and their colleagueshave shown that children
who are artificially placed in this situation identify whichever conceptual
representationis consistent with the syntactic form of the sentence. Since this
is the only information that appearsto be available to the children as a basis
for the decision, they argue that children are capable of using a process of
' '
syntactic bootstrapping to aid them in learning the subcategorisationsof
verbs. Such a processrequires the child to generalisefrom its existing partial
'
knowledge of syntax and verbal subcategorisations, perhaps via the linking
'
ru1es discussedby Pinker and his colleagues(cf. Gropen et al. 1991).
Gleitman' s proposal is in principle entirely consistent with semantic boot-
'
strapping in the senseoutlined here. It is actually quite likely that the child s
'
conceptual representation isn t much more underspecifiedthan being ambiguous
as to whether this situation is an instance of transitive chasing or
transitive fleeing, and a few other equally relevant propositions. That is, we
can probably assumethat the child knows what it is to be ' read a story' , that
the story is ' about the fox trying to eat the rabbit ' , and a lot of relevant facts
such as that ' to eat something you have to catch it ' , ' rabbits don' t like being
' '
eaten , and so on. (If they don t know stuff like this, then they may be in a
position to learn the nouns, but probably not to learn the verbs.) In this case,
there aren' t likely to be that many other possibilities, and syntactic bootstrap-
ping may well reduce the set to one possible meaning. If so, it may well do so
in the face of syntactic complications that are irrelevant to subcategorisations,
such as the presenceof modifiers or adjuncts. Their presencewill not prevent
the child from acquiring the meaning of the verb, which it must have already
478 M . Steedman/ Acquisition of verb categories
6 Two
problems which Siskind leaves open are the problem of polysemous verbs, and the
problem that ariseswhen the set of putative meaningsderived from an occurrenceof the verb are
all spurious. Both of these eventualities will lead to empty intersections. One simple tactic that
might serveto distinguish them and thereby be used to maintain a coherent lexicon would be to
respond to an empty intersection by keeping both entries, relying on a tactic like binomial error
estimation to distinguish between true polysemous lexical entries and spurious ones on distributional
grounds.
1 For
example, Brown ( 1973), Bowerman ( 1973) and Clark ( 1973), and Carey ( 1982), the last
including an extensivereview.
M. Steedman
I Acquisitionof verbcategories 479
'
even more bizarre ' complexes, revealed in non-standard lexical meanings.
There is also evidencethat children predict new lexical entries that they have
not actually encountered, via lexical rules such as the rule that generates
causative verbs from certain adjectives, such as cool. This process may on
occasiongive rise to non-standard lexical causatives, as in # It colds my hand,
either becauseof slightly non-standard lexical rules, or becausestandard rules
are applied to slightly non-standard lexical entries. (SeeBowerman 1982and
referencestherein.)
The way in which children successivelymodify non-standard lexical items
to approximate the adult lexicon is the most challenging and least well-
understood part of the process. But the undoubted fact that the processes of
syntactic and semantic bootstrapping appear to iterate in this way suggests
that together they may constitute the processby which children gain accessto
concepts which are not immediately available to prelinguistic sensory-motor
cognition , and may thereby provide the force behind the explosive change in
cognitive abilities that coincides, both in evolutionary and in child -developmental
terms, with the appearanceof language. 8 Computational models of
the kind proposed by Brent and Siskind will continue to provide the only way
in which theories of this process, such as syntactic, prosodic and semantic
' '
bootstrapping , can be developed and evaluated.
References
8 SeeVygotsky( 1962 ) for someearlyspeculations on the natureof this processand its relation
-motor development
to Piagetiansensory . and seeOleron ( 1953 ) and Furth ( 1961) for some
earlystudiesof the effectsof deprivation.
suggestive
480 M . Steedman/ Acquisition of verb categories
Clark, Eve, 1982 . The young word maker. In: Eric Wanner, Lila Gleitman(eds.), Language
acquisition:The stateof the art, 39()..428. Cambridge : CambridgeUniversityPress .
Chomsky , Noam, 1965 . Aspectsof the theoryof syntax. Cambridge , MA : MIT Press .
Clifton, Charlesand FernandaFerreira, 1989 . Ambiguity in context. Languageand Cognitive
Process es4, 77- 104.
Furth, Hans, 1961 . The influenceof languageon the development of conceptformationin deaf
children. Journalof Abnormaland SocialPsychology 63, 186- 389.
Gleitman, Lila, 1990 . The structuralsourceof verbmeanings . LanguageAcquisitionI , 3- 55.
Grimshaw , Jane, 1990 . Argumentstructure. Cambridge , MA : MIT Press .
Gropen, Jess , StevenPinker, MiceileHollanderand RichardGoldberg, 1991 . Affectednessand
direct objects: The role of lexicalsemanticsin the acquisitionof verb argumentstructure.
Cognition41, 153- 196.
Kelly, MichaelH., 1992 . Usingsoundto solvesyntacticproblems . Psychological Review99, 349-
364.
Kelly, MichaelH. andS. Martin, 1994 . Domain-generalabilitiesappliedto domain-specifictasks:
to in
Sensitivity probabilities perception , cognition, and language . Lingua92, 105- 140. (this
volume).
Lederer , Anneand MichaelKelly, 1992 . Prosodicinformationfor syntacticstructurein parental
speech . Paperpresented to 32ndMeetingof the Psychonomic Society , SanFrancisco , 1991
.
Miller, George , 1967 . The psychologyof communication . Harmondsworth : Penguin.
Oleron, Paul, 1953 . Conceptualthinkingof the deaf. AmericanAnnalsof th~ Deaf98, 304- 310.
Pinker, Steven , 1979 . Formalmodelsof languagelearning. Cognition7, 217- 283.
Pinker, Steven , 1989 . Learnabilityand cognition. Cambridge , MA : MIT Press .
Siskind, Jeffrey, 1992 . Naivephysics , eventperception , lexicalsemantics , and languageacquisition
. UnpublishedPhD . dissertation , MIT , Cambridge , MA.
Steedman , Mark and Gerry Altmann, 1989 . Ambiguity in context: A reply.. Languageand
CognitiveProcess es4, 105- 122.
Vygotsky , Lev , 1962 . Thoughtand language . Cambridge , MA : MIT Press .
Winogard, Terry, 1972 . Understanding naturallanguage . New York: AcademicPress .
Appendix
--
Symbolsfromthe
Phonetic
International
, with
Alphabet examples
-
a st.QP , C2P m man, him
a ~ , ~ ther
Im n ,n.ose, aImoy
.. f.@
:t, pl.@:cid IJ wi!!,l, sink
A byt, ft.Q.Qd ou gQ, SQY1
aJ ~ , kite .') w.@ .w
.lk , I.@
au ~ I, h~ 0 w.@ .nt, astron.@ .Y.t
b ,hoss , ho,ho .')1 v
bQil, .Qice
tl ~ balo, pi~ P eie, nae
d go, hag r rim, parrot
d3 w~ e, rase s .'-imon, bog
el b.@:!t, d~ I mip, f~ al
~ gmong, readily t lawn, walkeg
w~ ther, ! DY 9 !hiD, mytJ !
f felt, left 6 !l!en, wea!lier
g So, les u r.QQf , ft~
h hat, how u l.QQk , fyll
'
h Ohio, ahead 3r bird, wQ!St
I ~ I, ~ v yoice, giye
I if, remit w m , qyail
j ~et, can~on x (Spanish ) proteFr ,
(Dutch) ka~ el
k ~ , bac -k z ~ I, ~
I lip, bill 3 lealure, garale
Bibliography
Cohen, A . ( 1965) . Versprekingen als verklappers: bij bet proces van spreken en
verstaan. Forum der Letteren, 6, 175- 186.
Cohen, A . ( 1968). Errors of speechand their implication for understanding the
-
strategy of the languageusers. Zeitschrift fUr Phonetik, 21, 177 181.
'
Cohen, A ., Collier , R., & Hart , J. t ( 1982) . Declination: Construct of intrinsic
feature of speechpitch? Phonetica, 39, 254- 273.
'
Cohen, A ., & Hart , J. t ( 1965) . Perceptual analysis of intonation patterns. Proceedings
of the Vth ICA , A16. Liege, Belgium.
Collier , R., & Gelfer, C. E. ( 1983) . Physiological explanations of FO Declination.
In M . P. R. Van den Broecke & A . Cohen (Eds.), Proceedingsof the Tenth International
Congressof PhoneticSciences . Dordrecht: Foris.
Fay, D ., & Cutler , A . ( 1977) . Malapropisms and the structure of the mental
lexicon. Linguistic Inquiry , 8, 505- 520.
Feigenbaum, E. ( 1963) . The simulation of verbal learning behavior. In E. Feigen-
baum & J. Feldman ( Eds.), Computersand thought. New York : McGraw -Hill .
Fel' dman, A . G. ( 1966a). Functional tuning of the nervous systemwith control of
movement or maintenanceof a steady posture. II . Controllableparameters of the
muscles. Biophysics, JJ, 565- 578.
Fel' dman, A . G. ( 1966b) . Functional tuning of the nervious system with control
of movement or maintenanceof a steady posture. III . Mechanicographic analysis
of the execution by man of the simplestmotor tasks. Biophysics, JJ, 766- 775.
Fillmore , C. J. ( 1968) . The casefor case. In E. Bach & R. T . Harms (Eds.), Universals
of linguistic theory. New York : Holt , Rinehart and Winston.
Fillmore , C. J. ( 1973) . May we come in? Semiotica, 9, 98- 115.
Fillmore , C. J. ( 1977) . The case for case reopened. In P. Cole & J. M . Sadock
( Eds.), Syntax and semantics: Vol. 8. Grammatical relations. New York : Academic
Press.
Fillmore , C. ( 1982). Towards a descriptive framework for spatial deixis. In R. J.
Jarvella & W. Klein ( Eds.), Speech,place, and action: Studiesin deixis and related
topics. Chichester: John Wiley .
Flores d' Arcais, G. B. ( 1975) . Some perceptual determinants of sentenceconstruction
. In G . B. Flores d ' Arcais ( Ed.), Studiesin perception. Milan : Martel lo .
Flores d ' Arcais, G . B. ( 1987a). Automatic processes in languagecomprehension.
In G. Denes, C. Semenza, P. Bisiacchi, & E. Andreewsky ( Eds.), Perspectivesin
CognitiveNeuropsychology . Hillsdale, NJ: Lawrence Erlbaum.
'
Flores d Arcais, G . B. ( 1987b). Perceptual factors and word order in event descriptions
. In G. Kempen ( Ed.), Natural languagegeneration: New results in artificial
intelligence, psychologyand linguistics. Dordrecht: Martinus Nijhoff .
Flores d' Arcais, G. B., & Schreuder, R. ( 1987). Semanticactivation during object
naming. PsychologicalResearch , 49, 153- 159.
Fodor , J. A . ( 1975) . Thelanguageof thought. Hassocks: Harvester Press.
Fodor , J. A . ( 1983) . Themodularity of mind. Cambridge, MA : MIT Press.
Fodor , J. A . ( 1985) . Precis of the modularity of mind. The Brain and Behavioral
Sciences. 8 , 1- 42 .
Fodor , J. A . ( 1987). Modules, frames, fridgeons, sleepingdogs, and the music of
the spheres. In J. L. Garfield (Ed.), Modularity in knowledgerepresentationand
natural-languageunderstanding.Cambridge, MA : MIT Press.
Fodor , J. A ., Bever, T . G ., & Garrett , M . F. ( 1974) . The psychologyof language:
An introduction to psycholinguistics and generative grammer. New York :
McGraw - Hill .
Bibliography 512
-
subsequent recall . Memory and Cognition , 1, 213 216.
Garding, E. ( 1983 ). A generativemodelof intonation. In ACutlerD . R. Ladd
(Eds.), Prosody : Models andmeasurements . :
Heidelberg .
Springer
Garfield, J. L. (Ed.). ( 1987). Modularityin knowledge andnatural-
representation
language understanding . Cambridge, MA: MIT Press .
Garnham, A., Shillcock, R. S., Brown, G. D. A., Mill , A. I. D., & Cutler, A.
). Slips of the tonguein the London-Lund corpus of spontaneousconversations
( 1982
. In A. Cutler(Ed.), Slipsof thetongueandlanguage . Berlin:
production
Mouton.
Garnsey , S. M., & Dell, G. S. ( 1984). Someneurolinguisticimplicationsof pre-
articulatoryeditingin production. BrainandLanguage , 23, 64- 73.
Garrett, M. F. ( 1975 ). The analysisof sentenceproduction. In G. Bower
( ), Psychologyof learningand motivation
Ed. : Vol. 9. New York: Academic
Press.
Garrett, M. F. ( 1976). Syntacticprocess esin sentenceproduction. In R. J. Wales
& E. Walker (Eds.), New approach es to languagemechanisms . Amsterdam :
-
North Holland.
514
Jaffe, J., Breskin, S., & Gerstman, L. J. ( 1972). Random generation of apparent
speechrhythms. Languageand Speech, 15, 68- 71.
James, D . ( 1972) . Some aspects of the syntax and semantics of interjections.
Papersfrom the Eighth RegionalMeeting. Chicago Linguistic Society.
James, D . ( 1973) . Another look at, say, somegrammatical constraints on, ok , interjectio
and hesitations. Papersfrom the Ninth Regional Meeting. Chicago
Linguistic Society.
James, W. ( 1892) . Textbook of psychology. London: Macmillan .
James, W. ( 1893) . Theprinciples of psychology: Vol. 1. New York : Holt .
Jarvella, R. J, & HermanS . J. ( 1972). Clause structure of sentencesand speech
processing. Perceptionand Psychophysics , 11, 381- 384.
Jarvella, R. J., & Klein , W. (Eds.) . ( 1982). Speech, place and action. Studies in
deixis and related topics. Chichester: John Wiley,
Jassem, W ., Hill , D . R., & Witten , I . H. ( 1984) . Isochrony in English speech: Its
statistical validity and linguistic relevance. In D . Gibbon & H . Richter (Eds.), Intonation
, Accent, and Rhythm. Berlin: De Gruyter .
Jefferson, G. ( 1978) . Sequential aspects of story-telling in conversation. In J.
Schenkein ( Ed.), Studies in the organization of conversationalinteraction. New
York : Academic Press.
Jefferson, G. ( 1985) . An exercisein the transcription and analysis of laughter. In
T . A . van Dijk ( Ed.), Handbook of discourseanalysis: Vol. 3. London: Academic
Press.
Johnson-Laird , P. N . ( 1983) . Mental models. Cambridge University Press.
Johnson-Laird , P. N ., & Garnham, A . ( 1980) . Descriptions and discourse
models. Linguistics and Philosophy, 3, 371- 393.
Jones, H . G. V ., & Langford, S. ( 1987) . Phonological blocking in the tip of the
tongue state. Cognition, 26, 115- 122.
Jorgensen, J., Miller , G. A ., & Sperber, D . ( 1984) . Test of the mention theory of
irony . Journal of Experimental Psychology: General, 113, 112- 120.
Kaisse, E. M . ( 1985). Connectedspeech: The interaction of syntax and phonology.
New York : Academic Press.
Kamp , H . ( 1984) . A theory of truth and semantic representation. In J. Groenen-
dijk , Th . Jansen, & M . Stokhof (Eds.), Truth , interpretation and information.
Dordrecht: Foris.
Kao , H . S. R., Galen, G. P. van, & Hoosain, R. ( Eds.) . ( 1986) . Graphonomics
:
Contemporaryresearchin handwriting. Amsterdam: North - Holland .
Keele, S. W. ( 1981) . Behavioral analysisof movement. In V . Brooks ( Ed.), Handbook
of physiology. The nervoussystem: Vol. 2. Motor control. Bethesda, MD :
American Physiological Society.
Bibliography 519
" "
Keenan, E. L . ( 1976). Toward a universal definition of subject . In C. N . Li
( Ed.), Subjectand topic. New York : Academic Press.
Keenan, J., MacWhinney, B., & Mayhew, D . ( 1977) . Pragmatics in memory: A
study of natural conversation. Journal of Verbal Learning and Verbal Behavior,
16, 549- 560.
Kelly , M . H ., & Bock, J. K . ( 1988). Stressin time. Journal of Experimental Psychology
: Human Perceptionand Performance, 14, 389- 403.
Kelso, J. A . S., Holt , K . G ., Kugler , P. N ., & Turvey , M . T . ( 1980) . On the concept
of corrdinative structures as dissipative structures: II . Empirical lines of convergence
. In G. E. Stelmach & J. Requin (Eds.), Tutorials in motor behavior.
Amsterdam: North - Holland .
Kelso, J. A . S, Saltzman, E. L., & Tuller , B. ( 1986) . The dynamical perspectiveon
-
speechproduction data and theory . Journal of Phonetics, 14, 29 59.
" "
Kelso, J. A . S., Tuller , B., & Harris , K . S. ( 1983) . A dynamic pattern perspective
on the control and coordination of movement. In P. F. MacNeilage ( Ed.), The
production of speech.New York : Springer.
Kelso, J. A . S., Tuller , B., & Harris , K . S. ( 1986) . A theoretical note on speech
-
timing . In J. S. Perkell & D . Klatt ( Eds.), In varianceand variability in speechpro
cesses. Hillsdale, NJ: LawrenceErlbaum.
Kelso, J. A . S., Tuller , B., Vatikiotis , E., & Fowler , C. A . ( 1984). Functionally
specificarticulatory cooperation following jaw perturbations during speech: Evidence
for coordinative structures. Journal of Experimental Psychology: Human
Perceptionand Performance, 10, 812- 832.
Kempen, G. ( 1987). A framework for incremental syntactic tree formation . Proceedings
of the Tenth International Joint Conferenceon Artificial Intelligence. Los
Altos , CA : Morgan Kaufmann .
Kempen, G. ( 1988) . Language generation systems. In I . Batori , W . Lenders, &
W . Putschke(Eds.), Computationallinguistics: An international handbookon computer
orientedlanguageresearchand applications. Berlin: de Gruyter .
Kempen, G., & Hoenkamp, E. ( 1982) . Incremental sentencegeneration: Implications
for the structure of a syntactic processor. In J. Horecky ( Ed.), Proceedings
of the Ninth International Conference on Computational Linguistics.
Amsterdam: North -Holland .
Kempen, G., & Hoenkamp, E. ( 1987) . An incremental procedural grammar for
sentenceformulation . CognitiveScience, 11, 201- 258.
Kempen, G., & Huijbers , P. ( 1983) . The lexicalization process in sentenceproduction
and naming: Indirect election of words. Cognition, 14, 185- 209.
Kenstowicz, M ., & Kisseberth, C. ( 1979) . Generativephonology. New York : Academic
Press.
Kent , R. D . ( 1986) . Is a paradigm change needed? Journal of Phonetics, 14,
111- 115.
Bibliography 520
Lindblom B., & MacNeilage, P. ( 1986) . Action theory: Problems and alternative
approaches. Journal of Phonetics, 14, 117- 132.
Linde, C., & Goguen, J. ( 1978) . The structure of planning discourse. Journal of
Social and Biological Structures, 1, 219- 251.
Linde, C., & Labov , W. ( 1975) . Spatial networks as a site for the study of language
and thought . Language, 51, 924- 939.
Lindsley, J. R. ( 1975). Producing simple utterances: How far do we plan? Cognitive
Psychology, 7, 1- 19.
Lindsley, J. R. ( 1976). Producing simple utterances: Details of the planning process
. Journal of PsycholinguisticResearch,5, 331- 351.
Linell , P. ( 1979). Psychological reality in phonology. Cambridge University
Press.
Longuet-Higgins, H . C. ( 1987). Mental Processes. Cambridge, MA : MIT Press.
Lounsbury, F. G. ( 1954). Transitional probability , linguistic structure, and systems
of habit-family hierarchies. In CE . Osgood & T . Sebeok ( Eds.), Psycholing
: A survey of theory and researchproblems. Bloomington : Indiana
University Press.
Lyons, J. ( 1977) . Semantics(2 Vols.) . Cambridge University Press.
Lyons, J. ( 1981). Language, meaningand context. Bungay: Fontana Press.
MacKay, D . ( 1970) . Spoonerisms: The structure of errors in the serial order of
speech. Neuropsychologia , 8, 323- 350 (Reprinted in V. A . Fromkin ( Ed.) . ( 1973) .
Speech errors as linguistic evidence . The Hague: Mouton .)
MacKay , D. ( 1972). The structure of words and syllables: Evidence from errors
in speech. CognitivePsychology, 3, 210- 227.
MacKay , D . ( 1982) . The problem of flexibility , fluency, and speed-accuracy
trade-off in skilled behavior. PsychologicalReview, 89, 483- 506.
MacKay , D . ( 1987). The organization of perception and action: A theoryfor language
and other cognitiveskills. New York : Springer.
Maclay, H ., & Osgood, C. E. ( 1959) . Hesitation phenomena in spontaneous
English speech. Word, 15, 19- 44.
MacNeilage, P. F. Motor control and serialorderingof speech
. Psychological
Review
, 1970, 77, 182- 196.
MacNeilage, P. F. ( 1980 ). Emergingconceptsin speechcontrol. In G. S.
Stelmach& J. Requin(Eds.), Tutorialsin motor behavior. Amsterdam: North-
Holland
.
MacNeilage, P., &. Ladefoged, P. ( 1976) . The production of speechand language.
In: E. c . Carterette &. M . P. Friedman ( Eds.), Handbook of perception: Vol. 7.
New York : Academic Press.
Bibliography 525
in linguisticperformance
: Slipsof thetongue
, ear, pen, andhand. New York: Academi
Press .
Motley, M. T., Baars, B. J., & Camden , C. T. ( 1981 ). Syntacticcriteriain prearti-
culatoryediting: Evidencefrom laboratory-inducedslipsof the tongue. Journal
Research
of Psycholinguistic , 10, 503- 522.
Motley, M. T., Camden , C. T., & Baars, B. J. ( 1982 ). Covert formulationand
editingof anomaliesin speechproduction: Evidencefrom experimentally elicited
slipsof thetongue.Journalof VerbalLearningandVerbalBehavior , 21, 578- 594.
Munro, A. ( 1977). Speechact understanding . Unpublisheddoctoraldissertation ,
Universityof California, SanDiego.
Nakatani, L. H., O' Connor, J. D., & Aston, C.H. ( 1981 ). Prosodicaspectsof
AmericanEnglishspeech rhythm. Phonetica , 38, 84- 106.
Neilson, M. D., & Neilson, P. D. ( 1987
). Speechmotor control and stuttering: A
computationalmodelof adaptivesensory -motor processing
. SpeechCommunication
, 6, 325- 333.
527
Tent, J., & Clark , J. E. ( 1980) . An experimental investigation into the perception
of slips of the tongue. Journa/ of Phonetics, 8, 317- 325.
Terken, J. M . B. ( 1984). The distribution of pitch accents in instructions as a
function of discoursestructure. Languageand Speech,27, 269- 289.
Thomassen, A . J. W. M ., & Teulings, H . L . ( 1985) . Time, sizeand shapein handwriting
: Exploring spatio- temporal relationships at different levels. In J. A .
Michon & J. Jackson( Eds.), Time. mind and behavior. Heidelberg: Springer.
Thorson, N . ( 1983) . Two issuesin the prosody of standard Danish. In A . Cutler
& R. D. Ladd(Eds.), Prosody: Models,and measurements. Heidelberg : Springer .
Treiman, R. ( 1983) . The structure of spoken syllables: Evidence from novel word
games. Cognition, 15, 49- 74.
Treiman, R. ( 1984). On the status of final consonant clusters in English syllables.
Journal of Verbal Learning and Verbal Behavior, 23, 343- 356.
Turner , E., & Rommetveit, R. ( 1968) . Focus of attention in recall of active and
passivesentences. Journal of Verbal Learning and VerbalBehavior, 7, 543- 548.
Turvey, M . T . ( 1977) . Preliminaries to a theory of action with referenceto vision.
In R. Shaw & J. Brandsford ( Eds.), Perceiving, acting, and knowing: Toward an
ecologicalpsychology. Hillsdale, NJ: Lawrence Erlbaum.
Tyler, L ., & Wessels, J. ( 1983). Quantifying contextual contributions to word -
recognition processes. Perceptionand Psychophysics , 34, 409- 420.
Umeda, N . ( 1975) . Vowel duration in American English. Journal of the Acoustical
Societyof America, 58, 434- 445.
Vaissiere, J. ( 1983) . Language-independent prosodic features. In ACutlerD
. R. Ladd (Eds.), Prosody: Models and measurements . Heidelberg: Springer.
Valian , V. V. ( 1971) . Talking, listening and linguistic structure. Unpublished doctoral
dissertation, Northeastern University , Boston.
Van Bezooijen, R. ( 1984) . Characteristicsand recognizability of vocal expression
of emotion. Dordrecht: Foris.
Van den Broecke, M . P. R., & Goldstein, L. ( 1980). Consonant features in speech
errors. In V . A . Fromkin ( Ed.), Errors in linguistic performance: Slips of the
tongue, ear, pen, and hand. New York : Academic Press.
Van der Sandt, R. A . ( 1982) . Kontekst en presuppositie. Nijmegen Institute of
Semantics.
Van Dijk , T . A . ( 1985). Handbook of discourseanalysis. (4 Vols.). London : Academic
Press.
Van Galen, G. P., & Teulings, H . L . ( 1983) . The independentmonitoring of form
and scalefactors in handwriting . Acta Psychologica, 54, 9- 22.
Van Wijk , C. ( 1987). The PSY behind PHI : A psycholinguistic model for performance
, 16, 185- 199.
structures. Journal of PsycholinguisticResearch
Bibliography 536
Van Wijk , C., & Kempen, G. ( 1987). A dual system for producing self-repairs in
spontaneousspeech: Evidencefrom experimentally elicited corrections. Cognitive
Psychology, / 9, 403- 440.
-
Vorberg, D . ( 1985). Unerwartete Folgen von zufalliger Variabilitat : Wettlauf
Modeile fur den Stroop- Versuch. Zeitschrift fiir experimentelle und angewandte
Psychologie,32, 494- 521.
Wales, R., & Toner, H . ( 1980). Intonation and ambiguity . In W. Cooper & E.
Walker ( Eds.), Sentenceprocessing: Psycholinguisticstudiespresentedto Merrill
Garrett. Hillsdale, NJ: LawrenceErlbaum.
Walton , D . N . ( 1982). Topical relevance in argumentation. Amsterdam: John
Benjamins.
Webber, B. L . ( 1981) . Discourse model synthesis: Preliminaries to reference. In
A . K . Joshi, B. L . Webber, & I . A . Sag (Eds.), Elementsof discourseunderstanding
. Cambridge University Press.
Webelhuth, G. ( 1985). German is configurational . The Linguistic Review, 4,
203- 246.
Weissenborn, J., & Klein , W . (Eds.) . ( 1982) . Here and there: Cross-linguistic studies
on deixis and demonstration. Amsterdam: John Benjamins.
Wells, G., MacLure , M ., & Montgomery , M . ( 1981) . Some strategies for sustainin
conversation. In P. Werth (Ed.), Conversationand discourse: Structure
and interpretation. London: Croom Helm.
Wells, R. ( 1951). Predicting slips of the tongue. The Yale Scientific Magazine, 26,
9- 30. (Also in V. A . Fromkin ( Ed.), ( 1973) . Speecherrors as linguistic evidence.
The Hague: Mouton .)
Whorf , B. L . ( 1956). Language. thought. and reality . Cambridge, MA : MIT Press.
Wickelgren, W. A . ( 1969) . Context-sensitive coding, associative memory, and
serial order in (speech) behavior. PsychologicalReview, 76, 1- 15.
Wickelgren, W . A . ( 1976) . Phonetic code and serial order. In E. C. Carterette &
M . P. Friedman ( Eds.), Handbook of Perception: Vol. 7. New York : Academic
Press.
Willems, N . ( 1983) . English intonation from a Dutch point of view: An experiment
-
investigationof English intonation producedby Dutch native speakers. Dor
drecht: Foris.
Williams , E. ( 1983) . Semantic and syntactic categories. Linguistics and Philo-
sophy, 6, 423- 446.
Wilshire, C. ( 1985) . Speecherror distributions in two kinds of tongue twisters.
Unpublished bachelor thesis, Monash University, Clayton , Victoria .
'
WilsonD ., & Sperber, D . ( 1981) . On Grice s theory of conversation. In P. Werth
(Ed.), Conversationand discourse: Structure and interpretation. London: Croom
Helm.
Bibliography 537
WilsonD ., & Sperber, D . ( 1988) . Mood and the analysis of non-declarative sentences
. In J. Dancy. J. Moravsik . & C. Taylor (Eds.), Hunwmagency: Language.
dutyandvalue.StanfordUniversityPress .
Wingfield, A. ( 1968 ). Effectsof frequencyon identificationand namingof objects
. AmericanJournalof Psychology , 8/ , 226- 234.
Winterhoff-Spurk, P., Herrmann, T., & Weindrich, D. ( 1986 ). Requestingrewards
: A study of distributivejustice. Journalof Languageand SocialPsychology
, 5, 13- 31.
Wunderlich, D. ( 19] 6). Towardsan integratedtheoryof grammaticaland pragmatical
meaning . In A. Kasher(Ed.), Languagein focus: Foundations , methods
andsystems . Essaysin memoryof Yehoshua Bar-Hillel. Dordrecht: Reidel.
Wunderlich , D. ( 1979). Foundations . CambridgeUniversityPress
of linguistics .
Wunderlich, D. ( 1981 ). Linguisticstrategies . In F. Coulmas(Ed.), A Festschrift
for nativespeaker . TheHague: Mouton.
Wunderlich,D., &.. Reinelt, R. ( 1982 ). How to gettherefrom here. In R. Jarvella
& W. Klein (Eds.), Speech , place, andaction: Studiesin deixisandrelatedtopics.
Chichester: JohnWiley.
Wundt, W. ( 1896 ). GrundrissderPsychologie . Leipzig: Kroner.
Wundt, W. ( 1900 ). Die Sprache (2 Vols.). Leipzig: Kroner.
'
Yarmey, A. D. ( 1973 ). I recognizeyour facebut I cant rememberyour name:
- -
Furtherevidenceon the tip- of the tonguephenomenonMemoryand Cognition
. ,
1, 287- 290.
Yngve, V. H. ( 1970 ). On gettinga word in edgewise . In M. A. Campbell(Ed.),
Papersfrom the Sixth Regional Meeting , Chicago Linguistic Society. University
of ChicagoPress .
Zubin, D. A., & Choi, S. ( 1984). Orientationand gestalt: Conceptualorganizing
principlesin the lexicalizationof space . In D. Testen, V. Mishra & J. Drogo
(Eds.), Papersfrom the Parasession on Lexical Semantics . ChicagoLinguistic
Society.
Zwanenburg , W., Ouweneel ). La frontieredu
, G. R. E., & Levelt, W. J. M. ( 1977
mot enFran~ is. Studiesin Language , 1, 209- 221.
Zwicky, A. M. ( 1972). On casualspeech . In P. M. Peranteau , J. N. Levi, & G. C.
.
Phares(Eds), Papersfrom theEighthRegionalMeetingof the ChicagoLinguistic
Society , 607- 615.
Zwicky, A. M., & Pullum, G. K . ( 1983 ). Cliticizationversusinflection: English
n' t. Language , 59, 502- 513.
Author Index
Motley, M. T., 356, 461- 462, 464- 466, Pollock , M. D., 414
468- 469, 474, 477 Pols,L. C. W., 432
Munro, A., 139 Posner , M. 1., 20, 73
Power , M. J., 127
Nakatani, L. H., 388- 390, 392- 393 Premack , D., 73
Neilson, M. D., 451 Prentice , J. L., 262- 263
Neilson, P. D., 451 Priller,J., 320
Nelson, L. J., 471 Prince, A., 298
Nespor, M., 302- 303, 370, 373, 378, 381, Prince, E., 114 , 146, 148
392, 408 Pullum , G. K., 375- 376,378,408
Newell, A., 17, 143, 321 Pylyshyn , Z. W., 22
Ninnaier, H., 51
Nolan, F. J., 440 Quinlan, P. T., 230
Noordman, L. G. M., 212 Quirk, R., 2, 43
Noordman-Vonk, W., 183
Nooteboom , S., 149, 273, 350, 357, 387- Reason , J., 320
390, 463- 464, 470, 478, 480, 485 Recanati , F., 59
Nonnan, D. A., 74 Redeker , G., 148
Nunberg,G., 47- 48, 53, 134 Reich, P. A., 218, 351, 356
Reichenbach , H., 56
Ochs-Keenan . E.. 41- 42 Reinelt, R., 112, 139
O'Connell . D. C.. 391 Reinhart,T., 98- 99, 137, 265
O'Connor . J. D.. 307.310 Remez , R. E., 438, 450, 453
O'Connor . K. D.. 388 Richardson, G., 438
Ohala . J. J.. 400 Riddoch, M. J., 230
Ohman . S. E. G.. 453 RochesterS. R., 113, 259
Ojemann . G. A.. 441 Rommetveit , R., 262- 263
Oldfield . R. C.. 199 . 230 Rooth, M., 89
Oiler. D. K.. 446 Rosch, E., 155, 223
Olson . D. R.. 129 Ross,J. R., 65, 264
Osgood , C. E., 23, 44, 148 , 203,267-
, 199 Roudet, L., 387
269,272 Rubin, D. C., 320
Ouweneel , G. R. E., 166 , 367 Rubin, P., 438, 450, 453
Rumelhart , D. E., 19, 74
Paccia -Cooper,J. M., 14, 389 Ryalis, J., 400
Paivio, A., 264 Ryave,A. L., III
Pa1enno , D. S., 216
Parker, F., 444 Sacks, H., 23, 31, 33, 35- 36, 39, 111,459
Parkes , C. H., 201 Sadock , J., 63
Partee , B., 89 Saltzman , E. L., 439- 440, 446, 450
Pearson , M., 35 Samuels, S. J., 20
Pechmann , T., 46, 129- 133, 150, 273 Sanford, A., 113 , 119
Perdue , , 42
C. , D., 35
Schaffer
Perfetti, C. A., 263 Schank, R. C., 74, 90, 139
Perkell,J. S., 442- 444, 446, 453, 456 Schegioff, E. A., 23, 31, 36, 112
, 458,
Perry, J., 74 460-461,463,497
Petrie, H., 126 Schenkein , J., 122 , 273
Pick, H. L., 463 , K., 102
Scherer , 307
Pierrehumbert , J., 307 , D., 38
Schiffrin
Pike, K., 332 Schlesinger , I. M., 23, 104
Pinker, S., 192, 194, 197- 198 Schmerling , S., 165
Pinson,E. N., 426, 431 Schneider , W., 20
Author Index S44
, D. A., 52
Zubin
Zwanenburg , W., 166 , 367
Zwicky. A.. 63. 369. 375 - 376
. 378
. 408
Subject Index
Dative. SeeCase
Deaccentuation , 147- 150, 283
Deactivation , decayof activation,234, 3
359
Deafness , 444, 446
Debate, 112, 126
Decision-makingstage , 228
Decisiontable, 207- 210, 213, 233
Declarativeknowledge . SeeKnowledge
Declarativemood, sentence type, 101
Subject
Index 552
Genioglossus , 430
Gesamtvonte Uung,23. 26
Gestaltof object/scene . 131- 132. 156. 159
Gesture . 8. 23. 35. 45. 47. 53. 59. 68
articulatory. 291. 295. 297. 318. 326- 327.
346. 362. 364. 388, 409. 414. 434- 435.
438. 440. 444, 446
Giveninformation. 6. 98. 114. 311. 315
Givenness . 99. 100
Glottal speechsounds . 381. 433
Glottal stop. 381
Glottis. 309. 424. 427- 429. 433. 445
Goal
communicative . 5. II . 107. 117. 128. 136.
282
elaboration . 127
Grammar. 192. 236
message , 70
IncrementalProduction . 235. 249. 282
LexicalFunctional. 162. 235
of repair. 489
Grammatical
encoding . 9. II . 105, 179. 181- 283passim .
375, 422
relations. 15. 27. 161. 179. 285
Grammaticalfunction. 97. 152. 161- 163.
167- 170. 179- 180. 189- 190. 192- 195.
198. 222. 233, 235- 236. 238- 239. 244-
245, 250. 256. 260- 261. 264- 270. 275.
279. 281. 283
hierarchyof, 192. 194. 233, 260. 266. 269.
275
Gravityanddeixis. 49
Grid. metrical. 298- 299. 304- 305. 317. 323.
366. 374. 378. 380- 384. 411
Haplology. SeeErron
Harmonics . 427
Head/modifierstructures . 79. 82. 89, 96
Head-of-phrasefunction. 167- 168. 238
Head-of-phraseprocedure . 239. 247
Hemispheric lateralization. 22. 154
Henderson graph. 127
Hesitantspeech . 127- 128. 204
Galvanicskin response , 469 Hesitation.5- 6. 203- 204. 257- 259, 460.
Gap (pause ), 33, 78 484, 494
Gaze , 203, 204, 421
prelexical
in convenation, 8, 35, 36, 59, 68 Homonym,208
movementanddi~ tional tenDs, 153- 154 Hyperbole , 44
tour, in scenedescriptions, 153 Hypemym , 183
, 216,220
Gender
, 168, 231,238 Hypemym problem, 201,204,207,210,
Generic
verb,206 212- 214,233
Subject Index 555
I-marker, 23 Interference
Idea conceptual . 217- 218. 227. 229. 234
sensory -motor, 94 plan-internal. 222
unit, 23, 36 Interjectionin repairs.482. 484
Idealdelivery, 128, 384- 385, 391, 402 Interlinear-toneticnotation. 309- 310
. SeeDeixis
function
Identifying Internalloop in self-monitoring. 472- 473
- 187
Idiom. 186 Internalspeech . 12- 15. 21. 27- 28. 469-
Illocutionaryforce, 58- 65, 108, 123- 124, 473. 498
151
. 157
. 398
. 402 InternationalPhoneticAlphabet. 12. 295.
andaccessibility
Imageability , 264- 266 500
, 63- 65, 67, 71, 100
Imperative - 101, 106
, Interrogative . 63- 64. 67. 78. 100- 102. 106.
109144
, , , 157 170- 171 , , 403
240 109. 144. 157. 170- 171. 240. 403
Implicature Interruptionof speech . 28. 112- 113.459.
conventional, 44 473. 478- 48. 489. 492
conversational, 41- 44, 66- 67, 124, 138 delayed . 478. 480. 482. 484. 490- 491
Inclination of pitch, 404 immediate . 479- 480. 495. 498
Incremental , 2, 23- 28, 109
production , 157
, Main InterruptionRule. 478. 480- 482.
162 235
, , 239- , - 254
245 253 , 284 , 365, 498
371 - 384
, 373 , 396
, 398, 400, 405, 411, within-word. 479- 481. 484
Interview. 3- 5. 111- 112. 126. 158. 391
Incremental Production Grammar , 235, Intonation. intoning. 15. 18. 23- 24. 33.
249
. 282 62- 64. 102. 108. 165. 166. 171. 179- 180.
field.46
Indexical 255. 290. 306- 307. 315- 317. 365. 368.
Indignation . 30. 102. 171 406- 407
Indirect speechact. SeeSpeechact generationof. 398- 405
Inferability of referents. 99- 100. Seealso Intonational
Accessibility meaning . 102. 171. 312- 317. 366. 405
Infinitive . SeeTense lexicon. 171. 316
, 55, 104
Inflection , 165
, 161 , 183 , 251,
, 198 phrase . 259. 303- 304. 306- 311. 314- 317.
286,288-290,316,322,344
clitics
versus . 378
asdiacritic . 191
. 183
feature . 256
Knowledge(cont.) Lemma,6
encyclopedic , 9- 10 access , 198- 213, 320
mutual, shared , 8, 14, 29, 44- 45, 47, 69, access failure, 214- 221
84, 114- 115, 123, 125, 146, 158 substitutionin self-correction , 489
mode, 72- 74 Legitimacyof request , 134- 135
procedural , 9- 11, 72, 105, 124- 125, 149, Levelsof processing , 231- 232, 234, 319,
185- 186, 236 321, 330, 343- 345, 361- 362, 367, 369,
representation , 72- 74 468
situational, 10, 72 Lexicalaccess , 186, 199, 200, 205- 206,
store, II , 28 213- 214, 230, 234, 277, 468
failureof, 214- 221, 462
Labialspeech sounds , 295, 340, 442 theoriesof, 199, 201, 206, 211
Labio-dentalspeech sounds , 433 timecourseof, 181, 222- 232
Languages Lexicalaction, 208- 210
Cantonese , 268 Lexicalbiasin speech errors, 355- 356, 359,
Danish, 307 363
Dutch, 71, 103, 122, 130, 140, 167, 220, editingaccountof, 465, 477
237, 238, 307, 350, 433, 483, 485 activationspreadingaccountof, 356, 359
Dyirbal, 55 Lexical
Finnish, 185, 268, 289 categories , 167, 288
French, 231. 238, 297, 302, 307, 367- 368, encoding , 186- 187, 233
387, 392, 429, 433 entry, 182- 183, 186- 188, 231, 234, 322,
German,26, 45, 102, 165, 167, 179, 238, 344
248- 249, 254, 263, 265, 289, 350 error. SeeErrors
Hausa, 50 form, 12, 231, 258, 285, 320- 321
Hebrew,268, 433 gap, 209
Hungarian , 265, 268, 272- 273, 289 hypothesis , 181, 235
inflectional , 289 intrusion, 184
Italian, 262, 272- 273 pointer, 165, 167, 174, 180, 193, 195, 344,
Japanese , 103- 104, 186, 286, 272, 289, 329 362, 373, 375- 378
Kannada,269 LexicalFunctionalGrammar, 162, 235
Kiliyila, 104 Lexicalrule(passive , dativeshift), 193
Malay, 104 Lexicallydrivenformulation, 181, 198,
Malayalam , 163, 167, 180, 270 235- 236, 282
Quiche,497 Lexicon
Serbo-Croatian, 268 intonational, 171, 316
Spanish , 53, 102- 104, 268, 392- 393, 433 lemma,9, 187, 199
Swedish , 307 mental, 6, 11- 13, 18, 22, 27, 162, 167,
-
stresstimed~373, 392 393 - 181- 234, 290, 302, 316
syllable-timed, 392- 393 form, 12, 163, 187, 258, 284- 301, 323,
Thai, 497 329, 344, 352- 355, 363, 368. 373. 408
Turkish, 185- 186, 268, 289 Liaison, 295. 302. 367- 368. 454
Tuyaluan,497 Linearization . 107. 138. 138- 144, 153. 159.
Warlpiri, 169 460
Language -specificconceptualizing , 103- content-relateddeterminants . 138- 139
105, 157 principles , 138. 140. 143
Language of thought, 71, 73- 74, 105 process -relateddetenninants . 138- 144
Lapse(pause ), 2- 3, 33 strategies , 153. 159
Laryngealspeech sounds , 295, 316 Lip. 295- 296. 413. 428- 430. 432- 434. 436.
Larynx, laryngealsystem , 295, 413, 423- 430 440. 442, 446. 449- 452
Lecturing, 30 spreading /rounding. 409. 430. 434. 443
Subject Index 557
singular,47, 168, 187, 197, 238- 240, 242, Passive , 169, 181, 262, 268
289, 301, 344 elicitationof, 261- 264, 266, 268, 488
Numerals , 104, 273 lemma,245
lexicalrule, 193- 194, 233
' 's 3
Obligations , speakers andaddressee , , syntacticprimingof, 274- 275, 278
- -
20, 31 32, 61, 63 65, 135 truncated , 275
Object. SeealsoCase Path, asconceptualcategory , 90- 91, 94-
categorization , 223 96, 178, 189, 195, 237
naming, 130- 131, 158, 232. Seealso Pathfunction, 90- 91
Naming Patient. SeeThematicrole
orientation,49, 154- 155 Pause , pausing , 23, 33, 35- 37, 126- 128,
recognition , 230 203, 236, 257- 259, 305, 308, 364, 366-
reference , 133- 134 367, 372, 381, 385- 387, 390- 391, 409-
size, 155, 228- 229, 276 410, 420, 459, 482
Oblique. SeeCase Perceptual -loop theoryof monitoring,
Obstructionof pitch, 300. SeealsoPitch 469- 474, 476, 498
accent, movement Performativeverbs, 64- 68
Obstruents, 334, 434 Perseveration , 331, 347, 358- 359, 468
Omissions. SeeErrors Personasconceptualcategory , 74- 98,
Onset 113- 115, 189, 241
ofsyllable
, 290 - 294
, 293 , 299 - 344
. 324 . asdiacriticvariable, 191, 197, 233, 238,
- 349
347 , 353- 354357
, , 359- 362
. 367
. 240- 242, 250, 273, 289, 322, 331
369, 407- 410 Persondeixis. SeeDeixis
maximizationof, 294, 335 Perspective
of word. SeeWord addressee 's 52
,
Oralcavity, 413, 428, 430, 445, 455 informational/thematic,5, 11, 70, 96, 105,
Orbicularisoris, 430 109- 110, 132, 138, 145, 152- 159, 260,
Orderingerrors. SeeErrors 269
'
Orderingfor expression . SeeLinearization speakers, 94
Origin, origo. SeeDeicticorigin visual, 226
Orosensory goals, 442- 445, 456 Pharynx,428- 430
Overspecification in objectnaming, 129, Phonation , 424
158 Phonationperiod, 127
Overtone , 427 Phone,292, 294- 297, 328, 346, 362- 363,
Overtspeech , 2, 9, 12- 14, 16, 21, 27, 469- 438, 440- 441, 444, 472
472, 474, 498 Phoneme , 22, 211, 292, 298, 301, 320- 363
passim , 464, 475, 477
Palatalspeech sounds , 433 Phonemic
Palate(hard, soft), 423, 429- 431, 433 clause , 23
Palato-alveolarspeech sounds , 433 code, 297, 300
Parallelprocessing , 2, 20- 21, 24, 27- 28 similarityconstraint,350, 358
in articulation, 449 Phonetic
in grammaticalencoding , 238, 240, 242- feature,295- 297, 316, 350, 428, 432- 434,
243, 245- 246, 248- 251, 256, 259, 282 440, 442, 455
in lexicalaccess , 199- 202, 207- 211, 222, plan, 7- 8, 12, 24- 28, 284- 421passim ,
233- 234, 372 434- 436, 444, 449, 454- 457, 470, 473.
in phonologicalencoding , 355, 363 SeealsoArticulatoryplan
Parenthetical , 308, 386, 408, 497 Phoneticspellout, 319, 321, 324, 326- 329,
Pars-pro-toto principle, 124 332, 335, 338, 344, 348- 349, 361- 363
Parsedspeech (in monitoring), 13, 26, 470 in connected speech , 364- 370, 375, 387,
Particle,64, 375- 376 389, 400, 405- 412, 417, 436
classificatory, 104 Phonicduration, 387- 392
Subject Index 560
Phonological Planning
encoding , 7, 11- 12, 18, 20, 24, 26, 102, articulatory , 297, 420- 421
161- 162 , 165, 166, 171, 179, 181, 187, conceptual, S, II , 20, 24, 204. Seealso
231, 233, 236, 239- 242, 246, 255, 259, Macroplanning , Microplanning
275- 276, 278- 279, 282- 284, 303, 316, of connectedspeech, 364- 412
318- 412pa.rsim, 414, 416, 419- 422, of discourse, 115, 119
435- 436, 462, 468- 469 message , II , 119, 126, 259, 275, 467
feature,23, 350 metrical, 304, 317, 373, 397- 399, 402, 4OS ,
phrase , 259, 303- 307, 317, 373, 378- 381, 410- 411
390- 391, 411, 419- 421, 454, 473 motor, 420, 435, 443- 444
priming, 279- 280, 354 phonetic,296, 31S,321, 329, 341, 343-
word, 302, 317, 361, 370- 380, 389- 392, 346, 359, 365, 473
398, 406- 421passim , 454, 473 prosodic, IS, 364- 365, 396, 405, 410
Phonotactic speechact, 10S - I09, 474
possibility, 294, 335- 338, 344, 362, 369, andsyntax, 25S- 259
482 unit, 23, 256- 257, 259, 420
rule, 294 utterance , 3- 4, IS, 277, 2S2
PhrasalFocusRule, 174- 176, 178- 179 Planningdiscourse , 112
Phrase . SeeIntonational, Phonological , Pleas , 61
Phrasestructure Pledging , 61, 64
Phrasestructure, 163, 166- 167, 169, 179- Plosivespeech sounds , 290, 413, 433, 441,
180, 365, 369, 373, 384, 391, 408, 498 472
Pitchaccent , 7, 12, 100, 149- 151, 166, Plural. SeeNumber
174- 178, 237, 244, 299- 300, 306- 307, Politerequests , 66- 69, 135
310, 315, 317, 323, 364, 374, 382- 386, Politeness , 3- 6, 21, 30, 37, 65- 69, 124,474
389, 398- 400, 404- 405, 410. Seealso Posteriorspeechsounds , 296, 433
Accent, Accentuation Pragmatics , 39
andaccessibility of referent , 270, 273 Prearticulatoryediting. SeeEditing
contrastive , 172, 177, 247, 273, 307, 309, Precisionof articulation, 22, 396, 410, 434,
401, 495- 496 462
default, 177 Predication , 70- 71, S9, 90, 9S- 99, 113- 115,
asdiacriticfeature, 165, 180, 191, 233, 150- 1S4 , 159, 171, 261, 30S
366, 373, 382, 408 Prefix. SeeMorpheme
andfocus, 100, 174, 243, 246, 496 Prenuclear tune, 311- 312, 403- 405
monitoring, 463 Preposition , 4S, 122, 166- 167, 17S , 192,
andprominence , 307, 386 194- 195, 19S , 236, 251, 273, 2SS , 369,
andnuclearstress , 308- 309, 384- 385, 371, 37S,421
402 Prepositionalobject, 23S,248, 264, 274
Pitch-AccentProminence Rule, 382 Prepositionalphrase , II , 167, 169, 17S, ISO,
Pitch-AccentRule, 175 192, 195, 23S,266, 26S, 304, 485, 487,
Pitch 490
contour, 7, 171, 255, 300, 307- 310, 316, Presentatives , 272
364, 366, 373, 398, 405, 411, 427, 498 Prespecification in self-repairs,491
modulation,413 Presupposition , 102, 113, 117- 119, 171,
movement , 298, 309- 310, 327- 328, 367, 175, 212, 216, 219. SeealsoBackward
372, 381, 385, 391, 396, 399- 405, 409- suppletion
410, 427 existential , liS
obstruction , 300 factive, liS
raising, 113, 315 Pretonicaccent.SeePrenuclear tune
stranding , 247, 282, 372 Preverbalmessage , 9, II , 14- 15, 72- 74,
withholding , 147 76, 7S, 93, 101, 105, 107- 110, 157, 160,
Placefeatures , 295, 316, 350, 443 170, 179- ISI, IS6
Subject Index 561
Priming Prototype,84
andactivationspreading , 352, 354, 474- Psychologicalattitude. SeeAttitude
477, 498 Psychologicalreality, 79, 124, 206, 257
lexical, 221- 226, 230, 275, 476
phonological , 279- 280, 354 Quantifier, 76, 196
syntactic , 273 - 275 Question , 30- 31, 35, 38, 60, 65- 68, 89,
Principle 118, 255
of connectivity , 140 answering , 36, III , 122, ISO, 172, 179,
of contrast, 200, 213 221, 254, 273
cooperative , 39- 40, 53 intonation, 102,403- 404
core, 213 tacit, 78, 86
linearization , 138, 140, 143 tag, 34, 308, 386
naturalorder, 138, 159 andtopic, 262- 266
of specificity , 213- 214 wh-, 101, 404
uniqueness , 213 yes/no, 101, 255
Proceduralknowledge . SeeKnowledge
Procedure . SeeCategorial , Functional Radiotalk, 29, 112
Processing Raisingverb, 190, 250
capacity,20 Rate(speed)
components , 1- 2, 8, 13- 15, 20- 24, 28, 70, of activationspreading
- , 35I, 359,366
365- 368, 468- 469 of errordetection , 466, 470- 473, 498
stage , 24, 234 of lexicalaccess , 199- 20I , 206, 225, 228,
Procliticization , 376- 377 233- 234, 277
Productions(condition/actionpairs), of grammaticalencoding , 246, 454
17- 18, 82, 124, 207, 321 of processing , 26, 28
Pro-form, 148- 149 of speech , 22, 108, 199, 222, 306- 308, 316,
Projectable units(in turn-taking), 31, 35 355- 359, 363, 366, 368- 370, 374- 375,
Prominence 385- 386, 389- 391, 392- 393, 407, 410-
conceptual , 147, 149- 151, 165, 172, 175- 411, 413- 414, 422, 432, 447, 449, 453,
176, 178, 180, 236- 237, 243, 266, 280, 457
307, 398 Rationality, 41, 67- 68, 158
contrastive , 158, 172, 176, 307, 398, 496 Reading , 259, 308, 390- 392, 394, 396, 400,
prosodic , 131, 147, 149, 152, 171, 180, 405, 415- 416
305- 306, 309, 311, 382, 384, 386, 401- -. -timeprocessing
Real , 15- 16, 18, 28, 206,
402 210, 212
syntactic , 260- 261, 264- 266, 268, 275, Reassuring intonation, 313, 402
-
278 281 Recall(verbatum ), 121- 122, 263- 264
Promising , 60- 61, 64 Recipient . SeeThematicrole
Pronominalization , 99, 149, 271- 273, 283 Recursiveness of conceptualstructure,82,
Propositional 87
format/modeof representation , 71, 105, Redundancy in objectnaming, 129- 132,
108, 110, 144- 145, 152- 155, 159, 188 158. SeealsoEndophoric , Exophoric
network, 75 Reduction
Propositionalization , 145 phonological , 166, 302, 364, 369- 371,
Proprioception in articulation, 445, 451 376- 378, 388, 406, 409, 464
ProsodyGenerator , 323, 365- 412passim , of referringexpressions , 5- 6, 147- 150,
416- 417 176, 271- 272
Prosodic Reference
focus, 171- 172, 176, 271, 496 deictic. SeeDeixis
planning, 18, 364- 365, 396, 405, 410 demonstrative , 48, 53
stress, 132, 273. SeePitchaccent geographic , 51
unit, 259, 373, 436 intrinsic. SeeIntrinsic
Index
Subject 562
Spreading activation. SeeActivation structure.7- 8. II . 15. 17- 18, 24, 26- 28.
spreading 161- 180. 191, 198. 217, 222, 235- 283,
Spurt, 23 284- 285. 299, 302- 303, 305- 308, 317-
Stackprinciplein linearization , 143 318. 322, 343. 353, 362, 365, 368, 370-
Stages . SeeProcessing 371. 375- 379. 380- 381, 383- 384, 386.
Standard 398, 400- 406
linguistic, 30, 410- 411, 418 Swallowingreflex, 448, 450
of conversation , 31, 39, 41, 63, 61, 461 Syllabicity,292, 295, 323, 365. 370- 371,409
Statefunction, 80, 251 Syllabification , 284, 294, 318, 365, 375, 405.
Statusof addressee , 45- 46, 69 SeealsoResyllabification
Stem, 180, 183- 185, 288, 290, 323- 325, Syllable
329- 331, 344, 348- 349, 352, 354, 362, articulation, 452- 454
312, 318 boundaries , 22. 292, 294, 300. 332, 481-
Stimulusonsetasynchrony , 226, 230, 234 482
Stops,433, 441- 442 coda. SeeCoda
Storytelling, 131 constituent . SeeConstituent
Stranding . SeeErrors duration(length), 298, 318. 327, 364. 367.
StressS, 12, 298, 310, 403. SeealsoPitch 369- 370. 373, 381, 387- 396. 398, 409-
accent 411, 416- 417. 436, 449
alternation,299, 306, 314, 394- 396 heavy/weak, 299, 316
assignment , 298- 299, 306, 325, 382- 384, inventory, 318, 327
390 latencyeffect, 414- 419
clash, 299, 306, 314, 381, 384, 394- 396 node, 18- 19, 292, 357- 358, 475
contrastive , 241 nuclear,309, 313, 325, 402
error. SeeErrors nucleusof. SeeNucleus
group, 420 omission , 396- 397
nuclear.SeeNuclear onset. SeeOnset
pattern, 25, 186, 298, 301, 323, 344, 355, program, 319, 327, 361- 362, 417, 436
313, 382, 391- 398 . rime. SeeRime
prosodic . See Prosodic stressed / unstressed . SeeStress
rules, 298- 299, 305- 309, 314, 382- 384, tier, 290, 292- 295, 298, 301, 325
390, 394- 396 unit, 292, 452
strandingof, 241, 312, 282 Syllable/ stress -timedlanguage . See
syllable,291- 299, 306, 308- 310, 311- 318, Languages
320, 325, 328- 329, 332, 341- 342, 344, Synergism , 447
359, 369, 312- 314, 381- 390, 392, 391, Synonym , 183, 200, 213, 220, 234, 254
404, 408, 411 Syntactic
tertiary, 309- 310 bias281, 465
word, 25, 255, 281, 293, 298- 301, 311, buffer, 12, 13, 26
324, 315, 381, 384, 389 specialist , 168, 249, 275
Stridentspeechsounds , 296, 30I , 434 Syntagma , 23
Subcategorization , 169- 110, 115, 180, Syntax,26, 35- 36, 182
194- 196 cohesive , 271, 273- 275
Subglottalair pressure , 399, 422, 421 andmeaning , 88
Submissiveness , 102
Substitutions . SeeErrors
Subvocalspeech , 285. SeealsoInternal
speech
Supralaryngeal system , 12, 295, 291, 316,
413, 422- 423, 428, 434, 455
Surface
clause . SeeClause
Index
Subject 565
159
Writing.435
Vowel
back, 431
errors. SeeErrors
front , 431
lax/ tense, 350
lengthening
- - , 34, 36, 309. SeealsoDrawl
semi-, 434
Vowel space, 431
WeU-formedness
of conversation . 37- 38
of message , 70
monitoringof, 14, 498
Rule, 486, 489, 491, 497, 499
of self-repairs,485- 487
of syllable/cluster, 329, 369, 482
of syntax, 282, 468
Wh-movement , 253