The Language Instinct: How the Mind Creates Language (25 page)

BOOK: The Language Instinct: How the Mind Creates Language
2.04Mb size Format: txt, pdf, ePub

 

 

One extraordinary feature of the lexicon is the sheer capacity for memorization that goes into building it. How many words do you think an average person knows? If you are like most writers who have offered an opinion based on the number of words they hear or read, you might guess a few hundred for the uneducated, a few thousand for the literate, and as many as 15,000 for gifted wordsmiths like Shakespeare (that is how many distinct words are found in his collected plays and sonnets).

The real answer is very different. People can recognize vastly more words than they have occasion to use in some fixed period of time or space. To estimate the size of a person’s vocabulary—in the sense of memorized listemes, not morphological products, of course, because the latter are infinite—psychologists use the following method. Start with the largest unabridged dictionary available; the smaller the dictionary, the more words a person might know but not get credit for. Funk & Wagnall’s
New Standard Unabridged Dictionary
, to take an example, has 450,000 entries, a healthy number, but too many to test exhaustively. (At thirty seconds a word, eight hours a day, it would take more than a year to test a single person.) Instead, draw a sample—say, the third entry from the top of the first column on every eighth left-hand page. Entries often have many meanings, such as “
hard
: (1) firm; (2) difficult; (3) harsh; (4) toilsome…” and so on, but counting them would require making arbitrary decisions about how to lump or split the meanings. Thus it is practical only to estimate how many words a person has learned at least one meaning for, not how many meanings a person has learned altogether. The testee is presented with each word in the sample, and asked to choose the closest synonym from a set of alternatives. After a correction for guessing, the proportion correct is multiplied by the size of the dictionary, and that is an estimate of the person’s vocabulary size.

Actually, another correction must be applied first. Dictionaries are consumer products, not scientific instruments, and for advertising purposes their editors often inflate the number of entries. (“Authoritative. Comprehensive. Over 1.7 million words of text and 160,000 definitions. Includes a 16-page full-color atlas.”) They do it by including compounds and affixed forms whose meanings are predictable from the meanings of their roots and the rules of morphology, and thus are not true listemes. For example, my desk dictionary includes, together with
sail
, the derivatives
sailplane, sailer, sailless, sailing-boat
, and
sailcloth
, whose meanings I could deduce even if I had never heard them before.

The most sophisticated estimate comes from the psychologists William Nagy and Richard Anderson. They began with a list of 227,553 different words. Of these, 45,453 were simple roots and stems. Of the remaining 182,100 derivatives and compounds, they estimated that all but 42,080 could be understood in context by someone who knew their components. Thus there were a total of 44,453 + 42,080 = 88,533 listeme words. By sampling from this list and testing the sample, Nagy and Anderson estimated that an average American high school graduate knows 45,000 words—three times as many as Shakespeare managed to use! Actually, this is an underestimate, because proper names, numbers, foreign words, acronyms, and many common undecomposable compounds were excluded. There is no need to follow the rules of Scrabble in estimating vocabulary size; these forms are all listemes, and a person should be given credit for them. If they had been included, the average high school graduate would probably be credited with something like 60,000 words (a tetrabard?), and superior students, because they read more, would probably merit a figure twice as high, an octobard.

Is 60,000 words a lot or a little? It helps to think of how quickly they must have been learned. Word learning generally begins around the age of twelve months. Therefore, high school graduates, who have been at it for about seventeen years, must have been learning an average of ten new words a day continuously since their first birthdays, or about a new word every ninety waking minutes. Using similar techniques, we can estimate that an average six-year-old commands about 13,000 words (notwithstanding those dull, dull
Dick and Jane
reading primers, which were based on ridiculously lowball estimates). A bit of arithmetic shows that preliterate children, who are limited to ambient speech, must be lexical vacuum cleaners, inhaling a new word every two waking hours, day in, day out. Remember that we are talking about listemes, each involving an arbitrary pairing. Think about having to memorize a new batting average or treaty date or phone number every ninety minutes of your waking life since you took your first steps. The brain seems to be reserving an especially capacious storage space and an especially rapid transcribing mechanism for the mental dictionary. Indeed, naturalistic studies by the psychologist Susan Carey have shown that if you casually slip a new color word like
olive
into a conversation with a three-year-old, the child will probably remember something about it five weeks later.

 

 

Now think of what goes into each act of memorization. A word is the quintessential symbol. Its power comes from the fact that every member of a linguistic community uses it interchangeably in speaking and understanding. If you use a word, then as long as it is not too obscure I can take it for granted that if I later utter it to a third party, he will understand my use of it the same way I understood yours. I do not have to try the word back on you to see how you react, or test it out on every third party and see how they react, or wait for you to use it with third parties. This sounds more obvious than it is. After all, if I observe that a bear snarls before it attacks, I cannot expect to scare a mosquito by snarling at it; if I bang a pot and the bear flees, I cannot expect the bear to bang a pot to scare hunters. Even within our species, learning a word from another person is not just a case of imitating that person’s behavior. Actions are tied to particular kinds of actors and targets of the action in ways that words are not. If a girl learns to flirt by watching her older sister, she does not flirt with the sister or with their parents but only with the kind of person that she observes to be directly affected by the sister’s behavior. Words, in contrast, are a universal currency within a community. In order to learn to use a word upon merely hearing it used by others, babies must tacitly assume that a word is not merely a person’s characteristic behavior in affecting the behavior of others, but a shared bidirectional symbol, available to convert meaning to sound by any person when the person speaks, and sound to meaning by any person when the person listens, according to the same code.

Since a word is a pure symbol, the relation between its sound and its meaning is utterly arbitrary. As Shakespeare (using a mere tenth of a percent of his written lexicon and a far tinier fraction of his mental one) put it,

 

 

What’s in a name? that which we call a rose

By any other name would smell as sweet.

 

 

Because of that arbitrariness, there is no hope that mnemonic tricks might lighten the memorization burden, at least for words that are not built out of other words. Babies should not, and apparently do not, expect
cattle
to mean something similar to
battle
, or
singing
to be like
stinging
, or
coats
to resemble
goats
. Onomatopoeia, where it is found, is of no help, because it is almost as conventional as any other word sound. In English, pigs go “oink”; in Japanese, they go “boo-boo.” Even in sign languages the mimetic abilities of the hands are put aside and their configurations are treated as arbitrary symbols. Residues of resemblance between a sign and its referent can occasionally be discerned, but like onomatopoeia they are so much in the eye or ear of the beholder that they are of little use in learning. In American Sign Language the sign for “tree” is a motion of a hand as if it was a branch waving in the wind; in Chinese Sign Language “tree” is indicated by the motion of sketching a tree trunk.

The psychologist Laura Ann Petitto has a startling demonstration that the arbitrariness of the relation between a symbol and its meaning is deeply entrenched in the child’s mind. Shortly before they turn two, English-speaking children learn the pronouns
you
and
me
. Often they reverse them, using
you
to refer to themselves. The error is forgivable.
You
and
me
are “deictic” pronouns, whose referent shifts with the speaker:
you
refers to you when I use it but to me when you use it. So children may need some time to get that down. After all, Jessica hears her mother refer to her, Jessica, using
you;
why should she not think that
you
means “Jessica”?

Now, in ASL the sign for “me” is a point to one’s chest; the sign for “you” is a point to one’s partner. What could be more transparent? One would expect that using “you” and “me” in ASL would be as foolproof as knowing how to point, which all babies, deaf and hearing, do before their first birthday. But for the deaf children Petitto studied, pointing is not pointing. The children used the sign of pointing to their conversational partners to mean “me” at exactly the age at which hearing children use the spoken sound
you
to mean “me.” The children were treating the gesture as a pure linguistic symbol; the fact that it pointed somewhere did not register as being relevant. This attitude is appropriate in learning sign languages; in ASL, the pointing hand-shape is like a meaningless consonant or vowel, found as a component of many other signs, like “candy” and “ugly.”

 

 

There is one more reason we should stand in awe of the simple act of learning a word. The logician W. V. O. Quine asks us to imagine a linguist studying a newly discovered tribe. A rabbit scurries by, and a native shouts, “Gavagai!” What does
gavagai
mean? Logically speaking, it needn’t be “rabbit.” It could refer to that particular rabbit (Flopsy, for example). It could mean any furry thing, any mammal, or any member of that species of rabbit (say,
Oryctolagus cuniculus
), or any member of that variety of that species (say, chinchilla rabbit). It could mean scurrying rabbit, scurrying thing, rabbit plus the ground it scurries upon, or scurrying in general. It could mean footprint-maker, or habitat for rabbit-fleas. It could mean the top half of a rabbit, or rabbit-meat-on-the-hoof, or possessor of at least one rabbit’s foot. It could mean anything that is either a rabbit or a Buick. It could mean collection of undetached rabbit parts, or “Lo! Rabbithood again!,” or “It rabbiteth,” analogous to “It raineth.”

The problem is the same when the child is the linguist and the parents are the natives. Somehow a baby must intuit the correct meaning of a word and avoid the mind-boggling number of logically impeccable alternatives. It is an example of a more general problem that Quine calls “the scandal of induction,” which applies to scientists and children alike: how can they be so successful at observing a finite set of events and making some correct generalization about all future events of that sort, rejecting an infinite number of false generalizations that are also consistent with the original observations?

We all get away with induction because we are not open-minded logicians but happily blinkered humans, innately constrained to make only certain kinds of guesses—the probably correct kinds—about how the world and its occupants work. Let’s say the word-learning baby has a brain that carves the world into discrete, bounded, cohesive objects and into the actions they undergo, and that the baby forms mental categories that lump together objects that are of the same kind. Let’s also say that babies are designed to expect a language to contain words for kinds of objects and words for kinds of actions—nouns and verbs, more or less. Then the undetached rabbit parts, rabbit-trod ground, intermittent rabbiting, and other accurate descriptions of the scene will, fortunately, not occur to them as possible meanings off
gavagai
.

But could there really be a preordained harmony between the child’s mind and the parent’s? Many thinkers, from the woolliest mystics to the sharpest logicians, united only in their assault on common sense, have claimed that the distinction between an object and an action is not in the world or even in our minds, initially, but is imposed on us by our language’s distinction between nouns and verbs. And if it is the word that delineates the thing and the act, it cannot be the concepts of thing and act that allow for the learning of the word.

I think common sense wins this one. In an important sense, there really are things and kinds of things and actions out there in the world, and our mind is designed to find them and to label them with words. That important sense is Darwin’s. It’s a jungle out there, and the organism designed to make successful predictions about what is going to happen next will leave behind more babies designed just like it. Slicing space-time into objects and actions is an eminently sensible way to make predictions given the way the world is put together. Conceiving of an extent of solid matter as a thing—that is, giving a single mentalese name to all of its parts—invites the prediction that those parts will continue to occupy some region of space and will move as a unit. And for many portions of the world, that prediction is correct. Look away, and the rabbit still exists; lift the rabbit by the scruff of the neck, and the rabbit’s foot and the rabbit ears come along for the ride.

Other books

The 11th Floor: Awakening by Culver, Charles
One Bite Per Night by Brooklyn Ann
The World's Most Evil Gangs by Nigel Blundell
Bright Angel by Isabelle Merlin
Just One Kiss by Isabel Sharpe
The End Games by T. Michael Martin
The Grand Banks Café by Georges Simenon