Notes from Two Scientific Psychologists: Language isn't magical (but it is special)

Saturday, 19 May 2012

Language isn't magical (but it is special)

One of the most common comments about ecological psychology is that it's hard to imagine how it could apply to things like language. The sense is that language is a completely different kind of beast than perception-action and that it requires a completely different theoretical account (cognitive psychology). Andrew and I disagree. In this post I outline the similarities and differences between language and other types of perceptual information. The main idea is that language is indeed the same type of thing as perception-action, but there are key differences between them in the relationship between the information and what it means. These differences permit language to be flexible according to context, culture, and goals; to be expandable according to changing needs; and to be portable, allowing us to access information about things that are not currently in the environment. These properties make language special, but not magical.

Event Perception

Events in the world are defined in terms of their underlying dynamics. For example, two instances of a bouncing ball are instances of the same type of event - a bouncing-ball-event - because the dynamical equations of motion are the same in both cases. The two instances might be different in their parameters (e.g. the initial height of the ball) but they are still examples of the same event in the world.

Events in the world create information. The light reflecting off a bouncing ball is structured according to the laws of ecological optics by the specific motion of that ball. This structure (optic flow) is specific to the event in question and any organism that detects this information can therefore directly perceive the event in the world. The meaning of the information in this case is the dynamics of the event in the world and this is the meaning that the organism must learn. If the organism can use this information to successfully control it's behaviour, we take this as evidence that the organism has access to this meaning.

Speech is a type of event (well, probably a series of nested events, but I'll get to this later). The act of speaking structures the acoustic array according to the laws of ecological acoustics. This structure is specific to the speech event, and any organism that detects this structure can therefore directly perceive the speech event. But in this case, the meaning of the information is not the dynamics of the articulation of the word. The meaning the organism must learn is the conventional meaning of the word that was spoken and, if the organism acts in a manner consistent with that conventional meaning, this again is evidence for access to that meaning.

From a first person perspective, both cases require learning the meaning of information. I argue that the mechanism of learning this meaning is identical for both types of event.

If, for analysis purposes, we adopt a third person perspective it is possible to see an important difference between speech events and events such as bouncing balls. The difference is in the relationship between the information and what that information means. For the bouncing ball, the fact that the optic flow pattern means 'bouncing ball' is underwritten by the lawful process by which the ball's motion was projected into the optic array, and the form of the information therefore relates to the underlying event. For the speech event, the fact that the acoustic array pattern means, for example, 'Hello' is not underwitten by a lawful process, and

...there is no intrinsic similarity between the sounds of most words and their referents: the form of the word dog gives us no hints about the kind of thing to which it refers. And nothing in the similarity of the forms of dig and dog conveys a similarity in meaning.

Smith & Gasser, 2005, p. 22

Language as an information medium

For the sake of clarity, I will reserve the term perception to refer to the apprehension of structure in an energy array when the meaning of this information is underwritten by a specification relationship between the information and the world. Using this definition, hearing the word "dog" is not an act of perception. This is because hearing a spoken word involves the apprehension of structure in an energy array when the meaning of this information is underwritten by a conventional relationship between the information and the world. When I use the term perceptual information, I mean information whose meaning is underwritten by a specification relationship. Auditory information is about sounds. Visual information is about visual properties of the environment. In contrast, linguistic information (in whatever modality it is conveyed) is about the conventional meanings of linguistic events, which might refer to sounds, sights, ideas, etc.

Because the auditory events of spoken language are about the conventional meanings of linguistic events rather than the physical act of speech articulation, it may be helpful to think of language as its own medium. The medium of language permits the flow of linguistic information through the modalities of sound (speech), vision (writing, sign language), and touch (Braille).

This difference is obviously important and the consequences of it have been noted by many linguists and psychologists. For one thing, humans are uniquely adept at using linguistic information. Learning the meaning of a linguistic event is more difficult than learning the meaning of perceptual information. I can say the word "dog" whether or not there is an actual dog nearby, but the sound of a dog barking is usually going to mean that there is a dog within earshot. Humans have some adaptations that help them with this problem. We are good at establishing joint attention to something in the environment. We are also incredibly motivated to communicate with one another, which means that we're willing to persevere with this difficult learning problem. The process of learning the meaning of linguistic information will be an important part of a research programme on ecological approaches to language. That said, the task analysis that I will undertake in the next post will focus on proficient language users rather than on beginners.

To summarise, then: the route from language to it's meaning is not underwritten by laws, the way the route from perceptual information to it's meaning is. But, critically, from the first person perspective of the organism, there is no difference in what it is interacting with, and I argue that the organism will therefore apply the same tools to both problems. The different outcomes reflect the differences in the routes to meaning, and not fundamental differences within the organism.

In the next post I will consider the types of tasks for which linguistic information might be useful. An actual research programme would select one specific task from one of these types, but I want to begin by thinking broadly about when and how linguistic information guides behaviour. I will also introduce the idea of perceptual-linguistic systems, which will be central to understanding how the meaning of a linguistic event is understood. Finally, I will discuss why this approach to language is explicitly non-representational.

References

Smith, L. & Gasser, M. (2005). The development of embodied cognition: Six lessons from babies. Artificial Life, 11 (1), 13-30.

40 comments:

Sabrina Golonka20 May 2012 at 20:34
Hi Amy

You asked:

I see how your theory could account for learning a word like "dog," but how would it explain learning words like "promise?"

https://twitter.com/#!/amy_tabor/status/204217380015054848

The gist of your question seems to be this: There is an animal that we refer to with the word "dog." This animal is a thing in the environment, so in terms of word learning it is easy to imagine having enough stable perceptual experience with this animal that we can begin to link it to the word "dog." In contrast, a promise can't be seen - it's not a concrete thing - so it's harder to imagine having stable enough perceptual experience to learn the meaning of the speech event "promise."

I can think of a few potential responses to this at the moment, and I don't know which is the most fruitful, so I'm just going to rattle them off and we'll see what you think.

1) Although "dog" refers to a familiar animal, the word is used in a variety of ways. "Dog days of summer", "Dog-ear a page", etc. Wictionary provides 12 meaning for the noun form and 6 for the verb form. If we consider how flexibly we can actually use the word "dog" (the accounting of which would far exceed the number of discrete definitions it is assigned) it becomes much less obvious why this word should be somehow more amenable to this ecological account than the word "promise" (and at this point you might think my ideas are even less plausible than before you read this comment!). This is especially the case since "promise" is fairly well-behaved, having 2 meanings as a noun and 1 meaning as a verb. In any case, it is not clear that "dog" as a word with a variety of potential meanings is obviously more straightforward than a word like "promise", which actually has a fairly well-defined meaning.

(cont'd)
ReplyDelete
Replies
Sabrina Golonka20 May 2012 at 20:35
2) Although in the previous point I appealed to the number of definitions of a word to illustrate flexibility in word-meaning, I argue against thinking of words as having stable core meanings. For example, the Wictionary definition of "promise" discusses making vows and oaths as acts of promising. By this definition, simply declaring " I promise you that " constitutes a promise. In reality, our judgement of whether or not a promise has occurred is much more complex and dependent on perceptual an other factors (what I call perceptual-linguistic systems). Tone of voice (sarcasm, sincerity), situation (marriage ceremony, movie containing a marriage ceremony), and prior knowledge about the speaker, all participate fully in the creation of meaning during the speech act. This is equally true for the meaning of the speech act "dog." Even if we restrict this act to the noun form that refers to the animal, the actual referent of "dog" in a conversation could be a huge variety of things from a photograph of a dog, to a line drawings of a dog, to a cat dressed in a dog suit (in a line-up of cats dressed like other animals you can easily imagine someone referring to this as a "dog"), to a cloud shaped like a dog, etc. This is the type of thing that Smith & Jones (1993) refer to when they say that the thing that makes cognition smart isn't stability, it's flexibility. If we let go of the idea that words like "dog" are especially easy, then their contrast with words like "promise" is less apparent. Granted, children (Western, English-speaking children) are more likely to learn the word "dog" before the word "promise" and this is related to the fact that there is at least one stable perceptual referent for "dog" that we can label in kid's books. But, this early word learning is far from the eventual complexity that characterises adult usage of the word.

3) In this approach to language, evidence of understanding word meaning comes from appropriate behaviour (either using a word in a successful communication or acting sensibly in response to linguistic information in the environment). Again, the intuition is that "dog" is somehow easy because it is a concrete noun and concrete nouns are "content rich". Abstract words like "promise" and "democracy" are often assumed to require representations because it is unclear to people how we can use these words appropriately if we can't learn their meanings from perceptual experience (I think event perception will be an important answer to this, but I'll get to that in a later post). However, it is clear that we are able to learn the meaning of many "content poor" words (in that we can use the words correctly) even though they are entirely without physical referents. For instance, words like "than" and "of" only serve to string other linguistic content together. They have functions, but not meanings, in the sense that nouns and verbs have meanings. Yet, we are able to use the words perfectly well, we miss them when they are absent, and we notice when they are used inappropriately. And yet, there is not a temptation to invoke representations to explain this ability (I doubt you feel strongly that you have a representation of the concept "of").
ReplyDelete
Replies
Anonymous21 May 2012 at 07:18
Hi, I would like to ask if you (Andrew and Sabrina) could tell me the main differences between Ecological Psych and Radical Behaviorism.
Thanks!
ReplyDelete
Replies
Eric Charles21 May 2012 at 08:13
Anonymous,
There are many differences between ecological psychology as conceived by Gibson and Radical Behaviorism as conceived by Skinner. This is a long discussion. One problem is that there are a few versions of each of these systems. Though Radical Behaviorism is now often considered synonymous with Skinnerian Behaviorism, there were several other candidates, and, of course, any all scientific systems are evolving.

There are core similarities though. Both descend (historically/intellectually) from the lineage of American Philosophy, i.e., Pragmatism and Radical Empiricism. You can pick up a bit of that here. Or, if you just want a quick intro to eco psych, look here.
ReplyDelete
Replies
Eric Charles21 May 2012 at 08:25
Amy (assuming you are reading this),
In a system like this, no word 'means' anything, so the questions is only how you come to use the word and how others come to respond. So, even though we might ask for an explanation for people's use of the word "promise", that must be a short hand for asking why the word is said by specific types of people in specific situations. (You know, because the word is used in so many different ways, and we could presumably have a different explanation for each use.) So... to get us started, I'll provide the specifics: Why would a person say "I promise I will take out the trash?"

Now that we are more specific, I don't really have an answer :- )

Skinner (1957) created a category called "autoclitic" to handle words like "promise" in situations like this. One large category of autoclitics are words/phrases that modify the strength of another part of what was said. In that case, "promise" is simply a stronger response that "I will take out the trash", which is itself a stronger response than "I think I will take out the trash". The analogy is to a forceful vs. normal vs. weak pushing of the lever (or any other operant behavior).

This is brilliantly clever, but I don't know if it stands up, and I don't know what the current field of verbal behavior analysis thinks about that aspect of Skinner's work.
ReplyDelete
Replies
Sabrina Golonka21 May 2012 at 09:53
Anonymous,

That's a good question, and if it had occurred to me I would have put something about this in the introduction post. There are many differences between radical behaviourism and ecological psychology, but two of these are particularly salient for a discussion of language.

First, ecological psych includes a theory for how information enters the system via perception, while radical behaviourism does not. Ecological psych's idea of event perception is critical to understanding language because it opens up the possibility of many different layers of structure that can convey meaning in a speech event. For example, there are event structures for words, but there are also probably (and this is open to rigorous testing) structures for sentence types, moods, conversation types, etc. Ecological psych also tells us how to look for these (instances of the same event will have the same dynamics). Without a theory of perception, radical behaviourism isn't quite sure what information is available to support language. The obvious candidates are words, but words are only a part of the story and no account that focuses on words alone will be able to explain the complexity, flexibility, and unreasonable success of language.

The second difference between radical behaviourism and ecological psych that is relevant to language is the level of behavioural analysis. Radical behaviourism tells us that the response to a stimulus depends on our history with that stimulus. Using these principles, we can train people and other animals to do an astonishing variety of things. For instance, we could probably train a rat to ride a tiny bicycle. But, radical behaviourism doesn't explain HOW the rat comes to learn to ride the bicycle. Ecological psych fills this gap with the theory of information and perception-action systems. With ecological psych we can figure out what perceptual variables the rat uses in the continuous control of her behaviour while riding a tiny bicycle.

These two differences between radical behaviourism and ecological psych make the latter a much better candidate for studying language. It gives us a theory of information , which we can use to identify the event structures (beyond just words) that convey meaning during speech events. It gives the proper priority to perception, so that we can appropriately identify the types of things language can be used for (and the types of things it cannot be used for). And, it explains how information can be used in the continuous control of action (with the support of the dynamical systems literature).
ReplyDelete
Replies
Sabrina Golonka21 May 2012 at 10:23
Eric said:

"Skinner (1957) created a category called "autoclitic" to handle words like "promise" in situations like this. One large category of autoclitics are words/phrases that modify the strength of another part of what was said. In that case, "promise" is simply a stronger response that "I will take out the trash", which is itself a stronger response than "I think I will take out the trash"."

I am not a fan of this idea. I've worked a little on some computational linguistics programmes that used this notion to develop a full vocabulary using only a set of universal semantic primitives (a la Wierzbicka). The hope of the computer programmers on the project was that we could describe all verbs in terms of vectors where each cell corresponded to a value on a particular dimension, like strength, with respect to a primitive verb. The idea is intuitively appealing. One problem is that there is no principled reason for particular words to have particular dimensions. The relationship between physical forces and words like "promise" versus "intend" is obviously only metaphorical and the fact that we use one metaphor rather than another is related to culture/convention rather than some natural fit between them. This means that autoclitics can only be used to describe existing phenomenon. We can't look at a set of unfamiliar, but semntaically related words, and generate any sensible hypoteses about how these might related to one another in the language of physical forces.

Another problem with autoclitics is that the idea strips out the nuance in meaning. The difference between "say" and "shout" is more than just one of valence, even if that is one way to describe the difference. "Shout" isn't just "say" turned up. It has its own emotional connotations that are not rooted in a magnitude difference with "say." This is why the phrase "to shout" is not equivalent to the phrase "to say extremely loudly."
ReplyDelete
Replies
sjgknight21 May 2012 at 11:15
I wonder to what extent this is 'translating' other areas of psych (e.g. sociocultural psych, discursive psych) into the language of ecological?

Also I know Andrew read one of Andy Clark's books, forget which, but he has a nice analogy on p.81 of 'natural born cyborgs' of the Mangrove Swamp from which meaning is emergent from language (rather than labelled by it). He's also written some articles (e.g. I think one called "Magic Words") which may relate to some of these issues.
ReplyDelete
Replies
Neuroskeptic21 May 2012 at 16:27
"The meaning the organism must learn is the conventional meaning of the word that was spoken ... linguistic information (in whatever modality it is conveyed) is about the conventional meanings of linguistic events"

This seems sensible, but haven't you just admitted that words have "meanings" (i.e. they point to something in the world, without being it) and we learn those meanings... in other words, that our minds contain representations of the world?

In which case - haven't you thrown out radical embodiedment?

To put it another way, many people would say that as soon as you admit that words have meanings, you have become a representationalist.

And (I'm no expert) but didn't the later Wittgenstein specifically deny that words had meanings (in the conventional sense anyway) - for exactly that reason...?
ReplyDelete
Replies
afauno21 May 2012 at 16:44
sjgknight has a point imho. One is baffled by the continuous reinventing of things... Btw Skinner's autoclitics are for instance remarkably similar to the medieval modus vs dictum distinction. I'd like to come back to this thought at the end.

But I have a more precise remark concerning your approach. To explain my point, I'd like to introduce a classical dichotomy in the discussion: (1) language as a subjective activity of coding/decoding/understanding (Saussure's 'langage') versus (2) language as a social construct (as in the expression 'the english language'), where one may assume that many subjective processes resulted in a functional entity (which corresponds to Saussure's 'la langue')

It seems to me your approach has a huge blind spot for everything related to the latter. It's quite apparent when you dismiss the computational research on 'vectorizing' Wierzbicka's primitives as 'cultural'... You seem to think that cultural phenomena are irrelevant to psychology because they are not universal. But one might argue that even if no semantic primitive is really universal, all languages have some kind of primitives and therefore a certain cultural 'tendancy to have primitives' is universal... that would make it very relevant to studies of the mind.

To me, it's as if you claimed that the study of chemistry renders biology irrelevant because all biological phenomena are in the end chemical. Yet obviously the higher-level constructs studied by biology have a reality of their own, with laws and properties that can/should be studied as such. It doesn't always involve going all the way down to the chemical properties.

In other words, I claim that cultural/conventional phenomena in language have rules of their own. You could see language as a shared (social) construct created by the speakers. The goal would be to objectivize their subjective understanding into an intersubjective construct or artifact. One of the properties of this social artifact is to classify the reality into shared categories. In that view, trying to find major emerging dimensions within the categories, which your computational linguists were trying to do, seems quite an interesting approach.

So I don't say your view is 'bad', I'm just saying it tackles low-level routines that may assemble into higher-level phenomena when you move from the 1st person view to the cultural constructs.

An interesting question then is: What are the interactions between the lower level perceptual mecanisms and the higher level semantic constructs? To tackle that, I'd like to go back to the modalisation/autocliticity discussion. Some linguists argue that the binary opposition you noted between 'autoclitics' (aka modalisers and grammatical words) and lexemes (content words) is just the simplification of a continuous gradient between two poles. A theory called grammaticalization explains that content words can (under certain circonstances) have their meaning become less and less concrete and more and more grammatical. An example would be low-latin 'casa' (=house) becoming french 'chez' (=at). This process, among many others, indicates they are degrees of universality in the semantic constructs. In the context of your thoery, it could perhaps be interpreted as the tendancy to re-create important general patterns from the 'small change' of available content words.
ReplyDelete
Replies
afauno21 May 2012 at 20:45
Thanks for your answers... Sorry if I underestimated the importance of the cultural for you. I'll be happy to read your next post (yet I'm still convinced there's a dual embodied/disembodied nature of language...)
ReplyDelete
Replies
Eric Charles21 May 2012 at 20:50
Well, one big difference between what you are trying to do and what Skinner was trying to do is the organism of interest. Skinner called his book 'Verbal Behavior' because it was about the behavior of saying things (writing things, signing things, etc.). It was not a theory of language, but an extension of operant theory to explain why certain people say certain things in certain situations. Your theory, it seems will be focused on perception, i.e., on why listeners do certain things, including responding with more words.

This should lead to a difference in perspective, but whether it leads to incompatibility is a separate issue. Also, even if it incompatible with Skinner's approach to verbal behavior, that doesn't mean it is incompatible with radical behaviorism. So far as I can tell, the first use of 'radical behaviorism' was by Mary Calkins in 1916, long before Skinner came on the scene.

Eric
ReplyDelete
Replies
Charles T. Wolverton22 May 2012 at 04:14
the questions is only how you come to use the word and how others come to respond

I'm curious how far Eric, Sabrina, or anyone else is willing to go down this path. My inclination is to go pretty far (perhaps too far?) Hence, although being on-board with the claim that "the mechanism of learning this meaning is identical for both types of event" (those that unfold in accordance with natural laws and those that unfold in accordance - more or less - with social conventions, eg, language), I'd go even further.

I take the meaning of a linguistic event to be the response (possibly latent) intended by the linguistic agent. So, the agent's objective is not to convey "information" to someone but to effect action by someone, immediately or subsequently. If the target of the event responds as intended, the meaning was "understood". That view seems to extend easily to the concept of the "meaning" of any stimulus caused by an agent, eg, perceiving a bouncing ball hit by a tennis opponent. And it can be extended to allow attribution of "meaning" to an agentless event by assuming perfect "understanding" on the part of the perceiver so that the virtual meaning of the event can be interpreted as being whatever response results.

In developing tennis skills, one has to learn to produce a multitude of instances of a bouncing ball in an attempt to cause desired responses from an opponent - ie, to master the "meanings" (in the above sense) of produced bouncing tennis balls: those hit to/from the forehand/backhand, with/without top/back/side spin, long/short or high/low on fast/slow surfaces, etc, all in a multitude of complex and rapidly time-varying contexts. Is the process of learning how to play tennis at a given skill level really dramatically different from learning to play a similarly challenging language game with comparable skill, ie, to skillfully produce linguistic "bouncing balls"? In both cases the production clearly needs to be "flexible according to context [] and goals [,and] to be expandable according to changing needs". I'm less clear whether in either case they need to be "portable" in the sense of allowing a player to "access information about things that are not currently in the environment". The intentional idiom is convenient for those who insist on including mentalese in their vocabularies, and background knowledge of a tennis opponent's behavioral patterns can be an advantage. But is either necessary? Finally, I can't quite parse "flexible according to [] culture" and therefore have nothing to say about that proposed distinguishing feature of linguistic events. (All of this also applies, of course, to responding to stimuli.)

I fail to appreciate the significance of the Smith and Gasser quote. While an utterance considered only as an abstract sound obviously conveys no information about its intended referent (if any), that seems quite irrelevant to meaning. At the basic level appropriate to consideration of "dog", if we insist on ascribing meaning to the stand-alone word, it is initially merely an association between simultaneous experiences of neural activity due to visual stimulation consequent to light reflected from a present dog-object and aural sensory stimulation due to utterance of the word. Only later does meaning in the more complex sense of tool use emerge. Why in either case would anyone expect a sound per se to be similar (using any measure) to an object? In any event, this evolution of skilled word usage seems to me to parallel closely the development of a skill at handling bouncing tennis balls.
ReplyDelete
Replies
Charles T. Wolverton22 May 2012 at 04:31
Nitpicking:

1. While I understand that the concept of stimuli having "information content" is central to eco-psych (or at least the version to which you and Andrew subscribe), "information" still strikes me as being one of those overloaded terms that may cause more trouble than it's worth. Eg, isn't it likely to lead the unwary to think in terms of representation and computation - both perhaps required for third-person analysis and simulation but not necessarily for first-person implementation? Why not something more neutral like your own "[dynamic] structure in an energy array"?

2. Trying to distinguish law-driven stimuli and language by considering the latter to be a "medium" seems potentially confusing. In the post, transmission of language is said to be via "modalities", thereby leaving "medium" available for describing language itself. But I think most comm theory people would consider such a physical "modality" to be the communication "medium". In terms of layered comm protocols (which - intended or not - is the flavor of that part of the post), the physical layer is at the bottom of the protocol stack, and language arguably isn't even in the stack. Consistent with the protocol stack metaphor, one could indeed call language the "medium for meaning", but I see no benefit to doing so that would offset the possible consequent confusion.
ReplyDelete
Replies
Charles T. Wolverton22 May 2012 at 04:42
In other words, I claim that cultural/conventional phenomena in language have rules of their own. ... One of the properties of this social artifact is to classify the reality into shared categories.

afauno -

In case you don't already know, Donald Davidson has addresssed this idea at length, most notably here.
ReplyDelete
Replies

Add comment

Pages

Saturday, 19 May 2012

Language isn't magical (but it is special)

40 comments: