Notes from Two Scientific Psychologists: Why does linguistic information mean what it does?

Tuesday, 16 October 2012

Why does linguistic information mean what it does?

Sabrina has been working on a series of posts on an ecological analysis of language (here, here and here, plus more on the way). Her focus has been on the nature of the information for language, and the similarities and differences this information has with the information for perception. We're working some of this analysis into a paper, and writing that got me thinking about this in a little more detail.

Our main move on language is to reject the assumption that language is a qualitatively different kind of task than perception & action. The goal is to find ways to talk about these behaviours using the same basic analysis tools. Part of that is to draw the analogy to how perceptual information gets its meaning and use that to describe how linguistic information gets its meaning.

What I want to do here is just map this analogy out a little, because I ended up in an interesting place and I want feedback from people who know more than us on this about whether this is just plain crazy. In particular, if you know anything about the relationship between neural dynamics and the dynamics of speech, we think this is going to be relevant!

How perceptual information gets its meaning
When we talk about meaning, we're asking how an organism can come to learn what an information variable is information about. Perceptual information is about the underlying dynamics of the event that created the information.

What I mean by this is that events in the world can be distinguished and identified only in terms of the dynamics. A dynamical description is one that describes how something changes over time, and which includes reference to the underlying forces that caused that change. A fly ball in baseball looks and acts the way it does because it is an example of the projectile motion dynamic. The dynamical equation describing projectile motion events includes terms for the size and mass of the object, the initial speed and angle, gravity, drag and air resistance. You can use this description to plot out exactly how the position of the ball changes over time.

A dynamical event such as a fly ball creates information by interacting with, say, light; but this information is only kinematic, not dynamic. A kinematic description of an event is one that describes how something changes over time, but without reference to the underlying forces. In practical terms, this means that you can use variables like time, position, velocity and the other temporal derivatives of position but you can't use variables that include mass or force. The visual perceptual information for a dynamical event is therefore a pattern in the optic array that can be described in terms of things changing over time.

It turns out that it is possible for a kinematic pattern to specify a dynamic property. What this means is that an aspect of the dynamical event creates one and only one kinematic pattern as it unfolds over time. If this is the case, detecting the kinematic pattern is equivalent to perceiving that aspect of the dynamic event, and this is the mechanism for direct perception of the world.

Information is all we have access to, and you never get to peek behind the curtain to check what the dynamics are up to. So in order to learn what a given kinematic pattern means, you have to use that pattern to control some action. If that pattern lets you, say, intercept a fly ball, then that pattern comes to mean the catchable-ness of the ball (the affordance). In other words, perceptual information comes to mean the dynamics of the event that created the kinematic pattern.

How linguistic information gets its meaning (the analogy)
Linguistic information is also created by a dynamic event, but a much more complicated one. Take speech (but the idea works just as well for writing and gesture). The information that is created is kinematic patterns in the acoustic array. These patterns are caused by the underlying dynamics of articulation (how the lips, tongue and vocal cords change over time). However, and this is a big however, linguistic information does not come to mean the dynamics of articulation. When you detect a pattern in the acoustic array, you don't perceive that your conversation partner's throat is up to - you perceive the meaning of the word that was produced.

Remember, the goal is to apply the analysis of how perceptual information gets its meaning to how linguistic information gets its meaning; but we've run into a mismatch. My solution is to remember that the dynamical system producing speech is actually much more than just the articulators. A critical player in speech is the brain, and one of the main reasons the articulators move the way they do is that this is what happens when you couple the neural dynamics of language to an articulation system.

The crazy notion that emerges from this analysis is that linguistic information comes to mean the dynamics of the broader system, the dynamical system formed by the coupling of language related neural dynamics to an articulation system. This means the analogy holds (the kinematic information is about an underlying dynamical event in the world).

Initial problems
For perception, events such as projectile motion have the dynamics they do because of physic (see Turvey, Shaw Reed and Mace, 1981 for the details of this analysis). The dynamics of projectile motion is simply how you describe how an object changes it's state over time when it has been fired off with an initial speed and angle and then left to do it's thing.

This is not true for language. Why do the neural dynamics have the form they do? One crude answer from applying the analogy is that they are like this because that's how you describe what an extensively trained nervous system changes it's state over time when it's producing that sentence rather than another. Obviously this isn't all that satisfactory, but it's all I have just now.

It's also even more complicated than this, because the dynamics from which linguistic information arises also includes the conversational and social context, etc etc. It's possibly an intractable mess, although people are applying dynamical systems to all kinds of tasks these days.

So this analogy only gets us so far; but it does push the ecological analysis quite a long way into the problem, which I like.

The coupling between neural and articulation dynamics
There is apparently a bit of a literature on this (thanks Tom Hartley and Jon Brock for links). The debate in the literature right now seems to be about whether syllables can be described as oscillators. If they can be, then you can start to talk about things like coupling and entrainment between syllable production and the underlying neural oscillations you can measure in speech production. This recent paper in Frontiers in Language Sciences by Fred Cummins is skeptical but only because he thinks the syllable is the wrong place to look, I think; importantly it has links to all the key papers on this topic.We'll get into that literature at some point, but at this point I'm still trying to come to grips with this analysis and whether this literature fits it and might help us.

If anyone has any bright ideas, questions, comments, papers that might help/hinder this analysis, anything at all, let us know in the comments. This is all a work in progress!

References
Cummins, F. (2012). Oscillators and Syllables: A Cautionary Note Frontiers in Psychology, 3 DOI: 10.3389/fpsyg.2012.00364

Hartley, T. (2002) Syllabic phase: a bottom-up representation of the temporal structure of speech. Progress in Neural Processing 14 ,World Scientific: Singapore, 277-288. Download

Turvey, M. T., Shaw, R. E., Reed, E. S., Mace W. M. (1981). Ecological laws of perceiving and acting: In reply to Fodor and Pylyshyn (1981) Cognition, 9 (3), 237-304 DOI: 10.1016/0010-0277(81)90002-0 Download

23 comments:

P. Adrian Frazier16 October 2012 at 20:10
Your work on language, I think, would probably greatly benefit from reading some of Mark Bickhard’s work on language. It was a problem he spent many years working on once he developed his interactive model of representation (I know you don’t like that word, but it is a notion of representation radically different from the usual correspondence or “encodingist” models and is entirely consistent with radical embodied cognition). His theorizing overlaps quite a bit with what you have been discussing on this blog with respect to language. As you say, we have access to information, but we can’t know what it is information with (that is, we don’t know what caused it). When we engage in interactions with information that successfully satisfy the conditions for engaging in other interactions, then the interactions themselves become meaningful as indicators for further interaction*. Taking the outfielder as an example, a particular visual scan indicates that catching is afforded by moving backwards and to the left and holding up his glove. In turn, catching the ball indicates that throwing is afforded, etc. The outfielder’s actions are directed toward transforming the situation, and the relative success or failure of his actions change how it is that he characterizes the situation (a characterization that is constituted as a web of conditional interactions). If he catches the ball, then the batter is out. If he drops it then he has to pick it up and throw it to prevent the batter from getting on base. Etc. So, his actions don’t just recharacterize the situation for him, but for everyone else on field through their own webs of conditional interactions. For instance, for the first-baseman, standing in position to catch the ball is conditional on seeing the outfielder drop the ball, but not on seeing the outfielder catch the ball. Speaking is no different. Assuming that the people you are speaking to know how to use the language you are using (via situation conventions), you engage in speech acts intended to change how others characterize the situation (that is, you attempt to manipulate their perceptions of the affordances available and thus what actions are appropriate for the situation). For instance, the types of successful responses available when I say “I love you” and “I hate you” are entirely different (although there could be overlap—for instance, both might indicate that running away is the right thing to do!).

Anyway, you might find the references below useful. The article is the shortest and probably easiest to read, but the 1980 book is where the model was most thoroughly explicated.

*This is, basically the model for representation. Interactions are what constitute representation when they function as indicators for potential further interaction. It’s consistent with radical embodied cognition because the kind of representation available to an organism depends on the kinds of actions it can engage in, which depends on the organism’s body (which includes the organism’s nervous system). This kind of representation is “about” the environment in the sense that it (that is, the interaction) is presupposed to be appropriate for the environment. Environments (and situations) are implicitly defined, rather than explicitly defined, in terms of potential interactions (and the webs of conditional interactions implied by them).

Bickhard, M.H. (2004) The social ontology of persons. Carpendale, Jeremy I. M. (Ed); Muller, Ulrich (Ed), (2004). Social interaction and the development of knowledge., (pp. 111-132). Mahwah, NJ, US: Lawrence Erlbaum Associates Publishers (PDF: http://www.lehigh.edu/~mhb0/SocOntPersons.pdf)

Bickhard, M.H. (1980) Cognition, communication, and convention. Praeger

Campbell, R.J. (2011) The concept of truth. Palgrave Macmillan
ReplyDelete
Replies
Eric Charles16 October 2012 at 23:12
Adrian,
Thanks for the interesting comment! I suspect Andrew and Sabrina will be even more hardline than you think. While the the representations you describe do sound like they could be compatible with some forms of embodied cognition, the ecological psychologists are about as anti-representation as you can get. I, for one, am not sure there is any value in talking about mental representations, though I am fine with the notion of physical representations - as when a picture of Obama re-presents what Obama looks like. Some eco-psych people even get twitchy about that!

Eric
ReplyDelete
Replies
P. Adrian Frazier17 October 2012 at 00:35
Well, an argument can be made (and is in Bickhard and Richie’s book on Gibson—here’s a pdf if you are interested: http://www.lehigh.edu/~mhb0/BickhardRichieRepresentations.pdf) that the problem eco-psych has with representation is with correspondence (encodingist) models of representation, not with representation, per se. That is, the argument Gibson and others have launched against representation is an argument against correspondence (encodingist) models of representation. The arguments are not, however, arguments against an action based model of representation, such as the Bickhard’s interactivist model. To the contrary, Gibson’s model of perception is an interactive model of perception, and interactivism can be thought of as an extension of it to other areas of cognition. As I said before, it is actions (really, interactions) that are representational in that if successful, they indicate the possibility of other interactions. Seeing a blurb cross its visual field indicates the opportunity to flick its tongue and eat for a frog, but it does so functionally, not by creating some kind of structurally isomorphic stand-in for the blurb, as a correspondence model would suggest. The visual interaction itself is indicative that tongue flicking is appropriate (which, unlike with correspondence models, does not require that the indication is correct—the blurb could be a stone flicked by a scientist and not something nutritious like a fly). What is being represented (that is re-presented) is an action (or an interaction), not a set of correspondences to something in the environment. The aboutness, of course, is something in the environment, but the aboutness arises as a function of the action’s appropriateness for the environment rather than being some kind of picture of or correspondence with or stand-in for it. As Richard Campbell argued, you can call that something other than representation, but why? Why not just argue that correspondence models are wrong?

Anyway, the point was not to argue in favor of interactive representation, but to suggest that Bickhard’s model of language is massively congruous with Andrew and Sabrina’s project, and it would be highly useful for them to check it out. I only mentioned representation because I know they are allergic to the word “representation” and might discount his work on the grounds that he talks about it. However, his work is entirely consistent with radical embodied cognition and ecological psychology, and he covers much of the same ground (conventions, for instance), so it would be unfortunate to dismiss it on such unwarranted grounds. That’s all.
ReplyDelete
Replies
Andrew17 October 2012 at 09:26
Adrian:

First, thanks for the info - pointers like this is exactly what I was fishing for here.

Second, I like the sound of Bickhard's approach. We are inclined to say 'why bother calling that representation'; you correctly identify that our beef is with correspondence models but as far as we're concerned those models are just what representation means in psychology. The word therefore has baggage we don't want, so in order to "engage in speech acts intended to change how others characterize the situation" we think using that word is more trouble than it's worth :)

All that said, thanks for highlighting the meat of this work because it sounds like we should hold our nose on this use of the word and get into the content - thanks!
ReplyDelete
Replies
Unknown17 October 2012 at 17:35
I have two broad, very open questions:

1) How would you compare gesture to spoken language in this account? That is, presumably the dynamics of a gestural event reflect the dynamics of an underlying neural event, and yet it would feel to me as though the dynamics of gesture would allow itself to be analysed more easily from an ecological perception perspective. Both are communicative. Both, it has been argued, have grammatical form. Perhaps gesture may provide a ladder to get from action to language. Likewise, sign language and playing a musical instrument...

2) The second question is related to the above: where does the 'social' come into this account? Not to get all Wittegensteinian, but it doesn't seem as though language (or gesture or music) can 'mean' very much at all without con-specific agents and a network of these to pressurise the development and use of some particular neural-articulatory dynamic forms over others. Linguistic events, to me, seem more constrained by broadly 'social' laws (very loosely speaking) than by physical laws that constrain events like balls flying through the air.

I am very interested to hear your thoughts.
ReplyDelete
Replies
Charles T. Wolverton17 October 2012 at 17:36
Andrew and Sabrina -

I'm all for thinking of processing speech as just a special case of the general task of processing perceptual input. The question then becomes where in the spectrum between psychology and physics one should focus attention. Injecting a concept of "meaning" seems clearly too far toward psychology while looking to the physiological "dynamics" of speech production seems to me too far into the physical details. So, I'd like to list the features of speech processing on which I think we agree, then identify features on which we seem to disagree and reasons why.

1. We seem to agree on a Quinean stimulus-response paradigm for thinking about perception in general and speech processing in particular.

2. I think of stimuli in terms of the resulting patterns in neural activity. Your paragraph contrasting kinematic and dynamic patterns seems a bit confusing (I suspect there's a typo or two), but I gather your "kinematic patterns" roughly correspond to my neural activity patterns. Here, I'll go with your term.

3. We seem to agree that the objective of perceptual processing is to produce responses - which I refer to as "behavioral dispositions" to emphasize that they may be latent rather than immediate.

4. I assume we agree that some responses to stimuli must be learned, either via direct training by family/community or indirectly by trial and error - not that every response must be preprogrammed, only that a priori learning of a (large) set of stimulus- response pairs is required.

5. I assume that extraction of the "information" in a kinematic pattern is successful (it's "meaning is understood") if the pattern is sufficient to support determination of a response. Eg, in the case of a simple pattern it might be sufficiently close (in some sense) to a learned pattern that the behavioral disposition paired with the stored pattern suffices.

It is at this point (probably before) that we diverge with respect to speech processing. I can understand why "projectile dynamics" (as manifest in kinematic patterns) might be useful in analyzing perceptual tasks like the fly ball. If one assumes the "tracking" (as opposed to the "predicting") approach to reaching a point of intersection, then the processing must continue tracking those dynamics/kinematics up to the point where the fielder no longer needs to relocate (ie, to the point at which Adrian's succession of interactive tasks moves on to the task of actually catching the ball). One might call reaching that point "achieving perceptual sufficiency". In those terms, the frog achieves perceptual sufficiency when it's perceptual processing has reached the point where it's tongue flicking is likely to intersect the path of the "blurb". But I contend that speech perception isn't dynamic like tracking but instead is more like predicting or pattern recognition. Perceptual sufficiency can be achieved long before a speech event has been completed (see note below) which seems to make the detailed dynamics of the production process largely irrelevant.

Note: If you doubt this, google "who said [small fragment of your favorite movie line in quotes, ie, entered as a phrase]"; eg, 'who said "what we"', 'who said "make my"', 'who said frankly"'. Some context-dependence seems necessary, so google must assume a movie context for such queries; for us, context is often explicit - eg, "Do you remember which actor said ... ?") We presumably can do at least as well as google's computers.

----- cont'd ---
ReplyDelete
Replies
Charles T. Wolverton17 October 2012 at 17:44
--- cont'd ---

To repeat, I enthusiastically support your general direction - thinking of speech processing as just a special case of perceptual processing - but have doubts about your specific path. Here are some problems I see:

The information that is created [by speech] is kinematic patterns in the acoustic array. These patterns are caused by the underlying dynamics of articulation (how the lips, tongue and vocal cords change over time).

We understand artificial speech just fine despite there being no physiological dynamics involved, just computer generated sounds. A problem with the analogy may be that the relevant "dynamics" of successful speech are largely interpersonal, hence psychological. Anomalous monism (rightly or wrongly) suggests that they are therefore not subject to strict physical laws.

the dynamical system producing speech is actually much more than just the articulators. A critical player in speech is the brain

The question is how important such production details are in determining a behavioral disposition in response to a stimulus. In determining the detailed dynamics of a ball in flight, critical factors are the composition and trajectory of the bat, the point and angle of impact on the ball, et al. But does the fielder need to know any of those dynamics in order to move so as to intersect the ball's flight? OTOH, in dealing with a spinning tennis ball such factors are important. But they tend to be gleaned partly from perception of the hitter's stroke and therefore are essentially contextual, or are detected from the ball's trajectory and therefore are part of the kinematic pattern. I suspect that features of the larger "system" that includes the brain are similarly separable from, or imbedded in, the kinematic patterns consequent to heard speech.

[the neural dynamics] are like this because that's how ... an extensively trained nervous system changes it's state over time when it's producing that sentence rather than another.

I agree with this; it's part of an explanation of how we learn to respond to stimuli by speaking. Presumably, we create (notionally) tables of recognizable stimuli and motor muscle commands that can produce a speech event. But I don't see how this enters into the hearer's complementary task of determining an appropriate behavioral disposition. The hearer also has learned such a (notional) table and again (for a simple utterance) just does a table look-up using a combination of kinematic and contextual patterns. [By "notional" I mean to emphasize that I'm not suggesting an actual implementation, just a heuristic aid.]
ReplyDelete
Replies
Eric Charles18 October 2012 at 03:14
Andrew...
I'll admit I'm pretty skeptical here. My intuition is that language does not lawfully reflect the brain states in the manner proposed (either because it just doesn't, as in the super-redundancy of the swimming lobster, or because it does, but not the brain states we might care about). My bet is that Eco Psych will make the most progress with language by first doing a solid analysis of the functions of language. I think we need some solid progress on social psych and some solid progress on communication more generally, before "language" proper.

That said, everything you said sounds very intriguing, and I wouldn't be sad to read more as you wrestle with it. In particular, I like the sentence:

"The crazy notion that emerges from this analysis is that linguistic information comes to mean the dynamics of the broader system"

That seem appealing, but I suspect 1) that such a matching is ideographic to the speaker. Presumably, the broader-dynamics revealed by my saying "God save the Queen" are different than the broader-dynamics revealed when you say it. Also, 2) we would need to allow a very multi-level reading of "dynamics of the broader system". For example, a speaker saying "Ow!" is revealing less of their broad-dynamics than a speaker saying "Are their really still voters who are undecided about Obama?!?"
ReplyDelete
Replies
Marek McGann18 October 2012 at 14:39
I'd lean in the same direction Eric has on this one - if there are sufficiently reliable relationships of interest they'll be in the broader social context rather than between anything and the brain. What use (as regards action-control) is knowing what someone's brain is up to? And why would they provide that information through speech? It seems language and speech are much more about the coordination of actions between users (this might be the same user at different times) than the coordination of actions and brains.

The Gibsonian anthropologist Tim Ingold puts it best, I think:

"We 'feel' each other's presence in verbal discourse as the craftsman feels, with his tools, the material on which he works; and as with the craftsman's handling of tools, so is our handling of words sensitive to the nuances of our relationships with the felt environment."

His 2000 book might have some useful essays for you, as regards task analyses of language, but will be a long way from the kinds of measurable specifics you'd like:
Ingold, T. (2000). The perception of the environment: essays on livelihood, dwelling and skill. London: Routledge.
ReplyDelete
Replies
Charles T. Wolverton18 October 2012 at 20:32
Matthew -

A deictic gesture may function like [an] utterance

In the case of a primitive "language" as in W's PI §2, it seems clear that the choice of medium is arbitrary. In comm theory terms, symbols can be modulated onto any available medium: eg, uttering the sounds "slab" or "block" for aural, holding up one or two fingers for visual, tapping someone on either the right or left arm for tactile. By mutual agreement, each symbol will be paired with a behavioral disposition. Eg, in PI §2 the builder and helper have agreed that a symbol indicates the item that the helper is to bring to the builder.

In PI §6, W suggests that the meaning of a "sentence" (equivalently, a symbol) has been understood if the helper responds in accordance with the builder's intent. I have found this an attractive way of thinking about "meaning".

Once you start thinking in those terms, you recognize that even if a much more complex language has a finite vocabulary and a simple enough grammar, all sentences of the language could in principle be encoded as a finite number of symbols, each of which - by mutual agreement - could be paired with a response (eg, a behavioral disposition). Again, the symbols (equivalently, the sentences) could be transmitted via any medium. This seems to argue against taking the meaning of language to be the dynamics of the production of the sentences, ie, the "modulation" of symbols onto media.

Of course, the grammar of a natural language supports an infinite number of sentences, so "table look-up" won't work in general. But since complex sentences are often constructed from simple sentences, the applicability to relatively unsophisticated language speakers doesn't seem obviously implausible. In fact, it's implicit in the phrase "knee-jerk response".

BTW, I obviously find that to "get all Wittgensteinian" is quite helpful in these matters - as well as to get all Davidsonian, Quinean, Sellarsian, et al.
ReplyDelete
Replies
Eric Charles23 October 2012 at 05:12
Chris Green, one of the big players in history of psych, has a very nice series of podcasts about prominent people and events (at the bottom of the page here). I am teaching history of psych as a breadth class for the first time, and we are listening to several of them. This past week we listened to John Shook being interviewed regarding Dewey. He does a very good job explaining how one of the key elements of American scene at the time was reversing the S-R formula, to point out that the person produced the movement that created the so-called "sensory input". He also emphasized how that reversal was key (for Dewey) in making the information "meaningful", because it meant that the stimulation was always, at least in part, about what you had done to produce the observed changes.

That same logic is certainly displayed in Gibson's work - perhaps best most clearly in how he points out that the view-from-here-at-this-moment tells me at least as much about myself as about the world. That is, seeing the front of the TV as a top-heavy trapezoid tells me that I am in front of the TV, looking up at it, but that doesn't tell me much about the TV itself.

It occurred to me while listening that this is one of the things I think is missing from the TSM treatment of meaning.

---------------

P.S. Shook is a co-editor of the incipient Neuro-pragmatism book. I sent out another feeler recently as to the books progress and will get back to you soon. Apparently, hold out for Oxford is a good way to delay publication.

P.P.S. The best part of the podcast is definitely the parallels drawn between the philosophical, psychological, and sociological implications of this way of thinking, which ties together the otherwise disparate seeming aspects of Dewey's work. It is all about how actors create the responses of the world.
ReplyDelete
Replies
Alexander Naumenko11 September 2023 at 05:28
I would like to contribute my ideas to your research and thinking. Here are the links - https://ling.auf.net/lingbuzz/007345 and https://alexandernaumenko.substack.com/
Hope you will find those ideas useful. Keep up your great effort!
ReplyDelete
Replies

Add comment

Notes from Two Scientific Psychologists

Pages

Tuesday, 16 October 2012

Why does linguistic information mean what it does?

23 comments:

Featured post

Reading Group: Turvey (2019), Lectures on Perception

Search This Blog

Blog Archive

Our Blog List

Post Tags

Notes from Two Scientific Psychologists

Pages

Tuesday, 16 October 2012

Why does linguistic information mean what it does?

23 comments:

Featured post

Reading Group: Turvey (2019), Lectures on Perception

Search This Blog

Subscribe To

Blog Archive

Our Blog List

Post Tags