Spatial analogies pervade complex relational reasoning: Evidence from spontaneous gestures

Cooperrider, Kensy; Gentner, Dedre; Goldin-Meadow, Susan

doi:10.1186/s41235-016-0024-5

Original article
Open access
Published: 07 December 2016

Spatial analogies pervade complex relational reasoning: Evidence from spontaneous gestures

Cognitive Research: Principles and Implications volume 1, Article number: 28 (2016) Cite this article

3664 Accesses
9 Citations
8 Altmetric
Metrics details

Abstract

How do people think about complex phenomena like the behavior of ecosystems? Here we hypothesize that people reason about such relational systems in part by creating spatial analogies, and we explore this possibility by examining spontaneous gestures. In two studies, participants read a written lesson describing positive and negative feedback systems and then explained the differences between them. Though the lesson was highly abstract and people were not instructed to gesture, people produced spatial gestures in abundance during their explanations. These gestures used space to represent simple abstract relations (e.g., increase) and sometimes more complex relational structures (e.g., negative feedback). Moreover, over the course of their explanations, participants’ gestures often cohered into larger analogical models of relational structure. Importantly, the spatial ideas evident in the hands were largely unaccompanied by spatial words. Gesture thus suggests that spatial analogies are pervasive in complex relational reasoning, even when language does not.

Significance

Many phenomena in the natural and social world involve complex causal systems. Biological organisms, financial markets, ecosystems, and mechanical devices, for instance, all exhibit different types of feedback relationships, in which multiple causal factors in a system change and bring about changes in each other. Feedback systems are central to the STEM fields, as well as to urgent societal issues such as climate change. Here we investigate the possibility that people reason about such causal patterns in part by using spatial analogies—that is, spatial models of the relations involved. To investigate this possibility, we had people explain the differences between positive and negative feedback systems and examined their gestures. Gesture is ubiquitous in communication and classroom instruction, and it sometimes expresses spatial ideas not found in speech. We found that people gestured copiously when explaining the feedback systems, even though such systems are not inherently spatial. For example, people located different elements of the system in different parts of space, showed causal relationships as movements between locations, and modeled the overall behavior of the systems with complex movements. Interestingly, most of these spatial gestures were produced without spatial words. These findings suggest that spatial analogies may be a pervasive part of people’s on-the-fly reasoning about complex relational systems, and, further, that gesture may be an important medium through which such analogies are expressed. The pervasiveness of spatial analogy in reasoning about complex phenomena has implications for how such phenomena are taught in a range of fields.

Backround

Permafrost once covered much of the Arctic, but it is now thawing. This has a number of consequences, but one is especially insidious: as the frozen ground melts it releases methane, which then enters the atmosphere and accelerates the very warming that caused the ground to thaw in the first place (Anthony, 2009). This self-reinforcing pattern is an example of a complex causal system: multiple causal factors—in this case, atmospheric temperature, melting permafrost, and methane release—are changing over time and are causing changes in each other. Such systems are legion in the environmental sciences, but are hardly limited to them. Further examples can be found in financial markets, in the human body, in household appliances, in plant physiology, and beyond. Indeed, complex relational systems underlie phenomena throughout the natural and social world, in all domains and on all scales. Yet, despite the ubiquity of such systems—and their centrality to STEM (Science, Technology, Engineering, and Mathematics) fields and to urgent societal issues—much remains to be learned about the cognitive processes involved in understanding them. To fully grasp the alarming implications of permafrost melt, for example, we not only need to be able to understand the system as it is now, but also to project its behavior into the future. How do we do this?

Current evidence suggests that complex relational reasoning presents challenges even for adults. For example, undergraduates have considerable difficulty detecting higher-order causal patterns such as positive feedback and negative feedback, two important examples of complex relational systems (Rottman, Gentner, & Goldwater, 2012; see also Day, Motz, & Goldstone, 2015) and also the focus of the present paper. Expertise in identifying these and other causal patterns can develop, either through exposure to the same patterns across a range of domains (Rottman et al., 2012) or through a scaffolded process of comparing examples (Goldwater & Gentner, 2015). Considerable work has also examined the kinds of information that people use to make inferences about other types of causal structure in the world (Lagnado, Waldmann, Hagmayer, & Sloman, 2007; Sloman, 2005), and has detailed the challenges posed by complex systems in particular (Chi, Roscoe, Slotta, Roy, & Chase, 2012; Jacobson, Kapur, So, & Lee, 2011). An important open question, however, concerns the nature of the representations that people form as they reason about causal structure. What is the format of these representations and, moreover, how can we gain insight into them?

Possible clues come from research on how people understand complex systems more generally. Much previous research has investigated how people understand mechanical processes with multiple causal components, such as sets of gears and pulleys. A major finding of this line of work is that people often develop mental models of the system that are visuospatial in nature (Gentner & Stevens, 1983; Hegarty, 2004). One line of evidence for the visuospatial character of these models is that, when reasoning about or explaining such systems, people often produce diagrams (Forbus, Usher, Lovett, & Wetzel, 2011; Novick, 2001; Tversky, 2011) or gestures (e.g., Kang, Tversky, & Black, 2014; Nathan & Martinez, 2015; Schwartz & Black, 1996). Based on such observations, it seems plausible that people develop spatial mental models of other types of complex systems, such as the causal patterns under consideration here. However, there is a crucial difference between mechanical systems and positive and negative feedback systems. Positive feedback and negative feedback are consummate abstractions. They are relational patterns that may sometimes be instantiated in mechanical or concretely spatial systems—e.g., a flush valve toilet, which is an example of a negative feedback system—but their relational essence transcends any one concrete instantiation. It might thus seem unhelpful, or even counterproductive, to recruit visuospatial reasoning processes when thinking about such pure abstractions.

At the same time, a separate line of research has investigated how spatial concepts provide a foundation for more abstract ideas. This tendency can be seen in the spatial words and grammatical structures people draw on to talk about abstract domains, including time and others (Brugman, 1988; Clark, 1973; Heine, 1997; Jamrozik & Gentner, 2015; Traugott, 1978). In fact, evidence has now accumulated that this is not just a linguistic phenomenon—people draw on spatial representations when reasoning online about abstract concepts, whether or not language is involved (Boroditsky, 2001; Casasanto & Bottini, 2014). One vivid and naturalistic source of evidence for the use of space in abstract reasoning comes from the gestures people produce (Cienki, 1998; McNeill, 1992; Núñez, 2008). To date, perhaps the best-studied case of abstract spatial analogy in gesture—in which space is used to represent relations that are not inherently spatial—is the representation of a temporal sequence as a line. For instance, an English speaker might reference a future event in speech while locating it in gesture to their right, tacitly implying a relation between it and other unnamed events (for a review, see Cooperrider, Núñez, & Sweetser, 2014). Such temporal gestures use a simple spatial structure—a line—to capture a simple relational structure—order along a dimension—but they raise the possibility that people may create more complex spatial structures in gesture to represent more complex relational structures.

The above observations lead us to a hypothesis about how people reason about complex relational patterns like positive and negative feedback: they may do so, at least in part, by creating abstract spatial models of the relational structures involved—that is, spatial analogies. Furthermore, if this hypothesis is correct, then gesture should provide a powerful window onto this phenomenon. Gesture is well suited to the expression of spatial ideas (Alibali, 2005), and it has been shown to reveal implicit aspects of understanding that people have difficulty verbalizing (Broaders, Cook, Mitchell, & Goldin-Meadow, 2007; Goldin-Meadow, 2003). Moreover, it has been noted that the spatial information revealed in people’s abstract gestures sometimes goes beyond what is found in the language co-produced with those gestures (Casasanto & Jasmin, 2012; Cienki, 1998).

In the present studies, we explore this spatial analogy hypothesis by having people read a lesson about two types of complex relational systems—positive and negative feedback—and then explain the key differences between them. In our materials, we attempted to encourage highly abstract representations of such systems, devoid of the kinds of imagery that might prompt concrete spatial gestures. The most interesting possibility is that gesture might nonetheless reveal spatial analogies—that is, spatial models of relational structure—even though the relations involved in positive and negative feedback are not inherently spatial.

What would constitute evidence for such spatial analogies? A first kind of evidence would come from individual gestures that represent abstract relations spatially. These might include gestures representing simple relational structures, such as the use of a downward gesture to represent the notion of a decrease. Though interesting, such gestures do not necessarily go beyond prior observations about gestures representing simple relational structures, such as those described previously in the domain of time. More interestingly, people could represent complex relational structures within the confines of a single gesture. To our knowledge, analogical gestures of this more complex type have not been investigated in detail. A second kind of evidence may come from how gestures cohere over the course of an explanation. People may express complex relational structure—not in a single gesture—but by building up a spatial model of that relational structure over time. Gestural models have been observed in extended descriptions of concrete spatial layouts (Emmorey, Tversky, & Taylor, 2000), but gestural models of purely abstract relationships have not been described previously. Note, however, that the abstract nature of causal systems leaves open the possibility that people might not spatialize much of anything in their explanations. After all, gesture is thought to stem from vivid visuospatial or motoric imagery (e.g., Hostetter & Alibali, 2008), which was left out of our lesson by design.

If analogical gestures do occur, a secondary question concerns how those gestures relate to what people are saying when they produce them. If people’s abstract spatial gestures are most often produced along with corresponding spatial language—such as an upward gesture to represent an increase accompanied by “rise”, or a circular gesture when describing a feedback “loop”—gesture cannot be considered a uniquely revealing source of evidence for the presence of spatial analogies. Such gestures may be triggered by local lexical effects rather than by a stable, systematic spatial model of the system (see Gentner, 2001). However, to the extent that people’s gestures represent spatial ideas not found in speech, this would suggest that gesture plays a vital role in understanding the representations people draw on when reasoning about complex relational systems.

Study 1: Basic causal systems lesson

Methods

The study was approved by the University of Chicago Institutional Review Board, under protocol #IRB13-0248.

Participants

Twenty three adults from the University of Chicago community participated for course credit or cash. Four participants were excluded from the analyses: three because their gestures were largely occluded on the video; one for producing no gestures at all. In all, data from 19 participants (10 female; mean age = 20.8 years) are reported in the analyses.

Materials and procedure

Familiarization phase

After giving consent to participate and to be videotaped, participants carried out a series of activities that served both to familiarize them with causal systems and to assess their understanding of the systems. First, participants completed an adaptation of the Ambiguous Sorting Task (AST) used previously by Rottman et al. (2012) and Goldwater and Gentner (2015). In this type of sorting task, participants are given a set of vignettes printed on index cards and are asked to sort the cards into categories. Each vignette is an example of one of several types of causal systems (e.g., positive feedback) instantiated in one of several domains (e.g., economics). Participants are also given seed cards—vignettes just like those that need to be sorted but which serve as anchors for the categories to be used. A key feature of the task is that the seed cards leave the relevant categories open to interpretation: a participant may correctly categorize the vignettes according to the type of causal system described (e.g., positive feedback) or, more concretely, by the domain in which that system is couched (e.g., economics). In the adaptation of the AST used here, participants were presented with three seed cards: a first unlabeled card describing the phenomenon of stock market bubbles (a positive feedback system, economics domain), a second unlabeled card describing predator-prey relationships (a negative feedback system, natural world domain), and a third card that was blank except for the word ‘other.’ Participants were then presented with 11 new vignettes and were given five minutes to sort them. The vignettes described a range of phenomena, including internet routers, perspiration, interest rates, and so on, each instantiating a causal system.

After the sorting was complete, the experimenter removed the materials and prompted the participant to explain the main difference between the different categories involved in the sorting task. Most participants at least attempted to characterize differences in causal structure between the categories. This phase is the pre-lesson explanation. Together, the sorting task and the pre-lesson explanation served to familiarize participants with causal systems and perhaps prepared them to get more out of the lesson they would go on to study and explain (e.g., Little & Bjork, 2016).

Causal systems lesson

Next, participants were given a one-page written lesson (‘Causal Systems Lesson’) explaining the differences between positive and negative feedback systems (though without using those labels). The lesson was grounded in the seed cards used in the sorting task. It explained how the stock market and predator-prey vignettes exemplify different types of causal systems (see Appendix 1). The lesson also moved beyond the particular examples, characterizing in more abstract terms how each type of system involves different relationships between causal factors. Importantly, the lesson used no concrete spatial imagery. This decision was made to encourage participants to view the systems as abstractly as possible. It is well known that people gesture to represent concrete imagery (e.g., McNeill, 1992), but our interest is in the less studied question of how people gesture when viewing a phenomenon abstractly in terms of its causal structure. Additionally, the lesson used very little abstract spatial language. Participants were instructed to study the lesson for three minutes and were told that they would later be asked to explain it to another participant.

When the three minutes were up, the experimenter removed the lesson and brought in the ‘student’ (who was actually a confederate). The experimenter then prompted the participant as follows: “Please explain the lesson you just read. Go into as much detail as possible, but focus on the differences between the two types of causal systems”. The instructions made no mention of gesture. This phase is the post-lesson explanation.

Analysis

Videos of participants’ pre- and post-lesson explanations were transcribed and analyzed using ELAN video annotation software (https://tla.mpi.nl/tools/tla-tools/elan/). Participants gestured during both explanations. However, during the pre-lesson explanations, participants were highly variable in how much they talked (and thus how much they gestured), in what they talked about, and at what level of detail. The analyses reported here thus focus on the post-lesson explanations, which were more consistent across participants.

A first step in the gesture analysis was to identify all gestures in the post-lesson explanations that were “representational” (e.g., Chu, Meyer, Foulkes, & Kita, 2013). Representational gestures depict some property of a referent (commonly called “iconic” or “metaphoric” gestures), or point to a referent’s location (commonly called “deictic” gestures). In the present data, the representational gestures were abstract in nature—that is, they used location, movement, and spatial arrangement to depict ideas that themselves had no concrete location, did not actually move, and had no visible spatial properties. Once representational gestures were identified, they were then categorized into gesture types according to the type of system element they represented. We then classified the properties of these gestures, first considered individually and then considered in relation to other gestures in the explanation. Reliability was assessed by having a second coder classify the representational gestures in five randomly selected explanations (26% of the data), and it is reported with each analysis. The judgments of the primary coder were retained for all analyses.

Results

Sorting performance

Participants sorted the vignettes according to the appropriate causal structure the majority of the time (M = 0.62, SD = 0.22). The variability in sorting performance was limited, however, with three quarters of participants scoring above 50% (interquartile range = 0.50 to 0.76). This limited variability made it difficult to explore possible relations between participants’ sorting performance and their gestures, and such relations are not discussed further.

Spatial ideas in speech

Before analyzing the spatial ideas evident in people’s gestures, we first examined the spatial ideas evident in their speech. Based on analogy theory (Gentner, 1983; Gentner & Markman, 1997), as well as on pilot studies involving similar materials, we distinguished four elements of feedback systems that people mention in their explanations: 1) the factors involved in the system; 2) the changes those factors undergo; 3) the causal relations between the factors; and 4) the behavior of the whole system. We analyzed how often people’s references to these different aspects of the systems involved overtly spatial words. A word was considered spatial if the first dictionary definition for the word uniquely described a spatial idea, such as location, orientation, size, or motion (based on American Heritage Dictionary (5th edition); see Appendix 3 for further details). On this criterion, words such as “increase,” which has a first definition that is not uniquely spatial, were not considered spatial; however, words such as “growth,” which has a first definition referring uniquely to a change in size, were. In their post-lesson explanations, participants expressed these four system elements using spatial words an average of 6.4 (SD = 4.9) times per explanation.

Spatial ideas in gesture

Rates and types

On average, participants’ post-lesson explanations lasted 88.3 seconds (SD = 33.1) and involved 35.0 gestures (SD = 13.0), for a mean rate of 24.1 (SD = 4.4) representational gestures per minute speaking. The abstract nature of the Causal Systems Lesson thus did not stand in the way of eliciting representational gestures. Indeed, these rates are comparable to—or higher than—gesture rates reported in studies using concrete visual stimuli (e.g., Alibali, Heath, & Myers, 2001). The participants’ gestures spatialized the four elements of feedback systems in the following ways. First, people located the factors (e.g., the predator and prey populations in the negative feedback example) by placing their gestures in space or by pointing to locations. These we call factor reference gestures (see Fig. 1). Second, people represented changes to the factors (e.g., an increase in the predator population) as movements. These we call factor change gestures. Third, people represented causal relations in the system (e.g., how the change in the predator population causes a change in the prey population) as movements, sometimes between previously established locations. These we call causal relation gestures. Fourth, people used movements to characterize the behavior of the system as a whole (e.g., the equilibrium that is reached in the predator-prey system). These we call whole system gestures. People also gestured in a variety of other ways, such as to spatially contrast the two systems they read about (with the first system in one location and the second in another), or to illustrate an aspect of the system that was not one of the intended causal factors (such as the increase in investor confidence when stock values rise).

Some gestures were unclear and could not be definitively classified into one of the four categories or the ‘other’ category. Of those gestures that both coders judged to be codable (80% of gestures analyzed), 79% (N = 177, Cohen’s K = 0.67) were assigned to the same category.

Overall, the majority (476/666, or 71%) of participants’ representational gestures fell into one of the four spatial types, with an average of 25.1 (SD = 9.0) such gestures per explanation.^{Footnote 1} In other words, participants spatialized the system elements in gesture (25.1 per explanation) almost four times as often as they spatialized them in speech (6.4 per explanation). Further details about the counts and rates observed for each gesture type are given in Table 1. Subsequent analyses focus on those gestures that fit into the categorization scheme.

Table 1 Gesture counts and rates by type

Full size table

Properties of gestures considered individually

We next analyzed the properties that these four gesture types exhibited, starting with the spatial axis employed in each gesture: left-right, toward-away, up-down, a combination of axes (e.g., left-right and up-down), or no single axis. Factor reference gestures were analyzed for the axis along which the factors were located; factor change gestures, causal factor gestures, and whole system gestures were analyzed for the axis along which movement was represented. Coders agreed 89% (N = 114, Cohen’s K = 0.84) of the time in judging the spatial axis involved.

The vast majority of factor reference gestures located the factors on the left-right axis (202/248, or 81%), most often with the first-mentioned factor on the left and the second-mentioned factor on the right. Factor change gestures were more spatially variable, with 21% (37/176) depicting increases and decreases as movements along the left-right axis, 29% (51/176) along the toward-away axis, 26% (45/176) along the up-down axis, and the remaining 24% (43/176) involving either some combination of these axes (e.g., simultaneously upward and rightward) or a more complex movement. Causal relation gestures most commonly (24/32, or 75%) depicted causation as movement along the left-right axis. The spatial properties of whole system gestures were much more variable (see below), with the majority not representing movement along a single axis (11/20, or 55%).

Factor reference gestures locate abstract entities in distinct spatial locations. They either do this explicitly, as when locating both factors simultaneously (Fig. 1a), or implicitly, as when locating a single factor on one side of space and implying a contrast with an unrepresented factor (Fig. 1e). The other three types of gestures represent abstract relations spatially and thus provide a first type of evidence that spatial analogies are at play. Factor change gestures represent changes in quantity (a relation between an entity at one point in time and the same entity at another point in time) as movements. Causal relation gestures represent a higher-order relation (that a change in one factor results in a change in the other factor) as an action through space. Finally, whole system gestures represent the system’s behavior through time as movement.

Importantly, these gestures differ in the complexity of the relations represented. In particular, whole system gestures summarize an entire system over time—not just a single relation such as an increase—and thus would be expected to be more spatially complex. To examine this possibility, we analyzed the type of stroke involved in the different gesture types. The stroke is the part of the gesture that is meant to represent what is being described (McNeill, 1992). The canonical gesture stroke is simple in that it involves a single movement phase proceeding in a single direction, but gesture strokes can also be more complex in several ways. In the case of concrete iconic gestures, complex gesture strokes might occur, for instance, when someone traces the outline of a square, shows a repetitive action such as hammering, or depicts alternating action such as two weights oscillating on a scale (for discussion of different stroke types, see McNeill, 1992, p. 308; Kita, Van Gijn, & Van der Hulst, 1998, 2014). We considered gesture strokes complex if they had multiple movement phases (see Fig. 1d, h). Some strokes were unclear with respect to complexity; for those gestures that both coders considered codable (97% of gestures analyzed), agreement as to whether the stroke was simple or complex was 96% (N = 137, PABAK = 0.91).^{Footnote 2}

As predicted, whole system gestures involved complex strokes in 45% of cases (9/20), compared to 13% (58/456) for the other gesture types (Fig. 2). While factor change gestures, for instance, usually involve single, smooth movements along particular spatial axes, whole system gestures involve incremental upward movements, widening spirals, alternating upward and downward movements, and so on (Fig. 3). We also examined the number of participants who favored simple or complex strokes for the different gesture types. A significant number of participants favored simple strokes for factor reference (19/19, two-tailed exact binomial test, p < 0.001), factor change (17/18, two-tailed exact binomial test, p < 0.001), and causal relation gestures (10/13, two-tailed exact binomial test, p = 0.03). In contrast, a non-significant majority of participants favored complex strokes for their whole system gestures (5/9, two-tailed exact binomial test, p = 0.25).

Properties of gestures considered across the explanation

We next considered properties of people’s gestures across their entire explanation. Our primary interest was whether participants’ gestures were integrated into a larger analogical model—in other words, whether people produced an interconnected system of spatial gestures to represent an interconnected system of entities and relations. This kind of parallel interconnected structure is characteristic of analogical mapping (Gentner, 1983). If a participant maintains consistent, contrasting spatial locations for the factors, these gestures would be considered model-integrated. By contrast, if a participant locates the factors erratically from one mention to the next, these gestures would not be considered model-integrated. Further, if a participant produces a gesture representing an increase to a factor that incorporates the location of that factor as used over the course of the explanation, the gesture would be considered model-integrated. If, on the other hand, the increase was depicted in neutral space, the gesture would not be considered model-integrated. Similarly, if a participant produces a gesture representing a causal relation between two factors as a movement between the locations of the two factors as used over the course of the explanation, the gesture would be considered model-integrated (for examples, see Fig. 1c and g). Alternatively, if the gesture represented causation as a movement in neutral space, it would not be considered model-integrated. Note that we did not expect whole system gestures, which depict a high-level summary of the behavior of the system, to be closely integrated with the low-level causal details depicted in the other gestures.^{Footnote 3} This gesture type was thus not included in the analysis. Importantly, model-integration goes beyond mere spatial consistency. A speaker could produce spatially consistent gestures when representing, for instance, the notion of increase, without ever integrating those increase gestures into a larger model.

Examples of two different gesture sequences are illustrated in Fig. 4, one with factor change gestures that are model-integrated and one with factor change gestures that are not. For some gestures it was hard to make a judgment about whether or not they were model-integrated, and they were coded as unclear. For those factor reference, factor change, and causal relation gestures that both coders judged to be codable (85% of gestures analyzed), agreement as to whether the gestures were model-integrated or not was 91% (N = 95, PABAK = 0.81).

Overall, 90% (224/248) of participants’ factor reference gestures were model-integrated (mean percentage = 90%), as were 49% (86/176) of their factor change gestures (mean percentage = 53%), and 72% (27/32) of their causal relation gestures (mean percentage = 64%). All participants (19/19) produced at least one model-integrated factor reference gesture, 84% (16/19) of participants produced at least one model-integrated factor change gesture, and 47% (9/19) produced at least one model-integrated causal relation gesture. Three participants produced only model-integrated gestures in their explanations: in one case, 14 gestures (three factor reference, nine factor change, two causal relation); in another, 15 gestures (five factor reference, ten factor change); and in another, 12 (all factor reference gestures). Thus, although sometimes gestures expressed unconnected flashes of spatial imagery (see Fig. 4, sequence 2), they often cohered over the course of an explanation into a systematic analogical model.

Gestures in relation to speech

Finally, we analyzed the relationship between participants’ gestures and the language with which they were co-produced. Most often, in 94% of cases (448/476), the gestures represented aspects of the system that were simultaneously mentioned in speech. For example, a participant would produce a factor reference gesture while referring in speech to “the first factor” or a factor change gesture while mentioning an “increase”. Interestingly, however, gestures sometimes filled in where speech left off. Of the 20 whole system gestures, three (15%) were produced without actually verbalizing the system behavior. For example, a speaker describing a positive feedback system said, “so it’s like this sort of…” trailing off in speech but providing a spatial characterization in gesture. Conveying information in gesture and not in speech occurred numerically less often for the other gesture types: in 5% of cases for factor reference gestures, 5% for factor change gestures, and 9% for causal relation gestures. The higher rate observed for whole system gestures may stem from difficulty in verbalizing the overall system dynamics. However, given the small number of observations, more data are needed to confirm this pattern.

Lastly, we investigated the relationship between the different gesture types and overt spatial language. If the gesture’s verbal affiliate—the word or phrase that expresses the same meaning as the gesture—included a spatial word, the gesture was classified as occurring with spatial language; otherwise, it was classified as occurring with non-spatial language. Table 2 provides examples of both spatial and non-spatial language that was co-produced with the different gesture types.

Table 2 Examples of spatial and non-spatial language co-produced with gestures

Full size table

Of the gestures that represented aspects of the system that were simultaneously mentioned in speech, the overwhelming majority (84% (403/448)) were co-produced with non-spatial language. Given the paucity of spatial ideas expressed in speech overall, as described earlier, this result is unsurprising. However, the different gesture types varied in how often they were produced with spatial language. For example, factor reference gestures, though consistently involving the left-right axis, were not once co-produced with a reference to “left” or “right” and were only co-produced with a spatial word in a single instance (“external variable”). The other gesture types did occur with spatial language, although less than half the time: factor change 32%, causal relation 45%, and whole system behavior 35%. Overall, these results confirm that much of the spatial content in people’s explanations comes out not in their words, but in their hands.

Discussion

When explaining positive and negative feedback systems (two examples of complex relational patterns), participants recruited spatial representations in abundance. Elements of these systems were expressed spatially in gesture almost four times as often as they were expressed spatially in speech. These spatial gestures provided several types of evidence that people were drawing extensively on spatial analogy. First, participants produced individual gestures that used space to represent abstract relations. These included both simple relations, as in factor change gestures that represented increases and decreases as movements along axes, and complex relations, as in the whole system gestures used to summarize the behavior of the entire system. Second, considered in relation to each other, participants’ gestures were often integrated into larger analogical models, as when depicting a causal relationship as movement between previously established locations. Finally, the spatial analogies as revealed in gesture very often occurred without spatial language and, in some cases, without any language at all. Spatial analogies are thus pervasive in spontaneous gesture during complex relational reasoning, and are not merely a manual echo of spatial speech.

One limitation of Study 1 was that, although the lesson was largely devoid of rich imagistic content, it did include a sprinkling of abstract spatial words. For example, the words “direction” and “reversed” were used to describe changes in the system. Importantly, participants produced the same basic gesture types in their pre-lesson explanations as in their post-lesson explanations—thus, the lesson did not seem to tip participants off to the idea of spatializing aspects of the system. Nonetheless, it remains possible that the presence of spatial language increased participants’ propensity to spatialize their mental models of the system. To address this possibility, in Study 2, we removed the remaining spatial language from the Causal Systems Lesson. If the abundant use of analogical gestures in Study 1 was cued by the spatial language in our lesson, these behaviors should be considerably dampened. A second aim of Study 2 is to again examine whole system gestures (which were relatively infrequent in Study 1)—specifically, whether we will again find that they tend to involve complex strokes.