Using analogy to learn about phenomena at scales outside human perception

Resnick, Ilyse; Davatzes, Alexandra; Newcombe, Nora S.; Shipley, Thomas F.

doi:10.1186/s41235-017-0054-7

Original article
Open access
Published: 20 March 2017

Using analogy to learn about phenomena at scales outside human perception

Ilyse Resnick¹,
Alexandra Davatzes²,
Nora S. Newcombe³ &
…
Thomas F. Shipley³

Cognitive Research: Principles and Implications volume 2, Article number: 21 (2017) Cite this article

3794 Accesses
9 Citations
9 Altmetric
Metrics details

Abstract

Understanding and reasoning about phenomena at scales outside human perception (for example, geologic time) is critical across science, technology, engineering, and mathematics. Thus, devising strong methods to support acquisition of reasoning at such scales is an important goal in science, technology, engineering, and mathematics education. In two experiments, we examine the use of analogical principles in learning about geologic time. Across both experiments we find that using a spatial analogy (for example, a time line) to make multiple alignments, and keeping all unrelated components of the analogy held constant (for example, keep the time line the same length), leads to better understanding of the magnitude of geologic time. Effective approaches also include hierarchically and progressively aligning scale information (Experiment 1) and active prediction in making alignments paired with immediate feedback (Experiments 1 and 2).

Significance statement

Many fundamental science, technology, engineering, and mathematic (STEM) phenomena occur at extreme scales that cannot be directly perceived. For example, the Geologic Time Scale, discovery of the atom, size of the universe, and rapidly developing field of nanotechnology are all based on phenomena occurring at scales that humans cannot directly experience. Unfortunately, novices have trouble reasoning about phenomena outside human perception, at least in part, because they do not understand the relative magnitudes at these scales. Across two experiments we develop and test two successful instructional analogies designed to teach geologic time. These findings add to our theoretical understanding of how people reason about scale, and the role of analogical reasoning and active prediction in learning. We find that reasoning about different kinds of magnitude (that is, temporal and abstract) at different scales (that is, inside and outside human perception) have shared properties, and that people are able to use a spatial representation of temporal magnitude to develop a more accurate understanding of geologic time. Our findings suggest that structural alignment and having multiple opportunities to make alignments are critical in supporting reasoning about scale, and that the progressive and hierarchical organization of scale information may provide salient landmarks for estimation. Finally, we found that active prediction and corrective feedback are valuable in fostering a linear representation of magnitude. These findings have practical implications, as they can guide development of supports for the acquisition of reasoning at extreme scales, which is an important goal in STEM education.

Background

How can we improve students’ reasoning about large magnitudes?

Skills in reasoning about size and scale are central to performance across STEM disciplines (for example, Hawkins, 1978; Tretter, Jones, Andre, Negishi, & Minogue, 2006). Many fundamental science, technology, engineering, and mathematic (STEM) phenomena occur at extreme scales that cannot be directly perceived. For example, a core geologic concept is that geologic events can last billions of years (for example, the Earth formed approximately 4.6 billion years ago). Reasoning about geologic time allows geologists to reconstruct the surface conditions of ancient Earth, produce an accurate time line of Earth’s history, and understand the imperceptibly slow processes that have led to the current environment. Practically, understanding geologic time allows people to reason about the sustainability of non-renewable resources and the consequences of anthropogenic climate change. Given the importance of reasoning about scale, it should be no surprise that the National Research Council in A Framework for K-12 Science Education (National Research Council, 2011) and the American Association for the Advancement of Science in Benchmarks for Science Literacy (American Association for the Advancement of Science, 1993) identified “size and scale” as fundamental scientific concepts, and suggested them as a unifying theme in science education.

Unfortunately, novices have trouble reasoning about phenomena outside human perception. Although novices are sometimes reasonably accurate at ranking phenomena in a correct sequence, they have difficultly comparing the magnitude between phenomena at extreme scales (for example, Delgado, Stevens, Shin, Yunker, & Krajcik, 2007; Jones, Tretter, Taylor, & Oppewal, 2008; Libarkin, Anderson, Dahl, Beilfuss, & Boone, 2005). For example, although students are fairly accurate in identifying a correct sequence of events in Earth’s geologic history (Trend, 2001), they fail to understand the magnitude of time between events (Tretter et al., 2006).

Analogy is potentially valuable for learning and reasoning about phenomena outside human perception because such phenomena cannot be directly experienced (Jones, Taylor, & Broadwell, 2009). Analogy refers to a process of aligning structural similarities between a base concept and a target concept (Gentner, 1983). In fact, analogy is frequently used to teach phenomena at extreme scales (for example, Clary & Wandersee, 2009; Petcovic & Ruhf, 2008; Semken et al., 2009; Sibley, 2009). Unfortunately, even with a wide range of analogies, students struggle to comprehend phenomena outside human perception (for example, Delgado et al., 2007; Jones et al., 2008; Libarkin et al., 2005). Moreover, analogies can mislead students (Brown & Salter, 2010; Duit, 1991). For example, geologic time is often represented using a spatial analogy that compresses the time before life on Earth becomes relatively more complex. Although functional, learning from this nonlinear representation can mislead students into thinking that biologic events occurred very early in the Earth’s history (Resnick, Davatzes, Newcombe, & Shipley, 2016; Resnick, Newcombe, & Shipley, 2016).

Different kinds of spatial analogies may elicit different kinds of cognitive barriers to aligning extreme scales with human scales (for review see Resnick, Davatzes, et al., 2016). For example, a common analogy is to map an extreme scale (for example, Earth’s history) onto a spatial structure, such as the Eiffel Tower (Clary & Wandersee, 2009). However, without knowledge of the base concept (How tall is the Eiffel Tower?), it is difficult to identify corresponding relations between the base concept and target concept (for example, Gentner, 1983; Kotovsky & Gentner, 1996). It can also be difficult to identify the relevant relations to align if the base concept and target concept are different in many ways (Gentner, 1983; Gentner, 2001; Markman & Gentner, 1996, 1997; Kokinov & French, 2003). For example, Earth’s history is also commonly mapped onto a 24-hour clock. However, this analogy contains at least two differences in addition to differences in magnitude (24 hours versus billions of years): clocks are cyclical whereas Earth’s history is linear, and clocks are composed of equal temporal divisions whereas geologic time comprises unequal temporal divisions based on major geologic events. Thus, students may not be able to identify the appropriate analogy to make and, subsequently, draw incorrect conclusions (Brown & Salter, 2010; Gentner, 1983). In this instance, students may erroneously believe that, just like the 24-hour clock, periods of Earth’s history are also evenly spaced, and fail to make the appropriate analogy between relative magnitudes of time between events.

A review of analogical reasoning literature offers three techniques that may be useful in the development of effective analogies for phenomena at extreme scales. First, the base concept and target concept should be structurally aligned; that is, as similar as possible with just one “alignable difference” (Goldstone, 1994; Markman & Gentner, 1993a, 1993b; Medin, Goldstone, & Gentner, 1993). An alignable difference is a common relation shared by the base concept and target concept which differs along one dimension. In the case of aligning an extreme scale with a human scale, both scales should be constructed in the same format (for example, a linear number line), with the only difference being magnitude.

However, because the difference in magnitude between extreme and human scales is so vast, it may not be possible for the base concept and target concept to be structurally aligned. Thus, a second technique to align very different concepts is called progressive alignment (Kotovsky & Gentner, 1996; Thompson & Opfer, 2010). Using progressive alignment, an analogy may consist of more than one analogical step, beginning with a comparison of a base concept and a highly similar intermediate concept. Comparing two very similar concepts as an intermediate analogical step will help extend the analogy to the subsequent alignment of increasingly unfamiliar concepts (Gentner & Namy, 2006). For example, instead of mapping Earth’s history directly onto a spatial time line, the analogy may first map a human lifespan, and increase the amount of time the spatial time line represents in separate analogies (for example, American history, human evolution, and so on) until all of Earth’s history is included in the spatial time line. In learning about scale information, progressive alignment may alleviate conceptual dissimilarity by providing structural alignment across smaller increases of scale (Resnick, Davatzes, et al., 2016).

Finally, a third technique is called hierarchical alignment (Resnick, Newcombe, et al., 2016). Hierarchical alignment advocates the hierarchical organization of all analogical steps within each new analogy to highlight common relational structures between the base, intermediate, and target concepts. In the Earth’s history example, the learner would identify where each previous division in time (for example, human lifespan) would be located relative to the new spatial time line (for example, American history). This process of hierarchical alignment provides salient internal anchor points, which emphasize corresponding relations between each analogical step (Resnick, Newcombe, et al., 2016). Hierarchical alignment specifically supports learning about phenomena outside human perception by highlighting the proportional relation between magnitudes across multiple scales.

In addition to the principles of analogical reasoning discussed above, immediate (Coulter & Grossen, 1997) and corrective (Hao, 1991; Sharpe, Lounsbery, & Bahls, 1997) feedback has also been found effective in increasing student learning. In particular, corrective feedback has been found to be effective for learning about unfamiliar magnitudes in young children (Thompson & Opfer, 2010). Even a single trial of feedback can increase estimation accuracy by providing learners with a salient anchor (Opfer & Siegler, 2007; Opfer & Thompson, 2008; Opfer, Thompson, & Kim, 2016; Thompson & Opfer, 2008).

The current studies

In the current studies we ask whether analogies can foster accurate reasoning about phenomena at scales outside human perception and, if so, what the most efficient and effective techniques in teaching scale information are. While the instructional analogies developed here could be designed for use in teaching any magnitude-based context, we examine analogical reasoning in the context of a large temporal magnitude: geologic time.

Over two experiments we develop two instructional analogies using a combination of structural alignment, progressive alignment, and hierarchical alignment. Of importance, all three techniques provide multiple opportunities to practice making relevant analogies. Thus, both Experiments 1 and 2 examine the efficacy of providing multiple opportunities to align geologic time to a spatial linear representation using structural alignment to improve understanding and reasoning about geologic time. Across both experiments, students are also provided with corrective feedback. A key difference between Experiments 1 and 2 is that Experiment 1 also assesses the benefit of hierarchical and progressive alignment, whereas Experiment 2 assesses structural alignment and corrective feedback alone, without progressive or hierarchical alignment, in an effort to devise a more time-efficient means of intervention.

Experiment 1

The hierarchical alignment activity, an instructional spatial analogy that employs structural, progressive, and hierarchical alignment, increased accuracy reasoning about temporal and spatial magnitudes in a lab-based setting (Resnick, Newcombe, et al., 2016). Here, we assess the efficacy of the hierarchical alignment activity in a classroom setting. Learners begin by aligning a familiar scale to a linear spatial representation (a number line). They are then provided with multiple opportunities to align increasingly larger and unfamiliar scales to the spatial linear representation (progressive alignment). Structural alignment is maintained by keeping the length of space the same for each analogical step; only the magnitude of time changes. In this activity, every time students align a new temporal scale to space, they are also asked to locate all previous scales relative to the current scale (hierarchical alignment). After completing each intermediate analogy, learners are provided with corrective feedback.

The hierarchical alignment activity is contrasted with a conventional spatial analogy for geologic time called a stratigraphic column. A stratigraphic column is a spatial representation of the vertical location and age of rock units. Of importance, the hierarchical alignment activity and the stratigraphic column activity differ only in the use of analogical reasoning principles (structural, progressive, and hierarchical alignment) in their presentation of geologic time. Students in both activities learned the names and magnitudes of geologic time periods by aligning time to space. Thus, if the hierarchical alignment activity fosters more accurate understanding and reasoning about geologic time than the conventional stratigraphic column activity, this would suggest the importance of structural, progressive, and hierarchical alignment in fostering more accurate reasoning about phenomena at scales outside human perception.

Methods

Participants

The participants were 107 students (49 in the experimental group and 58 in the control group) enrolled in an undergraduate introductory-level geoscience course at a large urban university. Although the demographics of the participants could not be obtained, students were randomly assigned to either condition, which accounts for any individual differences. Demographic information was obtained in subsequent semesters of this same course during Experiment 2. While there may be some variation from semester to semester in demographic composition, Table 1, which contains information gathered in Experiment 2, provides characteristic demographic information of this general education course aimed primarily at non-majors. The geoscience course had twice weekly lectures and a weekly laboratory period. All lectures were given by the same faculty member; the students were divided into eight sections for the laboratory period. One teaching assistant (TA) covered four sections and two TAs covered two sections each.

Table 1 Demographics of enrollment by class and condition in Experiment 2

Full size table

Intervention activity

In the hierarchical alignment activity, students aligned time to space (a 1-meter ruler) beginning with a familiar scale: their own personal time line. The students then aligned successively longer intervals of time to the same 1-meter space. Ten time lines were constructed, each ending at the present day and extending backwards in time to include: personal history, an average human lifespan (from 75 years ago), American history (520 years ago), recorded history (5512 years ago), human evolution (6 million years ago), Cenozoic Period (65 million years ago), Phanerozoic Eon (542 million years ago), Proterozoic Eon (2.5 billion years ago), Archean Eon (3.8 billion years ago), and Hadean Eon (4.6 billion years ago – the full Geologic Time Scale). To populate and relate each time line, students indicated the time line’s length, located specific events, and located where all previous time lines would begin on the current time line (see Fig. 1). In order to determine where specific events and previous time lines were located the students had to calculate how many years each centimeter represented, and then how many centimeters were needed to represent a given event or time line. The students made these calculations using two equations, which were provided. Students received corrective feedback as required to make accurate time lines.

Control activity

In the control activity, students also aligned time to space by completing a stratigraphy laboratory. Stratigraphy is a branch of geology concerned with the order, relative position, and ages of rock layers (strata). During the stratigraphy laboratory, students learned about the age and distribution of rock types and the types of environments in which those rocks are formed by making and examining stratigraphic columns. A stratigraphic column is a spatial representation of the vertical location and age of rock units.

Of importance, stratigraphic columns involve aligning geologic temporal information to space. Thus, students in both the intervention and control conditions received practice aligning geologic time to space and exposure to magnitude information. The important difference between the intervention and control conditions is the use of structural, progressive, and hierarchical alignment in the presentation of magnitude information.

Measures

To access students’ understanding of geological scales, students were assessed on three questions and four number line estimations that were added to their regularly scheduled laboratory examination. Two questions assessed understanding of the magnitude of geologic time. The first item, referred to here as the "Geoscience Concept Inventory question," was from the Geoscience Concept Inventory. The Geoscience Concept Inventory is a multiple choice assessment instrument of geoscience understanding, which has been validated and is unbiased relative to demographic variables (Libarkin & Anderson, 2006, 2007). The Geoscience Concept Inventory question is commonly used to assess an individual’s understanding of geologic time on a linear scale (for example, Libarkin et al., 2005; Libarkin & Anderson, 2007; Petcovic & Ruhf, 2008; Teed & Slattery, 2011). This item contains five multiple choice response options, shown as five time lines with the same four geologic events in different locations. Four of the time lines represented common student misconceptions (response option A – life occurred when Earth formed, option B – humans and dinosaurs coexisted, option C – dinosaurs appeared much earlier than they did, option E – all life formed at the beginning of Earth’s history), and one time line showed the events in the correct relative locations (option D). Students were asked to choose the most correct time line. Incorrect response options A and B reflect relatively small magnitude errors (that is, the magnitude of error is less than 1 billion years) while incorrect response options C and E reflect relatively large magnitude errors (that is, the magnitude of error is greater than 2 billion years).

The second item assessing understanding of geologic magnitude, referred to here as the Geologic Time Scale diagram question, is a measure of geologic time developed for use with middle school students as part of a large-scale project being conducted by the 21st Century Center for Research & Development in Cognition & Science Instruction (Barghaus & Porter, 2010). This item is a multiple choice item that requires students to identify which duration-based statement is true using a conventional diagram of the Geologic Time Scale. Two of the incorrect response options reflect incorrectly reading the direction of time (response option C – The Jurassic Period ended 205 million years ago; response option D – The Pre-Archean Eon is the most recent time span). The correct choice is option A (The Proterozoic Eon lasted much longer than the Phanerozoic Eon). While numerical information is provided in the diagram, the correct choice may not be obvious to novices in the standard diagram because the spatial intervals of the eons do not proportionally correspond to their temporal lengths. This type of compressed representation is how the Geologic Time Scale is typically depicted, and serves a functional purpose in the field. In past work the most commonly chosen incorrect response was a statement that is consistent with the visible spatial intervals (response option B – The Phanerozoic lasted much longer than the Proterozoic).

The participants also completed a knowledge-based question that did not require an understanding of relative magnitude. Thus, this item served as a control for potential group differences (for example, motivation). This knowledge-based question was taken from the 21st Century Center for Research & Development in Cognition & Science Instruction study (Barghaus & Porter, 2010), and asked participants to identify when mammals were the dominant land animal.

A fourth kind of assessment examined transfer effects to estimation of large abstract numerical magnitude. Here we use number line estimations. Siegler and Booth (2004) noted that number line estimations are thought to be ecologically valid, as students often make number lines in class. A 2-year longitudinal study by Jordan et al. (2013) found performance on a number line estimation task to have a high internal consistency (Cronbach’s alpha 0.89). Students were presented with four horizontal number lines. The horizontal number lines were 10 cm long, with the right labeled 0 and the left labeled 4.6 billion. Students were asked to locate the following numerical values on the respective number lines in the following order: 230 million, 65 million, 3.5 billion, and 6 million. These magnitudes were chosen to be a numerical analog to the Geoscience Concept Inventory item (humans appear, 6 million years ago; dinosaurs disappear, 65 million years ago; dinosaurs appear, 230 million years ago; and life appears, 3.5 billion years ago). These number line estimations are abstract because no units (time or space) were indicated.

Procedure

Prior to the intervention, the researcher met with the main instructor and TAs. The TAs described their prepared lessons on geologic time, and were instructed to not change their lessons in anyway. The overarching aim of the experiment was then described: the development of a spatial analogy to teach geologic time. No details were given regarding the specific hypotheses of the intervention. The intervention and control activities were administered during a laboratory period on stratigraphy. The students in the intervention condition participated in the hierarchical alignment activity (1.5 hours) after a shortened stratigraphy laboratory (30 minutes). The guest lecturer and TA did not compare the hierarchical alignment activity and stratigraphy laboratory; they were presented as entirely separate activities. The students in the control condition completed the full stratigraphy laboratory (2 hours). Students were randomly assigned to either the intervention or control condition so that both conditions were evenly distributed across the TAs to control for instructor-based differences. All students completed the outcome measures 1 month after the stratigraphy laboratory as part of a laboratory examination.

The intervention was conducted by the first author as a guest lecturer. Fidelity of implementation was achieved by having the first author develop and administer the intervention. The control activity was the full stratigraphy laboratory administered by the course TAs, as part of their normal course instruction. Both the intervention and control conditions also received instruction on the Geologic Time Scale as part of the normal class curriculum in addition to the hierarchical alignment activity and stratigraphy laboratory. Students were required to memorize the major divisions in the Geologic Time Scale. Furthermore, before beginning a lecture on a new time division, the students were shown a conventional image of the Geologic Time Scale, with the respective time division(s) highlighted (see Fig. 2). Students also learned other concepts that are explicitly related to geologic time; students completed two fossil laboratories, which included identifying fossils from different divisions in time. Thus, students across conditions had multiple opportunities to compare different representations of geologic time; the only difference being that the intervention condition had the hierarchical alignment activity whereas the control condition had additional stratigraphic problems within the stratigraphy laboratory.

Results

The intervention conditions were administered in half of each TA’s sections of the course and the control conditions in the other half. No differences among the TA sections were found on any of the outcome measures and, thus, subsequent analyses compared the intervention and control conditions across TA sections. The intervention group performed better than the control group on the two items that assessed understanding of geologic magnitude. First, on the Geoscience Concept Inventory question, students in the intervention group were significantly less likely to make large magnitude errors than students in the control group, χ²(1) = 6.08, p = .01, Φ = .24. That is, they were significantly less likely to choose C, a large magnitude error and the most common error, χ²(1) = 7.35, p = .01, Φ = .26. An effect size for two binary variables (Φ) of .20 to .40 is considered a moderate association (Rea & Parker, 1992). See Fig. 3 for distribution of student responses. However, the groups did not differ in choosing the completely correct option, χ²(1) = .07, p = .79.

Second, on the Geologic Time Scale diagram question, which required students to identify the true duration-based statement using a conventional Geologic Time Scale diagram, the intervention group (37% correct) was more accurate than the control group (30% correct), χ²(1) = 3.99, p = .05, Φ = .19. In addition, students in the intervention group were less likely to select a visually misleading item that constitutes the most common error (48% of the time), whereas students from the control group selected it 69% of the time, χ²(1) = 4.41, p = .04, Φ = .20. Of importance, the intervention and control groups did not differ significantly on the knowledge-based test item, which did not require an understanding of magnitude, χ²(1) = .03, p = .86.

Transfer was assessed by the line estimation items. Here, there were no significant differences in performance between the experimental and control groups (t(105) = .02, p = .93). Failure to find differences may have resulted from the task being too easy. Mean errors (mainly overestimations) were low for both groups, ranging from .62 to 1.61 cm with standard deviations from .97 to 2.13. The observed high accuracy may have been due to the selection of items. Values near boundaries tend to show more accurate estimations than values that are distant from a boundary (Haun, Allen, & Wedell, 2005; Huttenlocher, Hedges, & Prohaska, 1988; Shipley & Zacks, 2008); six million and 65 million are both close to the beginning of the geologic number line, and 3.5 billion is close to 3/4 of the number line (breaking the number line up into quadrants). Thus, while these items were designed to be aligned with the geologic events in the Geoscience Concept Inventory item, in their numerical form they may not have been ideal for capturing variance in the underlying representations.

Discussion

Students who completed the hierarchical alignment activity demonstrated a better sense of the relative durations of geological events and a reduction in the magnitude of temporal location errors relative to the control group on the two items assessing geologic time. Of importance, the intervention and control groups did not differ significantly on the knowledge-based item, indicating that the hierarchical alignment activity acted on the understanding of the magnitude of geologic time, and did not necessarily simply increase effort or motivation. Notably, these effects were evident 1 month after the intervention, clearly indicating the effect was durable. Nevertheless, these positive findings need to be tempered with the facts that the intervention, although more effective than regular class instruction, did not take students to desirable levels of accuracy.

Given the context of a geology classroom we were unable to systematically align two aspects of the intervention condition (hierarchical alignment) and control condition (stratigraphy laboratory only). A guest lecturer presented the intervention condition; however, due to scheduling constraints, the regular TA presented the control condition. This may have resulted in other unaccounted differences between the intervention and control conditions. For example, the guest lecturer may have introduced a novelty as well as been more motivated to have the students learn from their carefully designed intervention. The hierarchical alignment activity also contained information on a student’s personal time line, a human lifespan, American history, and human evolution that was not covered in the control condition. Arguably, this may have benefited the control condition, because they spent more time working with the geologic time periods they were ultimately assessed on. However, this was not the case; beginning with non-course content improved students’ understanding of geologic time divisions despite spending less time working with them. Regardless, our findings were consistent with Resnick, Newcombe, et al. (2016), who were able to more tightly control for such differences in their laboratory-based assessment of the hierarchical alignment activity, and found the hierarchical alignment activity was effective for fostering more accurate reasoning about phenomena at extreme scales.

Another limitation of Experiment 1 was the limited measures for assessing understanding of geologic time (two items) and the high accuracy on the items selected for the line estimation task. Thus, one option for further investigation would be to replicate the intervention in Experiment 1 with more sensitive dependent variables. However, there was an additional, more practical, concern: a 1.5 hour intervention would be impractical for wide adoption in an already packed curriculum. Therefore, in Experiment 2, we had three goals:

(1)
Develop and test a spatial analogy that includes multiple opportunities to align time to space while maintaining structural alignment (as in Experiment 1).
(2)
Develop and test an activity that could be more easily integrated into classrooms.
(3)
Develop more sensitive measures.

Experiment 2

Experiment 2 was conducted over 2 years in the same introductory-level geoscience course as in Experiment 1. In Experiment 2, the course was taught by two instructors as separate classes. The instructors worked together to make the two classes as similar as possible; they developed materials together (slides, lectures, curricular sequence, exercises, and examinations), followed the same schedule of topics, and used the same textbooks and laboratory activities. However, there was a critical difference in the way in which they administered the spatial analogy activity developed for assessment here (detailed below). Subsequently, we can conceptualize the interventions administered in either class as separate interventions, which we will refer to as the clicker feedback activity and the linear visualization activity. All comparisons between the intervention and control are thus made within activity/instructor and not between activities/instructors. Taken together, the results from Experiments 1 and 2 provide evidence of the effect of spatial analogies using a structural, progressive, and hierarchical alignment paired with corrective feedback, as instantiated within each instructor’s teaching style.

The clicker feedback activity and the linear visualization activity both share many features with the hierarchical alignment activity assessed in Experiment 1. All the activities provide multiple opportunities to align time to a spatial linear representation using the same amount of space for each alignment (structural alignment) and provide students with corrective feedback. Of importance, the clicker feedback activity and linear visualization activity differ from the hierarchical alignment activity by not progressing from small familiar scales to geological scales (progressive alignment) and not hierarchically organizing all previous scales within the current scale (hierarchical alignment). Rather, the clicker feedback activity and the linear visualization activity ask students to align the divisions of the Geologic Time Scale to a linear scale in their sequential order. In addition, the exercises offered spaced practice in time estimation, which is important because spacing effects are a well-established principle in the optimization of learning (Pashler et al., 2007).

In the clicker feedback activity students align the non-linear representation of the Geologic Time Scale from their textbook with a linear time line, and are provided with corrective feedback using the clicker response system (Turning Technologies, LLC, 2013). The clicker response system is a handheld electronic device that can be used to answer multiple choice questions. Such electronic devices have been found to improve learning and engagement, particularly when paired with immediate feedback (Kay & LeSage, 2009). Submitting a clicker response involves the student in making a specific prediction, which is then confirmed or disconfirmed; this is a process that has also been found to improve understanding (for example, Howe, Rodgers, & Tolmie, 1990).

In the linear visualization activity, only the corrective feedback information about the Geologic Time Scale was presented visually as a linear representation. That is, students did not make a prediction about how the Geologic Time Scale from their textbook would align with the linear time line. Rather, they saw only the final image that aligned the two scales. They did not use the clicker response system.

More, and more sensitive, measures were also developed for Experiment 2. Twelve items were designed to assess reasoning about geologic content that required magnitude knowledge. A more sensitive number line task was developed, using a spatially longer time line and numbers farther away from salient boundaries. Finally, examination-level performance and demographic information were obtained for the sample.