Learning strategy impacts medical diagnostic reasoning in early learners

Sheldon, Signy; Fan, Carina; Uner, Idil; Young, Meredith

doi:10.1186/s41235-023-00472-3

Original article
Open access
Published: 09 March 2023

Learning strategy impacts medical diagnostic reasoning in early learners

Signy Sheldon¹,
Carina Fan²,
Idil Uner¹ &
…
Meredith Young³

Cognitive Research: Principles and Implications volume 8, Article number: 17 (2023) Cite this article

2155 Accesses
Metrics details

Abstract

Relating learned information to similar yet new scenarios, transfer of learning, is a key characteristic of expert reasoning in many fields including medicine. Psychological research indicates that transfer of learning is enhanced via active retrieval strategies. For diagnostic reasoning, this finding suggests that actively retrieving diagnostic information about patient cases could improve the ability to engage in transfer of learning to later diagnostic decisions. To test this hypothesis, we conducted an experiment in which two groups of undergraduate student participants learned symptom lists of simplified psychiatric diagnoses (e.g., Schizophrenia; Mania). Next, one group received written patient cases and actively retrieved the cases from memory and the other group read these written cases twice, engaging in a passive rehearsal learning strategy. Both groups then diagnosed test cases that had two equally valid diagnoses—one supported by “familiar” symptoms described in learned patient cases, and one by novel symptom descriptions. While all participants were more likely to assign higher diagnostic probability to those supported by the familiar symptoms, this effect was significantly larger for participants that engaged in active retrieval compared to passive rehearsal. There were also significant differences in performance across the given diagnoses, potentially due to differences in established knowledge of the disorders. To test this prediction, Experiment 2 compared performance on the described experiment between a participant group that received the standard diagnostic labels to a group that received fictional diagnostic labels, nonsense words designed to remove prior knowledge with each diagnosis. As predicted, there was no effect of diagnosis on task performance for the fictional label group. These results provide new insight on the impact of learning strategy and prior knowledge in fostering transfer of learning, potentially contributing to expert development in medicine.

Significance statement

Diagnostic reasoning is a complex task that requires retrieving information from a variety of sources, including previously encountered patient cases as well as established prior knowledge. An essential skill for diagnostic reasoning and a core component of expertise is the ability to effectively transfer learned information from prior cases to diagnose a new case. In this report, we leveraged findings from classic cognitive psychological research to show that engaging novice diagnosticians in active retrieval when exposed to patient cases enhanced the ability to transfer of learning to novel cases. We also found that prior knowledge about diagnoses affected the ability to engage in transfer of learning, indicating that prior knowledge gained outside of a training context affects diagnostic reasoning. Together, our results provide new insights in the role of learning and knowledge on guiding the reliance on previous cases for diagnosis. These findings further our understanding of how case-based knowledge, and rehearsal strategy, can support medical learners in developing diagnostic expertise.

Introduction

In medicine, diagnostic reasoning requires learners to gain new knowledge about diseases as well as efficiently apply that knowledge to new situations to make diagnoses, referred to as transfer of learning (Barrows & Feltovich, 1987; Boshuizen & Schmidt, 1992; Norman, 2005; Woods, 2007). Expertise in diagnostic reasoning has been characterized in a variety of different ways—from accuracy (Eva, 2005; Monteiro et al., 2019; Norman et al., 2007; Wood, 2014) to speed Sherbino et al., 2012) to adaptability (Croskerry, 2018; Mylopoulos & Woods, 2009)—and consistent across these definitions is the idea that expert diagnosticians effectively engage in transfer of learning. Thus, an important question to ask is how to effectively facilitate transfer of learning in order to promote the development of expertise in early medical learners. To answer this question, we turned to research in cognitive psychology that has demonstrated that transfer of learning is engaged when information is learned actively and through experience (for a review, see Roediger & Butler, 2011). Thus, the aim of the current study was to explore how promoting an active retrieval strategy when learning diagnostic cases (i.e., exemplars of patients that require diagnoses) affects transfer of learning in early learners.

A historical finding in cognitive psychology is that repeatedly rehearsing information during learning enhances memory for the rehearsed information, which is more likely to be used to guide subsequent decisions (Ebbinghaus, 1885). However, the way that information is rehearsed during learning—the strategy implemented—is a determining factor of the effects on memory (Craik & Lockhart, 1972). Research has noted a distinction between passive rehearsal and active retrieval learning strategies (Karpicke & Roediger, 2008; Nairne, 1986; Roediger & Butler, 2011). Whereas passive rehearsal involves repeating information during encoding (e.g., rote memorization), active retrieval involves transforming or manipulating information in the mind. An example of active retrieval is practicing recall (testing) right after encoding. The evidence suggests that engaging in active retrieval during learning creates a stronger and more flexible memory trace than engaging in passive rehearsal, often referred to as a retrieval practice or testing effect (Roediger & Butler, 2011). For example, a landmark study had participants learn a series of vignettes, either by repeatedly restudying the vignettes or by answering questions about the vignettes, therefore engaging in active retrieval. The participants were tested for their ability to recall as well as apply information they learned from the vignettes one week later. Only the vignettes that were learned via answering questions, a form of active retrieval, improved participants’ ability to recall as well as apply what they learned from the vignettes (Butler, 2010; also see, Butler et al., 2017). A recent meta-analysis revealed that, across several learning situations including medical diagnoses, engaging in active or elaborated rehearsal strategies during learning is a determining factor for how well a person can later answer concept-based and application-based questions (Pan & Rickard, 2018).

Although the benefit of active retrieval has been explained in several ways, all of these explanations share the idea that an active learning strategy effectively engages particular episodic memory processing during learning (Gureckis & Markant, 2012). According to multiple memory systems models (e.g., Ashby et al., 1998; Schacter & Tulving, 1994), different types of representations of experiences are processed in distinct modules with different properties. Within the episodic memory system, representations of past events can be formed at a general and flexible level or as a very rote “reproductive” representation. Classic memory theory suggests that learning with active retrieval, in comparison to passive rehearsal, promotes the creation of a generalized memory representation—one that captures the gist aspect of an event (Underwood, 1969; Reyan & Brainerd, 1995). This has been confirmed with more recent cognitive neuroscientific findings illustrating that forming generalized memory representations imbue mnemonic flexibility that are more easily be applied to new scenarios, supporting transfer of learning (Eichenbaum & Cohen, 2001). Moreover, engaging in active retrieval might also help promote a stronger incentive to engage in the acquired material that rote rehearsal further promoting flexibility in the use of memory, as predicted by motivation-cognitive theories (Maddox & Markman, 2010).

Transfer of learning is a central component of case-based reasoning (CBR), where one solves a current task by retrieving a similar past scenario (Kolodner, 1992). CBR is often employed in real-world reasoning scenarios that are not clearly defined (i.e., no established specific means to reach a solution), and it is also a core component of modern medical education (Eshach & Bitterman, 2003). CBR is a frequent and well-used approach in medical education as one way to gain practice recognizing and translating patient-described symptoms that are more opaque than learned lists into the clinical language of signs and symptoms (Dore et al., 2012; Lingard et al., 2003; Young et al., 2007).

Research has suggested that CBR is one way to engage in diagnostic reasoning as it flexibly draws on previous patient cases that have some similarity in symptoms or characteristics with a current case to help shape diagnostic reasoning (Eva, 2005; Norman, 2005; Young et al., 2007, 2011). Indeed, several reports have shown that expert diagnosticians increasingly rely on previous experiences to solve current problems (Dore et al., 2012; Eva, 2005; Norman, 2005; Sherbino et al., 2012), indicating that CBR as a learning tool could lead to more expert-like behaviour in diagnostic reasoning. In fact, a recent report described the efficacy of engaging in CBR for medical learners. This study found that students that learned via case-related readings and simulated patient cases (via the use of actors) showed significant improvements on a clinical assessment, due to an enhanced ability of the students to engage in transfer of learning, when compared to a control group (Turk et al., 2019; although see Himmelbauer et al. (2018) for a discussion of the importance of affect in determining the benefit of simulated cases on medical learning).

The main aim of the current study was to unite the above-described lines of research to explore how the learning strategy used in CBR (active versus passive rehearsal) impacts transfer of learning during a diagnostic reasoning task in early learners. Specifically, we tested the hypothesis that transfer of learning will be enhanced when learners engaged in active retrieval, compared to passive rehearsal, during a diagnostic task using written patient cases. To test this hypothesis, we implemented a between-subjects experimental design in which we manipulated participants’ strategy when learning about case vignettes (Experiment 1). One group of participants studied example cases by reading and then freely recalling the cases from memory (active retrieval) and another group studied these example cases via reading the cases twice in the practice session (passive rehearsal). Across the groups, we controlled key factors known to alter learning, such as the presence of feedback and exposure (Roediger & Butler, 2011). In our design, we used “real-world” diagnostic labels (e.g., Schizophrenia, Mania) that learners likely have different levels of familiarity with or pre-existing conceptual knowledge about. Following theories that suggest that established familiarity and knowledge with a concept can affect associated memory and reasoning tasks (Gilboa & Marlatte, 2017), we further explored for differences in transfer of learning across diagnoses. Following results from this exploratory analysis that indicated the presence of diagnostic label differences in transfer of learning, we conducted a second Experiment that tested if these differences across diagnostic labels would remaining without the presence of real-world labels, effectively removing access to established familiar knowledge of the diagnoses (Ashby & Maddox, 1993; Bordage & Zacks, 1984; Brooks, 1978; Brooks et al., 1991; Hatala et al., 1999; Hintzman, 1986; Medin, 1989; Medin & Schaffer, 1978; Young et al., 2007).

Experiment 1

Design overview

This experiment included three phases (Fig. 1): a learning phase in which participants learned a list of symptoms associated with the four diagnoses included in the experiment; a practice phase in which one group of participants (active retrieval group) learned and recalled detailed descriptions of case vignettes and another group (passive rehearsal group) instead read these vignettes twice, gaining similar exposure to the example cases without active engagement; and a test phase in which all participants categorised new “test” case vignettes. These test vignettes were associated with two equally valid diagnoses: one diagnosis that was supported by two familiar symptom instantiations (i.e., case-specific detailed descriptions of symptoms) drawn from an earlier practice case, and one diagnosis supported by two novel symptom descriptions.

Participants

In order to study the influence of practice approach on early learners, we invited novices (i.e., those with no formal undergraduate medical education training) to participate in this study. Sixty-four entry-level undergraduate psychology student participants were recruited from McGill University’s Psychology Human Participant Pool. All participants were fluent in English and had normal or corrected-to-normal vision. Tested participants were excluded from analysis if they withdrew (n = 1), had a history of major head injury, seizures, or disability (n = 5), had implausibly short test times (i.e., 2 SDs below the mean; n = 2), or did not complete the task as instructed (i.e., when asked to recall details from patient case vignettes, they instead inferred the diagnoses; n = 1). In all, 48 females and 7 males were included for analysis, with ages ranging from 18 to 34 (M = 20.8 years, SD = 2.3). The active retrieval group included 28 participants (24 female), and the passive rehearsal group included 27 participants (24 female). Informed consent was obtained from all participants.

Stimuli

The stimuli set for the learning phase included four symptom lists associated with four common psychiatric diagnoses (mania, schizophrenia, paranoid personality disorder (PPD), and obsessive compulsive disorder (OCD); Table 1), each adapted from the diagnostic rules listed in the Diagnostic and Statistical Manual for Mental Disorders (4th ed., tex rev.; DSM-IV-TR; American Psychiatric Association, 2000; similar to those used in Young et al., 2007, 2011; material available upon request). There was also a written example case vignette for each diagnosis. The stimuli set for the practice phase included three written example case vignettes for each diagnosis that contained all four symptoms, presented in a unique manner (e.g., the symptom “hallucinations” would be presented as “she is following something with her eyes that no one else can see”) and contained personally identifying, specific episodic content (e.g., name, age, type of employment, familial situation). Finally, the stimuli set for the test phase included 12 test case vignettes that were designed to contain two equally probable diagnoses—each case contained novel personally identifying “patient” information, as well as two familiar symptom instantiations drawn from an earlier practice case supporting one diagnosis, and two novel symptom descriptions supporting another diagnosis. Across the 12 test cases, all four diagnoses were equally paired with every other diagnosis, and all diagnoses appeared as both the familiar and novel instantiated features.

Table 1 Medical diagnoses and symptoms used, adapted from Young et al., (2007, 2011)

Full size table

Procedure

All materials were presented on a computer screen, programmed with RunTime Revolution Version 2.5 (RunTime Revolution Ltd, Edinburgh, UK).

Learning phase

This phase involved learning the four diagnoses in an order randomly assigned to each participant. For each diagnosis, participants first studied the four associated symptoms in list form (e.g., the symptoms for mania were: increased energy, decreased sleep, inflated self-esteem, and more talkative). After learning the symptoms for a diagnosis, participants took a quiz in which they had to identify the four symptoms of the diagnosis from a list of 16 symptoms. If they did not correctly identify all four symptoms, participants re-studied the symptom list for that diagnosis and took the quiz again; this process was repeated until they passed the quiz. Participants then were shown an example case of each diagnosis which contained all four of the associated symptoms, and they identified the symptoms by typing them into text boxes. They were then shown the correct symptoms in the text boxes, and relevant text in the case vignette was highlighted. Once all four diagnoses had been studied in this manner, participants were shown the full list of 16 symptoms and had to identify the four symptoms for each diagnosis. Participants needed to correctly match 15 of the 16 symptoms to the corresponding diagnosis to advance to the practice phase. This ensured that all participants were equally familiar with the diagnoses as presented in the experiment, prior to the practice phase.

Practice phase

Participants were randomly assigned to either the active retrieval or passive rehearsal group. Both groups were shown 12 case vignettes that contained unique instantiations of all four symptoms learned during the learning phase, as well as other episodic details that were unrelated to any diagnosis (see Fig. 1 for an adapted example vignette). Both groups of participants first studied a case and reported their diagnoses by typing a percent likelihood (i.e., diagnostic probability) in a text box beside the name of each diagnosis. Participants were told to distribute their percentages as they saw fit, with the only restriction being that they must sum to 100%. They were also asked to report the symptoms they thought were relevant for the diagnosis in each case by typing the symptoms into text boxes. Participants were free to report either the symptom label (e.g., “more talkative”) or the symptom in its “instantiated” form (e.g., “incredibly fast-talking”) and received feedback regarding the correct diagnosis (i.e., the diagnosis that was represented in the case). Participants in the passive rehearsal group saw the example case vignette again and assigned probabilities a second time. The active retrieval group was shown a blank text box and asked to type all the details they could remember from the example case vignette. These participants were instructed that no detail was too small to remember, and they had no time limit. After recalling these details, they assigned probabilities to the four possible diagnoses for this case. This process was repeated until participants had seen and diagnosed all 12 example case vignettes, which were presented in random order.

Test phase

In this phase, both participant groups were presented with the same set of 12 test case vignettes in random order. As outlined in the above description of experimental stimuli, each test case included two familiar symptom descriptions that were drawn from one of the cases from the practice phase (i.e., the symptoms were described in the same form as in the cases), as well as two “novel” symptom descriptions in the context of a written case vignette. These novel symptom descriptions had not been seen by participants before and were unique instantiations of the learned symptom lists associated with each diagnosis. For each case, participants were again asked to report their diagnoses by assigning a percent likelihood to each of the four diagnoses (diagnostic probability). If participants are referencing all four symptoms equally to assign these percentages, then they should assign diagnostic probabilities as: 50% to the diagnosis supported by the familiar symptom descriptions, and 50% to the diagnosis supported by the novel symptom descriptions. However, if a participant is biased towards using information from the familiar practice cases, then there will be a deviation in the diagnostic decision from 50:50 in favour of the diagnosis supported by the familiar descriptions (aligned with Young et al., 2007, 2011). Thus, our outcome variable was the percentage assigned to each diagnosis, or diagnostic probabilities. The diagnostic probability assigned to the diagnosis supported by familiar symptom descriptions was our metric of transferring learning from the example cases. The order of the stimulus materials was always randomised across participants within each phase of the experiment.

Statistical analysis

The primary analyses of interest were linear mixed models that estimated the diagnostic probabilities that participants assigned to the test cases, modelled as a function of diagnostic decision (that supported by familiar or novel symptom instantiations), diagnosis (mania, OCD, PPD, schizophrenia), experimental group (active retrieval; passive rehearsal), and the interactions between these variables, with a random intercept for participant. Independent t-tests on the average time spent in each experimental phase were conducted between the groups.

Results

Independent t-tests on the average time spent within each phase between the groups confirmed no significant difference in the time spent during the learning phase (t(45) = 1.435, p = 0.158) nor the test phase (t(45) = 0.650, p = 0.519), yet a difference in the time spent during the practice phase (t(45) = 8.250, p < 0.001). Those in the active retrieval group (M = 366.6 s, SD = 1.22 s) spent longer in this phase than those in the passive rehearsal group (M = 148.4 s, SD = 4.30 s), which was not unexpected given the task demands of the active retrieval versus passive rehearsal experimental conditions.

Focusing on the results from the test phase, a linear mixed model was constructed to estimate diagnostic probability assigned to test cases with the factors of group, diagnostic decision, and diagnosis. Since practice time was different between the groups, practice time was included as a covariate in the model. This model revealed three statistically significant effects (Table 2). First, there was a main effect of diagnostic decision, such that participants, regardless of group, assigned higher diagnostic probabilities to the diagnosis supported by familiar symptom descriptions (M = 53.4, SD = 24.0) than the diagnosis supported by novel symptom descriptions (M = 42.7, SD = 24.4). Second, the factor of diagnostic decision interacted with experimental group. Compared to the passive rehearsal group, the active retrieval group assigned an even higher probability to the diagnosis supported by the familiar symptom descriptions than to the diagnosis supported by novel symptom descriptions (Fig. 2). Finally, and somewhat surprisingly, there was an interaction between diagnostic decision and diagnosis across both groups. Pairwise contrasts between levels of diagnostic decision showed that only the OCD contrast was statistically significant, χ²(1) = 8.01, p = 0.02, all other ps > 0.58, such that when OCD was the diagnosis supported by familiar symptom descriptions, participants assigned a higher probability to OCD, suggesting some role of knowledge from outside of the experimental context interacting with diagnostic labels. To explore this effect, Experiment 2 was conducted.

Table 2 Model parameter estimates from Experiment 1

Full size table

Experiment 2

Findings from Experiment 1 suggest that transfer of learning on a diagnostic reasoning task is enhanced when patient cases are actively retrieved compared to when they are passively rehearsed. Results from this Experiment also revealed that performance on the diagnostic reasoning task differed across the given diagnostic labels, such that familiar symptoms were more likely to contribute to an OCD diagnosis than other diagnostic labels. One possible explanation is that the diagnoses included in Experiment 1 differed in terms of how much knowledge participants had about these diseases prior to the experiment. Prior work has indicated that previous experience with a given diagnosis does influence diagnostic reasoning (Dore et al., 2012; Eva, 2005; Norman, 2005; Sherbino et al., 2012). As well, research has found familiar stimuli, those with established memory representations, are more likely to facilitate memory and reasoning than less familiar stimuli (e.g., Reder et al., 2012). Thus, we hypothesized that removing differences in familiarity or pre-existing knowledge among disorder labels should reduce any distinctions in the use of the associated disorder cases for the diagnostic reasoning. To test this hypothesis, we ran Experiment 2 in which we compared diagnostic reasoning when participants were given the established labels of diagnoses to when they were given fictional diagnoses, effectively removing the ability to access associated representations.