Verifying unfamiliar identities: Effects of processing name and face information in the same identity-matching task

Trinh, Anita; Dunn, James D.; White, David

doi:10.1186/s41235-022-00441-2

Original article
Open access
Published: 12 October 2022

Verifying unfamiliar identities: Effects of processing name and face information in the same identity-matching task

Cognitive Research: Principles and Implications volume 7, Article number: 92 (2022) Cite this article

2437 Accesses
11 Altmetric
Metrics details

Abstract

Matching the identity of unfamiliar faces is important in applied identity verification tasks, for example when verifying photo ID at border crossings, in secure access areas, or when issuing identity credentials. In these settings, other biographical details—such as name or date of birth on an identity document—are also often compared to existing records, but the impact of these concurrent checks on decisions has not been examined. Here, we asked participants to sequentially compare name, then face information between an ID card and digital records to detect errors. Across four experiments (combined n = 274), despite being told that mismatches between written name pairs and face image pairs were independent, participants were more likely to say that face images matched when names also matched. Across all experiments, we found that this bias was unaffected by the image quality, suggesting that the source of the bias is somewhat independent of perceptual processes. In a final experiment, we show that this decisional bias was found only for name checks, but not when participants were asked to check ID card expiration dates or unrelated object names. We conclude that the bias arises from processing identity information and propose that it operates at the level of unfamiliar person identity representations. Results are interpreted in the context of theoretical models of face processing, and we discuss applied implications.

Significance statement

Face-matching tasks for unfamiliar faces are prevalent in many important applied settings, for example, passport screening and security checkpoints. Existing research has identified a tendency for novices and professional staff in these settings to make “match” biases when presented with unfamiliar face pairs in identity documents. This “match” bias can have detrimental impacts on border and national security, such as allowing fraudulent identity documents to be processed. Understanding the mechanisms and causes of these biases enables future research to develop a means of mitigating these biases. Here, we found that individuals were more likely to conclude that an unfamiliar face pair is a “match” after being shown matching name information, even when this information was irrelevant for the face-matching task. This bias appears to be specific to matching name information, suggesting that it is related to the automatic construction of identity representations. This result has implications for the design of workflow systems in applied settings where people verify the identity of unfamiliar people.

Background

Matching the identity of unfamiliar facial images is an important component of real-world identity verification and identity management, and human performance on these tasks has implications for forensic investigations, criminal trials, and security settings. In spite of this, face-matching errors are quite common with standard participant groups making 20–30% errors on average despite optimal viewing conditions (e.g. Bruce et al., 1999; Burton et al., 2010). More problematically, these error rates are observed in tests of practitioners who perform face matching in their daily work, for example in passport control, police and security settings (White et al., 2014, 2020).

Compounding these high error rates, recent work has shown that biases can be induced by extraneous visual elements that are often present in real-world tasks. For instance, participants are more likely to make a “same face” decision when one of two facial images in a face-matching task is embedded in a passport frame (McCaffery & Burton, 2016). A similar “match” bias has also been observed in other forms of photo ID, such as driving licenses and student ID cards (Feng & Burton, 2019). This initial research suggests that contextual information can negatively impact the outcome of unfamiliar face-matching decisions, either through the reduction in overall face-matching accuracy or the generation of response biases.

The underlying causes of contextual information bias on face matching have not been explored systematically. However, early evidence appears to suggest that the presence of biographical information is an important factor. For example, while the validity of biographical information presented on an ID card does not impact face-matching accuracy (McCaffery & Burton, 2016), removing the biographical information from the ID card appears to remove the match bias (Feng & Burton, 2019).

Understanding why biographical information has been found to trigger the “match bias” is critical for applied settings. For example, when processing passport applications, staff often have to review biographical information like names, addresses and date of birth to determine whether they match existing records. Similar parallel processing of identity cues is common in other settings, for example police investigation, and so it is practically important to understand perceptual and cognitive causes of bias in face-matching decisions.

The question of how biographic information is interactively processed with perceptual information is also important theoretically. The Interactive Activation and Competition (IAC) model (Burton et al., 1990) provides a mechanistic account of how face and other personal information may be aggregated in person identity judgments. Although the IAC is intended to model the representation of familiar people, it can also be adapted to explain how unfamiliar identities are mentally represented. Central to this model is the idea of a “person identity node” (PIN) that aggregates input received from perceptual face information, identity-level details (such as a person’s name), and semantic information (such as a person’s nationality or occupation). Identification decisions occur at the level of PINs, with pooled activation from semantic, name, and perceptual inputs producing person recognition once a certain threshold of PIN activation is achieved. This means that the likelihood of recognising a familiar face is increased when a familiar name is mentioned, or when the face is presented with semantic information associated with the identity, for example when the US president appears beside an American flag.

Associative identity networks akin to the IAC model could also influence the processing of unfamiliar faces in applied settings. A recent review of neuroscientific evidence suggests that networks of brain areas responsible for encoding semantic and perceptual person information are both activated when we initially encounter faces (Kovacs, 2020; see also Shoham et al., 2021, Todorov et al., 2007). In addition, experiments by Menon et al. (2015) show that the formation of identity representations of unfamiliar faces are influenced by linking with identity labels (see also Dunn et al., 2021), and Schwartz and Yovel (2016) found improved facial recognition accuracy when unfamiliar faces are associated with name labels during learning. Together, this evidence suggests that the processing of identity-specific details in real-world identity verification tasks, such as name information, is likely to influence concurrent face-matching performance.

Here we report a series of experiments that were designed to test whether matching names biases subsequent face-matching decisions, and whether the mechanisms of such a bias align with an associative identity network account. Four experiments were designed to measure these biases, examine whether they operate at the level of person identity representations (Experiment 2) and identify whether they are also induced by tasks that require matching non-biographical information present on identity cards (e.g. card expiry date, Experiment 4).

We also examined whether the strength of contextual biases is modulated by the quality of perceptual information (Experiments 1–3). Perceptual ambiguity can be introduced to unfamiliar face images in multiple ways and often leads to decrease face-matching accuracy. Examples include pixelation (Bindemann et al., 2013), poor lighting (Johnston et al., 1992), and greater camera-to-subject distances (Noyes & Jenkins, 2017). However, to our knowledge, there is no existing research exploring how such decreases in facial image quality affect the processing of unfamiliar faces.

The processing of other perceptually ambiguous visual stimuli has consistently been found to correlate with increased activation in brain regions associated with top-down processing (Heekeren et al., 2008; Li & Yang, 2012; Maksimenko et al., 2020; also see Karimi-Rouzbahani et al., 2021). This increased activation may be interpreted as an increased reliance on contextual information to aid in disambiguating uncertain visual stimuli (e.g. Klink et al., 2012). A relationship between perceptual ambiguity and contextual reliance has been observed in object recognition (Oliva & Torralba, 2007), action recognition (Wurm & Schubotz, 2017), and across a range of different visual tasks (Dror et al., 2005; Qi et al., 2018). For instance, individuals are more likely to conform to the decisions of collaborators when performing perceptually difficult discrimination tasks (Qi et al., 2018). If unfamiliar faces are perceptually processed in a similar manner to other visual stimuli, we would expect an interaction between perceptual ambiguity and contextual biases. According to this prior work, the more perceptually ambiguous a facial image is to process, the greater the contextual bias should be. On the other hand, influential face processing models propose additive contributions of perceptual and semantic information in person identification decisions (Bruce & Young, 1986; Burton et al., 1990). If these contributions are additive as existing models suggest, then according to the logic of additive factors (see e.g. Sternberg, 1969, 2011), we would not expect to see an interaction between the amount of perceptual evidence and the effect of context.

Clearly, understanding how the magnitude of contextual biases interacts with image quality is important on a theoretical level in disentangling contrasting theories about how such an interaction occurs with unfamiliar faces. However, it is also important in an applied sense, because face identification decisions are often made based on low-quality CCTV images in criminal trials and investigations (Davis & Valentine, 2009; Edmond et al., 2010; Porter, 2009; Walker & Tough, 2015).

Experiment 1

In Experiment 1, we examined two questions: firstly, whether matching name information biases subsequent face-matching decisions; and secondly, whether this bias is increased when there is greater perceptual uncertainty. We asked participants to complete sequences of name and face-matching decisions. For half of the total trials, the name pair type was consistent with the face trial type (e.g. if names matched then faces also matched) and in the other half it was inconsistent. We predicted that matching names would elicit a bias for participants to make more “match” responses to face pairs. Given that stronger effects of context have been observed when perceptual evidence is ambiguous (e.g. Dror et al., 2005; Qi et al., 2018; Wurm & Schubotz, 2017), we predicted that this bias would be stronger when face image quality was reduced for one of the two facial images.

Method

Participants

Ninety-four undergraduate students from UNSW Sydney were recruited for Experiment 1. Sample size was based on Experiment 1 of the McCaffery and Burton (2016) study, the first known reported instance of the document bias, with additional participants to account for data exclusions. The data of one participant were removed because they did not complete the experiment, and two participants were omitted due to scoring below 85% accuracy on the name-matching task (task described in further detail below). A total of 91 participants (gender = 60 female, 30 male, 1 unspecified; M_age = 19.2 years, SD_age = 2.3 years) were included in the final analysis.

Stimuli

One hundred and sixty-eight image pairs were taken from the Expertise in Facial Comparison Test (EFCT), an unfamiliar face-matching test previously used to compare the accuracy of novices and professionals (White et al., 2015). Half of the image pairs show the same face (match pairs), and the remaining identity pairs show two different faces (non-match pairs). To manipulate image quality in our experiments, we downsampled one image in each pair in Adobe Photoshop using a mosaic pixelation with a 16-pixel diameter (i.e. 1/16 fewer pixels per cm). An example of low and high image quality photographs is shown in Fig. 1. Participants completed the experiment on a desktop computer with a monitor resolution of 1920 × 1080 pixels, and participants were seated approximately 60 cm away from the screen (approximately 50° visual angle). Image pairs were presented on-screen in colour at a size of 400 × 600 px (approximately 10.5 × 16 cm).

Common first names were generated for each face to match the gender of the facial image; otherwise, the names were assigned at random. The names were presented in black capitalised Arial font. As shown in Fig. 2a, all names in the name task and facial images in the face task were displayed side-by-side and were equidistant from the centre of the screen (approximately 0.3° visual angle). All stimuli and written instructions were presented on a grey background.

Design and procedure

We used a 2 × (2 × 2) mixed factorial design, with Name Pair Type (same, different) and Image Quality (low, high) as the two within-subjects conditions. The Name Pair Type factor refers to whether name pairs in each trial are presented as matching or non-matching—“same” if the name pairs presented within a trial are matching, and “different” if the name pairs do not match. A factor of Task Order (name-first or face-first) was also included as a between-subjects condition, which modulated the order in which participants completed matching decisions for each identity (i.e. either matching name pairs first before face pairs, or vice versa). We would not expect face-matching decisions to be biased by name information if they were presented before the name-matching task, and so this task order provided a control condition. Participants were randomly allocated to one of the task order conditions—however, due to participant exclusions on the basis of name-matching performance accuracy, there was an unequal distribution of participants across the face-first (n = 45) and name-first conditions (n = 46).

The experiment was programmed using PsychoPy 3.0 (Peirce & MacAskill, 2018). Participants were instructed to assume the role of a surveillance officer checking the identities of employees who were entering and exiting a work building. Within this scenario, participants were required to check each presented identity against “database records” in a security system by indicating whether pairs of names and faces presented on-screen were of the same person or different people. Participants were instructed to make their decisions as accurately as possible.

Examples of experimental trials are shown in Fig. 2a. For each trial, participants were first shown a black fixation cross for 700 ms, followed by either name or face pairs (depending on task order allocation). For each name and face decision in a given trial, participants were instructed that they were to indicate “match” or “non-match” via key press. Two different pairs of keys were used to prevent accidental presses of the same key for separate decisions (“E” and “C” for the first matching decision; “I” and “M” for the second matching decision). The stimuli pair was displayed on-screen until the participants entered a valid keyboard input. Upon completion of the first matching decision, the stimuli pair for the second matching decision would immediately appear on-screen.

Each participant completed 168 trials. Participants received two short breaks after each third of the trials had been completed. There were an equal number of match and non-match face pairs in the experiment and match and non-match face pairs were equally likely to follow match and non-match name pairs and vice versa.

We originally planned for trials to be split equally across image quality conditions (84 high image quality, 84 low image quality). However, a coding error resulted in an unequal distribution across iterations of the experiment. The first 39 participants received 88 trials with low image quality and 80 trials with high image quality. The remaining 53 participants received 80 trials with low image quality and 88 trials with high image quality.

Results

We analysed face-matching performance using signal detection measures of sensitivity and criterion (Stanislaw & Todorov, 1999). Two-way ANOVAs with factors of image quality (low, high) and name pair type (match, non-match) were conducted separately for the name-first and face-first conditions. Analysing results in this way enabled us to separate changes in perceptual discrimination (indexed by the sensitivity measure, d-prime) from changes in response biases (indexed by criterion). Because the main purpose of the name-matching task was to ensure participants processed name information, name-matching performance data were analysed only for the purposes of participant-level exclusions and an overall accuracy calculation. Name data are not analysed further within current and subsequent experiments.

Because we were interested in the biasing effect of name matching on face-matching decisions, our discussion here focuses primarily on face-matching criterion. Across all experiments, we found a consistent and expected main effect of image quality on sensitivity, in that a lower image quality led to significantly reduced sensitivity scores. As there were no other effects of note, we report all details on sensitivity scores in Additional file 1. On average, participants scored 77.7% accuracy on face-matching decisions (SD = 6.9%). The average performance across all participants in the name-matching decision was 98.0% (SD = 2.1%).

Criterion scores for the name-first and face-first conditions are shown in Fig. 2b. Name pair type had a large and significant effect on face-matching response biases in the name-first condition (F_{1, 45} = 33.35, p < 0.005, ηp² = 0.43) which was not present in the face-first condition (F_{1, 44} = 0.38, p = 0.54, ηp² = 0.01). The direction of this effect aligns with our predictions; participants in the name-first condition were more likely to make a “match” response for a face-matching decision when it was preceded by matching name pairs than when it was preceded by a pair of non-matching names. Image quality did not shift response biases in either condition (face-first: F_{1, 44} = 0.51, p = 0.48, ηp² < 0.01; name-first: F_{1, 45} = 0.76, p = 0.39, ηp² = 0.02), and the interaction between name pair type and image quality was non-significant for both conditions (face-first: F_{1, 44} = 3.34, p = 0.07, ηp² = 0.07; name-first: F_{1, 45} = 1.01, p = 0.32, ηp² = 0.02), showing that effects of name decision on the face-matching task were not modulated by image quality.

In the above analysis, we found clear evidence that when name-matching decisions preceded the face-matching task, there was a bias in participant responses. This pattern was observed only for the name-first condition. It came to our attention that, due to our experimental design, participants may have experienced the experiment as one continuous stream of unrelated matching decisions, as opposed to distinct trials with names assigned to facial identities. We hypothesised that if this were the case, all name decisions would bias subsequent face-matching decisions, regardless of which facial identity the name decision pertained to.

To test this possibility, we performed a post hoc analysis using data from the face-first condition where the face-matching decision had been preceded by a name-matching decision in the previous trial. We then conducted a 2 × 2 ANOVA for the face-first condition face-matching trials with factors previous name pair type (same, different) and image quality (high, low). We found a significant main effect of previous name pair type on response biases (F_{1, 44} = 5.83, p = 0.02, ηp² = 0.12) whereby participants completing the face-first condition were more biased towards making a “match” face decision when the previous identity trial presented matching names. In other words, the name decision bias was not bound to singular trials. Results of this post hoc analysis are visualised in Additional file 1: Figure S5.

Discussion

In our first experiment, we found that face-matching decisions were biased by prior name-matching decisions. In other words, participants were more likely to state that face pairs were a “match” when they were previously shown matching name pairs. Interestingly, this tendency did not interact with image quality, as has been observed in previous studies where context has been found to have a significant biasing effect on perceptual decisions.

Another unexpected result was that the response bias was not confined within individual experimental trials; the post hoc analysis revealed that name-matching decisions biased the decision of subsequent face-matching trials, regardless of the facial identity to which the names were assigned. In our second experiment, we aimed to test whether it was possible to restrict the locus of the bias observed in Experiment 1 to a single identity decision. To do this, in Experiment 2 we include a mock ID frame design that binds name and face information for each trial. We also aimed to minimise the possibility that participants were consciously altering their response behaviour because they believed that matching faces were more likely after matching names. Hence, we emphasised instructions that explicitly instructed participants that the name information should not inform face-matching decisions. Finally, we replaced key-press responses with mouse clicks as the decisional input.