Implicit-association test

The implicit-association test (IAT) is an assessment intended to detect subconscious associations between mental representations of objects (concepts) in memory.

The IAT was introduced in the scientific literature in 1998 by Anthony Greenwald, Debbie McGhee, and Jordan Schwartz.

History

Implicit cognition and measurement

In 1995, social psychology researchers Anthony Greenwald and Mahzarin Banaji asserted that the idea of implicit and explicit memory can apply to social constructs as well. If memories that are not accessible to awareness can influence our actions, associations can also influence our attitudes and behavior. Thus, measures that tap into individual differences in associations of concepts should be developed. This would allow researchers to understand attitudes that cannot be measured through explicit self-report methods due to lack of awareness or social-desirability bias. In essence, the purpose of the IAT was to reliably assess individual differences in a manner producing large effect sizes. The first IAT article was published three years later in 1998.

Since its original publication date, the seminal IAT article has been cited over 4,000 times, making it one of the most influential psychological developments over the past couple of decades. Furthermore, several variations in IAT procedure have been introduced to address test limitations, Finally, as is characteristic of any psychological instrumentation, discussion and debate of the IAT's reliability and validity has continued since its introduction, particularly because these factors vary between different variations of the test.

The IAT is thought to measure implicit attitudes: "introspectively unidentified (or inaccurately identified) traces of past experience that mediate favorable or unfavorable feeling, thought, or action toward social objects." In research, the IAT has been used to develop theories to understand implicit cognition (i.e. cognitive processes of which a person has no conscious awareness). These processes may include memory, perception, attitudes, self-esteem, and stereotypes. Because the IAT requires that users make a series of rapid judgments, researchers believe that IAT scores may also reflect attitudes which people are unwilling to reveal publicly.

Procedure

Task 1 (practice):

{| width="250" style="background:black;"

| align="left" style="color:white;"|Black

| align="right" style="color:white;"|White

| align="center" style="color:white;" colspan="2"| <big>Aaliyah</big>

| style="color:silver;" colspan="2" align="center" |

Task 2 (practice):

{| width="250" style="background:black;"

| align="left" style="color:lime;" |Pleasant

| align="right" style="color:lime;" |Unpleasant

| align="center" style="color:lime;" colspan="2"| <big>Suffering</big>

| align="center" style="color:silver;" colspan="2"| Press E to classify as Pleasant or I to classify as Unpleasant

Tasks 3 and 4 (data collection):

{| width="250" style="background:black;"

| align="left" style="color:white;"|Black/

| align="right" style="color:white;"|White/

| align="left" style="color:lime;" |Pleasant

| align="right" style="color:lime;" |Unpleasant

| align="center" style="color:lime;" colspan="2"|<big>Happiness</big>

| align="center" style="color:silver;" colspan="2"| Press E to classify as Black or Pleasant or I to classify as White or Unpleasant

Task 5 (practice):

{| width="250" style="background:black;"

| align="left" style="color:white;"|White

| align="right" style="color:white;"|Black

| align="center" style="color:white;" colspan="2"| <big>Eminem</big>

| align="center" style="color:silver;" colspan="2"| Press E to classify as White or I to classify as Black

Tasks 6 and 7 (data collection):

{| width="250" style="background:black;"

| align="left" style="color:white;"|White/

| align="right" style="color:white;"|Black/

| align="left" style="color:lime;" |Pleasant

| align="right" style="color:lime;" |Unpleasant

| align="center" style="color:white;" colspan="2"|<big>Shanice</big>

| align="center" style="color:silver;" colspan="2"| Press E to classify as White or Pleasant or I to classify as Black or Unpleasant

Example of a typical IAT procedure

</div>

A typical IAT procedure involves a series of seven tasks. In the first task, an individual is asked to categorize stimuli into two categories. For example, a person might be presented with a computer screen on which the word "Black" appears in the top left-hand corner and the word "White" appears in the top right-hand corner. In the middle of the screen a word, such as a first name, that is typically associated with either the categories of "Black" or "White". For each word that appears in the middle of the screen, the person is asked to sort the word into the appropriate category by pressing the appropriate left-hand or right-hand key. On the second task, the person would complete a similar sorting procedure with an attribute of some kind. For example, the word "Pleasant" might now appear in the top left-hand corner of the screen and the word "Unpleasant" in the top right-hand corner. In the middle of the screen would appear a word that is either pleasant or unpleasant. Once again, the person would be asked to sort each word as being either pleasant or unpleasant by pressing the appropriate key. On the third task, individuals are asked to complete a combined task that includes both the categories and attributes from the first two tasks. In this example, the words "Black/Pleasant" might appear in the top left-hand corner while the words "White/Unpleasant" would appear in the top right-hand corner. Individuals would then see a series of stimuli in the center of the screen consisting of either a name or word. They would be asked to press the left-hand key if the name or word belongs to the "Black/Pleasant" category or the right-hand key if it belongs to the "White/Unpleasant" category. The fourth task is a repeat of the third task but with more repetitions of the names, words, or images.

The fifth task is a repeat of the first task with the exception that the position of the two target words would be reversed. For example, "Black" would now appear in the top right-hand corner of the screen and "White" in the top left-hand corner. The sixth task would be a repeat of the third, except that the objects and subjects of study would be in opposite pairings from previous trials. In this case, "Black/Unpleasant" would now appear in the top right-hand corner and "White/Pleasant" would now appear in the top left-hand corner. The seventh task is a repeat of the sixth task but with more repetitions of the names, words, or images. If the categories under study (e.g. Black or White) are associated with the presented attributes (e.g. Pleasant/Unpleasant) to differing degrees, the pairing reflecting the stronger association (or the "compatible" pairing) should be easier for the participant. the Brief-IAT An idiographic approach using the IAT and the SC-IAT for measuring implicit anxiety showed that personalized stimulus selection did not affect the outcome, reliabilities and correlations to outside criteria.

The Go/No-go Association Test (GNAT) is a variation of the IAT that assesses implicit attitudes or beliefs by measuring the relationship between a target concept and two different extremes of an attribute. During the ST-IAT, respondents complete a discrimination block of the evaluative stimuli. The second block consists of sorting target concepts and positive items with one response key and negative items with the other. In the last block, respondents are required to sort target stimuli and negative items together with one key, and positive items with the other key. In comparison to the IAT, which uses contrasts in latency between two concepts and two attributes, the ST-IAT focuses on latency differences in relation to one concept and two attributes. On the other hand, only half of Black individuals prefer Blacks over Whites (cf. the earlier "doll experiment" developed by psychologists Kenneth and Mamie Clark during the early civil rights era). Similarly, the Age IAT generally shows that most individuals have an implicit preference for young over old, regardless of the age of the person taking the IAT. The Weight IAT indicates that medical students have lower implicit biases towards obese individuals compared to the general public, but increased explicit biases, although public explicit and implicit biases remained stable. Research with the Sexuality IAT shows that heterosexual individuals have an implicit preference for heterosexuals, associating them with more positive attributes. In contrast, bisexual individuals indicated a preference for heterosexuals over homosexuals, specifically as a result of attributing homosexuals to negative attributes. Neither of these trends of attributing more positive or negative attributes to a specific sexual identity are seen with homosexual respondents. Some other valence IATs include the Weight IAT, the Sexuality IAT, the Arab-Muslim IAT, and the Skin-tone IAT.

Stereotype

Stereotype IATs measure associations between concepts that often reflect the strength to which a person holds a particular societal stereotype. For example, the Gender-Science IAT reveals that most people associate women more strongly with liberal arts and men more strongly with science.

Self-esteem

The self-esteem IAT measures implicit self-esteem by pairing "self" and "other" words with words of positive and negative valence. Those who find it easier to pair "self" with positive words than negative words are purported to have higher implicit self-esteem. Generally, measures of implicit self-esteem, including the IAT, are not strongly related to one another and are not strongly related to explicit measures of self-esteem.

Brief

The Brief IAT (BIAT) uses a similar procedure to the standard IAT but requires fewer classifications. It involves approximately four to six tasks rather than seven, only uses combined tasks (corresponding most closely to tasks 3, 4, 6, and 7 on the standard IAT), and has fewer repetitions. Additionally, it requires specification of a focal concept in each task as well as a single attribute, instead of two. For example, although, White, Black, Pleasant, and Unpleasant stimuli all appear, participants would press one key when White and Pleasant words appear and another key when "anything else" appears. Subsequently, participants would press one key when Black and Pleasant words appear and another key when "anything else" appears. Unlike the GNAT, the Brief IAT doesn't not use accuracy of correctly identifying the specific concept and attribute requested. Instead, the latency is used to acquire results.

Child

The Child IAT (Ch-IAT) allows for children as young as four years of age to take the IAT. Rather than words and pictures, the Ch-IAT uses sound and pictures. For example, positive and negative valence are indicated with smiling and frowning faces. Positive and negative words to be classified are voiced out loud to children.

Studies using the Ch-IAT have revealed that six-year-old White children, ten-year-old White children, and White adults have comparable implicit attitudes on the Race IAT.

The interpretation that the IAT provides a "window" to unconscious mental contents has been challenged by Hahn and colleagues, whose results indicated that people are highly accurate in predicting their own IAT scores for a variety of social groups.

De Houwer theorizes that the IAT is a measure of a response compatibility effect, in which participants first learn to associate positive and negative words and concepts with pressing specific keys on the keyboard. Later in the test, when participants are instructed to sort words and concepts that are both negative and positive with the same keyboard key, De Houwer argues that much of the latency and incorrect responses that result from this change are due to the increased cognitive complexity of the task, and not necessarily a reflection of implicit bias.

Brendl, Markman, and Messner have proposed a random walk model process to explain responses in the critical portions of the IAT. They theorize that test respondents base their responses on a process of mental evidence-gathering that continues until the evidence for one option or the other (right or left key) reaches a threshold, at which time a decision is made, and action is taken. This requires consideration of both the concept and the attribute, which can be congruent or incongruent - all factors that affect decision speed. All evidence during the compatible block of the test is congruent, allowing for fast decision-making. However, incongruent concept and attribute in the incompatible task leads to longer processing time. Increased task difficulty also increases evidence threshold criterion, further decreasing decision speed.

Balanced-identity theory and design

Heider's balance theory

In 1958, Fritz Heider proposed the balance theory, which stated that a system of liking and disliking relationships is balanced if the product of the valence of all relationships within the system is positive. In the theory, there are concepts and associations. Concepts are persons, groups, or attributes; and among attribute concepts, there are positive and negative valences. Associations are relations between pairs of concepts, and the strength of association is the potential for one concept to activate another, either by external stimuli or by excitation through their associations with other, already active, concepts. The theory followed the assumption of associative social knowledge: an important portion of social knowledge could be represented as a network of variable-strength associations among person concepts (including self and groups) and attributes (including valence).

Balance–congruity principle

When two unlinked or weakly linked nodes are linked to the same third node, the association between these two should strengthen. This is the principle of balance–congruity. The nodes in the principle of balance–congruity are equivalent to the concepts in Heider's balance theory, and the three involved nodes/concepts make up a system. Since every relationship within the system here is positively associated, this, according to a derivation of Heider's theory, also represents a balanced system where the product of the direction of all associations within the system is positive. and in the public sphere (e.g. in the Wall Street Journal). For example, it has been interpreted as assessing familiarity, For instance, feedback may report that someone has a [minimal/slight/moderate/strong] automatic preference for [European Americans/African Americans], though critics contest the degree to which such conclusions can be drawn from an IAT. Proponents of the IAT have responded to these charges but the debate continues. According to an article in The New York Times, "there isn't even that much consistency in the same person's scores if the test is taken again". In addition, researchers have recently claimed that results of the IAT might be biased by the participant's lacking cognitive capability to adjust to switching categories, thus biasing results in favor of the first category pairing (e.g. pairing "Asian" with positive stimuli first, instead of pairing "Asian" with negative stimuli first).

Validity research

Since its introduction into the scientific literature in 1998, a great deal of research has been conducted in order to examine the psychometric properties of the IAT as well as to address other criticisms on validity and reliability. Oswald's team found that implicit measures were only weakly predictive of behaviors and no better than explicit measures. Some research has found that the IAT tends to be a better predictor of behavior in socially sensitive contexts (e.g. discrimination and suicidal behaviour) than traditional "explicit" self-report methods, mental health (e.g. a self-injury IAT differentiated between adolescents who injured themselves and those who did not), medical outcomes (e.g. medical recommendations by physicians), employment outcomes (e.g. interviewing Muslim-Arab versus Swedish job applicants), education outcomes (e.g. gender-science stereotypes predict gender disparities in nations' science and math test scores), and environmentalism (e.g., membership of a pro-environmental organisation).

The Brief Death Implicit Association Test is a short computerized task that assesses how strongly a person automatically identifies the self with death rather than with life, and higher self-death identification has been associated with elevated suicide risk in clinical samples, while it remains a matter of debate what precisely this test measures for any given person.

In applied settings, the IAT has been used in marketing and industrial psychology. For example, in determining the predictors of risk-taking behaviour of pilots in general aviation, attitudes towards risky flight behaviour as measured through an IAT have shown to be a more accurate forecast of risky flight behaviour than traditional explicit attitude or personality scales. The IAT has also been used in clinical psychology research, especially anxiety and addiction research.

Salience asymmetry

Researchers have argued that the IAT may measure salience of concepts rather than associations. Whereas IAT proponents claim that faster response times when pairing concepts indicate stronger associations, critics claim that faster response times indicate that concepts are similar in salience (and slower response times indicate that concepts differ in salience). There is some support for this claim. For example, in an old-young IAT, old faces would be more salient than young faces. As a result, researchers created an old-young IAT that involved pairing young and old faces with neutral words (non-salient attribute) and non-words (salient attribute). Response times were faster when old faces (salient) were paired with non-words (salient) than when old faces (salient) were paired with neutral words (non-salient), supporting the assertion that faster response time can be facilitated by matching salience.

Although proponents of the IAT acknowledge that it may be influenced by salience asymmetry, they argue that this does not preclude interpreting the IAT as a measure of associations.

Culture versus person

Another criticism of the IAT is that it may measure associations that are picked up from cultural knowledge rather than associations actually residing within a person.

To address the possibility that the IAT picks up on cultural knowledge rather than beliefs that are present in a person, some critics of the standard IAT created the personalized IAT. The primary difference between a standard valence IAT and the personalized IAT is that rather than using pleasant and unpleasant words as category labels, it uses "I like" and "I don't like" as category labels. Additionally, the Personalized IAT does not provide error feedback for an incorrect response as in the standard IAT. This form of the IAT is more strongly related to explicit self-report measures of bias.

Proponents of the standard IAT argue that the Personalized IAT increases the likelihood that those taking it will evaluate the concept rather than classify it. This would increase its relationship with explicit measures without necessarily removing the effect of cultural knowledge. In fact, some researchers have examined the relationship between perceptions of general American attitudes and Personalized IAT scores and have concluded that the relationship between the IAT and cultural knowledge is not decreased by personalizing it. However, it is important to note that there was no relationship between cultural knowledge and standard IAT scores either.

Internal validity

Ability to fake result

The IAT has also demonstrated a reasonable amount of resistance to social-desirability bias. Individuals asked to fake their responses on the IAT have demonstrated difficulty in doing so in some studies. For example, participants who were asked to present a positive impression of themselves were able to do so on a self-report measure of anxiety but not an IAT measuring anxiety. Nonetheless, faking is possible, and recent research indicates that the most effective method of faking the IAT is to intentionally slow down responses for pairings that should be relatively easy. Most subjects, however, do not discover this strategy on their own, so faking is relatively rare. An algorithm developed to estimate IAT faking can identify those who are faking with approximately 75% accuracy.

There is a recent study showing that participants can even speed up their responses during the relatively difficult response pairings in an autobiographical implicit association test that aims to test the veracity of autobiographical statement. Specifically, participants who were instructed to speed up their responses to fake the test were able to do so. The effect was larger when participants were trained in speeding up. Most importantly, guilty participants who speed up their responses during the difficult response pairing successfully beat the test to obtain an innocent result. In other words, participants can reverse their test outcome without being detected.

Susceptibility to conscious control

Distinct from faking (the deliberate obscuring of a true association), some studies have shown that heightening awareness about the nature of the test can change the outcome, potentially by activating different fluencies and associations. For example, in one study, a simple reminder from the experimenter ("Please be careful not to stereotype on the next section of the task") was sufficient to significantly reduce the expression of biased associations on a race IAT. Notably, there was not a significant decrease in overall reaction time in this experiment, indicating that this "control" may also be implicit.

Familiarity

A common criticism of the IAT is that it may be difficult to associate positive attributes with less familiar concepts. For example, if a person has had less contact with members of a particular ethnic group, they may have a more difficult time associating members of that ethnic group with positive words simply because of this lack of familiarity. There is some evidence against the familiarity based on studies that have ensured equal familiarity with the African American and White names as well as the faces appearing on the Race IAT.

Order

As the IAT relies on a comparison of response times in different tasks pairing concepts and attributes, researchers and others taking the IAT have speculated that the pairing on the first combined task may affect performance on the next combined task. For example, a participant who begins a gender stereotype IAT by pairing female names with family words may subsequently find the task of pairing female names with career words more difficult. Research has indeed shown a small effect of order. As a result, it is recommended to increase the number of classifications required in the fifth IAT task. Older subjects also tend to have more extreme IAT scores, and this may be related to cognitive fluency, or slower overall response times.

An improved scoring algorithm for the IAT, which reduces the effect of cognitive fluency on the IAT, has been introduced. A summary of the scoring algorithm can be found on Greenwald's webpage.

Experience

Repeated administrations of the IAT tend to decrease the magnitude of the effect for a particular person. This issue is somewhat ameliorated with the improved scoring algorithm. Results from four pre-registered experiments demonstrated that completing a Race IAT resulted in increases in White participants' negative automatic racial evaluations of Black people as measured by two different implicit measures (Single Category IAT and the Affective Misattribution Procedure) but did not generalize to another measure of automatic racial bias (Shooter Bias Task).

Reliability

The IAT demonstrates inconsistent internal consistency and its test-retest reliability stands at 0.60, a relatively weak level. Similarly, the Race IAT scores for an individual may indicate bias, but that bias is diminished on another IAT administered after associating with a mixed-race group. In fact, Race IAT scores can be changed even more easily; administering the IAT in different languages yields significantly different scores for bilingual individuals. For example, studies conducted with Moroccan participants fluent in both French and Arabic showed that participants are biased when completing an IAT in their native language; however, that bias is diminished when completing an IAT in another language. Similar results were found in the United States when administering an English and Spanish IAT on bilingual Hispanic Americans. however, this may be more to do with who completes the task at each time of day than a function of circadian rhythms.

In popular culture

After establishing the IAT in the scientific literature, Greenwald, along with Mahzarin Banaji (Professor of Psychology at Harvard University) and Brian Nosek (Associate Professor of Psychology at the University of Virginia), co-founded Project Implicit, a virtual laboratory and educational outreach organization that facilitates research on implicit cognition.

The IAT has been profiled in major media outlets (e.g. in the Washington Post) and in the popular book Blink, where it was suggested that one could score better on the implicit racism test by visualizing respected black leaders such as Nelson Mandela. The IAT was also discussed in a 2006 episode of The Oprah Winfrey Show.

In the episode "Racist Dawg" on King of the Hill, Hank and Peggy take an IAT, colloquially referred to as the "racist test" to see if they prefer the company of white or black people.

References

External links

Project Implicit – take the test
IAT Review chapter in Implicit measures of attitudes: Procedures and controversies (seven years after IAT creation)
IAT Review chapter in Zeitschrift für Experimentelle Psychologie (three years after IAT creation)
Discussion of IAT critiques in American Psychological Association article
Further Readings on Unconscious Bias from John Tierney New York Times column on Bias of Bias test
Televised Lecture "The Psychology of Blink: Understanding How Our Minds Work Unconsciously - Part 1 of 2" (recorded March 5, 2008)

History

Implicit cognition and measurement

Procedure

Stereotype

Self-esteem

Brief

Child

Balanced-identity theory and design

Heider's balance theory

Balance–congruity principle

Validity research

Salience asymmetry

Culture versus person

Internal validity

Ability to fake result

Susceptibility to conscious control

Familiarity

Order

Experience

Reliability

In popular culture

See also

References

External links