Abstract
Can people accurately perceive who is authentic? Laypeople believe they can tell who is authentic, and they report that authenticity is an important attribute in others (Studies 1a and 1b; N = 369). However, when we directly tested the accuracy of perceived authenticity, we found no significant correlation between self- and other-rated authenticity in two cohorts of adult students in randomly assigned teams (Studies 2 and 3; 4,040 self-other observations). In addition, we found that perceived authenticity was biased in two ways: (a) Other-rated authenticity showed a positivity bias compared with self-ratings, and (b) other-rated authenticity was biased by the rater’s own authenticity. In Study 3, we also investigated authenticity meta-perceptions; although people expect their authenticity to be accurately perceived by others, their meta-perceptions did not correlate with other-rated authenticity. That is, beliefs about the visibility of one’s authenticity are similarly not accurate. Overall, we found no evidence that people can accurately identify who is authentic.
In the 2016 presidential election, Donald Trump was dubbed the “authenticity candidate” despite a prolific record of deception (Hahl et al., 2018). On the other hand, his opponent Hillary Clinton was accused of coming across as cold, calculated, and inauthentic—even being described as “at pains to convince Americans that she is a real person” (Byers, 2015). The perception that one is authentic is nontrivial—individuals who are perceived to be authentic are more liked (Kernis & Goldman, 2005; Liu & Perrewe, 2006) and more trusted (Wang & Hsieh, 2013). Indeed, authenticity is considered to be the “gold standard” for leaders (Ibarra, 2015, p. 54). If authenticity is used as a metric for evaluating other people, it is important to know whether these judgments are accurate.
Being accurate at recognizing authenticity is no small feat because it requires knowing someone’s inner self, observing their behavior, and comparing the two. Functional perspectives on perceived authenticity assume that individuals can discern “real,” “true,” or “genuine” people in the same way they can judge an authentic product (Newman, 2019). In past research, it has been implicitly assumed that judgments of authenticity are related to the target’s authenticity, as reflected in research on the antecedents of authenticity. For instance, perceived authenticity is associated with the target’s morality (Bai et al., 2019), their perceived self-knowledge, and self-consistency (Peus et al., 2012). These antecedents parallel variables linked to individual expression of authenticity (Gino et al., 2015; Kraus et al., 2011). However, the accuracy of authenticity judgments remains untested. 1
Our core proposition in the present research is that perceived authenticity (a) is not accurate (i.e., other-ratings of authenticity are not related to self-ratings of authenticity) and (b) is biased by the rater making the authenticity judgment. To develop our hypotheses regarding perceived authenticity, we integrate the authenticity literature with work on person perception and deception detection. From this, we identify several hurdles in the pursuit of accurate authenticity perception and suggest biases that may occur in perceived authenticity.
Theoretically, a person is authentic when their behavior is genuine, that is, their behavior reflects their true inner qualities and feelings (Beer & Brandler, 2021; Kernis & Goldman, 2006; Ryan & Ryan, 2019). As a multidimensional construct, authenticity can be thought of as an individual difference, with some individuals being generally more authentic than others (“trait authenticity”; e.g., Wood et al., 2008). Authenticity can also be described as an affective experience when one feels connected to their core, true self (“state authenticity”; e.g., Lenton et al., 2013). Given that authenticity is tied to a core or true self unique to each person, authenticity is expressed differently between individuals. This constitutes a challenge to accurate authenticity perception. Past research has found that people are accurate at recognizing certain individual differences, such as personality traits (Ambady & Rosenthal, 1992). However, judging someone’s authenticity is not like judging how extraverted someone is; for extraversion, what is high (vs. low) extraversion is relatively standard between individuals. This is not true when estimating someone’s authenticity, as one person’s inner self is not the same as another person’s inner self. Thus, the self at the center of authenticity suggests that perceived authenticity is a uniquely challenging type of person perception.
Second, laypeople are “intuitive psychologists” who tend to assume that the behavior of other individuals is driven by dispositional characteristics rather than by a complex interaction with the situation (Ross, 1977). Heider (1958) provides the classic example of seeing someone eating. The intuitive psychologist would say that “she eats because she is hungry.” That is, the observed behavior of eating is linked to a simple internal motivation in the eyes of other people. When perceived authenticity is considered from this view, it suggests that people tend to assume that the behavior of others emerges from within (i.e., is authentic).
Third, decades of research on deception detection have found that individuals are barely able to accurately detect lies above chance (for a review, see Bond & DePaulo, 2006). Whereas inauthenticity and lying are related, lying is an extreme version of inauthenticity. Indeed, there are many ways that individuals can be more or less authentic that do not involve outright lies, such as sharing selective positive features on dating websites or omitting negative aspects in the recall of an experience. If individuals have low accuracy at detecting lies or truths, this suggests that accuracy may be even lower when perceiving authenticity.
Statement of Relevance
Popular press pieces tout authenticity as a key metric for selecting friends, judging potential mates, and electing leaders, implicitly assuming that there is a kernel of truth embedded in these judgments. We suggest that this type of person perception—requiring both knowing someone’s inner self and observing their behavior—is a prohibitively difficult social judgment to make. The current studies examined both the accuracy (whether there is congruence between self- and other-rated authenticity) and biases that distort authenticity judgments. As predicted, authenticity was difficult to accurately perceive and biased by the person making the judgment.
Given that the standards against which to judge authenticity are relatively obscure, the process is prone to bias. Past research has robustly demonstrated predictable asymmetries between self- and other-perception, which occur because people are biased in how they see themselves and how they see others (Ross, 1977). Our theoretical perspective suggests that a “perceived-authenticity bias” will be evidenced in at least two ways.
First, we suggest that other-rated authenticity will be more positive compared with self-rated authenticity. We derived this hypothesis given that deception research has found that people reliably exhibit a bias toward truth (DePaulo et al., 1997; Zuckerman et al., 1981). When people are not informed about the base rates of lies to truths, they overestimate the number of truths, potentially as a means of maintaining cohesion and reducing social costs (ten Brinke et al., 2016). In addition, attributing behavior to simple internal characteristics suggests that observed behavior will be assumed to be authentic. This is in contrast to the self-knowledge that an individual has of their own motivations, ranging from impression management and social desirability to authenticity. Although we do not suggest that individuals consider themselves inauthentic, pondering one’s own authenticity brings to mind the small ways in which self-presentation is often edited in the social sphere.
Second, we propose that a perceived-authenticity bias will present as egocentric judgments, whereby raters will project their authenticity onto other people. Egocentrism suggests that individuals view and judge others in line with how they view themselves (Nickerson, 1999; Ross, 1977). Indeed, when individuals attempt to understand the inner states of others, they begin at their own point of view and adjust only incrementally (Epley et al., 2004; Tamir & Mitchell, 2013). Therefore, we hypothesized that a rater’s authenticity will be projected onto their authenticity ratings of other people.
Research Overview
We first sought to confirm lay beliefs about the ability to detect authenticity and the importance of authenticity in interpersonal relationships in Studies 1a and 1b. In Studies 2 and 3, we tested the accuracy and biases in authenticity ratings using a unique interpersonal setting: randomly assigned teams of students with little to no prior interactions with one another. The random assignment to these groups mitigated issues in selection that typically occur in both personal and professional relationships. We followed these groups over 6 weeks, surveying them at multiple time points. Participants rated their own authenticity and the authenticity of every other person in their group. This allowed us to directly test the congruence between self- and other-ratings of authenticity as well as test for biases in authenticity judgments. In Study 3, we also examined authenticity meta-perceptions; that is, what do targets assume about the visibility of their authenticity to other individuals? This allowed us to confirm whether authentic people expect to be perceived as authentic by others. Second, we examined whether targets who expect to be perceived as authentic actually are perceived as such. All studies were approved by the Columbia University Institutional Review Board.
Studies 1a and 1b
Our goal in Studies 1a and 1b was to test assumptions that laypeople have about whether they can tell when other individuals are being authentic. To do this, we surveyed two samples: an online sample and a sample of master of business administration (MBA) students. Participants were asked whether they felt they could tell when other people were being authentic. In addition, we asked participants the importance they placed on authenticity in close others.
Method
Studies 1a and 1b were identical surveys given to two samples and are therefore summarized together.
Participants
For Study 1a, we recruited 150 participants from Amazon’s Mechanical Turk (MTurk). After applying exclusions based on an attention check (“Please select disagree”), we retained 140 participants for our final sample (54 women, 86 men; age: M = 39.53 years, SD = 11.84). Sample size was determined by financial limitations.
For Study 1b, we surveyed 229 participants from an incoming class of MBA students (92 women, 137 men; age: M = 28.52 years, SD = 2.65) for course credit. Sample size was determined on the basis of course enrollment.
Measures
Perceptions of authenticity
To assess the participants’ self-reported ability to perceive the authenticity of other individuals, we created a face-valid five-item measure. Participants responded to the following prompts on a 7-point scale ranging from 1 (strongly disagree) to 7 (strongly agree): “I can tell when people are being real,” “I know when others are being authentic,” “It’s hard for me to tell when people are being themselves” (reverse-coded), “I know when people are being genuine,” and “I can tell when people are being fake” (Study 1a: M = 5.16, SD = 0.76, α = .73; Study 1b: M = 5.44, SD = 0.89, α = .89).
Importance of authenticity
We asked participants how important authenticity was to them as an attribute of other people. Participants responded to the following items on a slider from 0 (extremely unimportant) to 100 (extremely important): “In general, how important is authenticity to you?” “How important is it to have leaders who are authentic?” “How important is it to have friends that are authentic?” and “How important is it to you that other people be authentic?” (Study 1a: M = 78.24, SD = 16.14, α = .90; Study 1b: M = 85.60, SD = 12.24, α = .73).
Results
The means and standard deviations for all variables, as well as correlations between all variables, are available in Tables S1 and S2 in the Supplemental Material available online.
First, we considered whether participants felt that they could tell when other people were being authentic. We conducted a one-sample t test comparing the mean of the response distribution with the midpoint of the scale (4). This test confirmed that the average response fell above the midpoint; that is, the majority of participants believed that they could—at least somewhat—tell when other people are being authentic in the sample of MTurk participants (difference = 1.16), t(139) = 18.03, p < .001, Cohen’s d = 1.52, and students (difference = 1.44), t(228) = 24.36, p < .001, Cohen’s d = 1.61 (see Fig. 1).

Distribution of participant ratings regarding the self-reported ability to perceive authenticity (left column) and the importance of perceived authenticity in other individuals (right column) in Studies 1a and 1b. Solid black lines indicate the midpoints of the scales. Dashed lines indicate average responses.
We expected to find that, on average, participants would report that authenticity was an important attribute to them. To evaluate this hypothesis, we conducted a one-sample t test comparing the mean of the response distribution with the midpoint of the scale (50). This test confirmed that the average response of participants fell above the midpoint, meaning that both MTurk participants (difference = 28.24), t(139) = 20.70, p < .001, Cohen’s d = 1.75, and MBA students (difference = 35.60), t(228) = 44.02, p < .001, Cohen’s d = 2.91, agreed that authenticity was an important attribute in other individuals. Taken together, these results support the assumption that people feel that they can tell when others are being authentic and consider authenticity an important attribute in others.
Study 2
Our goal in Study 2 was to compare self- and other-rated authenticity in randomly assigned working groups. We hypothesized that the relationship between self- and other-rated authenticity would be nonsignificant. In addition, we tested for perceived-authenticity biases. First, we compared other-rated authenticity with self-rated authenticity, hypothesizing that other-rated authenticity would be higher than self-rated authenticity. Second, we hypothesized that the rater’s own authenticity would predict their authenticity ratings of other people.
Method
Participants
Participants were 207 incoming MBA students (84 women, 123 men; age: M = 28.01 years, SD = 2.85). Two-hundred six participants rated at least one other participant for a total of 1,033 observations. The sample size was determined on the basis of class enrollment and completion of the course surveys.
Participants completed several surveys over 6 weeks as part of their introductory course. Figure 2 displays the timeline of the surveys. We administered Survey 1 when students entered the MBA program. Next, participants began the two-stage process of round-robin ratings for their team (self- and other-ratings). First, participants were instructed to complete their own trait self-evaluations (self-rated authenticity and personality) in Survey 2. Four weeks later, they were asked to evaluate their group members on their traits (other-rated authenticity and personality) in Survey 3. They had 2 weeks to complete the evaluations of their group members (1 week before and following Survey 4). The final survey (Survey 4) was taken at the end of the 6 weeks. All surveys were completed for course credit.

Timeline of Study 2. For Surveys 1, 2, and 4, the arrows mark the due date; however, late responses were accepted. Survey 3 was a two-step process, in which students first nominated evaluators and then received the nominees’ evaluations. Survey 3 remained open for other-ratings for 3 weeks following this point.
When students entered the program, they were each randomly assigned to small workgroups (4–6 students) with whom they took all first-semester courses as well as completed a number of graded assignments. These groups are an important part of the program, and groups spend a significant amount of time together. As part of the curriculum, groups meet to develop a charter of norms, rules, and responsibilities designed to foster successful working relationships. 2 This setting is relatively consistent in a number of important ways that strengthen the test of accuracy. First, in our sample, the majority of interactions occurred while group members were together, either while taking courses or engaging in coursework. Thus, participants were largely engaging with one another at the same time. Second, interactions were largely in the same or a similar setting (the classroom or school facilities). Therefore, participants were seeing one another in a controlled yet ecologically valid setting.
Self-rated measures
Our primary comparison was between self-rated trait authenticity and other-rated trait authenticity. However, it is possible that raters rely on alternative expressions of authenticity (e.g., feelings of authenticity) when estimating the trait authenticity of a target. To test for this, we also asked targets to report how authentic they felt, whether they felt they could act authentically with their teammates, and whether they felt that their teammates know who they really are. We then tested the congruence of these additional forms of self-rated authenticity with other-rated trait authenticity. We first captured a measure of whether the target had an “authentic personality” (e.g., “I would describe myself as an authentic person”). Second, we measured state, or experiential, authenticity at two time points: Survey 1 and Survey 4. In Survey 4, participants self-reported whether they felt they could act authentically with their teammates. Finally, participants self-reported whether they felt their teammates knew who they really are. We describe each of these authenticity measures in more detail below.
Trait authenticity (Survey 2)
Participants rated their authenticity using a combination of items modified from the Authentic Personality Scale (Wood et al., 2008) and items created for this study. The items were “I am true to myself in most situations,” “I would describe myself as an authentic person,” “I am more sincere in my interactions than strategic,” and “I could be more authentic” (the final item was reverse-coded; α = .66). Exploratory factor analysis confirmed that these four items loaded onto a single factor (for more information, see the Supplemental Material).
We primarily examined the congruence of self- and other-ratings along this trait-authenticity measure. Authenticity is a multifaceted construct, considered from both a trait perspective (e.g., Wood et al., 2008) and a state perspective (Lenton et al., 2013, 2016). Between these two perspectives, authenticity can either be relatively stable (trait perspective) or vary significantly across time and context (state perspective). Because trait-based views of authenticity assume greater stability, other-ratings of authenticity were collected along the trait dimension. We then compared the congruence of self-rated trait authenticity and other-rated trait authenticity. We return to this point in the General Discussion.
State authenticity (Surveys 1 and 4)
Participants rated their feelings of authenticity at two time points: at the beginning of the program (Survey 1) and after 6 weeks (Survey 4). They responded to the following two face-valid items, which we reverse-coded, as a measure of authenticity: “I feel like I’m pretending to be something that I am not” and “I feel fake” (Survey 1: r = .76, p < .001; Survey 4: r = .78, p < .001).
Acting authentically (Survey 4)
To estimate whether participants felt they could act authentically, we asked them to respond to the prompt, “So far in the MBA program . . .” with the following three, face-valid items: “there have been times where I felt like I couldn’t be myself with my classmates” (reverse-coded), “I could be authentic with my classmates,” “I have felt like I was who I wanted to be with my classmates” (α = .65).
Feeling known (Survey 4)
Participants responded to a two-item measure based on prior research regarding the perception that their classmates know them: “My team members know who I really am” and “My friends at [university] know who I really am” (r = .71, p < .001; Purvanova, 2013).
Personality (Survey 2)
Participants rated their Big Five personality traits using the Ten-Item Personality Inventory (Ehrhart et al., 2009). The Big Five model of personality is predictive of a number of important life outcomes (Goldberg, 1992). We used participant personality in three ways. First, prior research has shown that other-ratings of personality traits are relatively accurate (Ambady & Rosenthal, 1992). Therefore, we can compare self- and other-rated personalities to create a benchmark for the accuracy of authenticity. Second, personality traits were used as a control variable to test the robustness of our models. Third, in exploratory analyses, we examined whether certain personality traits were associated with authenticity ratings. Participants responded to the prompt, “I see myself as . . .” followed by the 10 short descriptors, two per personality trait. For example, agreeableness was measured as the average of “critical, quarrelsome” (reverse-coded) and “sympathetic, warm.”
Demographic information
Participants reported their gender, age, and race, which we used as control variables. In exploratory analyses, we considered whether authenticity ratings were accurate among rater-target pairs of the same gender or same race. We found null or conflicting results of same-gender and same-race pairs on authenticity accuracy, which we describe in more detail in the Supplemental Material (see the Authenticity Perceptions in Homophilous Subsets of Study 2 and 3 section).
Other-rated measures
Two-hundred six participants rated at least one other participant, for a total of 1,033 observations. Round-robin ratings of group members were required for course credit; that is, each participant who submitted a self-rating was required to complete other-ratings for each member of their team for the course. Participants entered these team members into the survey software, which automatically generated the other-rated surveys. If participants did not rate each of their team members, they were reminded by the course administrators via email.
In addition, participants were allowed to request feedback from additional classmates from their program. These ratings were voluntary and would typically occur if the classmates were working together on other projects or were friends (72 observations). Unless noted otherwise, we used other-ratings from the full sample and controlled for familiarity (described below) between the rater and target.
Trait authenticity (Survey 3)
Participants were rated by their team members on the perception that they are authentic along the same four items in the trait-based self-ratings. Raters responded to four items: “[Target name] is true to themselves in most situations,” “I would describe [target name] as an authentic person,” “[Target name] is more sincere in their interactions than strategic,” and “[Target name] could be more authentic” (the final item was reverse-coded; α = .80).
Rater authenticity
Given that this was a round-robin rating scenario, we were able to match rater authenticity (trait, state, acting authentic, and feeling known) from their self-rated surveys.
Authenticity-inaccuracy score
For some hypotheses, we were interested in what predicts the gap between self- and other-ratings. Therefore, we calculated an authenticity-inaccuracy score as the signed difference between self- and other-rated authenticity. This signed difference was estimated by subtracting self-rated trait authenticity from other-rated trait authenticity; a positive authenticity-inaccuracy score indicated that the other-rating was higher compared with self-rated trait authenticity. Zero indicated accurate alignment between self- and other-ratings. A negative authenticity-inaccuracy score indicated that other-rated trait authenticity was lower than self-rated trait authenticity.
Personality (Survey 2)
To compare self- and other-rated personalities, we asked raters to complete the Ten-Item Personality Inventory regarding the target. The prompt read, “I see [target name] as . . .” followed by the same 10 descriptions that the target completed. For example, extraversion was measured as the average of “extraverted, enthusiastic” and “reserved, quiet” (the final item was reverse-coded).
Familiarity
Finally, the raters completed a single, face-valid item that assessed how familiar they were with the target. The item read, “How well do you know [target name]?” and participants responded using a scale ranging from 1 (not well at all) to 4 (very well).
Results
The means and standard deviations for all variables, as well as correlations between all variables, are presented in Table 1. All continuous variables were standardized before analyses were conducted.
Descriptive Statistics and Correlations for All Variables in Study 2
Note: Data were averaged at the target level. All continuous measures were captured on a 7-point Likert-type scale with the exception of familiarity, which was measured on a 4-point scale. Two variables were other-rated: other-rated trait authenticity (1) and familiarity (12).
p < .05. **p < .01.
The accuracy of authenticity judgments
Trait authenticity
We first considered whether authenticity ratings were accurate. Given the nested structure of the data, with multiple raters rating each target, we tested our hypotheses using multilevel models nesting the target-rated responses at the rater level. This approach captured variance between the raters (results of all models were replicated when we used ordinary least squares (OLS) regressions; see the Supplemental Material). We first compared self-rated trait authenticity with other-reported trait authenticity on the same four items. We found that self-rated trait authenticity was not predictive of other-rated trait authenticity (b = 0.04, SE = 0.03, 95% CI = [−0.01, 0.09], p = .132; see Fig. 3).

Scatterplots showing the relationship in Study 2 between self- and other-ratings of trait authenticity (top left), state authenticity (top right), ability to act authentically (bottom left), and sense that the rater is known by other people (bottom right). In each graph, the line indicates the best-fitting regression, and the error band represents ±1 SE.
State authenticity
Similarly, other-rated trait authenticity was not related to self-rated state authenticity in either the first or final surveys (Survey 1: b = 0.01, SE = 0.03, 95% CI = [−0.04, 0.06], p = .608; Survey 4: b = 0.02, SE = 0.03, 95% CI = [–0.03, 0.08], p = .348; see Fig. 3).
Acting authentically and feeling known
Other-rated trait authenticity was also not related to the target’s perception that they can act authentically with their teammates (b = 0.00, SE = 0.03, 95% CI = [−0.05, 0.05], p = .980). Finally, other-rated trait authenticity was not predicted by participants’ ratings that their teammates knew them (b = −0.04, SE = 0.03, 95% CI = [−0.09, 0.01], p = .142). Taken together, this pattern of results suggests that other-rated authenticity was not correlated with self-rated authenticity (see Fig. 3).
Bayesian analyses for the null relationship between self- and other-rated trait authenticity
For robustness, we estimated Bayesian regressions for the relationship between self- and other-rated authenticity, with noninformative priors, to obtain credible intervals of the regression coefficients (Wagenmakers et al., 2016). We found that the 95% credible interval for the coefficient of other-rated trait authenticity on self-rated trait authenticity ([–0.05, 0.07]) included zero.
In addition, we tested the null relationship using the region of practical equivalence (ROPE) approach (Kruschke & Liddell, 2018). This method examines whether the 90% highest density interval (HDI) lies outside of the ROPE (i.e., a negligible magnitude). We found a complete overlap (100.00%) between the HDI and the ROPE for the relationship between self- and other-rated trait authenticity. This suggests that the null hypothesis was not rejected; there was not a significant relationship between self- and other-rated authenticity.
What accounts for the variance in ratings? Social-relations modeling (SRM)
To examine which aspect of the relationship explained the variance in ratings of authenticity, we conducted SRM (Kenny & La Voie, 1984), which decomposes self- and other-ratings into variance explained by (a) the one making the judgments, (b) the target characteristics causing the judgments, or (c) variance unique to that self-other pair. In addition, SRM provides reliability effects of the other-ratings compared with the self-ratings. Finally, SRM estimates the partial correlation of self- and other-ratings controlling for group membership (i.e., what is the correlation between being authentic and being perceived as authentic?) and the correlation of the rater’s ratings on the other-ratings they give (i.e., what is the correlation between authentic raters and their perception that other people are also authentic?).
To conduct the SRM, we limited the data set in three ways, following the recommendation of Schönbrodt and colleagues (2012). Our outcome variable of interest was the authenticity rating at the trait level, which participants rated for both themselves and others on the same items. First, we limited the data set to only group members. Next, we limited our data set to groups of three or more. Finally, we limited the data set to groups for which the round-robin ratings were completely captured; that is every person in the group submitted a self-rating and rated every other person in the group (36 groups, average group size = 5.69).
The results of the SRM suggest that the variance on behalf of the target, that is, differences between targets, explained about 6% of the variance (SE = 0.02, p = .003). Differences between raters, on the other hand, explained 41% of the variance in authenticity ratings (SE = 0.06, p < .001), and differences in self-other relationships explained 45% of the variance in authenticity ratings (SE = 0.03, p < .001). This suggests that the majority of the variance in authenticity ratings had to do with differences between the ones making the ratings, and across unique relationships, rather than differences in the target.
In further support of the earlier results, the partial correlation of self-rated trait authenticity and other-rated authenticity, controlling for group membership, was not significant, r = .12, t(167) = 1.53, p = .129. That is, self-rated trait authenticity was not correlated with other-rated trait authenticity. However, there was a significant partial correlation between the rater’s authenticity and their perception that the target is authentic, controlling for group membership, r = .34, t(167) = 4.69, p < .001. We return to this in more detail in the Perceived-Authenticity Biases section.
Finally, SRM provides estimates regarding the reliability of the target and perceiver effects. This reliability provides some sense of the consensus regarding authenticity ratings; that is, does the group generally see Person A as more authentic than Person B? The reliability of the rater effect was .80; in comparison, the reliability of the partner effect was .37. This suggests that there is little consensus between raters about who in the group is authentic and who is not.
Is a lack of accuracy unique to authenticity?
An alternative explanation for our results is that raters in this sample are not accurate judges of their targets. To rule this out, we compared self-rated personality traits with other-rated personality traits. We again conducted a series of multilevel models, using self-rated personality traits to predict other-rated personality traits (e.g., does a target’s rating of their own extraversion predict other-ratings of their extraversion?) clustered at the rater level. A positive relationship indicated that self-rated personality positively predicted other-rated personality, suggesting that raters were accurate in their personality ratings. We found a significant and positive relationship between self- and other-rated personality on four out of five personality traits.
Specifically, self-rated extraversion positively and significantly predicted other-rated extraversion (b = 0.46, SE = 0.03, 95% CI = [0.41, 0.51], p < .001), self-rated conscientiousness positively and significantly predicted other-rated conscientiousness (b = 0.14, SE = 0.03, 95% CI = [0.08, 0.19], p < .001), self-rated agreeableness positively and significantly predicted other-rated agreeableness (b = 0.18, SE = 0.03, 95% CI = [0.13, 0.23], p < .001), and self-rated neuroticism positively and significantly predicted other-rated neuroticism (b = 0.18, SE = 0.03, 95% CI = [0.13, 0.24], p < .001). Self-rated openness to experience did not predict other-rated openness to experience (b = 0.03, SE = 0.03, 95% CI = [−0.02, 0.08], p = .290; see Table S3 in the Supplemental Material).
Perceived-authenticity biases
Recall that we had hypothesized that authenticity judgments would be biased in two ways. First, we proposed that other-rated trait authenticity would be higher than self-rated trait authenticity. We tested this by comparing self-rated trait authenticity with the average other-rating of trait authenticity that a participant gave. We found that other-rated trait authenticity (M = 5.84) was significantly higher than self-rated trait authenticity (M = 5.27; mean difference = 0.57, 95% CI = [0.43, 0.71]), t(315) = 7.98, p < .001, Cohen’s d = 0.79. That is, participants were more likely to rate other individuals as more authentic than they rated themselves.
Raters’ trait authenticity
We then considered whether raters are biased by their own trait authenticity. To test this, we used the rater’s trait authenticity, nested at the target level, to predict the authenticity rating that they made of targets. We found a positive and significant relationship between raters’ trait authenticity and the ratings of authenticity they gave to other individuals (b = 0.23, SE = 0.03, 95% CI = [0.17, 0.29], p < .001). This effect was robust when analyses controlled for the rater’s personality, demographic characteristics, and familiarity with the target (b = 0.20, SE = 0.03, 95% CI = [0.13, 0.26], p < .001).
Raters’ state authenticity
Recall that self-ratings were submitted weeks before the other-ratings. However, self- and other-ratings were the same and embedded within mirrored surveys. Thus, an alternative explanation for our findings is that raters were biased as a result of the similarity in items, not their own authenticity. To account for this, we examined the relationship between the other authenticity variables (state, acting authentic, feeling known) and authenticity ratings.
First, we found a significant and positive effect of state authenticity at both time points on authenticity judgments of other individuals (Survey 1: b = 0.12, SE = 0.03, 95% CI = [0.06, 0.18], p < .001; Survey 4: b = 0.11, SE = 0.03, 95% CI = [0.04, 0.17], p < .001).
Raters’ acting authentically and feeling known
Similarly, raters’ perceptions that they can act authentically were positively predictive of their authenticity judgments of other people (b = 0.10, SE = 0.03, 95% CI = [0.04, 0.17], p = .002) as was their feeling known by their group members (b = 0.19, SE = 0.03, 95% CI = [0.12, 0.25], p < .001; see Fig. 4).

Estimates for the five measures of raters’ authenticity (trait, state, acting authentic, feeling known) as predictors of their authenticity judgments of other individuals, nested at the target level. The dotted black line indicates zero.
We then used a rater’s authenticity as a predictor of the signed-difference authenticity-inaccuracy score. The rater’s trait authenticity predicted a positive authenticity-inaccuracy score (b = 0.24, SE = 0.03, 95% CI = [0.18, 0.30], p < .001). That is, authentic raters perceived other people to be more authentic relative to the target’s rating of their own authenticity.
Who is perceived as authentic?
We then considered who is more likely to be perceived as authentic by raters. In exploratory analyses, we entered the demographic and personality variables of the targets, clustered at the rater level, as predictors of other-rated trait authenticity. We found that no individual differences predicted other-rated trait authenticity (all ps > .11).
Study 3
Study 3 replicated Study 2 in a larger set of students 6 months following the first data collection. This set of students was observed during the COVID-19 pandemic in the United States, and coursework was conducted in a mix of in-person and online environments. Thus, this sample allowed us to test whether the effects we found in face-to-face interactions generalized to the online context. In addition to replicating the results of Study 3, we also measured authenticity meta-perceptions, considering whether participants have a sense of whether their authenticity is observable by other people.
Method
Participants
Participants were 571 MBA students (233 women, 336 men, two did not report demographic information; age: M = 27.94 years, SD = 2.51). 3 Participants in the study completed a number of surveys over 6 weeks for course credit. They followed the same timeline as in Study 2 (see Fig. 2). Completion of all surveys was done for course credit. Five-hundred seventy-six participants completed at least one survey rating a classmate, for a total of 3,007 observations (this includes five students not enrolled in the course).
Measures
Self-rated trait authenticity (Survey 2)
Participants rated their authenticity using the same four items from Study 2 (α = .61).
Self-rated state authenticity (Survey 1)
In addition, participants responded to their state-level authenticity using the two items from the first sample (“I feel like I’m pretending to be something that I am not” and “I feel fake”; both items were reverse-coded). For additional face validity for this measure, we added a single item that read, “I feel authentic.” These three items were averaged as a measure of state authenticity (α = .77).
Self-rated personality (Survey 2)
Participants rated their Big Five personality traits using the Ten-Item Personality Inventory (Ehrhart et al., 2009).
Self-rated authenticity meta-perceptions
We asked participants about their perceptions that other people see them as authentic. Participants responded to the following three items: “Other people see me as authentic,” “Other people see me as sincere,” and “Other people perceive me as being real” (α = .89). This measure allowed us to test the implicit assumption of targets—that more authentic targets expect to be seen as more authentic by other individuals, and vice versa. In addition, we were able to test whether targets who expected to be seen as authentic were actually perceived as more authentic.
Demographic information
Participants reported their gender, age, and race. As mentioned in Study 2, we found null or conflicting results of same-sex and same-race pairs on authenticity accuracy, which we describe in more detail in the Supplemental Material (see the Authenticity Perceptions in Homophilous Subsets of Study 2 and 3 section).
Other-rated trait authenticity (Survey 3)
Each rater was rated by their classmates on the perception that they are authentic using the same four items as self-reported trait authenticity. They responded to the following: “[Target name] is true to themselves in most situations,” “I would describe [target name] as an authentic person,” “[Target name] is more sincere in their interactions than strategic,” and “[Target name] could be more authentic” (the final item was reverse-coded; α = .76).
Other-rated raters’ authenticity
We matched other-rated authenticity (trait and state) from their self-rated surveys using their full name.
Other-rated authenticity-inaccuracy score
We calculated an authenticity-inaccuracy score as the signed difference between self- and other-rated trait authenticity. A positive authenticity-inaccuracy score indicated that other-rated trait authenticity was higher than self-rated trait authenticity. Zero indicated accurate alignment between self- and other-ratings. Finally, a negative authenticity-inaccuracy score meant that other-rated trait authenticity was lower than self-rated trait authenticity.
Other-rated personality
As in Study 2, the raters also completed the Big Five personality traits of the targets using the same Ten-Item Personality Inventory (Ehrhart et al., 2009).
Familiarity
Finally, the raters completed a single, face-valid item that assessed how familiar they were with the target. The item read, “How well do you know [target name]?” and participants responded using a scale ranging from 1 (not well at all) to 4 (very well).
Results
The means and standard deviations for all variables, as well as correlations between all variables, are presented in Table 2. All continuous variables were standardized before analyses were conducted. All results from Study 2 were replicated or strengthened in Study 3. For the sake of brevity, below we summarize the results around the core hypotheses of Study 2. Full results are available in the Supplemental Material.
Descriptive Statistics and Correlations for All Variables in Study 3
Note: Data were averaged at the target level. All continuous measures were captured on a 7-point Likert-type scale with the exception of familiarity, which was measured on a 4-point scale. Two variables were other-rated: other-rated trait authenticity (1) and familiarity (10).
p < .05. **p < .01.
The accuracy of authenticity judgments
Trait authenticity
We first compared self- and other-rated trait authenticity. Given the nested structure of the data, we used self-rated authenticity, nested at the rater-level, to predict other-rated trait authenticity. As in Study 2 we found that self-rated trait authenticity was not predictive of other-rated trait authenticity (b = 0.01, SE = 0.02, 95% CI = [−0.02, 0.04], p = .585).
State authenticity
Similarly, self-rated state authenticity did not predict other-rated trait authenticity (b = 0.02, SE = 0.02, 95% CI = [−0.01, 0.05], p = .256).
Bayesian analyses for the null relationship between self- and other-rated trait authenticity
As in Study 2, we confirmed the null effect using Bayesian analyses. We found that the 95% credible interval for the coefficient of other-rated trait authenticity on self-rated trait authenticity ([–.03, .04]) included zero. In addition, we tested the null relationship using the ROPE approach. We found a complete overlap (100.00%) between the HDI and the ROPE for the relationship between self- and other-rated trait authenticity.
What accounts for the variance in ratings? SRM
Next, we conducted SRM to examine what accounts for the variance in ratings. We limited the data set in three ways: First, we limited the data set to only group members. Next, we limited our data set to groups of three or more. Finally, we limited the data set to groups for which the round-robin ratings were completely captured; that is, every person in the group submitted a self-rating of trait authenticity and rated every other person in the group (90 groups, average group size = 5.89).
The results of the SRM suggest that the variance on behalf of the target, that is, differences between targets, explained about 4% of the variance (SE = 0.01, p < .001). Differences in raters, on the other hand, explained 34% of the variance in trait-authenticity ratings (SE = 0.03, p < .001), and differences in self-other relationships explained 39% of the variance in trait-authenticity ratings (SE = 0.02, p < .001). This suggests that the majority of the variance in trait-authenticity ratings had to do with differences between the ones making the ratings, and across unique relationships, rather than differences in the target. In addition, the reliability of the actor effect was .80, whereas the target-effect reliability was low (.35), suggesting low consensus between raters about which targets are authentic.
In further support of the findings of Study 2, the correlation of self-ratings of authenticity and other-rating trait of authenticity controlling for group membership was not significant, r = .01, t(439) = 0.26, p = .792. That is, self-rated trait authenticity was not correlated with other-rated trait authenticity. However, there was a significant correlation between the rater’s trait authenticity and their perception that the target is authentic controlling for group membership, r = .26, t(439) = 5.60, p < .001.
Is a lack of accuracy unique to authenticity?
Next, we tested whether our raters were accurate on another dimension—personality. We found that all five self- and other-rated personality traits were positively and significantly correlated at the p < .001 level, suggesting that a lack of accuracy was unique to authenticity (for more details, see the Is Inaccuracy Unique to Authenticity? section and Table S5 in the Supplemental Material). This suggests that our raters are relatively accurate judges of the targets’ personalities.
Perceived-authenticity biases
As in Study 2, we compared average other-ratings of trait authenticity at the rater level with self-rated trait authenticity. We found that other-rated trait authenticity (M = 5.97) was significantly higher relative to self-rated trait authenticity (M = 5.27; mean difference = 0.70, 95% CI = [0.62, 0.78]), t(857) = 17.38, p < .001, Cohen’s d = 1.03.
We then considered whether raters were biased in their ratings of other peoples by their own authenticity.
Raters’ trait authenticity
We first tested whether raters were biased by their own trait authenticity when rating the trait authenticity of a target. To test this, we ran a multilevel model with participants’ self-rated authenticity nested at the target level predicting participants’ ratings of the target’s authenticity. We found a positive and significant relationship between rater trait authenticity and the ratings of authenticity they give to other individuals (b = 0.16, SE = 0.02, 95% CI = [0.13, 0.20], p < .001). This effect was robust when analyses controlled for the rater’s personality, familiarity with the target, and demographic characteristics (b = 0.13, SE = 0.02, 95% CI = [0.09, 0.16], p < .001). We then used the rater’s trait authenticity as a predictor of the signed-difference authenticity-inaccuracy score. We found that raters’ trait authenticity predicted a positive authenticity-inaccuracy score (b = 0.15, SE = 0.02, 95% CI = [0.12, 0.19], p < .001).
Raters’ state authenticity
Similarly, we found a significant and positive effect of rater state authenticity on their trait-authenticity judgments of other individuals (b = 0.14, SE = 0.02, 95% CI = [0.10, 0.17], p < .001). Thus, authentic raters were biased by their own authenticity, rating other people as more authentic than the targets rated themselves.
Who is perceived as authentic?
In exploratory analyses, we again considered who is more likely to be perceived as authentic. We entered the demographic and personality variables of the targets, nested at the rater level, as predictors of other-rated trait authenticity. We found that agreeableness positively predicted other-rated trait authenticity (b = 0.04, SE = 0.02, 95% CI = [0.00, 0.07], p = .031). In addition, age was a positive and significant predictor of other-rated trait authenticity (b = 0.01, SE = 0.01, 95% CI = [0.00, 0.03], p = .018).
Meta-perceptions of authenticity
In addition to replicating the results of Study 2, we also collected meta-perceptions about authenticity from targets. With this information, we were able to test two things. First, was a target’s authenticity positively correlated with their meta-perceptions about authenticity? That is, did authentic targets expect to be perceived as more authentic? Second, we were able to test whether a target’s meta-perceptions about their authenticity predicted other-rated authenticity. This would reveal whether targets were accurate in their meta-perceptions.
Using OLS regressions, we found that participant trait authenticity predicted expectations that other people would see them as authentic (b = 0.36, SE = 0.04, 95% CI = [0.28, 0.43], p < .001). Similarly, participants’ state authenticity predicted expectations that other people would see them as authentic (b = 0.23, SE = 0.03, 95% CI = [0.18, 0.29], p < .001). This suggests that authentic people expect to be perceived as authentic by others.
Next, we tested whether participants’ expectations that they would be perceived as authentic were related to other-rated trait authenticity. We estimated the relationship between authenticity meta-perceptions and other-rated trait authenticity using meta-perceptions nested at the rater level to predict other-rated trait authenticity. We found that there was not a significant relationship between target-rated expectations of their perceived authenticity and other-rated trait authenticity (b = 0.02, SE = 0.02, 95% CI = [−0.01, 0.05], p = .278). Thus, targets were also not accurate with their perceptions of how other individuals judge their authenticity.
General Discussion
If authenticity is used as a criterion for conferring status, societal value, and morality judgments, perceived authenticity must be accurate. In this research, we first confirmed that individuals assume they can discern authenticity in others. We then directly compared self- and other-rated authenticity in a real social setting with two samples of randomized working teams over 6 weeks. We found that other-rated authenticity did not significantly correlate with any self-rated measure of authenticity. In addition, we found that perceived authenticity was biased. First, other-ratings of authenticity were more positive than self-ratings. Second, authentic raters rated other individuals as more authentic; that is, raters were biased by their own authenticity. Finally, we found that meta-perceptions (expectations that other people will see you as authentic) were similarly uncorrelated with other-ratings of authenticity.
A crucial assumption embedded in our research is that we consider an individual’s ratings of their own authenticity to be the ground truth and compare other-rated authenticity against it. The source of truth is a longstanding debate in accuracy research. A constructivist point of view (Kruglanski, 1989) would suggest that authentic judgments are a function of the collective view and that other-rated authenticity is more valid than self-rated authenticity. In line with this, research has found that self-rated authenticity can be biased. For example, individuals consider their positive actions to be more authentic than negative ones (Jongman-Sereno & Leary, 2016), and self-rated authenticity is more strongly predicted by positive mood than trait-consistent expression (Lenton et al., 2013). If self-rated authenticity is biased by situational or valence characteristics, it adds additional strain to its perceptibility. Regardless of the starting point for judging accuracy, the self- and other-ratings of authenticity in our studies were never significantly correlated.
There are a number of additional explanations for why it is difficult to accurately perceive authenticity in other people. First, authenticity may be hard to observe, whereas inauthenticity is not. Research on deception accuracy has found a similar pattern—people are more accurate at recognizing lies as falsehoods than they are at recognizing truths as truthful (Bond & DePaulo, 2006). Second, it may be easier to fake the earmarks of authenticity than it is to authentically meet them. Consider someone faking an illness: Someone who is pretending to be sick is more likely to exhibit all of the symptoms of that illness relative to someone who is authentically ill, presenting some symptoms but not others. Future research should focus on which, if any, of these factors play a role in the inaccuracy of authenticity judgments.
A limitation of our studies is that we did not observe the usefulness of these authenticity judgments. It may be beneficial in interpersonal relationships to believe that other people are authentic even when they are not. As Swann (1984) noted, “An accurate belief is an instrumental belief” (p. 461). Indeed, self-rated authenticity was high in our samples (Study 2: M = 5.27, Study 3: M = 5.27), as in past research (Lenton et al., 2016). Therefore, assuming that other individuals are being authentic may be strategically beneficial. Future research should focus on whether groups that view each other as authentic perform better and have increased trust and cohesion, relative to groups who do not view each other as authentic.
An additional limitation of our studies is that we compared self- and other-rated authenticity in a specific context, in randomly assigned teams, and in specific populations. First, given that the setting was not strictly social, it may be that settings in which self-disclosure is more common (e.g., support groups or religious/spiritual groups) provide boundary conditions for when perceived authenticity can be accurate. In addition, there are potential limits to the generalizability of the findings, given the study populations (MTurk workers, MBA students).
Finally, we chose to compare authenticity ratings at the trait level, assuming that trait-level authenticity would be easier to discern. This decision was made given that trait-based views of authenticity assume more stability within an individual and across time. However, there are cases in which state-based measures may be more perceptible to other individuals, such as a first interaction or a moment of particularly high or low state authenticity. Alternatively, cues as to an “authentic person” may be different from the cues about an authentic expression. Future research is needed to fully align perceived authenticity with the multifaceted construct of authenticity.
Future research should consider when and why perceived-authenticity biases occur. For instance, we posit that other-rated authenticity is higher than self-rated authenticity because the deception-detection literature has found that individuals overestimate truth relative to lies (DePaulo et al., 1997; Zuckerman et al., 1981). An alternative explanation is that individuals are modest in their self-ratings of authenticity. In addition, there is asymmetry of information available when these ratings are being made—that is, people may rely on different types of information when evaluating their authenticity relative to another’s authenticity. Further empirical evidence is needed to more deeply understand this bias.
Because judgments of another’s authenticity require insight into that person’s true self, one would assume that the better you know someone, the better you should be at judging that person’s authenticity. In contrast, a surprising finding in our data was that familiarity did not increase the accuracy of perceived authenticity; rather, the greater the familiarity between a rater and their target, the less accurate their authenticity ratings became. Specifically, as familiarity increased, other-rated authenticity grew increasingly more positive relative to the target’s self-rated authenticity (b = 0.22, SE = 0.03, 95% CI = [0.16, 0.28], p < .001; see Fig. S1 in the Supplemental Material). This could result from a number of motivations. For instance, the better we know someone, the more motivated we are to believe that that person (and our relationship with them) is authentic. Future research should further delve into the relationship between familiarity and perceived authenticity.
Our core contribution is to the authenticity literature. Prior research has implicitly assumed that perceived authenticity is related, on some level, to experienced authenticity. This has driven research on the antecedents of authenticity to focus on characteristics of an individual that should be related to self-rated authenticity. A notable exception is the work by Kokkoris and Kühnen (2014), who found that self-expression is perceived as authentic only when it fits with societal norms, suggesting that perceived authenticity may have more to do with social norms than unobstructed self-expression. By unhooking the individual experience of authenticity from perceptions of authenticity, we hope to broaden the scope for future research to explore antecedents of perceived authenticity. In addition, we add caution to sweeping judgments of who is authentic and who is not, suggesting that the truth of someone is hard to perceive in everyday life.
Supplemental Material
sj-docx-1-pss-10.1177_09567976211056623 – Supplemental material for Are You for Real? Perceptions of Authenticity Are Systematically Biased and Not Accurate
Supplemental material, sj-docx-1-pss-10.1177_09567976211056623 for Are You for Real? Perceptions of Authenticity Are Systematically Biased and Not Accurate by Erica R. Bailey and Aharon Levy in Psychological Science
Footnotes
Acknowledgements
We thank Adam Galinsky, C. Blaine Horton, Sheena Iyengar, Jon Jachimowicz, Michael Slepian, Daniel Stein, and Julianna Pillemer for critical feedback that substantially improved this article.
Transparency
Action Editor: Kate Ratliff
Editor: Patricia J. Bauer
Author Contributions
E. R. Bailey and A. Levy designed and conducted the studies. E. R. Bailey analyzed the data. Both the authors wrote the manuscript and approved the final version for submission.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
