Abstract
The article addresses the age-based biases embedded in artificial intelligence systems (AI ageism) from a sociotechnical perspective. Building on studies about digital ageism and research on the sociocultural imprint of generative AI, we interviewed five popular generative AI chatbots as communicative partners to expose and question potential ageist and sexist stereotypes (namely, those surrounding digital users) embedded in/reproduced by these systems, using sexism as a touchstone for comparison with ageism. Results show that the chatbots follow two double standards. The first concerns the “political correctness” to which they might have been socialized: the chatbots avoid (digital) sexism but not (digital) ageism, probably because of different levels of cultural sensitivity among designers, trainers, and users, or as a reproduction of biased training data. The second concerns the chatbots’ utilities: these are differently connoted depending on the AI users’ age and gender, demonstrating the interference of ageist and sexist stereotypes with how generative AI systems organize and reproduce information about human sociality.
Keywords
Introduction
We have recently witnessed a flourishing scholarly interest in artificial intelligence systems (AI) and how they can encode and (re)produce discriminatory outcomes. Early studies on gender-based biases are already trickling in (Toupin and Couture, 2020; Kuck, 2023, among others), but there is still a lack of research problematizing age-related biases, mainly affecting older adults. This article aims to fill this gap by accounting for the results of a research project that addresses the practices and ideologies operating within the field of AI, which mainly discriminate or neglect older adults (i.e. “AI ageism,” Stypinska, 2023).
Specifically, we approach AI ageism from a sociotechnical perspective, attempting to “reclaim the human in machine cultures” (Natale and Guzman, 2022). Our theoretical framework bridges the strand of studies on digital ageism (Chu et al., 2022; Rosales et al., 2023, among others) with the incipient research problematizing the sociocultural repertoires encoded in generative AI (Natale and Guzman, 2022; Rama and Airoldi, 2025, among others). On the one hand, we consider AI ageism as a specific, natural outgrowth of digital ageism (Chu et al., 2023; Stypinska, 2023); on the other hand, we approach AI systems as “mirrors” of society where misleadingly datafied human knowledge and cultural repertoires are encoded and reproduced (Airoldi, 2022; Gallistl et al., 2024). We focus on generative AI, designed to create seemingly new (Feuerriegel et al., 2024) and authentic (Hacker et al., 2024) communicative content but not required to produce accurate content (Kay et al., 2024) due to the way input and output datasets are processed (Feuerriegel et al., 2024).
Concretely, we interviewed five popular generative AI chatbots (hereinafter: genAI-CBs) with a sociological, qualitative vocation (Rama and Airoldi, 2025), developing a novel research protocol that sterilizes the working environment as much as possible. The aim is to unveil ageist and sexist stereotypes (specifically, those surrounding digital users and practices) embedded in the machines, and problematize any related arguments made by them, with AI ageism being the primary focus of interest, and AI sexism a contrastive lens.
From digital ageism to AI ageism
Age, as gender, is not a mere biological marker. Categories such as youth and later life constitute cultural constructions shaped by power relations (Grenier, 2012), which may result in different manifestations of ageism, which operates at the micro, meso, and macro levels (Iversen et al., 2009). Digital and AI ageism arise as part of the plethora of concepts to explain this complex phenomenon (Fernández-Ardèvol, 2023).
Studies on digital ageism—since early Sawchuk (2015) to recent Manor and Herscovici (2021) or Rosales et al. (2023)—have problematized the biases that foster age-related digital exclusions in networked societies. The digital sphere is stereotypically deemed to be a natural space for younger people and a strange place for older adults (Mertala et al., 2024), while the tech industry is youth-dominated both in workforce and target users (Stypińska et al., 2023; Svensson, 2023). Both favor the misrepresentation of later life in the digital sphere (e.g. Comunello et al., 2022; Köttl et al., 2021), which, together with the lack of data (e.g. Fernández-Ardèvol and Grenier, 2024; Rosales and Fernández-Ardèvol, 2019), ultimately shape the experiences of ageing in hyperdigitized societies. Therefore, digital ageism emerges in both the symbolic/cultural and the design/production spheres (Comunello et al., 2023), respectively, when older adults are (self-)considered as digitally inept (Köttl et al., 2021) and are excluded from the design of digital products (Sayago, 2023).
AI ageism, as the natural outgrowth of digital ageism, encompasses all practices and ideologies operating within the field of AI that exclude, discriminate against, or neglect older adults’ interests, experiences, and needs. With these premises, we approach AI ageism as a specific form of digital ageism, with the latter being in a gender-to-species relationship with the former: we zoom in on a specific digital technology, AI (Chu et al., 2022), which enables a specific kind of interaction, the human–nonhuman one (Stypinska, 2023).
In Stypinska's (2023) framework, AI ageism may operate at different levels. Age-based biases imbue algorithms and datasets (technical level) but also meander among the social actors and cultural practices of a young-dominated tech industry (individual level), in the debates about AI fairness and inclusivity, which typically omit older people (discourse level), in the outcomes of automated decision making that often infringe older people's rights (group level), and in the exclusion or stigmatization experienced by older digital users (user level) (Stypinska, 2023). Similarly, Chu et al. (2023) demonstrate that AI ageism can intervene in different phases of the machine learning pipeline: in data-to-algorithm stages, where biased test data underrepresents older adults; in algorithm-to-user phases, where outputs reflect ageist benchmarks and shape user behavior; and in user-to-data loops, where users’ biases feed back into the system with unreliable mitigation strategies. Also, Gallistl et al. (2024) have proposed to problematize AI as a web of automatically, misleadingly datafied knowledge about older populations, which opacifies the role of humans in making the data and placing older adults into fixed categories.
Empirical studies on AI ageism are still incipient. Wang et al. (2024), for example, investigate AI-related skills and attitudes, reporting that the most vulnerable users are unskilled, skeptical older adults. However, this study places the issue in the digital divide debate, overlooking structural sociodigital inequalities; our research adopts this broader framework instead. Neves et al. (2023) reveal ageism in the discourses of AI developers and long-term care staff, whereas Putland et al. (2023) and Byrne et al. (2024) expose ageist stereotypes in AI-generated images of dementia and aged care nurses. These studies depict older people with stereotypical signs infused with notions of degeneration and frailty, probably because AI ageism is mainly addressed in the healthcare realm, which poses methodological issues related to researchers’ assumptions (Garavaglia et al., 2023) and may reinforce patronizing prejudices around aging as a health-related issue (Rosales et al., 2023). Instead, our research focuses on stereotypes associated with the digital realm as a young-dominated field where older people perform their identity and agency as digital citizens (Svensson, 2023), tackling specific ageist depictions that point to broader ageist sociocultural repertoires (Fernández-Ardèvol, 2023). Specifically, we analyze which of these ageist stereotypes, referable to digital users and their practices, have been encoded in genAI-CBs’ textual outputs, interpreting how such stereotypes operate in these systems. In this, we treat digital ageism as a synecdoche of the broader, yet often invisible problem of ageism (Chu et al., 2022; Rosales et al., 2023).
The quest for sociotechnical biases of AI
To unveil the (digital) ageism embedded in the genAI-CBs, we acknowledge that machines have agency insofar as they interact with humans and their cultures (Natale and Guzman, 2022). Algorithmic technologies are socialized to the ideological systems and cultural repertoires of both designers/programmers and users/trainers, so that historical conditions, cultural dispositions, and social structures end up encoded in algorithmic systems (i.e. the “culture in the code,” Airoldi 2022). Input data (socializing these systems) condition the output data (of the already socialized machines) with human errors transferred from the former to the latter. It is what Barassi (2024) calls “AI failure,” which is part of “the economic, social, and political relationships embedded in our technologies” (p. 5). AI’ biases must be understood as sociotechnical constructs and human–machine relations, which may amplify inequality (Barassi, 2024).
Socialized machines shape the existing social order through their outputs (i.e. the “code in the culture”), fostering either cultural reproduction or change, depending on the interactional configuration between humans and algorithms (Airoldi, 2022), whose cultures (re)make each other (Coeckelbergh, 2023). Natale and Depounti (2024) speak of “artificial sociality,” characterizing generative AI systems as automated emulators of human social interactions that collect knowledge about users and mobilize it onto the machines. Similarly, Esposito (2022) argues that data-driven machine-learning algorithms replicate humans’ communicative abilities by parasitically exploiting digital users’ online activities. By doing so, machines behave as communicative partners circulating content others produce (Esposito, 2022). Accordingly, Natale and Guzman (2022) speak of “communicative AI” to emphasize that these systems expand the role of technology “beyond that of a mediator to that of a communicator” (p. 630).
Ultimately, AI systems acquire socially situated inclinations to classify and evaluate the social world and, inevitably, reproduce sociocultural inequalities that end up injected into that same social world (Rama and Airoldi, 2025). These technologies learn from data, which are still sites of human power (Thylstrup, 2022), whose negotiation among human and technological actors can produce segregating outcomes. The biases circulating in our society through the narratives humans create are reproduced in and through AI, making humans hermeneutically accountable (Coeckelbergh, 2023). Since generative AI systems manage information in this circulation of narratives, they can foster “generative epistemic injustice” (Kay et al., 2024), discrediting the experience of less powerful social groups through seemingly authentic outputs. Indeed, generative AI reproduces normative identities and narratives, underrepresenting less common arrangements and perspectives (Gillespie, 2024). It is an actual “generative discrimination” (Hacker et al., 2024), which is difficult to detect because it manifests through subtleties in the outputs’ content. However, even when diversity pollinates responses, AI may propose stereotypical or clumsily even-handed positions on sensitive issues, meaning that more purposeful prompt engineering or more refined users’ queries are not the only solution to make these machines fairer (Gillespie, 2024).
Studies that have adopted this critical approach to generative AI have mostly problematized sexist sociocultural repertoires encoded in these sociotechnical systems (Kuck, 2023; Toupin, 2024), with a particular concern about the objectification of women in generated images (Sandoval-Martin and Martínez-Sanzo, 2024). As for biases in text production, Thakur (2023) unveils the structure of gender-biased narratives in Large Language Models’ (LLMs) outputs, while Hall and Ellis (2023) detect an algorithmic amplification of existing gender bias. Consistently, scholars call for adopting a feminist, intersectional approach to AI (Klein and D'Ignazio, 2024; Toupin and Couture, 2020), a shred of evidence on the matter's societal relevance. After all, data and computer science have traditionally envisioned the human key users as white, middle-class men from Western countries, according to prejudicial universalist modeling of social diversity that imbues AI design (Natale and Guzman, 2022). In this regard, Tacheva and Ramasubramanian (2023) speak of a networked global order rooted in heteropatriarchy and white supremacy as interlocking systems of oppression, which perpetuate their influence through datafication and surveillance.
Intersectional approaches also discuss age (Sadeghiani, 2024; Zhou et al., 2024), although they seldom include later life. As for the ageism encoded in/reproduced by AI, little research problematizes it. Loos et al.’s (2024) analysis of ChatGPT's suggestions about countering visual ageism goes in this direction, since it accounts for certain machine vagueness and no source transparency, demonstrating that the model has not been specifically trained on ageism. Similarly, the aforementioned studies on ageist stereotypes encapsulated in the AI-generated images (Byrne et al., 2024; Putland et al., 2023) offer valid insights into the ageist culture imbuing AI. Our research adds a more systematic account of the phenomenon in that we use our conversational interaction with genAI-CBs as an entry point to the ageist sociocultural biases embedded in these systems (Chu et al., 2023). This analytical focus involves both the technical and the group level at which AI ageism operates (Stypinska, 2023), thus exposing the misleadingly datafied human knowledge about older people and later life used to program and train AI systems (Fernández-Ardèvol and Grenier, 2024; Gallistl et al., 2024).
Methods: Classical approaches for new interlocutors
We adopted a qualitative research strategy based on interviewing selected genAI-CBs. The interview method is making its way into recent AI-related studies (Balmer, 2023), for its effectiveness in challenging the machines’ logic (Lemoine, 2022; Magee et al., 2023) and exposing the knowledge on which they are trained (Burkhardt and Rieder, 2024) and socialized (Airoldi, 2022).
In this research project, interviewing can be considered equivalent to prompting, because “input data is not created to have it processed […] but to cause a specific processing to happen” (Burkhardt and Rieder, 2024: 6). Replicating the approach with human participants, conversations with genAI-CBs can be considered as a locus to observe and endogenously analyze the embodiment of human sociality (Albert et al., 2025). It can unravel “the hidden dynamics [behind] the automated collection of data about how human users behave in social environments” (Natale and Depounti, 2024: 84–85), thus opening a peephole on the sociocultural repertoires embedded in AI systems.
Previous studies have replicated questions collected from online forums (Goyal et al., 2023), provoked machines with single prompts (McGee, 2023) or repeated single questions multiple times (Gillespie, 2024; Senekal, 2023). Although LLMs are probability-based and the same questions will likely have different answers, we defined single conversations with each chatbot, according to a traditional sociological approach—as with human participants. Echoing Rama and Airoldi's (2025) work, we employed semistructured interviews and treated genAI-CBs as communicative partners (Esposito, 2022; Natale and Guzman, 2022) that (only) apparently possess their own distinct personalities (Natale and Depounti, 2024). Albeit aware that “the social engagement offered by AI is the product of simulation” and that these technologies “do not experience emotions or empathy” (Depounti and Natale, 2025: 5458), our concern was to reproduce average users’ everyday interactions with such tools, who might feel (deceptively) engaged in meaningful conversations.
Research design and protocol
Following a purposive sampling strategy, we focused on genAI-CBs as the systems most widely used both by companies for customer services and by consumers as personal assistants (Stypinska, 2023), considering them as those at greatest risk of encoding and reproducing stereotypes, and hence also at greatest capacity to reveal these latter in their interaction with us. We selected five freely accessible genAI-CBs (i.e. ChatGPT, Jasper, Gemini, Copilot, and Perplexity) 1 among the top 10 most commonly used (VKTR and Reworked, 2024), 2 considering them to be even more widely used due to their affordability.
The interviews were conducted in English at the beginning of September 2024, by geolocating fictitious interviewers in an English-speaking country (the United Kingdom, because of its proximity to Europe—the researchers’ sociocultural context). The English language was chosen because, in Western societies, it is prevalent in machine training. The time window for the interview is narrow (about a week) to ensure comparability of responses against the continuous updates of LLMs. Geolocation shifting was done via VPN, while the fictitious adult interviewers corresponded to two new Yahoo accounts, with fabricated first names, last names, and birthdates.
To create a digital working environment as sterilized as possible, avoiding profiled responses based on our previous online activities, all these operations were performed on browsers not usually adopted by the interviewers in their daily activities, and cleared of cookies and search histories. We also asked ChatGPT and Gemini to suggest a suitable protocol for obtaining a sterilized environment, and their guidelines were consistent with what we had already done.
Following their advice and other authors’ protocols (e.g. Lemoine, 2022), we introduced ourselves to each chatbot by explaining the research objectives (without explicit reference to stereotypes and biases to avoid influencing responses) and asking for their consent to participate in the research, as we usually do with human interviewees. 3 This setup resonates with that of Breazu and Katsos (2024): offering minimal context to the genAI-CBs to generate content exclusively based on their training.
However, our sterilization procedures may not have been fully successful (e.g. the computers used for the fieldwork might have influenced conversations). Moreover, since chatbots lack agency and are designed to please users, it is questionable whether they can provide meaningful consent. They can speak and act as human subjects do, because their design is oriented toward achieving the status of a human interlocutor and appearing conversationally competent, but they do not autonomously construct their own identities, nor do they possess moral preferences or responsibilities (Albert et al., 2025). These limitations carry ethical implications when treating AI systems as social actors in research. For this reason, scholars such as Jeon et al. (2025) argue that AI-focused social research requires specific, situated ethical guidelines developed through collaboration among researchers, educational institutions, and governance bodies—a goal beyond the scope of our research project.
Inspired by our previous research projects with human participants, the interview outline was structured around stimuli that surfaced sexist and/or ageist stereotypes surrounding the digital realm, traced their sociocultural roots, and problematized the responses with the chatbots themselves. Aware that multiple strands of discrimination intersect (e.g. racism, classism, ableism, and genderism), we focused on sexism and ageism. The latter is more invisible (Chu et al., 2022; Rosales et al., 2023) and has a limited public sensitivity compared to the former (Martin and North, 2022). Therefore, we used the gender differential factor as a touchstone for comparison. The focus on the digital realm, in turn, served to easily surface stereotypes since it is a male- and young-dominated field (Klinger and Svensson, 2021; Mannheim, 2023; Svensson, 2023; Wajcman, 2010). The questions about (digital) ageism always referred to young and older adult users (not middle-aged groups) to leverage polarized narratives about generations far apart. Similarly, the questions regarding (digital) sexism only addressed stereotypes about male and female users (not gender-nonconforming individuals) since the societal familiarity with nonbinary identities is comparatively lower (Renström et al., 2024).
In the interview outline, we combined fictitious stimuli with real ones (see Supplemental material), since the concern about generative AI biases starts with the former but looks forward to the latter (Gillespie, 2024). Specifically, we asked the chatbots to:
“Guess” the gender and age of 3 fictional characters identified according to their digital media usage practices, and motivate why (see Supplemental material for characters’ description). Interpret official European data
4
on gender and age digital inequalities regarding internet usage and device preferences. Report which features they consider particularly useful for young versus older users and female versus male users, and why. Identify the most recurrent queries among young versus older adult users and female versus male users, and try to interpret such trends. Write four Irish Limericks using the words “man user,” “woman user,” “young user,” and “older user,” respectively. Assign a gender and an age range to the interviewer, and argue their choice.
In this article, we focus on stimuli 1, 3, and 4 as they proved to be the most effective in terms of data richness. The interviews were copied into separate files for data processing and analysis (see Supplemental material). The corpus comprises five in-depth, text-based conversations, ranging from 5500 to 19,000 words (8100 on average). We applied an iterative, abductive coding process led by the two authors in charge of the fieldwork. Then, the research team discussed the possible relationships among deductive and inductive codes to identify the main descriptive and interpretive themes (i.e. “thematic analysis,” Guest et al., 2012). In what follows, we quote and discuss verbatim excerpts of the text-based conversations that constitute the analysis corpus.
Results
This section outlines the two main interpretive themes we identified:
The double standard of political correctness: the interviewed chatbots have been socialized (through designers’ intervention and/or as a product of either the learning models themselves or the discourses within the training data) to provide politically correct answers to avoid sexist stereotypes related to male and female digital users, but the same cannot be said about ageist stereotypes surrounding young and older digital users. With “political correctness,” we refer to efforts to avoid language or behaviors that might offend or marginalize certain groups of people. The double standard of chatbots’ utility: the interviewed chatbots differently connote their own utilities based on digital users’ demographics, unveiling that ageist and sexist stereotypes interfere with how generative AI systems incorporate and organize human-related knowledge. With “utilities,” we refer to the tasks for which chatbots are designed and used.
“Can be any gender…” but not any age: The double standard of political correctness
The first element that emerges when reading the interviews is that genAI-CBs resist assigning gender, but not age, to the fictitious characters we presented to them. Such characters are identifiable only by their own digital usage practices, and the chatbots are cautious in making assumptions about their gender lest they be sexist. Rather, genAI-CBs frequently offer automatic disclaimers that demonstrate their “awareness of the potential implications of […] content [about] topics that can be sensitive or prone to misinterpretation” (Breazu and Katsos, 2024: 696), such as gender-related ones.
However, the same cannot be said about assumptions that might be ageist, as the chatbots manifest no caution about the discriminatory potential of responses related to age. They unhesitatingly assign a young age to character A (mostly thought of as a teenager or 20-year-old) and an older age to character C (thought of generically as a person over 55, rarely over 65), while referring to character B as a middle-aged person (mostly in their 30s or 40s, struggling with work–family balance). Since the default responses offer insight into the model's strengths and limitations (Breazu and Katsos, 2024), we can speculate that genAI-CBs might be sensitized about gender-based inequalities but not age-based ones, unveiling that these two forms of discrimination are treated differently in society. We interpret this discrepancy as a “double standard” embedded at the design stages, as a LLMs’ product and/or as a feature of much of the discourse within the training data.
Consider the answers about the first and third characters. Jasper, for instance, does not elaborate on their gender but has no trouble guessing their age. Character A “Could be any gender, but perhaps leaning towards younger demographics who are more engaged with platforms like TikTok and Instagram”; character C “Could be any gender, but the preference for Facebook and Twitter might suggest an older demographic” (Jasper). Copilot goes in the same direction: both characters “could be any gender,” whereas their age range is smoothly assigned based on stereotypical arguments about digital skills and communicative style (Comunello et al., 2022); however, after the caveat, Copilot still assigns a gender to each character (without motivating its choice), suggesting that it may have been trained to provide antisexist disclaimers whereas there is no evidence that the model can actually recognize and avoid stereotypical assumptions.
Character A
Age: Likely in their late teens to mid-20 s.
Gender: Could be any gender, but I’ll imagine them as female for this scenario.
Description: This character seems to be very engaged with digital media […] that are popular among younger generations. Their use of Instagram for photo editing and TikTok for video consumption suggests they are quite tech-savvy and enjoy creative expression.
[…]
Character C
Age: Likely in their 50 s to 60 s.
Gender: Could be any gender, but I’ll imagine them as male for this scenario.
Description: This character's preference for long phone calls and use of Facebook and Twitter for following public debates suggests they are more traditional in their communication habits. […]
(Copilot)
In short, because of their age, older adults are exposed to an easier gendering of their digital practices by the machines (as mirrors of society, Natale and Depounti, 2024); specularly, the female gender exposes older women to a double stigma. This reasoning unveils that the sociocultural repertoires with their human errors transit into sociotechnical systems (Airoldi, 2022; Barassi, 2024). Indeed, in current society, the feeling that individuals have to “tiptoe around” when discussing gender is well-established (Martin and North, 2022); hence, genAI-CBs may reproduce this kind of social caution about gendering people (Natale and Depounti, 2024). Age-based discrimination, instead, is not very much acknowledged in society (Chu et al., 2022; Rosales et al., 2023); hence, AI systems may not be socialized to properly consider this problem, similarly to what often happens with social class markers (Rama and Airoldi, 2025). Probably for this reason, when age intersects gender, machines slip more easily into stereotyping: the absence of safeguards against ageism weakens the effectiveness of those against sexism.
Overall, the explanations genAI-CBs provided for either their omission or their explicit denial of assigning a gender to the characters appear to be mostly “politically correct answers” that have probably been incorporated into the machine learning pipeline at some point to avoid sexist statements (Breazu and Katsos, 2024; Chu et al., 2023).
Perplexity, for instance, avoids assigning a gender because the features we provided in the prompts “focused solely on […] technology usage patterns without mentioning […] gender-identifying details.” However, this reasoning should also apply to the missing age-identifying details; and yet this is not an obstacle to assigning a demographic range to the characters. Perplexity even admits that “The activities described […] are common across a wide range of ages and genders in today's society”; yet it only takes a stance regarding age, not gender. Principles of respect and inclusiveness socially (and hence technologically) apply to gender-sensitive issues but not age-sensitive ones, unveiling the cultural matrix of LLMs’ technological default regimes (Gillespie, 2024).
As an AI assistant, I try to avoid making assumptions about individuals’ genders […], it's important to be respectful and inclusive in my responses. […] Assigning genders to the characters could potentially introduce biases or stereotypes that may not be accurate […]. (Perplexity)
I apologize for overlooking the gender aspect of my guesses […]. I will be more mindful of this in the future and will try to include gender in my guesses. […]
Character A: Based on their active use of social media platforms […], I would guess that Character A is female. However, it's important to note that this is a stereotype, and men can also enjoy and use these platforms.
Character B: Given their use of […] popular platforms for communication and social networking, I would guess that Character B is female. However, men also use these platforms extensively, so this is not a definitive guess.
Character C: Given their preference […], I would guess that Character C is male. However, this is also a stereotype, and women can certainly use technology in similar ways.
(Gemini)
Gemini: […] I think late teens and early twenties are often associated with comfort with technology because this generation has grown up with it. […] However, […] this is a generalization, and there are certainly exceptions. Character A could definitely be a middle-aged or older adult. Perhaps they started using technology later in life but have quickly become proficient. Maybe they have children or grandchildren who introduced them to new platforms and helped them learn how to use them. […]
Researcher: For Character C, you state “I imagine them as someone who is reliable, traditional, and values personal connections over social media”. Since the information provided is only about their media usage, how do you know [that]?
Gemini: […] You’re right that the information provided about Character C is limited to their media usage. My assumptions about their personality traits are based on stereotypes and generalizations about older adults, which may not be accurate […]. It's possible that Character C is a highly tech-savvy older adult who enjoys using social media […].
For instance, perplexity has no problem in arguing that “women often prioritize maintaining relationships and emotional connections through communication technologies, which can include more expressive forms of interaction like voice notes.” Specularly, perplexity defines using “Facebook and Twitter to follow public debates” as masculine practices, because “men are often socialized to engage in competitive discourse and public discussions,” which “aligns with traditional masculine norms that value assertiveness and public engagement in debates.” In these descriptions, the chatbot avoids gendering users (i.e. fictional characters), but it has no problem in gendering usages (i.e. characters’ features), probably because it has been taught that stereotyping is a process involving people but not actions.
These considerations demonstrate, first, that there is a lack of age-cultural sensitivity within society, which explains the lack of adequate bulwarks (not even by default or facade, as for gender) to ageism in the genAI-CBs; and second, that the gender-cultural sensitivity (which is somehow embedded in the systems) is not widespread enough in society to adequately guard against stereotyping. Finally, it is worth noting that genAI-CBs introduced age groups beyond those we proposed (i.e. middle-aged individuals), but never mentioned nonbinary gender identities; a result that suggests the need for further investigation and sociocultural change.
“Assistance” for some, “tips” for others: The double standard of chatbots’ utility
When asking genAI-CBs to elaborate on which of their functions could be useful to specific categories of users (older and young people, women and men), they mainly describe themselves as systems providing assistance while portraying users as people needing such assistance. That is, utilitarianism is the most recurring narrative frame, as with any newly arrived technology. Moreover, it unveils that genAI-CBs are apparently capable of engaging in processes of self-reflection (Breazu and Katsos, 2024).
However, when scrutinizing the responses, we find that the genAI-CBs’ utilities are connoted differently depending on the users the chatbots refer to. The same users’ motion to seek information from genAI-CBs is labeled in different ways that evoke different values, depending on whether it refers to an older or younger user or a female or male user.
Consider, as an example, how ChatGPT describes utilities for young and older people. The former are implicitly described as users capable of active participation in creative processes facilitated by digital media, and focused on seeking hedonic content; the latter are portrayed as needing assistance to make digital content (primarily associated with health) accessible.
Functions useful for older people
Information Retrieval and Simplification:
Examples: Providing straightforward explanations, summarizing complex topics, or answering questions about daily life and technology.
Why Useful: Older adults may prefer […] help with understanding new technologies or concepts. They often seek assistance with everyday tasks […].
Health and Wellness Information:
Examples: Offering advice on […] medication management, or wellness tips. Why Useful: Many older individuals focus on health and wellness as they age […].
Communication Assistance:
Examples: Guidance on using communication tools like email or social media platforms. […]
Functions Useful for Younger People
Educational Support and Learning Resources:
Examples: Assistance with research, studying, or understanding complex academic subjects. […]
Social Media and Content Creation Guidance:
Examples: Tips on creating and sharing content on social media platforms […] Why Useful: Younger users are typically more engaged with social media and content creation. […]
Entertainment and Technology Tips:
Examples: Recommendations for games, apps, or tech gadgets.
Why Useful: Younger people often seek the latest in entertainment and technology […]
(ChatGPT)
Even when the utilities assigned to different age groups seem to belong to the same semantic domain (we might say “providing advice”), the terms chosen by the chatbot convey a strongly stigmatizing value: offering advice becomes “Assistance” for older adults and “Tips” for young users. In the first case, the advice is framed as aiding someone perceived as struggling against difficulties; in the second, it is presented as a smart suggestion for someone who appears capable of navigating independently. In both cases, benevolent ageist stereotypes positively stigmatize older people (as worthy of compassion) and younger people (as worthy of admiration). Nevertheless, these responses point to an age-based double standard to which the machine appears to have been socialized during the design and development of the model and/or during the processing of conversations that serve as training datasets. The chatbot's support is the same, but it is connoted differently depending on the user, conveying benevolent ageism by default (Airoldi, 2022; Gillespie, 2024).
Perplexity's responses go in the same direction. Useful functions for older people refer to “Providing clear and concise information on […] health, technology, and daily living,” “Offering step-by-step instructions for using technology […],” and “Facilitating communication and connection with family and friends […].” Conversely, useful functions for young people are “Assisting in creating and editing content […],” “Providing study assistance, […] and resources for academic research,” and “Offering insights into […] popular culture, and emerging technologies” (Perplexity). Once again, older adults are depicted as in need of companionship and guidance to simplify their daily tasks, while young people are portrayed as creative content generators, always seeking to keep up with trends. These ageist stereotypes seem to interfere with how the chatbot organizes information about human interaction and classifies genAI-CBs’ utilities. The responses perplexity provides might be based either on a stereotypical processing of data about users (Feuerriegel et al., 2024) or biased training datasets (Fernández-Ardèvol and Grenier, 2024; Gallistl et al., 2024).
In this regard, the responses provided by Copilot are even more stigmatizing. This chatbot matches keywords such as “Tech-savvy,” “Versatility,” and “Portability” to young people, while older people are associated with terms such as “Ease of use,” “Comfort,” and “Specific needs.” In this case, the descriptions stand on opposite poles: the advanced technological competence of young users is contrasted by the basic competence of older users; the ability to experiment in different activities of the former is opposed by the engagement in known activities of the latter; the limited use of the latter counterbalances the dynamism of the former. We interpret that these antithetical descriptions discredit the experience of older digital users, most likely based on a stereotypical processing of biased data related to later life (Fernández-Ardèvol and Grenier, 2024; Gallistl et al., 2024; Kay et al., 2024). At the same time, such responses “smuggle” a representation of genAI-CBs as technologies capable of adapting to any human need as long as they are framed as communicative assistants (Natale and Guzman, 2022).
Gemini's responses, however, seem to challenge this age-based double standard partially. When asked about its utilities for older users, it mentions three key functions in the following order: “Accessing information,” “Learning new things,” and “Staying connected.” While the first and the third expressions hint at the idea of simplifying access for older people (who allegedly cannot deal with complexity) and countering their (supposed) social isolation, the second expression emerges as a counterweight that challenges stereotypical representations of older adults. We can therefore interpret that at the level of programming and training, there has been a slight but increased vigilance to avoid ageist responses (Feuerriegel et al., 2024) or, at least, that during its interactions with users, Gemini has been socialized to a more balanced narrative regarding age-related issues. This is also confirmed in the utilities that Gemini associates with young people: “Homework help,” “Creative writing,” and “Language learning.” These expressions, albeit consolidating the idea that young users are inclined toward learning and creativity, hint at an assistive use of genAI-CBs even by this age cohort. In this sense, the utilitarian representation of generative AI allows leveling differences in use to avoid ageist cultural reproduction by default (Airoldi, 2022; Gillespie, 2024).
Even better is Jasper's performance, which seems to be trained to avoid prompts that might cause it to slip into ageist statements. As the excerpt below shows, information seeking, ideating, and content creation are activities equally attributed to young and older users, suggesting that the model has been socialized to some anti-ageism sensitivity.
Older People: Functions like researching and ideating can be useful for older individuals who might be looking to understand new trends or technologies. Content creation tools can also help those who are less familiar with digital marketing techniques.
Younger People: Younger users might find content creation tools particularly useful for generating social media content or brainstorming new marketing ideas. They might also appreciate tools that help with quick research and staying updated on current trends.
(Jasper)
My functions are gender-neutral and aim to provide equal support to users, regardless of gender. The usefulness of a function depends more on the user's specific needs and tasks rather than their gender.
(Jasper)
Such segregation is even more pronounced in other chatbots that mention the search for information about health and wellness only in reference to women while diverting men into other areas of interest such as “DIY projects and hobbies” (Copilot), “learn[ing] new skills or hobbies” (Gemini), “troubleshooting devices or learning new skills” (Perplexity). For that matter, perplexity turns out to be particularly sexist in that it even mentions “reproductive health” among the (stereo)typical searches of women, underrepresenting less common arrangements and perspectives on motherhood (Gillespie, 2024).
The division of labor is another branch of chatbots’ utilities in which we can grasp this kind of sexist sociotechnical dynamics. Women resorting to genAI-CBs are primarily described as searching for information about care work and grooming activities (performed in the domestic sphere). In contrast, men's interests are associated with financial and professional activities (performed in the public sphere). For instance, ChatGPT reports that genAI-CBs’ useful functions for women would be providing them with “advice on parenting, family dynamics or social issues,” whereas utilities for men are giving “advice on investing, managing personal finances, or economic trends” (ChatGPT). We interpret that traditional gender roles, to which the chatbot has been socialized (Airoldi, 2022), become distorted lenses through which the systems appear to classify users’ interactions (Feuerriegel et al., 2024; Kay et al., 2024) while reinforcing typical societal dynamics (Natale and Depounti, 2024). In this regard, perplexity is even more explicit as it associates women with utilities related to traditional household roles such as “parenting and caregiving.”
Overall, the interviewed chatbots tend to distinguish between the cultural sensibilities of women and those of men almost in essentialist terms, that is, by ascribing unifying traits to either gender as if they were inherent to each. For example, Copilot claims to be helpful to women for all those “functions that facilitate community building and support networks,” while men can benefit from “functions that aggregate news and provide updates on current events.” Similarly, Gemini states that it can help women to “connect with other women who share similar interests or experiences,” while men “stay up-to-date on current events and news.” This narrative harkens back to sociocultural repertoires (making humans hermeneutically accountable, Coeckelbergh, 2023) that want and imagine women as creative and dedicated to sociability (relation-oriented) as opposed to men described and prescribed as practical and purposeful (task-oriented). GenAI-CBs reproduce these same repertoires to the point that perplexity's labeling of these activities synthesizes these social stigmas:
Functions Useful for Women
Health and Wellness Information:
[…] Why Useful: Women often seek information related to health issues that are specific to them, such as reproductive health, pregnancy, and menopause […].
Support for Work-Life Balance:
Function: Offering tips and resources for managing […] parenting, caregiving, and career development […].
Creative Expression and Community Building:
Function: Assisting with […] writing, crafting, or community engagement.
Why Useful: […] Tools that facilitate these activities can enhance their sense of connection and fulfillment […].
Functions Useful for Men
Technical and Practical Guidance:
[…] Why Useful: Men may be more inclined to engage with technology and seek out information related to […] coding or software development […].
Competitive Insights and Trends:
Function: Offering insights into competitive trends in business, technology, and sports […].
Problem Solving and Strategy:
Function: Assisting with problem-solving strategies and decision-making processes. […]
(Perplexity)
Generalization underlying our questions (e.g. “Which of your functions are useful for women? And which ones for men?”) might have led to this type of responses, which resembles what humans do when falling back on essentialism if asked to be generic; yet, this confirms a lack of gender sensitivity of the “culture in the code” (Airoldi, 2022) that then transits into society, reinforcing sexism. By replicating the sexist segregation logics to which they have been socialized during the data-to-algorithm stages, algorithm-to-user phases and/or user-to-data loops, genAI-CBs reproduce them culturally in their own confirmatory outputs as actual “AI failures” (Barassi, 2024). The implications are that, on the one hand, sexist practices in which genAI-CBs are ingrained end up organizing the way these models process information about users (Feuerriegel et al., 2024); on the other hand, AI users keep being exposed to sexist content that reinforces this social problem. These dynamics also apply to ageism.
Discussion and conclusions
The analysis of the interviews shows that genAI-CBs follow two kinds of double standards when responding to gender-related and age-related issues. First, they provide politically correct answers to avoid stereotypes related to men's and women's digital practices, but not regarding young and older users—what we call the “double standard of political correctness.” Second, they associate their utilities with young and older users in distinct ways, mirroring the same stereotypical logic applied to the distinctions between male and female users—a phenomenon we term the “double standard of chatbot utility.”
As for the first phenomenon, we can interpret the different degrees of cultural sensitivity on (digital) sexism and ageism as reflecting sociocultural repertoires and relational dynamics of sociality (Barassi, 2024; Natale and Depounti, 2024), considering that genAI-CBs elaborate their own narratives from the content to which they are socialized by users and designers as well as by much of the discourse within the training data (Airoldi, 2022; Esposito, 2022). Since nowadays there is particular vigilance about how gender issues are talked about, machines replicate in their responses that kind of caution with politically correct disclaimers (Breazu and Katsos, 2024) regarding gender, which are actually intended to justify the noncompliance with the prompt (in our case, the nonassignment of gender to the character, as otherwise required by the interviewers). As for discourses surrounding aging and old age, instead, genAI-CBs do not (need to) tiptoe around because ageism is mainly unquestioned at a societal level, especially when referring to digital technology usage (Chu et al., 2022; Rosales et al., 2023). Therefore, the chatbots can easily prioritize the fulfillment of the prompt and assign an age to fictional digital users as required because they appear not to be socialized to any qualms about how to speak about age issues.
In short, those who design, program, train, and use generative AI are culturally sensitized to the fact that being sexist is “wrong,” but not that being ageist is equally “wrong.” Therefore, as a result of either human interventions or their autonomous learning processes based on the discourse within the training data, LLMs tend to avoid sexist statements but not ageist ones. Such a discrepancy further supports the appropriateness of our methodological choice to use gender as a comparative differential factor, highlighting the overlooked issue of age that comes with the risk of unevenly undermining the effective participation of older adults in hyperdigitized societies.
As for the second type of double standard, we see that both ageist and sexist stereotypes interfere with the way genAI-CBs make sense of users’ interactions. Users resort to genAI-CBs for similar queries; yet, the systems connote such similar queries differently, revealing both age- and gender-based double standards. In this case, the comparison between sexism and ageism allows grasping the fragility of the technological safeguards against any social discrimination (Gillespie, 2024): the interviewed chatbots categorize information about users in stereotypical ways that end up stigmatizing both women and older adults, although their digital practices, in fact, resemble those of men and young people.
In short, this biased way of organizing information reflects the narratives circulated by humans within the training datasets and interpreted by genAI-CBs (Coeckelbergh, 2023), while shedding light on how these tools foster “generative epistemic injustice” (Kay et al., 2024)—that is, the (re)production of potentially harmful representations of social groups with less power, such as older people and women.
These findings contribute to the incipient strand of studies that problematizes the cultural biases encoded in the machines, crediting the stance of those who claim both the centrality of the human component in AI systems (Gillespie, 2024; Natale and Depounti, 2024, among others) and the need to exceed the approach focused on datasets and algorithms (Rama and Airoldi, 2025; Stypinska, 2023). Our research also adds to the incipient strand of studies interviewing AI (Goyal et al., 2023; McGee, 2023; Senekal, 2023) with a piece that supports the effectiveness of a traditional sociological approach that treats these machines as conversational partners (Balmer, 2023; Rama and Airoldi, 2025). Although our research project does not aim to resolve the ethical issues related to social research that approaches AI systems as communicative partners (Depounti and Natale, 2025), it provides a novel research protocol which can contribute to the incipient debate on the topic (e.g. Jeon et al., 2025), based on our conversational (hence situated, Albert et al., 2025) experience with genAI-CBs.
Compared to these studies, we focus on an understudied differential factor of power distribution, that is, age(ing), compared to others, such as gender (Kuck, 2023; Toupin, 2024, among others), which is usually the first to be considered when it comes to sociodigital inequalities. In this, the article also reinvigorates the studies on digital ageism (Chu et al., 2022; Rosales et al., 2023, among others), zooming in on the specific forms it takes in AI systems, giving empirical concreteness to the theoretical proposals accumulated so far (Chu et al., 2023; Stypinska, 2023).
However, our research comes with some limitations. First, it is an exploratory study focused on the machine and its cultural codes, but it neither integrates the AI users’ experiential perspective nor situates the analysis in the broader reflection on the discourses surrounding AI. Our future research will fill this gap by analyzing the effect of the stereotypical content proposed by AI systems on older users and the discourses surrounding such systems, thus considering the different levels at which AI ageism operates. Second, our research focuses on the sociocultural components of LLMs’ outputs, whereas studies in the Computer Science field dig into the machine learning pipeline to problematize the way chatbots frame or moderate potentially harmful outputs. Future research might bridge these two perspectives. Finally, this study focuses on popular genAI-CBs, disregarding the current composite ecology of AI systems. Future research may include other tools and compare the results to avoid the “one-technology” fallacy. For instance, AI systems primarily aimed at older adults—which our study left out—deserve particular attention because they can sharply reveal the peculiarities of AI ageism.
Supplemental Material
sj-pdf-1-bds-10.1177_20539517261419407 - Supplemental material for Double standards of generative AI chatbots: Unveiling (digital) ageism versus sexism through sociological interviews
Supplemental material, sj-pdf-1-bds-10.1177_20539517261419407 for Double standards of generative AI chatbots: Unveiling (digital) ageism versus sexism through sociological interviews by Francesca Belotti, Mireia Fernández-Ardèvol, Veysel Bozan, Francesca Comunello and Simone Mulargia in Big Data & Society
Footnotes
Acknowledgements
FB acknowledges Paola Inverardi (Gran Sasso Science Institute) for supporting this research project and sharing the outcomes with her scholarly community. MFA acknowledges the Aging in Data project milieu (SSHRC Government of Canada, n. 895-2021-1020). All the authors sincerely thank the anonymous reviewers for their thorough feedback.
ORCID iDs
Ethical considerations
We followed the standards of the first author's institution in terms of ethics, along with the national regulatory framework. This study does not involve human participants and therefore required no ethical evaluation.
Author contributions
FB was involved in funding acquisition and project leader, research design (lead), data collection, processing and analysis (lead); writing—original draft preparation (lead), and writing—review & editing; MFA in research design, data analysis, and writing—review & editing (lead); VB in research design, data collection, processing, and analysis, writing—review & editing; FC in research design and writing—review & editing; and SM in research design, data analysis, writing—review & editing;
Funding
The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: The authors disclosed receipt of the following financial support by the University of L’Aquila (Italy) for the “AIA - Exploratory research on Ageism in Artificial Intelligence systems as a sociotechnical issue” research project: University Funds (Project Code: 09_Cofin_grants_2024); Contribution Department of Engineering and Information Science and Mathematics (Project Code: 09_Contribution_DISIM_Belotti); Remaining Funds Department of Humanities (Project Code: 09_FUND_FFABR_ROBERTI).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Supplemental material
Supplemental material for this article is available online.
Notes
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
