- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- A bottom-up framework for cross-cultural evaluation...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation Liu, Zhuozhuo
Abstract
Large Language Models (LLMs) have been demonstrated to align with the values of Western or North American cultures. Prior work predominantly showed this effect through leveraging surveys that directly ask – originally people and now also LLMs – about their values. However, it is not clear that these explicitly stated beliefs actually correspond to the slant that LLMs take in real tasks. To address that, we take a bottom-up approach, asking LLMs to recall cultural norms invoked by narratives from different cultures. We find that GPT-4o tends to generate norms that, while not necessarily incorrect, are significantly less culture-specific. In addition, while it avoids overtly generating stereotypes, the stereotypical representations of certain cultures are merely hidden rather than suppressed in the model, and such stereotypes can be easily recovered. Addressing these challenges is a crucial step towards developing LLMs that fairly serve their diverse user base.
Item Metadata
Title |
A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2025
|
Description |
Large Language Models (LLMs) have been demonstrated to align with the values of Western or North American cultures. Prior work predominantly showed this effect through leveraging surveys that directly ask – originally people and now also LLMs – about their values. However, it is not clear that these explicitly stated beliefs actually correspond to the slant that LLMs take in real tasks. To address that, we take a bottom-up approach, asking LLMs to recall cultural norms invoked by narratives from different cultures. We find that GPT-4o tends to generate norms that, while not necessarily incorrect, are significantly less culture-specific. In addition, while it avoids overtly generating stereotypes, the stereotypical representations of certain cultures are merely hidden rather than suppressed in the model, and such stereotypes can be easily recovered. Addressing these challenges is a crucial step towards developing LLMs that fairly serve their diverse user base.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2025-09-09
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0450081
|
URI | |
Degree (Theses) | |
Program (Theses) | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2025-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International