A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation Liu, Zhuozhuo

Abstract

Large Language Models (LLMs) have been demonstrated to align with the values of Western or North American cultures. Prior work predominantly showed this effect through leveraging surveys that directly ask – originally people and now also LLMs – about their values. However, it is not clear that these explicitly stated beliefs actually correspond to the slant that LLMs take in real tasks. To address that, we take a bottom-up approach, asking LLMs to recall cultural norms invoked by narratives from different cultures. We find that GPT-4o tends to generate norms that, while not necessarily incorrect, are significantly less culture-specific. In addition, while it avoids overtly generating stereotypes, the stereotypical representations of certain cultures are merely hidden rather than suppressed in the model, and such stereotypes can be easily recovered. Addressing these challenges is a crucial step towards developing LLMs that fairly serve their diverse user base.

Item Metadata

Title	A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation
Creator	Liu, Zhuozhuo
Supervisor	Shwartz, Vered
Publisher	University of British Columbia
Date Issued	2025
Description	Large Language Models (LLMs) have been demonstrated to align with the values of Western or North American cultures. Prior work predominantly showed this effect through leveraging surveys that directly ask – originally people and now also LLMs – about their values. However, it is not clear that these explicitly stated beliefs actually correspond to the slant that LLMs take in real tasks. To address that, we take a bottom-up approach, asking LLMs to recall cultural norms invoked by narratives from different cultures. We find that GPT-4o tends to generate norms that, while not necessarily incorrect, are significantly less culture-specific. In addition, while it avoids overtly generating stereotypes, the stereotypical representations of certain cultures are merely hidden rather than suppressed in the model, and such stereotypes can be easily recovered. Addressing these challenges is a crucial step towards developing LLMs that fairly serve their diverse user base.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2025-09-09
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0450081
URI	http://hdl.handle.net/2429/92274
Degree (Theses)	Master of Science - MSc
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2025-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

A bottom-up framework for cross-cultural evaluation of GPT-4o’s social norm biases via implicit narrative invocation Liu, Zhuozhuo

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights