High-dimensional perception with the double machine learning lens model equation

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

High-dimensional perception with the double machine learning lens model equation Li, Raymond

Abstract

Traditional models of perception are ill-equipped for the high-dimensional data, such as text embeddings, that are central to modern AI and psychological science. To address this, we introduce the Double Machine Learning Lens Model Equation (DML-LME), an integrated framework combining a high-dimensional lens model with a suite of interpretability techniques. We apply this framework to analyze how an AI perceives social class from 9,513 aspirational essays, comparing a 384-dimension embedding model (all-MiniLM-L6-v2) with a 4,096-dimension model (NV-Embed-v2). While both models achieved similar mediation of the AI’s judgment (64.7% vs. 68.6%), our analysis revealed that both failed to effectively predict the actual environmental criterion (e.g., R² = −.05). Crucially, our interpretability suite uncovered a systematic linguistic bias: essays with poor writing quality were 2.6 times more likely to receive a low social class rating from the AI than to actually originate from a low social class background. This bias was strong enough to override other valid cues, causing the AI to misjudge essays discussing traditional upper-class markers like equestrian activities when they were paired with grammatical errors. The DML-LME, combined with robust interpretability tools, thus enables researchers to not only quantify perception in high-dimensional settings but also to uncover the specific, and potentially discriminatory, heuristics that guide AI judgment.

Item Metadata

Title	High-dimensional perception with the double machine learning lens model equation
Creator	Li, Raymond
Supervisor	Biesanz, Jeremy C.
Publisher	University of British Columbia
Date Issued	2025
Description	Traditional models of perception are ill-equipped for the high-dimensional data, such as text embeddings, that are central to modern AI and psychological science. To address this, we introduce the Double Machine Learning Lens Model Equation (DML-LME), an integrated framework combining a high-dimensional lens model with a suite of interpretability techniques. We apply this framework to analyze how an AI perceives social class from 9,513 aspirational essays, comparing a 384-dimension embedding model (all-MiniLM-L6-v2) with a 4,096-dimension model (NV-Embed-v2). While both models achieved similar mediation of the AI’s judgment (64.7% vs. 68.6%), our analysis revealed that both failed to effectively predict the actual environmental criterion (e.g., R² = −.05). Crucially, our interpretability suite uncovered a systematic linguistic bias: essays with poor writing quality were 2.6 times more likely to receive a low social class rating from the AI than to actually originate from a low social class background. This bias was strong enough to override other valid cues, causing the AI to misjudge essays discussing traditional upper-class markers like equestrian activities when they were paired with grammatical errors. The DML-LME, combined with robust interpretability tools, thus enables researchers to not only quantify perception in high-dimensional settings but also to uncover the specific, and potentially discriminatory, heuristics that guide AI judgment.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2025-08-28
Provider	Vancouver : University of British Columbia Library
Rights	Attribution 4.0 International
DOI	10.14288/1.0449943
URI	http://hdl.handle.net/2429/92146
Degree (Theses)	Master of Arts - MA
Program (Theses)	Psychology
Affiliation	Arts, Faculty of; Psychology, Department of
Degree Grantor	University of British Columbia
Graduation Date	2025-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

High-dimensional perception with the double machine learning lens model equation Li, Raymond

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights