- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- High-dimensional perception with the double machine...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
High-dimensional perception with the double machine learning lens model equation Li, Raymond
Abstract
Traditional models of perception are ill-equipped for the high-dimensional data, such as text embeddings, that are central to modern AI and psychological science. To address this, we introduce the Double Machine Learning Lens Model Equation (DML-LME), an integrated framework combining a high-dimensional lens model with a suite of interpretability techniques. We apply this framework to analyze how an AI perceives social class from 9,513 aspirational essays, comparing a 384-dimension embedding model (all-MiniLM-L6-v2) with a 4,096-dimension model (NV-Embed-v2). While both models achieved similar mediation of the AI’s judgment (64.7% vs. 68.6%), our analysis revealed that both failed to effectively predict the actual environmental criterion (e.g., R² = −.05). Crucially, our interpretability suite uncovered a systematic linguistic bias: essays with poor writing quality were 2.6 times more likely to receive a low social class rating from the AI than to actually originate from a low social class background. This bias was strong enough to override other valid cues, causing the AI to misjudge essays discussing traditional upper-class markers like equestrian activities when they were paired with grammatical errors. The DML-LME, combined with robust interpretability tools, thus enables researchers to not only quantify perception in high-dimensional settings but also to uncover the specific, and potentially discriminatory, heuristics that guide AI judgment.
Item Metadata
Title |
High-dimensional perception with the double machine learning lens model equation
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2025
|
Description |
Traditional models of perception are ill-equipped for the high-dimensional data, such as text embeddings, that are central to modern AI and psychological science. To address this, we introduce the Double Machine Learning Lens Model Equation (DML-LME), an integrated framework combining a high-dimensional lens model with a suite of interpretability techniques. We apply this framework to analyze how an AI perceives social class from 9,513 aspirational essays, comparing a 384-dimension embedding model (all-MiniLM-L6-v2) with a 4,096-dimension model (NV-Embed-v2). While both models achieved similar mediation of the AI’s judgment (64.7% vs. 68.6%), our analysis revealed that both failed to effectively predict the actual environmental criterion (e.g., R² = −.05). Crucially, our interpretability suite uncovered a systematic linguistic bias: essays with poor writing quality were 2.6 times more likely to receive a low social class rating from the AI than to actually originate from a low social class background. This bias was strong enough to override other valid cues, causing the AI to misjudge essays discussing traditional upper-class markers like equestrian activities when they were paired with grammatical errors. The DML-LME, combined with robust interpretability tools, thus enables researchers to not only quantify perception in high-dimensional settings but also to uncover the specific, and potentially discriminatory, heuristics that guide AI judgment.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2025-08-28
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution 4.0 International
|
DOI |
10.14288/1.0449943
|
URI | |
Degree (Theses) | |
Program (Theses) | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2025-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution 4.0 International