- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Undergraduate Research /
- From Embeddings to Entities : A Comparative Analysis...
Open Collections
UBC Undergraduate Research
From Embeddings to Entities : A Comparative Analysis of RAG Architectures in Academic Domains Harjono, Karel Joshua
Abstract
Retrieval-Augmented Generation (RAG) systems are transforming how AI models access and utilize external knowledge, specifically in domainspecific applications such as education. Traditional RAG methods typically rely on vector store retrieval, which excels in semantic similarity but struggles with transparency and structured reasoning. This thesis explores an alternative approach, GraphRAG, which uses knowledge graphs to encode explicit relationships between entities from a given passage, potentially offering improved context relevance and explainability. Through a controlled evaluation involving curated datasets across seven academic disciplines and six question types, this thesis compares the performance, retrieval accuracy, and transparency of GraphRAG and vector-based RAG systems. Results show comparable performance across most metrics, with GraphRAG offering notable advantages in source traceability and structured retrieval. Additionally, this study introduces a domain-specific benchmark dataset to assess RAG systems in educational contexts. The findings highlight the value of structured retrieval in enhancing trust and interpretability in AIassisted learning environments and suggest directions for future research on evaluation methodologies and user interface improvements.
Item Metadata
Title |
From Embeddings to Entities : A Comparative Analysis of RAG Architectures in Academic Domains
|
Creator | |
Date Issued |
2025-04
|
Description |
Retrieval-Augmented Generation (RAG) systems are transforming how
AI models access and utilize external knowledge, specifically in domainspecific
applications such as education. Traditional RAG methods typically
rely on vector store retrieval, which excels in semantic similarity but struggles
with transparency and structured reasoning. This thesis explores an
alternative approach, GraphRAG, which uses knowledge graphs to encode
explicit relationships between entities from a given passage, potentially offering
improved context relevance and explainability. Through a controlled
evaluation involving curated datasets across seven academic disciplines and
six question types, this thesis compares the performance, retrieval accuracy,
and transparency of GraphRAG and vector-based RAG systems. Results
show comparable performance across most metrics, with GraphRAG
offering notable advantages in source traceability and structured retrieval.
Additionally, this study introduces a domain-specific benchmark dataset to
assess RAG systems in educational contexts. The findings highlight the
value of structured retrieval in enhancing trust and interpretability in AIassisted
learning environments and suggest directions for future research on
evaluation methodologies and user interface improvements.
|
Genre | |
Type | |
Language |
eng
|
Series | |
Date Available |
2025-05-12
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0448869
|
URI | |
Affiliation | |
Peer Review Status |
Unreviewed
|
Scholarly Level |
Undergraduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International