An intelligent class : the development of a novel context capturing method for the functional auto-classification of records

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

An intelligent class : the development of a novel context capturing method for the functional auto-classification of records Payne, Nathaniel

Abstract

The need to accurately classify records is a core problem in many domains. Historically, the classification of records was done manually as records were received and then categorized. Unfortunately, due to a significant growth in the volume of records, the need for robust auto-classification methods that can effectively “read” and classify records, is high. Today, significant challenges remain to the development of effective auto-classification processes for records. This is because the records traditionally require functional classification based on context, not topic classification based on content. Functional classification traditionally has been a challenge for both humans and machines, with little research on how to effectively functionally classify a record. In order to move research forward, this thesis will address the challenges of both human and machine classification of records. Firstly, this thesis, will seek to evaluate the efficacy of human manual classifiers on a classification task, using knowledge from this process to articulate a process for automated functional classification that utilizes a record’s archival diplomatic context. Secondly, this thesis will compare the efficacy of manual versus machine (i.e., auto-classification) using a record set with over 500,000 records, using a novel auto-classification approach that leverages a record’s context, not just its content, to improve classification accuracy. As this thesis will discuss, there is significant variance between expert human (i.e., records managers) during the manual classification process, with statistically significant differences in their ability to accurately classify both administrative and operational records. Moreover, this thesis will demonstrate that an auto-classifier, when trained using key elements of context, can statistically outperform a group of expert human classifiers on a classification task.

Item Metadata

Title	An intelligent class : the development of a novel context capturing method for the functional auto-classification of records
Creator	Payne, Nathaniel
Supervisor	Lemieux, Victoria L., 1963-
Publisher	University of British Columbia
Date Issued	2024
Description	The need to accurately classify records is a core problem in many domains. Historically, the classification of records was done manually as records were received and then categorized. Unfortunately, due to a significant growth in the volume of records, the need for robust auto-classification methods that can effectively “read” and classify records, is high. Today, significant challenges remain to the development of effective auto-classification processes for records. This is because the records traditionally require functional classification based on context, not topic classification based on content. Functional classification traditionally has been a challenge for both humans and machines, with little research on how to effectively functionally classify a record. In order to move research forward, this thesis will address the challenges of both human and machine classification of records. Firstly, this thesis, will seek to evaluate the efficacy of human manual classifiers on a classification task, using knowledge from this process to articulate a process for automated functional classification that utilizes a record’s archival diplomatic context. Secondly, this thesis will compare the efficacy of manual versus machine (i.e., auto-classification) using a record set with over 500,000 records, using a novel auto-classification approach that leverages a record’s context, not just its content, to improve classification accuracy. As this thesis will discuss, there is significant variance between expert human (i.e., records managers) during the manual classification process, with statistically significant differences in their ability to accurately classify both administrative and operational records. Moreover, this thesis will demonstrate that an auto-classifier, when trained using key elements of context, can statistically outperform a group of expert human classifiers on a classification task.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2024-01-19
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0438751
URI	http://hdl.handle.net/2429/87326
Degree (Theses)	Doctor of Philosophy - PhD
Program (Theses)	Library, Archival and Information Studies
Affiliation	Arts, Faculty of; Information, School of
Degree Grantor	University of British Columbia
Graduation Date	2024-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

An intelligent class : the development of a novel context capturing method for the functional auto-classification of records Payne, Nathaniel

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights