Grammaticus ex machina : tone inventories as hypothesized by machine

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Grammaticus ex machina : tone inventories as hypothesized by machine Fry, Michael David

Abstract

A fundamental task of linguistics is to accurately describe the sound patterns of a language. In the field of phonology, this often starts with identifying the set of contrastive sounds in the language, its phoneme inventory. If the language under investigation is a tone language, then identifying the contrastive tones in the language, its tone inventory, is also needed. Historically, phonologists have identified phoneme and tone inventories through lengthy elicitation sessions in order to determine contrasting units. Yet, given the recent advances in machine learning, there may be another way. In this thesis, I argue, by way of demonstration, that machine learning has become a valuable tool for field and theoretical linguists in the description of language and in the development of linguistic theory. Specifically, I present empirical support, using machine learning methods, for the theory of Emergent Phonology, which holds that phonology emerges as the "consequence of accumulated phonetic experience'' (Lindblom, 1999, p. 195). This support comes in the form of hypothesized tone inventories (part of one's phonology) that emerge, via an unsupervised learning model, from acoustic-phonetic data for a given language. Since the hypothesized inventories match fairly well with the tone inventories standardly reported in the literature, an aspect of phonology is shown to have emerged from phonetics and support for Emergent Phonology is achieved. To test the robustness of the unsupervised learning method, it is applied to four languages: Mandarin, Cantonese, Fungwa and English. Finally, since the identification of tone inventories has hitherto been under the purview of human linguists, success in this project provides a first step towards creating a grammaticus ex machina -- a linguist (grammarian) from the machine.

Item Metadata

Title	Grammaticus ex machina : tone inventories as hypothesized by machine
Creator	Fry, Michael David
Publisher	University of British Columbia
Date Issued	2020
Description	A fundamental task of linguistics is to accurately describe the sound patterns of a language. In the field of phonology, this often starts with identifying the set of contrastive sounds in the language, its phoneme inventory. If the language under investigation is a tone language, then identifying the contrastive tones in the language, its tone inventory, is also needed. Historically, phonologists have identified phoneme and tone inventories through lengthy elicitation sessions in order to determine contrasting units. Yet, given the recent advances in machine learning, there may be another way. In this thesis, I argue, by way of demonstration, that machine learning has become a valuable tool for field and theoretical linguists in the description of language and in the development of linguistic theory. Specifically, I present empirical support, using machine learning methods, for the theory of Emergent Phonology, which holds that phonology emerges as the "consequence of accumulated phonetic experience'' (Lindblom, 1999, p. 195). This support comes in the form of hypothesized tone inventories (part of one's phonology) that emerge, via an unsupervised learning model, from acoustic-phonetic data for a given language. Since the hypothesized inventories match fairly well with the tone inventories standardly reported in the literature, an aspect of phonology is shown to have emerged from phonetics and support for Emergent Phonology is achieved. To test the robustness of the unsupervised learning method, it is applied to four languages: Mandarin, Cantonese, Fungwa and English. Finally, since the identification of tone inventories has hitherto been under the purview of human linguists, success in this project provides a first step towards creating a grammaticus ex machina -- a linguist (grammarian) from the machine.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2020-04-14
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0389820
URI	http://hdl.handle.net/2429/74009
Degree (Theses)	Doctor of Philosophy - PhD
Program (Theses)	Linguistics
Affiliation	Arts, Faculty of; Linguistics, Department of
Degree Grantor	University of British Columbia
Graduation Date	2020-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Grammaticus ex machina : tone inventories as hypothesized by machine Fry, Michael David

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights