Investigation of gesture control for articulatory speech synthesis with a bio-mechanical mapping layer

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Investigation of gesture control for articulatory speech synthesis with a bio-mechanical mapping layer Wang, Yizhong Johnty

Abstract

In the process of working with a real-time, gesture controlled speech and singing synthesizer used for musical performance, we have documented performer related issues and provided some suggestions that will serve to improve future work in the field from an engineering and technician's perspective. One particular, significant detrimental factor in the existing system is the sound quality caused by the limitations of the one-to-one kinematic mapping between the gesture input and output. In order to solve this a force activated bio-mechanical mapping layer was implemented to drive an articulatory synthesizer, and the results were and compared with the existing mapping system for the same task from both the performer and listener perspective. The results show that adding the complex, dynamic bio-mechanical mapping layer introduces more difficulty but allows a greater degree of expression to the performer that is consistent with existing work in the literature. However, to the novice listener, there is no significant difference in the intelligibility of the sound or the perceived quality. The results suggest that for browsing through a vowel space force and position input are comparable when considering output intelligibility alone but for expressivity a complex input may be more suitable.

Item Metadata

Title	Investigation of gesture control for articulatory speech synthesis with a bio-mechanical mapping layer
Creator	Wang, Yizhong Johnty
Publisher	University of British Columbia
Date Issued	2012
Description	In the process of working with a real-time, gesture controlled speech and singing synthesizer used for musical performance, we have documented performer related issues and provided some suggestions that will serve to improve future work in the field from an engineering and technician's perspective. One particular, significant detrimental factor in the existing system is the sound quality caused by the limitations of the one-to-one kinematic mapping between the gesture input and output. In order to solve this a force activated bio-mechanical mapping layer was implemented to drive an articulatory synthesizer, and the results were and compared with the existing mapping system for the same task from both the performer and listener perspective. The results show that adding the complex, dynamic bio-mechanical mapping layer introduces more difficulty but allows a greater degree of expression to the performer that is consistent with existing work in the literature. However, to the novice listener, there is no significant difference in the intelligibility of the sound or the perceived quality. The results suggest that for browsing through a vowel space force and position input are comparable when considering output intelligibility alone but for expressivity a complex input may be more suitable.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2012-09-10
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-ShareAlike 3.0 Unported
DOI	10.14288/1.0073129
URI	http://hdl.handle.net/2429/43193
Degree	Master of Applied Science - MASc
Program	Electrical and Computer Engineering
Affiliation	Applied Science, Faculty of; Electrical and Computer Engineering, Department of
Degree Grantor	University of British Columbia
Graduation Date	2012-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-sa/3.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Investigation of gesture control for articulatory speech synthesis with a bio-mechanical mapping layer Wang, Yizhong Johnty

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights