Speech recognition in a harsh environment

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Speech recognition in a harsh environment Pullman, Susan

Abstract

Speech Recognition is a rapidly expanding field with many useful applications in man-machine interfacing. One of the main benefits of speech control is the flexibility and ease of use allowed an operator for any number of specific applications. Speech recognition units (SRU) are currently at a high level of accuracy for user dependent, pretrained, isolated word recognition. However, if uncontrollable noise is added to the speech input, recognition degrades rapidly. If the application requires a vast set of control words to be used by many operators, then, there can be inconsistencies in recognition. The specific application of this study is the secondary control (ie. non-critical control) of heavy machinery (in particular a caterpillar tractor) using an operator - speech interface. The inherent problem of this application is the environmental background noise due to the tractor. It is also important that a robust vocabulary is selected so that no misrecognition occurs between critical control words. In order to add speech input for control of machines in a harsh environment there are two considerations: 1. The reduction of noise from the input speech signal. 2. The selection of a robust vocabulary dependent upon the specific operator and the specific SRU. This study investigates many different types of noise reduction filters, including traditional Wiener, Power Spectral Subtraction and Gaussian filters. The results show that the best types of noise reduction filters are adaptive optimization filters which use two input signals or the Power Spectral Subtraction (PSS) filter. It is possible to reduce the noise to a level within the range of the SRU's capacity for noise. An algorithm for selecting an accurate vocabulary is proposed. This algorithm determines weaknesses for the specific SRU, vocabulary and speaker; and selects the control words around those weaknesses. Testing of this algorithm showed that it was possible to achieve closed to 98% recognition and 0% misrecognition.

Item Metadata

Title	Speech recognition in a harsh environment
Creator	Pullman, Susan
Publisher	University of British Columbia
Date Issued	1988
Description	Speech Recognition is a rapidly expanding field with many useful applications in man-machine interfacing. One of the main benefits of speech control is the flexibility and ease of use allowed an operator for any number of specific applications. Speech recognition units (SRU) are currently at a high level of accuracy for user dependent, pretrained, isolated word recognition. However, if uncontrollable noise is added to the speech input, recognition degrades rapidly. If the application requires a vast set of control words to be used by many operators, then, there can be inconsistencies in recognition. The specific application of this study is the secondary control (ie. non-critical control) of heavy machinery (in particular a caterpillar tractor) using an operator - speech interface. The inherent problem of this application is the environmental background noise due to the tractor. It is also important that a robust vocabulary is selected so that no misrecognition occurs between critical control words. In order to add speech input for control of machines in a harsh environment there are two considerations: 1. The reduction of noise from the input speech signal. 2. The selection of a robust vocabulary dependent upon the specific operator and the specific SRU. This study investigates many different types of noise reduction filters, including traditional Wiener, Power Spectral Subtraction and Gaussian filters. The results show that the best types of noise reduction filters are adaptive optimization filters which use two input signals or the Power Spectral Subtraction (PSS) filter. It is possible to reduce the noise to a level within the range of the SRU's capacity for noise. An algorithm for selecting an accurate vocabulary is proposed. This algorithm determines weaknesses for the specific SRU, vocabulary and speaker; and selects the control words around those weaknesses. Testing of this algorithm showed that it was possible to achieve closed to 98% recognition and 0% misrecognition.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2010-08-31
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0064775
URI	http://hdl.handle.net/2429/27992
Degree (Theses)	Master of Applied Science - MASc
Program (Theses)	Electrical and Computer Engineering
Affiliation	Applied Science, Faculty of; Electrical Engineering, Department of
Degree Grantor	University of British Columbia
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

UBC_1989_A7 P94.pdf -- 9.2MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Speech recognition in a harsh environment Pullman, Susan

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights