- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- A model for auditory lateralization in non-stationary...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
A model for auditory lateralization in non-stationary multi-source environments Shu, Zhengjin
Abstract
The problem addressed in this thesis is the determination of the positions of sound sources in non-stationary multi-source environments. This problem is approached by developing models that mimic the processing of sounds by the auditory system. It is well known that in the localization process the auditory system utilizes interaural intensity and time differences (lID and ITD) and the interaural envelope delay (lED). However, the way such cues are estimated and organized by the auditory system in non-stationary multi-source situations is not known. It is argued in this thesis that the auditory localization process can be divided into three serial processing stages: decomposition, localization, and integration (DLI). Specifically, the signals detected by the two ears are first decomposed into their spectro-temporal distributions as represented in the neural activities of the auditory nerve fibers. Short-time spatial attributes, in terms of the localization cues, are then determined from energy concentrations in these distributions. A spatial scene of acoustic events is finally built by integrating the energy concentrations according to their spatial attributes. A unique DLI model is proposed in which short-time cue estimation is realized by a process of pattern recognition and comparison using neural networks, and the spatial scene is represented by short-time cue distributions. Three implementations of the DLI model, which model separate auditory pathways responsible for three different types of cue sensitivity (11D, lTD. and lED) observed in the auditory system, are developed, and their performance in estimating short-time cue distributions are studied by computer simulation. It is shown that there are unexplored patterns in the neural signals carried by the auditory nerve fibers that are important for auditory localization. These patterns are shown to contain good indications of interaural differences, and can be used to obtain robust short-time cue estimates by training neural networks that have relatively simple structures. Furthermore, while such networks can be trained using the simplest types of inputs, they show the ability to generalize and perform well with more complex stimuli. It is demonstrated that the model works well in noise and in non-stationary multi-source situations. The same model structure can be trained to estimate different localization cues, suggesting that the underlying structure of the different pathways responsible for different types of cue sensitivities in the auditory system may not necessarily be different. The receptive connection patterns of the hidden neurons in the model indicate that the spectro-temporal response properties of binaural neurons in the auditory system may play an important role in auditory localization, and that excitatory and inhibitory inputs to the binaural neurons play equally important roles in localization.
Item Metadata
Title |
A model for auditory lateralization in non-stationary multi-source environments
|
Creator | |
Publisher |
University of British Columbia
|
Date Issued |
1995
|
Description |
The problem addressed in this thesis is the determination of the positions of sound
sources in non-stationary multi-source environments. This problem is approached by
developing models that mimic the processing of sounds by the auditory system. It is
well known that in the localization process the auditory system utilizes interaural
intensity and time differences (lID and ITD) and the interaural envelope delay (lED).
However, the way such cues are estimated and organized by the auditory system in
non-stationary multi-source situations is not known.
It is argued in this thesis that the auditory localization process can be divided into
three serial processing stages: decomposition, localization, and integration (DLI).
Specifically, the signals detected by the two ears are first decomposed into their
spectro-temporal distributions as represented in the neural activities of the auditory
nerve fibers. Short-time spatial attributes, in terms of the localization cues, are then
determined from energy concentrations in these distributions. A spatial scene of
acoustic events is finally built by integrating the energy concentrations according to
their spatial attributes.
A unique DLI model is proposed in which short-time cue estimation is realized by
a process of pattern recognition and comparison using neural networks, and the
spatial scene is represented by short-time cue distributions. Three implementations of
the DLI model, which model separate auditory pathways responsible for three
different types of cue sensitivity (11D, lTD. and lED) observed in the auditory system,
are developed, and their performance in estimating short-time cue distributions are
studied by computer simulation.
It is shown that there are unexplored patterns in the neural signals carried by the
auditory nerve fibers that are important for auditory localization. These patterns are
shown to contain good indications of interaural differences, and can be used to obtain
robust short-time cue estimates by training neural networks that have relatively simple structures. Furthermore, while such networks can be trained using the
simplest types of inputs, they show the ability to generalize and perform well with
more complex stimuli. It is demonstrated that the model works well in noise and in
non-stationary multi-source situations.
The same model structure can be trained to estimate different localization cues,
suggesting that the underlying structure of the different pathways responsible for
different types of cue sensitivities in the auditory system may not necessarily be
different. The receptive connection patterns of the hidden neurons in the model
indicate that the spectro-temporal response properties of binaural neurons in the
auditory system may play an important role in auditory localization, and that
excitatory and inhibitory inputs to the binaural neurons play equally important roles
in localization.
|
Extent |
4239914 bytes
|
Genre | |
Type | |
File Format |
application/pdf
|
Language |
eng
|
Date Available |
2009-04-22
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
|
DOI |
10.14288/1.0065019
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
1995-11
|
Campus | |
Scholarly Level |
Graduate
|
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.