Multiclass object recognition inspired by the ventral visual pathway

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Multiclass object recognition inspired by the ventral visual pathway Mutch, James Vincent

Abstract

We describe a biologically-inspired system for classifying objects in still images. Our system learns to identify the class (car, person, etc.) of a previously-unseen instance of an object. As the primate visual system still outperforms computer vision systems on this task by a wide margin, we base our work on a model of the ventral visual pathway, thought to be primarily responsible for object recognition in cortex. Our model modifies that of Serre, Wolf, and Poggio, which hierarchically builds up feature selectivity and invariance to position and scale in a manner analogous to that of visual areas V1, V2, V4, and IT. As in that work, we first apply Gabor filters at all positions and scales; selectivity and invariance are then built up by alternating template matching and max pooling operations. We refine the approach in several biologically plausible ways, using simple versions of sparsification and lateral inhibition. We demonstrate the value of retaining some position and scale information above the intermediate feature level. Using feature selection we arrive at a model that performs better with fewer features. Our final model is tested on the Caltech 101 object categories and the UIUC car localization task, in both cases achieving state-of-the-art performance. The results strengthen the case for using this type of model in computer vision.

Item Metadata

Title	Multiclass object recognition inspired by the ventral visual pathway
Creator	Mutch, James Vincent
Publisher	University of British Columbia
Date Issued	2006
Description	We describe a biologically-inspired system for classifying objects in still images. Our system learns to identify the class (car, person, etc.) of a previously-unseen instance of an object. As the primate visual system still outperforms computer vision systems on this task by a wide margin, we base our work on a model of the ventral visual pathway, thought to be primarily responsible for object recognition in cortex. Our model modifies that of Serre, Wolf, and Poggio, which hierarchically builds up feature selectivity and invariance to position and scale in a manner analogous to that of visual areas V1, V2, V4, and IT. As in that work, we first apply Gabor filters at all positions and scales; selectivity and invariance are then built up by alternating template matching and max pooling operations. We refine the approach in several biologically plausible ways, using simple versions of sparsification and lateral inhibition. We demonstrate the value of retaining some position and scale information above the intermediate feature level. Using feature selection we arrive at a model that performs better with fewer features. Our final model is tested on the Caltech 101 object categories and the UIUC car localization task, in both cases achieving state-of-the-art performance. The results strengthen the case for using this type of model in computer vision.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2010-01-12
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0051724
URI	http://hdl.handle.net/2429/18090
Degree	Master of Science - MSc
Program	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2006-11
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

ubc_2006-0594.pdf -- 12.63MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Multiclass object recognition inspired by the ventral visual pathway Mutch, James Vincent

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights