Solving for relative orientation and depth

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Solving for relative orientation and depth McReynolds, Daniel Peter

Abstract

A translating and rotating camera acquires images of a static scene from disparate viewpoints. Given a minimum number of corresponding image tokens from two or more images, the rotation and translation (referred to as the relative orientation) of the camera, and the relative depths of the tokens, can be computed between the views. Image tokens, which can be points or lines, are deemed corresponding if they arise from the same physical entity. The process of determining corresponding tokens is assumed to be known. This thesis poses the problem of solving for relative orientation and depth. The solution to this problem has applications in building object or scene models from multiple views and in passive navigation. A minimum of five corresponding pairs of points, measured from two perspective projection images, are required. In the case of lines, the measurements of six corresponding sets of lines, from a minimum of three images, are necessary for a unique solution. The algorithm simultaneously determines the rotation and translation of the camera between the images and the three dimensional position of the matched scene tokens. The translation and depth can only be determined up to a global scale factor. It is assumed that the motion of the camera or object is rigid. The equations that describe the motion and scene structure are nonlinear in the camera model parameters. Newton's method is used to perform the nonlinear parameter estimation. The camera model is based on the collinearity condition, well known in the area of photogrammetry. This condition states that the focal point, image point and object point must all lie on the same line under perspective projection. This approach differs from other similar approaches in the choice of model parameters and in the formulation of the collinearity condition. The formulation for line correspondences turns out to be an easy extension of the point formulation. To improve the range of convergence and the robustness of the algorithm, the Levenberg-Marquardt extension for discrete least squares is applied to the over-determined system. Results from a Monte Carlo simulation with noise-free images indicate a range of convergence for rotation of approximately plus or minus sixty degrees. Simulations with noisy images (image measurements perturbed by plus or minus three pixels) yield a range of convergence for rotation of approximately plus or minus forty degrees. The range of convergence in translation is in the order of twice the length of the translation vector in any direction parallel to the image plane. For translation orthogonal to the image plane, the range of convergence is approximately eighty percent of the length of the translation vector. Simulations with the same noisy images generated for the rotation tests, indicate that the range of convergence for translation is not appreciably affected by these noise levels. Tests with real images, in which a 3D model of an object is derived from multiple views with human assistance in determining line correspondences, yield results that agree reasonably well with the noisy image simulations.

Item Metadata

Title	Solving for relative orientation and depth
Creator	McReynolds, Daniel Peter
Publisher	University of British Columbia
Date Issued	1988
Description	A translating and rotating camera acquires images of a static scene from disparate viewpoints. Given a minimum number of corresponding image tokens from two or more images, the rotation and translation (referred to as the relative orientation) of the camera, and the relative depths of the tokens, can be computed between the views. Image tokens, which can be points or lines, are deemed corresponding if they arise from the same physical entity. The process of determining corresponding tokens is assumed to be known. This thesis poses the problem of solving for relative orientation and depth. The solution to this problem has applications in building object or scene models from multiple views and in passive navigation. A minimum of five corresponding pairs of points, measured from two perspective projection images, are required. In the case of lines, the measurements of six corresponding sets of lines, from a minimum of three images, are necessary for a unique solution. The algorithm simultaneously determines the rotation and translation of the camera between the images and the three dimensional position of the matched scene tokens. The translation and depth can only be determined up to a global scale factor. It is assumed that the motion of the camera or object is rigid. The equations that describe the motion and scene structure are nonlinear in the camera model parameters. Newton's method is used to perform the nonlinear parameter estimation. The camera model is based on the collinearity condition, well known in the area of photogrammetry. This condition states that the focal point, image point and object point must all lie on the same line under perspective projection. This approach differs from other similar approaches in the choice of model parameters and in the formulation of the collinearity condition. The formulation for line correspondences turns out to be an easy extension of the point formulation. To improve the range of convergence and the robustness of the algorithm, the Levenberg-Marquardt extension for discrete least squares is applied to the over-determined system. Results from a Monte Carlo simulation with noise-free images indicate a range of convergence for rotation of approximately plus or minus sixty degrees. Simulations with noisy images (image measurements perturbed by plus or minus three pixels) yield a range of convergence for rotation of approximately plus or minus forty degrees. The range of convergence in translation is in the order of twice the length of the translation vector in any direction parallel to the image plane. For translation orthogonal to the image plane, the range of convergence is approximately eighty percent of the length of the translation vector. Simulations with the same noisy images generated for the rotation tests, indicate that the range of convergence for translation is not appreciably affected by these noise levels. Tests with real images, in which a 3D model of an object is derived from multiple views with human assistance in determining line correspondences, yield results that agree reasonably well with the noisy image simulations.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2010-08-31
Provider	Vancouver : University of British Columbia Library
Rights	For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.
DOI	10.14288/1.0051262
URI	http://hdl.handle.net/2429/28023
Degree (Theses)	Master of Science - MSc
Program (Theses)	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Campus	UBCV
Scholarly Level	Graduate
Aggregated Source Repository	DSpace

Item Media

UBC_1988_A6_7 M38.pdf -- 7.37MB

Item Citations and Data

Rights

For non-commercial purposes only, such as research, private study and education. Additional conditions apply, see Terms of Use https://open.library.ubc.ca/terms_of_use.

Open Collections

UBC Theses and Dissertations

Solving for relative orientation and depth McReynolds, Daniel Peter

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights