Beyond traditional single object tracking

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Beyond traditional single object tracking Abdelaziz, Omar

Abstract

Single object tracking is a crucial yet challenging task in computer vision. In this task, a model is given the appearance of an arbitrary object in a sequence of frames. The model is required to track the object in all sequence frames. Traditionally, discriminative correlation filters and Siamese convolutional networks have dominated the field but with limitations when tracking challenging objects such as occluded and long-term tracked objects. However, many techniques backed by recent advancements in machine learning theory, such as masked image modelling, generative adversarial networks, variational autoencoders, and diffusion models, are being increasingly leveraged in single object tracking to provide more customized representations that capture the unique characteristics of the target object. This thesis explores recent advancements in single object tracking by providing three novel contributions. The first contribution is that we comprehensively survey emerging trends in single object tracking, including sequence models, generative models, and self-supervised learning, proposing a novel categorization for single object tracking methods. We further compare existing approaches and identify promising directions for future research. The second contribution focuses on the critical role of bounding box regression within single object tracking. We argue that leveraging the receptive field of convolutional networks is essential for accurate object localization. We introduce two novel bounding box regression networks that achieve superior performance on benchmark datasets compared to existing methods. The third contribution addresses the persistent performance gap between training and test data in single object tracking, particularly for transformer-based trackers. It proposes the Deformable Masking Tracker (DMTrack), which injects deformable convolution within the Vision Transformer (ViT) architecture. DMTrack improves the robustness of attentional features, leading to a significant performance boost of up to 2% across seven tracking benchmarks. Overall, this thesis contributes to the field of single object tracking by offering a comprehensive analysis of novel techniques, demonstrating the importance of bounding box regression and proposing a novel solution to bridge the data performance gap. The findings in this thesis pave the way for further advancements in robust and generalizable single object tracking algorithms.

Item Metadata

Title	Beyond traditional single object tracking
Creator	Abdelaziz, Omar
Supervisor	Shehata, Mohamed S.; Abdelpakey, Mohamed H.
Publisher	University of British Columbia
Date Issued	2024
Description	Single object tracking is a crucial yet challenging task in computer vision. In this task, a model is given the appearance of an arbitrary object in a sequence of frames. The model is required to track the object in all sequence frames. Traditionally, discriminative correlation filters and Siamese convolutional networks have dominated the field but with limitations when tracking challenging objects such as occluded and long-term tracked objects. However, many techniques backed by recent advancements in machine learning theory, such as masked image modelling, generative adversarial networks, variational autoencoders, and diffusion models, are being increasingly leveraged in single object tracking to provide more customized representations that capture the unique characteristics of the target object. This thesis explores recent advancements in single object tracking by providing three novel contributions. The first contribution is that we comprehensively survey emerging trends in single object tracking, including sequence models, generative models, and self-supervised learning, proposing a novel categorization for single object tracking methods. We further compare existing approaches and identify promising directions for future research. The second contribution focuses on the critical role of bounding box regression within single object tracking. We argue that leveraging the receptive field of convolutional networks is essential for accurate object localization. We introduce two novel bounding box regression networks that achieve superior performance on benchmark datasets compared to existing methods. The third contribution addresses the persistent performance gap between training and test data in single object tracking, particularly for transformer-based trackers. It proposes the Deformable Masking Tracker (DMTrack), which injects deformable convolution within the Vision Transformer (ViT) architecture. DMTrack improves the robustness of attentional features, leading to a significant performance boost of up to 2% across seven tracking benchmarks. Overall, this thesis contributes to the field of single object tracking by offering a comprehensive analysis of novel techniques, demonstrating the importance of bounding box regression and proposing a novel solution to bridge the data performance gap. The findings in this thesis pave the way for further advancements in robust and generalizable single object tracking algorithms.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2024-06-20
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0444000
URI	http://hdl.handle.net/2429/88494
Degree	Master of Science - MSc
Program	Computer Science
Affiliation	Science, Irving K. Barber Faculty of (Okanagan); Computer Science, Mathematics, Physics and Statistics, Department of (Okanagan)
Degree Grantor	University of British Columbia
Graduation Date	2024-09
Campus	UBCO
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Beyond traditional single object tracking Abdelaziz, Omar

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights