Convolutional neural network compression via tensor decomposition

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Convolutional neural network compression via tensor decomposition Faltyn, Mateusz

Abstract

Computer vision models, such as image and video classifiers, are increasingly prevalent in Internet-of-Things systems. Since the advent of the AlexNet neural network model in 2012, convolutional neural networks have been demonstrated to be very effective at performing many computer vision tasks. However, convolutional neural networks’ high computational and storage costs hinder the wider adoption of computer vision models in smaller Internet-of-Things devices such as mobile phones or embedded systems. As larger neural network models increase hardware costs, industry and academia have come together to tackle the problem of how to compress convolutional neural networks. Convolutional neural network compression via tensor decomposition has been shown to reduce the memory and storage requirements for devices to perform computer vision tasks successfully. In this work, we first review the preliminaries of tensor decomposition and define the four major types of tensor decompositions and their related decomposition algorithms in Chapter 1. Afterward, we introduce the building blocks of neural networks and describe convolutional neural networks in Chapter 2. Finally, we overview the different tensor decomposition approaches for convolutional neural network compression and display the results of two experiments using the PyTorch-TedNet CIFAR10 and CIFAR100 model benchmarks in Chapter 3.

Item Metadata

Title	Convolutional neural network compression via tensor decomposition
Creator	Faltyn, Mateusz
Supervisor	Robeva, Elina
Publisher	University of British Columbia
Date Issued	2023
Description	Computer vision models, such as image and video classifiers, are increasingly prevalent in Internet-of-Things systems. Since the advent of the AlexNet neural network model in 2012, convolutional neural networks have been demonstrated to be very effective at performing many computer vision tasks. However, convolutional neural networks’ high computational and storage costs hinder the wider adoption of computer vision models in smaller Internet-of-Things devices such as mobile phones or embedded systems. As larger neural network models increase hardware costs, industry and academia have come together to tackle the problem of how to compress convolutional neural networks. Convolutional neural network compression via tensor decomposition has been shown to reduce the memory and storage requirements for devices to perform computer vision tasks successfully. In this work, we first review the preliminaries of tensor decomposition and define the four major types of tensor decompositions and their related decomposition algorithms in Chapter 1. Afterward, we introduce the building blocks of neural networks and describe convolutional neural networks in Chapter 2. Finally, we overview the different tensor decomposition approaches for convolutional neural network compression and display the results of two experiments using the PyTorch-TedNet CIFAR10 and CIFAR100 model benchmarks in Chapter 3.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2023-02-02
Provider	Vancouver : University of British Columbia Library
Rights	Attribution 4.0 International
DOI	10.14288/1.0423855
URI	http://hdl.handle.net/2429/83778
Degree	Master of Science - MSc
Program	Mathematics
Affiliation	Science, Faculty of; Mathematics, Department of
Degree Grantor	University of British Columbia
Graduation Date	2023-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Convolutional neural network compression via tensor decomposition Faltyn, Mateusz

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights