- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Video-based human fall detection in indoor spaces for...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Video-based human fall detection in indoor spaces for health monitoring Shojaei Hashemi, Anahita
Abstract
Remote health monitoring is evolving from a luxury to a necessity in residential buildings and smart homes. As the early detection of fall incidents can prevent mortality and alleviate morbidity, automatic fall detection systems are crucial for remote health monitoring. Providing rich content at an affordable cost, video cameras have become prevalent for fall detection. Most video-based fall detection models use classic machine learning, relying on handcrafted that limit their performance and generalization capability. In this thesis, we propose three novel video-based fall detection methods to solve the mentioned weaknesses, while considering privacy concerns associated with video data. Using deep learning, our models automatically learn the most discriminative features without resorting to definitions and assumptions. The first method is among the early video-based fall detection algorithms that employ deep learning. Consisting of a long short-term memory (LSTM) neural network, it works with depth cameras and uses the three-dimensional (3D) locations of major body joints to address privacy concerns. The second method is designed to operate with RGB cameras, targeting applications that require the identity of the fallen person to provide personalized medical help. It is comprised of a two-dimensional (2D) convolutional neural network (CNN) that extracts features from the frames and feeds them to an LSTM neural network. To address privacy concerns, edge computing is essential for models working with RGB videos to avoid video transmission. In designing our third method, we paid specific attention to this aspect by keeping the computational complexity of the model so low that it can be implemented on cameras built-in boards without losing its real-time performance. Spatial features extracted from the frames by a lightweight, yet powerful 2D CNN are sequentially processed by a transformer. Evaluations prove the competing performance of our models with regards to the state-of-the-art methods. We also present an embedded system prototype we have built to verify the capability of our RGB models to perform in real time on limited computational resource. Finally, we introduce the UBC-DML Fall Dataset, a large comprehensive video dataset we have captured to address the shortcomings in the current public datasets for fall detection.
Item Metadata
Title |
Video-based human fall detection in indoor spaces for health monitoring
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2023
|
Description |
Remote health monitoring is evolving from a luxury to a necessity in residential buildings and smart homes. As the early detection of fall incidents can prevent mortality and alleviate morbidity, automatic fall detection systems are crucial for remote health monitoring. Providing rich content at an affordable cost, video cameras have become prevalent for fall detection. Most video-based fall detection models use classic machine learning, relying on handcrafted that limit their performance and generalization capability.
In this thesis, we propose three novel video-based fall detection methods to solve the mentioned weaknesses, while considering privacy concerns associated with video data. Using deep learning, our models automatically learn the most discriminative features without resorting to definitions and assumptions. The first method is among the early video-based fall detection algorithms that employ deep learning. Consisting of a long short-term memory (LSTM) neural network, it works with depth cameras and uses the three-dimensional (3D) locations of major body joints to address privacy concerns. The second method is designed to operate with RGB cameras, targeting applications that require the identity of the fallen person to provide personalized medical help. It is comprised of a two-dimensional (2D) convolutional neural network (CNN) that extracts features from the frames and feeds them to an LSTM neural network. To address privacy concerns, edge computing is essential for models working with RGB videos to avoid video transmission. In designing our third method, we paid specific attention to this aspect by keeping the computational complexity of the model so low that it can be implemented on cameras built-in boards without losing its real-time performance. Spatial features extracted from the frames by a lightweight, yet powerful 2D CNN are sequentially processed by a transformer. Evaluations prove the competing performance of our models with regards to the state-of-the-art methods.
We also present an embedded system prototype we have built to verify the capability of our RGB models to perform in real time on limited computational resource. Finally, we introduce the UBC-DML Fall Dataset, a large comprehensive video dataset we have captured to address the shortcomings in the current public datasets for fall detection.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2025-04-30
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NonCommercial-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0431598
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2023-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NonCommercial-NoDerivatives 4.0 International