Enhancing sewer asset management using machine learning algorithms

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Enhancing sewer asset management using machine learning algorithms Seng, Vannary

Abstract

Closed-circuit television (CCTV) is widely employed for assessing defects in sewers. This thesis: i) develops a number of deep convolutional neural network (CNN)-based image classification models for identifying sewer defects in CCTV videos, ii) develops a number of Machine Learning (ML)-based models for predicting the likely occurrence of specific defects in sewer networks, as well as identifying the factors that contribute to these defects, and iii) analyses the model performances with data from multiple utilities. The image classification models are trained and tested using images extracted from CCTV inspection videos for three utilities. They are analysed for serving two purposes: conducting a quality assurance and quality control (QA/QC) assessment of manual inspection reports of previously inspected videos, and identifying defects in images from videos that have not yet been manually inspected. Models are developed based on three pre-trained CNN backbone architectures: MobileNet V3 Large, ResNet50, and ResNet101; two model configurations: one- and two-stage models; and grouped classes. Two-stage models result in higher performance compared with alternative approaches. For QA/QC, on average, the ResNet models outperform MobileNet V3 Large where model accuracies for Utilities A, B, and C, are 0.95, 0.70, and 0.98, respectively. For recognizing defects in videos not previously viewed, the optimal model for each of the three utilities does not perform well. However, all models possess the ability to distinguish between non-defective and defective images with accuracies higher than 0.50, where the Utility A model achieves an accuracy of 0.80. Multiple decision tree-based ML models that predict the likely occurrence of infiltration and structural defects are developed using data from CCTV inspections coupled with additional pipe information and Geographical Information System files for two utilities. The performance of the models is assessed using the area under the Receiver Operating Characteristics (AUC-ROC) and Precision Recall (AUC-PR) curves. LightGBM-based models with cost-sensitive learning show the best performance overall when predicting infiltration and structural defects. The best performing model achieves an AUC-ROC of 0.79 and an AUC-PR of 0.62. For these utilities, an application of SHapley Additive exPlanations (SHAP) shows that the most important factors are “pipe location” and “pipe age”.

Item Metadata

Title	Enhancing sewer asset management using machine learning algorithms
Creator	Seng, Vannary
Supervisor	Lence, Barbara J.
Publisher	University of British Columbia
Date Issued	2024
Description	Closed-circuit television (CCTV) is widely employed for assessing defects in sewers. This thesis: i) develops a number of deep convolutional neural network (CNN)-based image classification models for identifying sewer defects in CCTV videos, ii) develops a number of Machine Learning (ML)-based models for predicting the likely occurrence of specific defects in sewer networks, as well as identifying the factors that contribute to these defects, and iii) analyses the model performances with data from multiple utilities. The image classification models are trained and tested using images extracted from CCTV inspection videos for three utilities. They are analysed for serving two purposes: conducting a quality assurance and quality control (QA/QC) assessment of manual inspection reports of previously inspected videos, and identifying defects in images from videos that have not yet been manually inspected. Models are developed based on three pre-trained CNN backbone architectures: MobileNet V3 Large, ResNet50, and ResNet101; two model configurations: one- and two-stage models; and grouped classes. Two-stage models result in higher performance compared with alternative approaches. For QA/QC, on average, the ResNet models outperform MobileNet V3 Large where model accuracies for Utilities A, B, and C, are 0.95, 0.70, and 0.98, respectively. For recognizing defects in videos not previously viewed, the optimal model for each of the three utilities does not perform well. However, all models possess the ability to distinguish between non-defective and defective images with accuracies higher than 0.50, where the Utility A model achieves an accuracy of 0.80. Multiple decision tree-based ML models that predict the likely occurrence of infiltration and structural defects are developed using data from CCTV inspections coupled with additional pipe information and Geographical Information System files for two utilities. The performance of the models is assessed using the area under the Receiver Operating Characteristics (AUC-ROC) and Precision Recall (AUC-PR) curves. LightGBM-based models with cost-sensitive learning show the best performance overall when predicting infiltration and structural defects. The best performing model achieves an AUC-ROC of 0.79 and an AUC-PR of 0.62. For these utilities, an application of SHapley Additive exPlanations (SHAP) shows that the most important factors are “pipe location” and “pipe age”.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2024-04-10
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0441279
URI	http://hdl.handle.net/2429/87751
Degree	Master of Applied Science - MASc
Program	Civil Engineering
Affiliation	Applied Science, Faculty of; Civil Engineering, Department of
Degree Grantor	University of British Columbia
Graduation Date	2024-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Enhancing sewer asset management using machine learning algorithms Seng, Vannary

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights