- Library Home /
- Search Collections /
- Open Collections /
- Browse Collections /
- UBC Theses and Dissertations /
- Using Wasserstein distance and conditionality for stable...
Open Collections
UBC Theses and Dissertations
UBC Theses and Dissertations
Using Wasserstein distance and conditionality for stable and selective data augmentation in pavement crack analysis Shahrestani, Afshin
Abstract
Pavement crack detection and treatment is one of the vital tasks in road infrastructure management. Failure to do so in a proactive manner increases maintenance costs and results in a drop in the structural integrity of the pavement. Deep learning techniques promise to automate pavement crack detection, aiding proactive highway maintenance. This approach reduces costs, time, and human bias in defect analysis. One barrier to large-scale adoption of deep learning techniques in this area is the scarcity of high-quality, and balanced training datasets. Inherent data imbalances, where certain crack types predominate over others, lead to detection bias and overfitting in machine learning models. This is exacerbated by variations in crack types, skewing the model training towards more frequently occurring cracks, thereby diminishing efficacy in identifying less common types of cracks. Data augmentation is one of the possible solutions to these issues. Mode Collapse, training instability, and lack of agency in data generation were identified as the main challenges associated with previous pavement crack augmentation methods. To overcome the challenges, the thesis implements two Generative Adversarial Network (GAN) architectures based on the models Wasserstein GAN (WGAN), and Conditional WGAN (C-WGAN) developed in previous literature. These models aim to remedy the shortcomings of previous data augmentation methods used in pavement crack analysis. The research utilizes annotated images from the Crack500 and CrackForest datasets, categorized into transverse, longitudinal, block, and alligator crack types. These images are used as the training data for both the GANs and a baseline classifier, aimed at measuring the impact of synthetic crack images in augmented datasets. The findings demonstrate the effectiveness of WGAN and C-WGAN in addressing the limitations of previous augmentation methods, generating high-quality, diverse synthetic images. The GAN-augmented models achieved an average classification score improvement of 5\% over the baseline. C-WGAN offered the advantage of user-specific image generation. Both models proved to be effective data augmentation tools for pavement crack datasets, each with distinct advantages and potential trade-offs. This research contributes to the field by providing robust GAN-based solutions for enhancing pavement crack detection and classification.
Item Metadata
Title |
Using Wasserstein distance and conditionality for stable and selective data augmentation in pavement crack analysis
|
Creator | |
Supervisor | |
Publisher |
University of British Columbia
|
Date Issued |
2024
|
Description |
Pavement crack detection and treatment is one of the vital tasks in road infrastructure management. Failure to do so in a proactive manner increases maintenance costs and results in a drop in the structural integrity of the pavement. Deep learning techniques promise to automate pavement crack detection, aiding proactive highway maintenance. This approach reduces costs, time, and human bias in defect analysis. One barrier to large-scale adoption of deep learning techniques in this area is the scarcity of high-quality, and balanced training datasets. Inherent data imbalances, where certain crack types predominate over others, lead to detection bias and overfitting in machine learning models. This is exacerbated by variations in crack types, skewing the model training towards more frequently occurring cracks, thereby diminishing efficacy in identifying less common types of cracks. Data augmentation is one of the possible solutions to these issues.
Mode Collapse, training instability, and lack of agency in data generation were identified as the main challenges associated with previous pavement crack augmentation methods. To overcome the challenges, the thesis implements two Generative Adversarial Network (GAN) architectures based on the models Wasserstein GAN (WGAN), and Conditional WGAN (C-WGAN) developed in previous literature. These models aim to remedy the shortcomings of previous data augmentation methods used in pavement crack analysis.
The research utilizes annotated images from the Crack500 and CrackForest datasets, categorized into transverse, longitudinal, block, and alligator crack types. These images are used as the training data for both the GANs and a baseline classifier, aimed at measuring the impact of synthetic crack images in augmented datasets. The findings demonstrate the effectiveness of WGAN and C-WGAN in addressing the limitations of previous augmentation methods, generating high-quality, diverse synthetic images. The GAN-augmented models achieved an average classification score improvement of 5\% over the baseline. C-WGAN offered the advantage of user-specific image generation. Both models proved to be effective data augmentation tools for pavement crack datasets, each with distinct advantages and potential trade-offs. This research contributes to the field by providing robust GAN-based solutions for enhancing pavement crack detection and classification.
|
Genre | |
Type | |
Language |
eng
|
Date Available |
2024-10-01
|
Provider |
Vancouver : University of British Columbia Library
|
Rights |
Attribution-NoDerivatives 4.0 International
|
DOI |
10.14288/1.0445467
|
URI | |
Degree | |
Program | |
Affiliation | |
Degree Grantor |
University of British Columbia
|
Graduation Date |
2024-11
|
Campus | |
Scholarly Level |
Graduate
|
Rights URI | |
Aggregated Source Repository |
DSpace
|
Item Media
Item Citations and Data
Rights
Attribution-NoDerivatives 4.0 International