The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19 : A Geometric Transformation Perspective

UBC Faculty Research and Publications

The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19 : A Geometric Transformation Perspective Elgendi, Mohamed; Nasir, Muhammad Umer; Tang, Qunfeng; Smith, David; Grenier, John-Paul; Batte, Catherine; Spieler, Bradley; Leslie, William Donald; Menon, Carlo; Fletcher, Richard Ribbon; Howard, Newton; Ward, Rabab Kreidieh; Parker, William; Nicolaou, Savvas

Abstract

Chest X-ray imaging technology used for the early detection and screening of COVID-19 pneumonia is both accessible worldwide and affordable compared to other non-invasive technologies. Additionally, deep learning methods have recently shown remarkable results in detecting COVID-19 on chest X-rays, making it a promising screening technology for COVID-19. Deep learning relies on a large amount of data to avoid overfitting. While overfitting can result in perfect modeling on the original training dataset, on a new testing dataset it can fail to achieve high accuracy. In the image processing field, an image augmentation step (i.e., adding more training data) is often used to reduce overfitting on the training dataset, and improve prediction accuracy on the testing dataset. In this paper, we examined the impact of geometric augmentations as implemented in several recent publications for detecting COVID-19. We compared the performance of 17 deep learning algorithms with and without different geometric augmentations. We empirically examined the influence of augmentation with respect to detection accuracy, dataset diversity, augmentation methodology, and network size. Contrary to expectation, our results show that the removal of recently used geometrical augmentation steps actually improved the Matthews correlation coefficient (MCC) of 17 models. The MCC without augmentation (MCC = 0.51) outperformed four recent geometrical augmentations (MCC = 0.47 for Data Augmentation 1, MCC = 0.44 for Data Augmentation 2, MCC = 0.48 for Data Augmentation 3, and MCC = 0.49 for Data Augmentation 4). When we retrained a recently published deep learning without augmentation on the same dataset, the detection accuracy significantly increased, with a χ 2 McNemar′s statistic = 163.2 and a p-value of 2.23 × 10−37. This is an interesting finding that may improve current deep learning algorithms using geometrical augmentations for detecting COVID-19. We also provide clinical perspectives on geometric augmentation to consider regarding the development of a robust COVID-19 X-ray-based detector.

Item Metadata

Title	The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19 : A Geometric Transformation Perspective
Creator	Elgendi, Mohamed; Nasir, Muhammad Umer; Tang, Qunfeng; Smith, David; Grenier, John-Paul; Batte, Catherine; Spieler, Bradley; Leslie, William Donald; Menon, Carlo; Fletcher, Richard Ribbon; Howard, Newton; Ward, Rabab Kreidieh; Parker, William; Nicolaou, Savvas
Publisher	Frontiers
Date Issued	2021-03-01
Description	Chest X-ray imaging technology used for the early detection and screening of COVID-19 pneumonia is both accessible worldwide and affordable compared to other non-invasive technologies. Additionally, deep learning methods have recently shown remarkable results in detecting COVID-19 on chest X-rays, making it a promising screening technology for COVID-19. Deep learning relies on a large amount of data to avoid overfitting. While overfitting can result in perfect modeling on the original training dataset, on a new testing dataset it can fail to achieve high accuracy. In the image processing field, an image augmentation step (i.e., adding more training data) is often used to reduce overfitting on the training dataset, and improve prediction accuracy on the testing dataset. In this paper, we examined the impact of geometric augmentations as implemented in several recent publications for detecting COVID-19. We compared the performance of 17 deep learning algorithms with and without different geometric augmentations. We empirically examined the influence of augmentation with respect to detection accuracy, dataset diversity, augmentation methodology, and network size. Contrary to expectation, our results show that the removal of recently used geometrical augmentation steps actually improved the Matthews correlation coefficient (MCC) of 17 models. The MCC without augmentation (MCC = 0.51) outperformed four recent geometrical augmentations (MCC = 0.47 for Data Augmentation 1, MCC = 0.44 for Data Augmentation 2, MCC = 0.48 for Data Augmentation 3, and MCC = 0.49 for Data Augmentation 4). When we retrained a recently published deep learning without augmentation on the same dataset, the detection accuracy significantly increased, with a χ 2 McNemar′s statistic = 163.2 and a p-value of 2.23 × 10−37. This is an interesting finding that may improve current deep learning algorithms using geometrical augmentations for detecting COVID-19. We also provide clinical perspectives on geometric augmentation to consider regarding the development of a robust COVID-19 X-ray-based detector.
Subject	radiology; corona virus; transfer learning; data augmentation; chest X-ray; digital health; machine learning; artificial intelligence
Genre	Article
Type	Text
Language	eng
Date Available	2021-06-02
Provider	Vancouver : University of British Columbia Library
Rights	Attribution 4.0 International
DOI	10.14288/1.0398233
URI	http://hdl.handle.net/2429/78511
Affiliation	Applied Science, Faculty of; Medicine, Faculty of; Non UBC; Electrical and Computer Engineering, Department of; Radiology, Department of
Citation	Elgendi M, Nasir MU, Tang Q, Smith D, Grenier J-P, Batte C, Spieler B, Leslie WD, Menon C, Fletcher RR, Howard N, Ward R, Parker W and Nicolaou S (2021) The Effectiveness of Image Augmentation in Deep Learning Networks for Detecting COVID-19: A Geometric Transformation Perspective. Front. Med. 8:629134.
Publisher DOI	10.3389/fmed.2021.629134
Peer Review Status	Reviewed
Scholarly Level	Faculty; Researcher
Rights URI	http://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Faculty Research and Publications

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights