Identifying the Key Components in ResNet-50 for Diabetic Retinopathy Grading from Fundus Images : A Systematic Investigation

UBC Faculty Research and Publications

Identifying the Key Components in ResNet-50 for Diabetic Retinopathy Grading from Fundus Images : A Systematic Investigation Huang, Yijin; Lin, Li; Cheng, Pujin; Lyu, Junyan; Tam, Roger; Tang, Xiaoying

Abstract

Although deep learning-based diabetic retinopathy (DR) classification methods typically benefit from well-designed architectures of convolutional neural networks, the training setting also has a non-negligible impact on prediction performance. The training setting includes various interdependent components, such as an objective function, a data sampling strategy, and a data augmentation approach. To identify the key components in a standard deep learning framework (ResNet-50) for DR grading, we systematically analyze the impact of several major components. Extensive experiments are conducted on a publicly available dataset EyePACS. We demonstrate that (1) the DR grading framework is sensitive to input resolution, objective function, and composition of data augmentation; (2) using mean square error as the loss function can effectively improve the performance with respect to a task-specific evaluation metric, namely the quadratically weighted Kappa; (3) utilizing eye pairs boosts the performance of DR grading and; (4) using data resampling to address the problem of imbalanced data distribution in EyePACS hurts the performance. Based on these observations and an optimal combination of the investigated components, our framework, without any specialized network design, achieves a state-of-the-art result (0.8631 for Kappa) on the EyePACS test set (a total of 42,670 fundus images) with only image-level labels. We also examine the proposed training practices on other fundus datasets and other network architectures to evaluate their generalizability. Our codes and pre-trained model are available online.

Item Metadata

Title	Identifying the Key Components in ResNet-50 for Diabetic Retinopathy Grading from Fundus Images : A Systematic Investigation
Creator	Huang, Yijin; Lin, Li; Cheng, Pujin; Lyu, Junyan; Tam, Roger; Tang, Xiaoying
Publisher	Multidisciplinary Digital Publishing Institute
Date Issued	2023-05-09
Description	Although deep learning-based diabetic retinopathy (DR) classification methods typically benefit from well-designed architectures of convolutional neural networks, the training setting also has a non-negligible impact on prediction performance. The training setting includes various interdependent components, such as an objective function, a data sampling strategy, and a data augmentation approach. To identify the key components in a standard deep learning framework (ResNet-50) for DR grading, we systematically analyze the impact of several major components. Extensive experiments are conducted on a publicly available dataset EyePACS. We demonstrate that (1) the DR grading framework is sensitive to input resolution, objective function, and composition of data augmentation; (2) using mean square error as the loss function can effectively improve the performance with respect to a task-specific evaluation metric, namely the quadratically weighted Kappa; (3) utilizing eye pairs boosts the performance of DR grading and; (4) using data resampling to address the problem of imbalanced data distribution in EyePACS hurts the performance. Based on these observations and an optimal combination of the investigated components, our framework, without any specialized network design, achieves a state-of-the-art result (0.8631 for Kappa) on the EyePACS test set (a total of 42,670 fundus images) with only image-level labels. We also examine the proposed training practices on other fundus datasets and other network architectures to evaluate their generalizability. Our codes and pre-trained model are available online.
Subject	diabetic retinopathy; classification; training setting
Genre	Article
Type	Text
Language	eng
Date Available	2023-11-14
Provider	Vancouver : University of British Columbia Library
Rights	CC BY 4.0
DOI	10.14288/1.0437652
URI	http://hdl.handle.net/2429/86514
Affiliation	Applied Science, Faculty of; Non UBC; Biomedical Engineering, School of
Citation	Diagnostics 13 (10): 1664 (2023)
Publisher DOI	10.3390/diagnostics13101664
Peer Review Status	Reviewed
Scholarly Level	Faculty; Graduate
Rights URI	https://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Faculty Research and Publications