Generalizable deep learning models for epithelial ovarian carcinoma classification

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Generalizable deep learning models for epithelial ovarian carcinoma classification Boschman, Jeffrey

Abstract

Ovarian carcinoma is the deadliest cancer of the female reproductive system in North America. There are five major histological subtypes which require different treatments. Pathologists diagnose these histotypes by examining hematoxylin and eosin (H&E)-stained whole slide images (WSIs) of tissue. However, histotype diagnosis is not simple, with poor interobserver agreement between general pathologists (Cohen’s kappa 0.54-0.67). We hypothesize that latest machine learning (ML)-based image classification models may be able to recognize ovarian carcinoma histotype sufficiently well that they could aid pathologists in diagnosis. However, the color variation of H&E-stained tissues, especially those from different centers/hospitals, is a longstanding challenge for applications of AI in digital pathology. First, we investigate eight color normalization algorithms as a preprocessing step for artificial intelligence (AI)-based classification. Using multiple datasets of different cancer types, reference images, and cross-validation splits, we show that color normalization significantly improves the classification accuracy of WSIs when the train and test data are from separate institutions (ovarian cancer: 0.25 AUC increase, p = 1.6 e-05, pleural cancer: 0.21 AUC increase, p = 1.4 e-10). Furthermore, we introduce a novel augmentation strategy by mixing color-normalized images using three easily accessible algorithms that consistently improves the diagnosis of test images from external institutions. Secondly, we train four different deep convolutional neural networks to automatically classify H&E-stained images of epithelial ovarian carcinoma using the largest training dataset to date (948 slides corresponding to 485 patients). Performance is assessed on an independent test set of 60 patients from another institution. The best performing model achieves a mean diagnostic concordance of 80.97% (Cohen’s kappa 0.7547). As well, in 4 of 8 cases misclassified by ML from the external dataset, two expert subspecialty pathologists rendered diagnoses, based on blind review of the WSIs, that agree with AI rather than the integrated reference diagnosis. Our results indicate that color normalization can reliably improve AI-based diagnosis of WSIs sourced from multiple centers, and specifically that an ML-based ovarian carcinoma classifier is ready for clinical validation studies as an adjunct for informing histotype diagnosis, thereby supporting histotype-specific ovarian cancer treatment and accordingly reduce the deadliness of this disease.

Item Metadata

Title	Generalizable deep learning models for epithelial ovarian carcinoma classification
Creator	Boschman, Jeffrey
Supervisor	Bashashati, Ali
Publisher	University of British Columbia
Date Issued	2022
Description	Ovarian carcinoma is the deadliest cancer of the female reproductive system in North America. There are five major histological subtypes which require different treatments. Pathologists diagnose these histotypes by examining hematoxylin and eosin (H&E)-stained whole slide images (WSIs) of tissue. However, histotype diagnosis is not simple, with poor interobserver agreement between general pathologists (Cohen’s kappa 0.54-0.67). We hypothesize that latest machine learning (ML)-based image classification models may be able to recognize ovarian carcinoma histotype sufficiently well that they could aid pathologists in diagnosis. However, the color variation of H&E-stained tissues, especially those from different centers/hospitals, is a longstanding challenge for applications of AI in digital pathology. First, we investigate eight color normalization algorithms as a preprocessing step for artificial intelligence (AI)-based classification. Using multiple datasets of different cancer types, reference images, and cross-validation splits, we show that color normalization significantly improves the classification accuracy of WSIs when the train and test data are from separate institutions (ovarian cancer: 0.25 AUC increase, p = 1.6 e-05, pleural cancer: 0.21 AUC increase, p = 1.4 e-10). Furthermore, we introduce a novel augmentation strategy by mixing color-normalized images using three easily accessible algorithms that consistently improves the diagnosis of test images from external institutions. Secondly, we train four different deep convolutional neural networks to automatically classify H&E-stained images of epithelial ovarian carcinoma using the largest training dataset to date (948 slides corresponding to 485 patients). Performance is assessed on an independent test set of 60 patients from another institution. The best performing model achieves a mean diagnostic concordance of 80.97% (Cohen’s kappa 0.7547). As well, in 4 of 8 cases misclassified by ML from the external dataset, two expert subspecialty pathologists rendered diagnoses, based on blind review of the WSIs, that agree with AI rather than the integrated reference diagnosis. Our results indicate that color normalization can reliably improve AI-based diagnosis of WSIs sourced from multiple centers, and specifically that an ML-based ovarian carcinoma classifier is ready for clinical validation studies as an adjunct for informing histotype diagnosis, thereby supporting histotype-specific ovarian cancer treatment and accordingly reduce the deadliness of this disease.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2022-08-03
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivatives 4.0 International
DOI	10.14288/1.0416550
URI	http://hdl.handle.net/2429/82237
Degree	Master of Applied Science - MASc
Program	Biomedical Engineering
Affiliation	Applied Science, Faculty of; Biomedical Engineering, School of
Degree Grantor	University of British Columbia
Graduation Date	2022-11
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Generalizable deep learning models for epithelial ovarian carcinoma classification Boschman, Jeffrey

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights