Predicting postoperative surgical site infection with administrative data: a random forests algorithm

UBC Faculty Research and Publications

Predicting postoperative surgical site infection with administrative data: a random forests algorithm Petrosyan, Yelena; Thavorn, Kednapa; Smith, Glenys; Maclure, Malcolm; Preston, Roanne; van Walravan, Carl; Forster, Alan J.

Abstract

Background Since primary data collection can be time-consuming and expensive, surgical site infections (SSIs) could ideally be monitored using routinely collected administrative data. We derived and internally validated efficient algorithms to identify SSIs within 30 days after surgery with health administrative data, using Machine Learning algorithms. Methods All patients enrolled in the National Surgical Quality Improvement Program from the Ottawa Hospital were linked to administrative datasets in Ontario, Canada. Machine Learning approaches, including a Random Forests algorithm and the high-performance logistic regression, were used to derive parsimonious models to predict SSI status. Finally, a risk score methodology was used to transform the final models into the risk score system. The SSI risk models were validated in the validation datasets. Results Of 14,351 patients, 795 (5.5%) had an SSI. First, separate predictive models were built for three distinct administrative datasets. The final model, including hospitalization diagnostic, physician diagnostic and procedure codes, demonstrated excellent discrimination (C statistics, 0.91, 95% CI, 0.90–0.92) and calibration (Hosmer-Lemeshow χ2 statistics, 4.531, p = 0.402). Conclusion We demonstrated that health administrative data can be effectively used to identify SSIs. Machine learning algorithms have shown a high degree of accuracy in predicting postoperative SSIs and can integrate and utilize a large amount of administrative data. External validation of this model is required before it can be routinely used to identify SSIs.

Item Metadata

Title	Predicting postoperative surgical site infection with administrative data: a random forests algorithm
Creator	Petrosyan, Yelena; Thavorn, Kednapa; Smith, Glenys; Maclure, Malcolm; Preston, Roanne; van Walravan, Carl; Forster, Alan J.
Publisher	BioMed Central
Date Issued	2021-08-28
Description	Background Since primary data collection can be time-consuming and expensive, surgical site infections (SSIs) could ideally be monitored using routinely collected administrative data. We derived and internally validated efficient algorithms to identify SSIs within 30 days after surgery with health administrative data, using Machine Learning algorithms. Methods All patients enrolled in the National Surgical Quality Improvement Program from the Ottawa Hospital were linked to administrative datasets in Ontario, Canada. Machine Learning approaches, including a Random Forests algorithm and the high-performance logistic regression, were used to derive parsimonious models to predict SSI status. Finally, a risk score methodology was used to transform the final models into the risk score system. The SSI risk models were validated in the validation datasets. Results Of 14,351 patients, 795 (5.5%) had an SSI. First, separate predictive models were built for three distinct administrative datasets. The final model, including hospitalization diagnostic, physician diagnostic and procedure codes, demonstrated excellent discrimination (C statistics, 0.91, 95% CI, 0.90–0.92) and calibration (Hosmer-Lemeshow χ2 statistics, 4.531, p = 0.402). Conclusion We demonstrated that health administrative data can be effectively used to identify SSIs. Machine learning algorithms have shown a high degree of accuracy in predicting postoperative SSIs and can integrate and utilize a large amount of administrative data. External validation of this model is required before it can be routinely used to identify SSIs.
Subject	Surgical site infection; Administrative data; Machine learning; Random forests; Data mining; Predictive modeling
Genre	Article
Type	Text
Language	eng
Date Available	2021-10-04
Provider	Vancouver : University of British Columbia Library
Rights	Attribution 4.0 International (CC BY 4.0)
DOI	10.14288/1.0402417
URI	http://hdl.handle.net/2429/79879
Affiliation	Medicine, Faculty of; Non UBC; Anesthesiology, Pharmacology and Therapeutics, Department of
Citation	BMC Medical Research Methodology. 2021 Aug 28;21(1):179
Publisher DOI	10.1186/s12874-021-01369-9
Peer Review Status	Reviewed
Scholarly Level	Faculty; Researcher
Copyright Holder	The Author(s)
Rights URI	http://creativecommons.org/licenses/by/4.0/
Aggregated Source Repository	DSpace

Open Collections

UBC Faculty Research and Publications