Wide range of applications for machine-learning prediction models in orthopedic surgical outcome: a systematic review

Authors

  • Paul T Ogink Department of Orthopedic Surgery, University Medical Center Utrecht – Utrecht University, Utrecht, The Netherlands
  • Olivier Q Groot Department of Orthopedic Surgery, Orthopedic Oncology Service, Massachusetts General Hospital – Harvard Medical School, Boston, USA
  • Aditya V Karhade Department of Orthopedic Surgery, Orthopedic Oncology Service, Massachusetts General Hospital – Harvard Medical School, Boston, USA
  • Michiel E R Bongers Department of Orthopedic Surgery, Orthopedic Oncology Service, Massachusetts General Hospital – Harvard Medical School, Boston, USA
  • F Cumhur Oner Department of Orthopedic Surgery, University Medical Center Utrecht – Utrecht University, Utrecht, The Netherlands
  • Jorrit-Jan Verlaan Department of Orthopedic Surgery, University Medical Center Utrecht – Utrecht University, Utrecht, The Netherlands
  • Joseph H Schwab Department of Orthopedic Surgery, Orthopedic Oncology Service, Massachusetts General Hospital – Harvard Medical School, Boston, USA

DOI:

https://doi.org/10.1080/17453674.2021.1932928

Abstract

Background and purpose — Advancements in software and hardware have enabled the rise of clinical prediction models based on machine learning (ML) in orthopedic sur- gery. Given their growing popularity and their likely imple- mentation in clinical practice we evaluated which outcomes these new models have focused on and what methodologies are being employed.

Material and methods — We performed a systematic search in PubMed, Embase, and Cochrane Library for studies published up to June 18, 2020. Studies reporting on non-ML prediction models or non-orthopedic outcomes were excluded. After screening 7,138 studies, 59 studies reporting on 77 pre- diction models were included. We extracted data regarding outcome, study design, and reported performance metrics.

Results — Of the 77 identified ML prediction models the most commonly reported outcome domain was medi- cal management (17/77). Spinal surgery was the most com- monly involved orthopedic subspecialty (28/77). The most frequently employed algorithm was neural networks (42/77). Median size of datasets was 5,507 (IQR 635–26,364). The median area under the curve (AUC) was 0.80 (IQR 0.73– 0.86). Calibration was reported for 26 of the models and 14 provided decision-curve analysis.

Interpretation — ML prediction models have been developed for a wide variety of topics in orthopedics. Topics regarding medical management were the most commonly studied. Heterogeneity between studies is based on study size, algorithm, and time-point of outcome. Calibration and decision-curve analysis were generally poorly reported.

Downloads

Download data is not yet available.

Downloads

Additional Files

Published

2021-06-10

How to Cite

Ogink, P. T., Groot, O. Q. ., Karhade, A. V., Bongers, M. E. R., Oner, F. C., Verlaan, J.-J., & Schwab, J. H. (2021). Wide range of applications for machine-learning prediction models in orthopedic surgical outcome: a systematic review. Acta Orthopaedica, 92(5), 526–531. https://doi.org/10.1080/17453674.2021.1932928