Using machine learning to predict internal rotation after anatomic and reverse total shoulder arthroplasty

Vikas Kumar, KenSci, Seattle, WA, USA.
Bradley S. Schoch, Department of Orthopedic Surgery, Mayo Clinic, Jacksonville, FL, USA.
Christine Allen, KenSci, Seattle, WA, USA.
Steve Overman, KenSci, Seattle, WA, USA; University of Washington School of Medicine, Seattle, WA, USA.
Ankur Teredesai, University of Washington School of Medicine, Seattle, WA, USA.
William Aibinder, Department of Orthopaedic Surgery and Rehabilitation Medicine, Downstate Health Sciences University, Brooklyn, NY, USA.
Moby Parsons, The Knee Hip and Shoulder Center, Portsmouth, NH, USA.
Jonathan Watling, Maine Health, Portland, ME, USA.
Jiawei Kevin Ko, Orthopedic Physician Associates, Swedish Orthopedic Institute, Seattle, WA, USA.
Bruno Gobbato, IDOMED, Jaraguá do Sul, Brazil.
Thomas Throckmorton, Department of Orthopaedic Surgery, University of Tennessee-Campbell Clinic, Memphis, TN, USA.
Howard Routman, Atlantis Orthopedics, Palm Beach Gardens, FL, USA.
Christopher Roche, Exactech Inc., Gainesville, FL, USA. Electronic address: Chris.Roche@exac.com.

Abstract

BACKGROUND: Improvement in internal rotation (IR) after anatomic (aTSA) and reverse (rTSA) total shoulder arthroplasty is difficult to predict, with rTSA patients experiencing greater variability and more limited IR improvements than aTSA patients. The purpose of this study is to quantify and compare the IR score for aTSA and rTSA patients and create supervised machine learning that predicts IR after aTSA and rTSA at multiple postoperative time points. METHODS: Clinical data from 2270 aTSA and 4198 rTSA patients were analyzed using 3 supervised machine learning techniques to create predictive models for internal rotation as measured by the IR score at 6 postoperative time points. Predictions were performed using the full input feature set and 2 minimal input feature sets. The mean absolute error (MAE) quantified the difference between actual and predicted IR scores for each model at each time point. The predictive accuracy of the XGBoost algorithm was also quantified by its ability to distinguish which patients would achieve clinical improvement greater than the minimal clinically important difference (MCID) and substantial clinical benefit (SCB) patient satisfaction thresholds for IR score at 2-3 years after surgery. RESULTS: rTSA patients had significantly lower mean IR scores and significantly less mean IR score improvement than aTSA patients at each postoperative time point. Both aTSA and rTSA patients experienced significant improvements in their ability to perform activities of daily living (ADLs); however, aTSA patients were significantly more likely to perform these ADLs. Using a minimal feature set of preoperative inputs, our machine learning algorithms had equivalent accuracy when predicting IR score for both aTSA (0.92-1.18 MAE) and rTSA (1.03-1.25 MAE) from 3 months to >5 years after surgery. Furthermore, these predictive algorithms identified with 90% accuracy for aTSA and 85% accuracy for rTSA which patients will achieve MCID IR score improvement and predicted with 85% accuracy for aTSA patients and 77% accuracy for rTSA which patients will achieve SCB IR score improvement at 2-3 years after surgery. DISCUSSION: Our machine learning study demonstrates that active internal rotation can be accurately predicted after aTSA and rTSA at multiple postoperative time points using a minimal feature set of preoperative inputs. These predictive algorithms accurately identified which patients will, and will not, achieve clinical improvement in IR score that exceeds the MCID and SCB patient satisfaction thresholds.