Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI)

Loading...
Thumbnail Image

Identifiers

Publication date

Authors

Oreiro Villar, Natividad
Vázquez-García, Jorge
Morano Torres, Antonio
Balboa-Barreiro, Vanesa
Rodríguez-Valle, Isabel
Relaño, Sara
Veronese, Nicola
de Andrés, María C.
Rego-Pérez, I.

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Blanco FJ, Oreiro N, Vázquez-García J, et al. Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI). RMD Open 2026;12:e006476.

Type of academic work

Academic degree

Abstract

[Abstract] Objective To develop a pragmatic model to predict total knee replacement (TKR) in knee osteoarthritis using non-imaging clinical, genetic and lifestyle data with machine learning (ML)-guided feature selection. Methods We analysed 3790 Osteoarthritis Initiative participants. Nested ML feature selection on the training set identified 15 informative variables. Classifiers were benchmarked, then a multivariable logistic regression was fit on the full cohort. Performance was summarised by discrimination (area under the curve (AUC) with 95% CI) and calibration (Brier score). To assess the incremental value of genetics, we refit an otherwise identical clinical model excluding the Polygenic Risk Score (PRS) and compared specificity at fixed sensitivities using Bonferroni-adjusted McNemar tests. A prespecified analysis examined performance by baseline Kellgren-Lawrence (KL) grade (KL 0–1 vs KL ≥2). Results On the test set, classifier AUCs ranged 0.716–0.748, with Elastic Net and XGBoost performing best. The final logistic model fit on the full cohort achieved AUC 0.765 (95% CI 0.736 to 0.793) with acceptable calibration (Brier 0.097). Performance remained robust by disease stage, with higher discrimination in pre-radiographic knees (KL 0–1: AUC 0.827) and moderate discrimination in KL ≥2 (AUC 0.720); decile plots indicated broadly aligned observed versus predicted risks. PRS added modest, statistically significant gains in specificity at several fixed sensitivities without materially changing AUC. Conclusions We present a pragmatic, non-imaging, ML-informed model that predicts TKR with clinically acceptable discrimination and calibration using routinely collected data. This framework provides a practical basis for individualised risk stratification and decision support without reliance on imaging.

Description

Keywords

Rights

Attribution-NonCommercial 4.0 International
Attribution-NonCommercial 4.0 International

Except where otherwise noted, this item's license is described as Attribution-NonCommercial 4.0 International