Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI)

Blanco García, Francisco J; Oreiro Villar, Natividad; Vázquez-García, Jorge; Morano Torres, Antonio; Balboa-Barreiro, Vanesa; Rodríguez-Valle, Isabel; Relaño, Sara; Veronese, Nicola; de Andrés, María C.; Rego-Pérez, I.

Use this link to cite:

https://hdl.handle.net/2183/47471

Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI)

Files

Blanco_Predicting_2026.pdf (705.48 KB)

Blanco_Predicting_2026_Suppl_1.pdf (296.15 KB)

Blanco_Predicting_2026_Suppl_2.pdf (369.49 KB)

Blanco_Predicting_2026_Suppl_3.pdf (144.93 KB)

Blanco_Predicting_2026_Suppl_4.pdf (136.58 KB)

Identifiers

URI: https://hdl.handle.net/2183/47471

DOI: 10.1136/rmdopen-2025-006476

Publication date

2026-02-10

Authors

Blanco García, Francisco J

Oreiro Villar, Natividad

Vázquez-García, Jorge

Morano Torres, Antonio

Balboa-Barreiro, Vanesa

Rodríguez-Valle, Isabel

Relaño, Sara

Veronese, Nicola

de Andrés, María C.

Rego-Pérez, I.

Bibliographic citation

Blanco FJ, Oreiro N, Vázquez-García J, et al. Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI). RMD Open 2026;12:e006476.

Abstract

[Abstract] Objective To develop a pragmatic model to predict total knee replacement (TKR) in knee osteoarthritis using non-imaging clinical, genetic and lifestyle data with machine learning (ML)-guided feature selection. Methods We analysed 3790 Osteoarthritis Initiative participants. Nested ML feature selection on the training set identified 15 informative variables. Classifiers were benchmarked, then a multivariable logistic regression was fit on the full cohort. Performance was summarised by discrimination (area under the curve (AUC) with 95% CI) and calibration (Brier score). To assess the incremental value of genetics, we refit an otherwise identical clinical model excluding the Polygenic Risk Score (PRS) and compared specificity at fixed sensitivities using Bonferroni-adjusted McNemar tests. A prespecified analysis examined performance by baseline Kellgren-Lawrence (KL) grade (KL 0–1 vs KL ≥2). Results On the test set, classifier AUCs ranged 0.716–0.748, with Elastic Net and XGBoost performing best. The final logistic model fit on the full cohort achieved AUC 0.765 (95% CI 0.736 to 0.793) with acceptable calibration (Brier 0.097). Performance remained robust by disease stage, with higher discrimination in pre-radiographic knees (KL 0–1: AUC 0.827) and moderate discrimination in KL ≥2 (AUC 0.720); decile plots indicated broadly aligned observed versus predicted risks. PRS added modest, statistically significant gains in specificity at several fixed sensitivities without materially changing AUC. Conclusions We present a pragmatic, non-imaging, ML-informed model that predicts TKR with clinically acceptable discrimination and calibration using routinely collected data. This framework provides a practical basis for individualised risk stratification and decision support without reliance on imaging.

Editor version

https://doi.org/10.1136/rmdopen-2025-006476

Rights

Attribution-NonCommercial 4.0 International

Collections

Investigación (FFISIO)

Full item page

Except where otherwise noted, this item's license is described as Attribution-NonCommercial 4.0 International

Predicting total knee replacement in knee osteoarthritis using a machine learning-guided approach in patients of the Osteoarthritis Initiative (OAI)

Files

Identifiers

Publication date

Authors

Advisors

Other responsabilities

Journal Title

Bibliographic citation

Type of academic work

Academic degree

Abstract

Description

Keywords

Editor version

Rights

Collections