Using Artificial Intelligence for Detecting Diabetic Foot Osteomyelitis: Validation of Deep Learning Model for Plain Radiograph Interpretation

Objective: To develop and validate a ResNet-50-based deep learning model for automatic detection of osteomyelitis (DFO) in plain radiographs of patients with diabetic foot ulcers (DFUs). Research Design and Methods: This retrospective study included 168 patients with type one or type two diabetes an...

Full description

Saved in:
Bibliographic Details
Main Authors: Francisco Javier Álvaro-Afonso, Aroa Tardáguila-García, Mateo López-Moral, Irene Sanz-Corbalán, Esther García-Morales, José Luis Lázaro-Martínez
Format: Article
Language:English
Published: MDPI AG 2025-08-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/15/8583
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Objective: To develop and validate a ResNet-50-based deep learning model for automatic detection of osteomyelitis (DFO) in plain radiographs of patients with diabetic foot ulcers (DFUs). Research Design and Methods: This retrospective study included 168 patients with type one or type two diabetes and clinical suspicion of DFO confirmed via a surgical bone biopsy. An experienced clinician and a pretrained ResNet-50 model independently interpreted the radiographs. The model was developed using Python-based frameworks with ChatGPT assistance for coding. The diagnostic performance was assessed against the histopathological findings, calculating sensitivity, specificity, the positive predictive value (PPV), the negative predictive value (NPV), and the likelihood ratios. Agreement between the AI model and the clinician was evaluated using Cohen’s kappa coefficient. Results: The AI model demonstrated high sensitivity (92.8%) and PPV (0.97), but low-level specificity (4.4%). The clinician showed 90.2% sensitivity and 37.8% specificity. The Cohen’s kappa coefficient between the AI model and the clinician was −0.105 (<i>p</i> = 0.117), indicating weak agreement. Both the methods tended to classify many cases as DFO-positive, with 81.5% agreement in the positive cases. Conclusions: This study demonstrates the potential of IA to support the radiographic diagnosis of DFO using a ResNet-50-based deep learning model. AI-assisted radiographic interpretation could enhance early DFO detection, particularly in high-prevalence settings. However, further validation is necessary to improve its specificity and assess its utility in primary care.
ISSN:2076-3417