Session: (0306–0336) Osteoarthritis – Clinical Poster I
0327: Structural Severity Assessment of Hand Osteoarthritis Using Radiomic Analysis: Correlation with the Kellgren-Lawrence Score – Results from the DIGICOD Cohort
Northwestern university Chicago, Illinois, United States
Disclosure(s): No financial relationships with ineligible companies to disclose
Background/Purpose: Radiomics involves the high-throughput extraction of quantitative data from medical imaging. This study aimed to develop a radiomic analysis pipeline applied to hand radiographs and evaluate its ability to classify joints according to the Kellgren-Lawrence (KL) score, a grading system for hand osteoarthritis severity. The goal was to assess whether radiomic features could enable automated and standardized KL classification, complementing expert visual assessment. Methods: The study was based on 436 hand radiographs from the DIGICOD cohort, a prospective study of patients with hand osteoarthritis according to American College of Rheumatology criteria. After excluding 30 radiographs not found in PACS and 25 with artifacts or surgical hardware, 381 radiographs were analyzed, representing 10,531 joints (trapeziometacarpal [TMC], metacarpophalangeal [MCP], proximal interphalangeal [PIP], and distal interphalangeal [DIP] joints). KL grading was performed by an experienced radiologist at baseline.
Radiomic analysis involved several steps: (1) joint segmentation using an algorithm developed for the study. Manual segmentations from 20 patients served as ground truth to train a U-Net model with iterative training and correction; (2) extraction of 179 radiomic features (4 shape, 45 intensity histogram, and 130 texture features); (3) exclusion of texture features with relative variation < 0.05 after pixel shuffling (deemed non-informative); (4) dimensionality reduction by removing highly correlated features (Spearman > 0.9), resulting in 86 features. A random forest model with stratified cross-validation selected the 20 most relevant features for predicting KL scores. Classification performance was evaluated by comparing predicted vs. expert-assigned KL scores, using area under the curve (AUC), accuracy, and balanced accuracy. Accuracy reflects the proportion of correctly classified joints, while balanced accuracy adjusts for class imbalance. Results: The radiomic model achieved good classification performance for KL grading. Differentiation between affected joints (KL 2–4) and normal or borderline joints (KL 0–1) yielded an AUC of 0.81 and accuracy of 0.74. Discrimination between non/mildly affected (KL 0–2) and severely affected joints (KL 3–4) achieved an AUC of 0.83 and accuracy of 0.68. Considering all KL classes, the overall AUC was 0.76, with balanced accuracy of 0.37. Joint-type specific analysis (affected vs. normal/borderline) showed lower performance, with AUCs of 0.67, 0.71, and 0.49 and balanced accuracies of 0.34, 0.35, and 0.39 for MCP, PIP, and DIP joints, respectively. A downsampling strategy was applied to mitigate class imbalance, slightly improving overall classification, especially for the most frequent classes. Conclusion: This study is the first to evaluate radiomic analysis in hand osteoarthritis. Radiomics enabled, to some extent, automated reproduction of KL classification. However, performance varied depending on joint type and severity. While this approach shows promise as a complementary tool for assessing hand osteoarthritis, external validation on an independent cohort is required before clinical application.