Risk Stratification of Indeterminate Thyroid Nodules

Abstract and Introduction

Abstract

Background: Indeterminate thyroid nodules (Bethesda III) are challenging to characterize without diagnostic surgery. Auxiliary strategies including molecular analysis, machine learning models, and ultrasound grading with Thyroid Imaging, Reporting and Data System (TI-RADS) can help to triage accordingly, but further refinement is needed to prevent unnecessary surgeries and increase positive predictive values.

Design: Retrospective review of 88 patients with Bethesda III nodules who had diagnostic surgery with final pathological diagnosis.

Measurements: Each nodule was retrospectively scored through TI-RADS. Two deep learning models were tested, one previously developed and trained on another data set, mainly containing determinate cases and then validated on our data set while the other one trained and tested on our data set (indeterminate cases).

Results: The mean TI-RADS score was 3 for benign and 4 for malignant nodules (p= .0022). Radiological high risk (TI-RADS 4,5) and low risk (TI-RADS 2,3) categories were established. The PPV for the high radiological risk category in those with >10 mm nodules was 85% (CI: 70%–93%). The NPV for low radiological risk in patients >60 years (mean age was 100% (CI: 83%–100%). The area under the curve (AUC) value of our novel classifier was 0.75 (CI: 0.62–0.84) and differed significantly from the chance-level (

Table 1. Performance metric of the deep learning models for the classification
Metric	ThyNet	ResNet-50 trained/tested on our data
Sensitivity
All	0.50 (CI: 0.34–0.66)	0.82 (CI: 0.76–0.89)
TI-RADS 2	0.50 (CI: 0.09–0.91)	1.00 (CI: 0.32–1.00)
TI-RADS 3	0.60 (CI: 0.23–0.88)	0.83 (CI: 0.68–0.99)
TI-RADS 4	0.39 (CI: 0.20–0.61)	0.82 (CI: 0.73–0.92)
TI-RADS 5	0.67 (CI: 0.35–0.88)	0.78 (CI: 0.64–0.92)
Specificity
All	0.74 (CI: 0.61–0.84)	0.59 (CI: 0.53–0.66)
TI-RADS 2	0.6 (CI: 0.23–0.88)	0.80 (CI: 0.62–0.98)
TI-RADS 3	0.76 (CI: 0.57–0.88)	0.64 (CI: 0.54–0.74)
TI-RADS 4	0.74 (CI: 0.51–0.88)	0.47 (CI: 0.36–0.59)
TI-RADS 5	0.80 (CI: 0.38–0.96)	0.60 (CI: 0.38–0.82)
Positive predictive value
All	0.55 (CI: 0.46–0.64)	0.56 (CI: 0.49–0.63)
TIRADS 2	0.33 (CI: 0.06–0.61)	0.67 (CI: 0.39–0.94)
TIRADS 3	0.33 (CI: 0.18–0.49)	0.36 (CI: 0.23–0.49)
TI-RADS 4	0.58 (CI: 0.44–0.73)	0.88 (CI: 0.81–0.94)
TI-RADS 5	0.86 (CI: 0.72–0.99)	0.78 (CI: 0.64–0.92)
Negative predictive value
All	0.7 (CI: 0.64–0.76)	0.84 (CI: 0.78–0.90)
TI-RADS 2	0.75 (CI: 0.53–0.97)	1.00 (CI: 0.56–1.00)
TI-RADS 3	0.9 (CI: 0.84–0.97)	0.94 (CI: 0.88–1.00)
TI-RADS 4	0.58 (CI: 0.48–0.68)	0.82 (CI: 0.71–0.93)
TI-RADS 5	0.57 (CI: 0.38–0.76)	0.6 (CI: 0.38–0.82)

Table 1. Performance metric of the deep learning models for the classification

Metric

ThyNet

ResNet-50 trained/tested on our data

Sensitivity

All

0.50 (CI: 0.34–0.66)

0.82 (CI: 0.76–0.89)

TI-RADS 2

0.50 (CI: 0.09–0.91)

1.00 (CI: 0.32–1.00)

TI-RADS 3

0.60 (CI: 0.23–0.88)

0.83 (CI: 0.68–0.99)

TI-RADS 4

0.39 (CI: 0.20–0.61)

0.82 (CI: 0.73–0.92)

TI-RADS 5

0.67 (CI: 0.35–0.88)

0.78 (CI: 0.64–0.92)

Specificity

All

0.74 (CI: 0.61–0.84)

0.59 (CI: 0.53–0.66)

TI-RADS 2

0.6 (CI: 0.23–0.88)

0.80 (CI: 0.62–0.98)

TI-RADS 3

0.76 (CI: 0.57–0.88)

0.64 (CI: 0.54–0.74)

TI-RADS 4

0.74 (CI: 0.51–0.88)

0.47 (CI: 0.36–0.59)

TI-RADS 5

0.80 (CI: 0.38–0.96)

0.60 (CI: 0.38–0.82)

Positive predictive value

All

0.55 (CI: 0.46–0.64)

0.56 (CI: 0.49–0.63)

TIRADS 2

0.33 (CI: 0.06–0.61)

0.67 (CI: 0.39–0.94)

TIRADS 3

0.33 (CI: 0.18–0.49)

0.36 (CI: 0.23–0.49)

TI-RADS 4

0.58 (CI: 0.44–0.73)

0.88 (CI: 0.81–0.94)

TI-RADS 5

0.86 (CI: 0.72–0.99)

0.78 (CI: 0.64–0.92)

Negative predictive value

All

0.7 (CI: 0.64–0.76)

0.84 (CI: 0.78–0.90)

TI-RADS 2

0.75 (CI: 0.53–0.97)

1.00 (CI: 0.56–1.00)

TI-RADS 3

0.9 (CI: 0.84–0.97)

0.94 (CI: 0.88–1.00)

TI-RADS 4

0.58 (CI: 0.48–0.68)

0.82 (CI: 0.71–0.93)

TI-RADS 5

0.57 (CI: 0.38–0.76)

0.6 (CI: 0.38–0.82)

Risk Stratification of Indeterminate Thyroid Nodules Using Ultrasound and Machine Learning Algorithms

Abstract and Introduction

Abstract

Most Popular Articles

Risk Stratification of Indeterminate Thyroid Nodules Using Ultrasound and Machine Learning Algorithms

Abstract and Introduction

Abstract

Tables

References

Authors and Disclosures

Authors and Disclosures

Most Popular Articles

Email This

Feedback