Deep learning dose prediction for IMRT of esophageal cancer: The effect of data quality and quantity on model performance
Organs at Risk
Esophageal Neoplasms
retrospective study
Esophageal cancer
dependent variable
Article
experimental study
03 medical and health sciences
Deep Learning
0302 clinical medicine
data quality
Humans
human
IMRT
Retrospective Studies
Radiotherapy
Radiotherapy Planning, Computer-Assisted
deep learning
radiation dose reduction
Deep learning
Radiotherapy Dosage
prediction
intensity modulated radiation therapy
radiation beam
Data Accuracy
3. Good health
radiation dose fractionation
planning target volume
esophagus cancer
Radiotherapy, Intensity-Modulated
Automatic planning
DOI:
10.1016/j.ejmp.2021.02.026
Publication Date:
2021-03-11T05:51:06Z
AUTHORS (7)
ABSTRACT
To investigate the effect of data quality and quantity on the performance of deep learning (DL) models, for dose prediction of intensity-modulated radiotherapy (IMRT) of esophageal cancer.Two databases were used: a variable database (VarDB) with 56 clinical cases extracted retrospectively, including user-dependent variability in delineation and planning, different machines and beam configurations; and a homogenized database (HomDB), created to reduce this variability by re-contouring and re-planning all patients with a fixed class-solution protocol. Experiment 1 analysed the user-dependent variability, using 26 patients planned with the same machine and beam setup (E26-VarDB versus E26-HomDB). Experiment 2 increased the training set by groups of 10 patients (E16, E26, E36, E46, and E56) for both databases. Model evaluation metrics were the mean absolute error (MAE) for selected dose-volume metrics and the global MAE for all body voxels.For Experiment 1, E26-HomDB reduced the MAE for the considered dose-volume metrics compared to E26-VarDB (e.g. reduction of 0.2 Gy for D95-PTV, 1.2 Gy for Dmean-heart or 3.3% for V5-lungs). For Experiment 2, increasing the database size slightly improved performance for HomDB models (e.g. decrease in global MAE of 0.13 Gy for E56-HomDB versus E26-HomDB), but increased the error for the VarDB models (e.g. increase in global MAE of 0.20 Gy for E56-VarDB versus E26-VarDB).A small database may suffice to obtain good DL prediction performance, provided that homogenous training data is used. Data variability reduces the performance of DL models, which is further pronounced when increasing the training set.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (63)
CITATIONS (45)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....