4.6 Article

A comparison of methods for fully automatic segmentation of tumors and involved nodes in PET/CT of head and neck cancers

期刊

PHYSICS IN MEDICINE AND BIOLOGY
卷 66, 期 6, 页码 -

出版社

IOP Publishing Ltd
DOI: 10.1088/1361-6560/abe553

关键词

head and neck cancer; PET; CT; gross tumor volume; automatic segmentation; thresholding; machine learning; deep learning

资金

  1. Norwegian Cancer Society [160907-2014, 182672-2016]

向作者/读者索取更多资源

Target volume delineation in radiotherapy is crucial for delivering the right dose while minimizing side effects. A study comparing PET thresholding, machine learning, and CNN methods for automatic tumor volume segmentation in head and neck cancer found that a CNN model using multimodality PET/CT input outperformed other methods in terms of accuracy and target coverage.
Target volume delineation is a vital but time-consuming and challenging part of radiotherapy, where the goal is to deliver sufficient dose to the target while reducing risks of side effects. For head and neck cancer (HNC) this is complicated by the complex anatomy of the head and neck region and the proximity of target volumes to organs at risk. The purpose of this study was to compare and evaluate conventional PET thresholding methods, six classical machine learning algorithms and a 2D U-Net convolutional neural network (CNN) for automatic gross tumor volume (GTV) segmentation of HNC in PET/CT images. For the latter two approaches the impact of single versus multimodality input on segmentation quality was also assessed. 197 patients were included in the study. The cohort was split into training and test sets (157 and 40 patients, respectively). Five-fold cross-validation was used on the training set for model comparison and selection. Manual GTV delineations represented the ground truth. Tresholding, classical machine learning and CNN segmentation models were ranked separately according to the cross-validation Sorensen-Dice similarity coefficient (Dice). PET thresholding gave a maximum mean Dice of 0.62, whereas classical machine learning resulted in maximum mean Dice scores of 0.24 (CT) and 0.66 (PET; PET/CT). CNN models obtained maximum mean Dice scores of 0.66 (CT), 0.68 (PET) and 0.74 (PET/CT). The difference in cross-validation Dice between multimodality PET/CT and single modality CNN models was significant (p <= 0.0001). The top-ranked PET/CT-based CNN model outperformed the best-performing thresholding and classical machine learning models, giving significantly better segmentations in terms of cross-validation and test set Dice, true positive rate, positive predictive value and surface distance-based metrics (p <= 0.0001). Thus, deep learning based on multimodality PET/CT input resulted in superior target coverage and less inclusion of surrounding normal tissue.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据