LRUNet:轻量级脑肿瘤快速语义分割网络
摘 要
目的 针对目前基于深度学习的脑肿瘤分割算法参数量大、计算复杂和快速性差的问题,提出了一种超轻量级快速语义分割网络LRUNet (lightweight rapid UNet),在保证分割精度提升的同时,极大地减少了网络的参数量与计算量,达到快速分割的效果。方法 LRUNet网络结构基于UNet,将3D-UNet的通道数减少为原来的1/4,减少原先3D-UNet过多的参数量;将UNet网络中除最后一层外的所有传统卷积变为深度可分离卷积,深度可分离卷积以牺牲极少精度,大大减少网络参数量,实现网络的轻量级;使用空间—通道压缩和激发模块(spatial and channel squeeze&excitation block,scSE),该模块能够放大特征图中对模型有利的参数的权重,缩小对模型不利参数的权重,提升网络分割的精度。结果 在BraTS 2018(Brain Tumor Segmentation Challenge 2018)数据集上的在线验证结果显示,该模型在全肿瘤、核心区肿瘤和增强区肿瘤分割的平均Dice系数分别为0.893 6、0.804 6和0.787 2。LRUNet与同为轻量级网络的S3D-UNet相比Dice有所提升,但是,参数量仅为S3D-UNet的1/4,FLOPs (floating point operations per second)仅为1/2。结论 与3D-UNet、S3D-UNet和3D-ESPNet等算法相比,LRUNet算法不仅保证精度得到提升,而且极大地减少网络中计算的参数量与计算成本消耗,同时网络模型的预测速度得到很大提升,使得快速语义分割在3维医学图像领域成为可能。
关键词
LRUNet: a lightweight rapid semantic segmentation network for brain tumors
He Kanghui, Xiao Zhiyong(School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi 214122, China) Abstract
Objective The brain tumor has been divided into primary and secondary tumors types. Glioma has been divided into lower glial tumors and higher glial tumors. Magnetic resonance imaging(MRI) has been a vital diagnostic tool for brain tumor analysis, detection and surgical planning. Accurate segmentation of brain tumors has been crucial for diagnosis and treatment planning. Manual segmentation has required senior doctors to spend a lot of time to complete nowadays. Automatic brain tumor segmentation has been applied instead manual segmentation further. The intensified profile of the tumor area has overlapped significantly with a healthy portion. Method This research has bridged the gap between the efficiency and accuracy of 3D MRI brain tumor segmentation models. A light-weighted rapid semantic segmentation network called LRUNet has been demonstrated. LRUNet has improved the segmentation accuracy and achieved the effect of lightweight, high precision and rapid semantic segmentation in comparison with the existing network. The amount of parameters of these networks has been deleted compared with the algorithms in order to achieve the lightweight effects. At the beginning, the number of channels in the existed 3D-UNet has deducted by four times in each output layer to reduce the number of network parameters dramatically. Next, existed 3D convolution has been excluded and deep separable convolution has been applied to 3D convolution to reduce the number of network parameters on the premise of maintaining accuracy greatly. At last, the convolution-based feature map has not been beneficial to the model entirely. The weight of parameters based on space and channel compression & excitation module has been strengthened to improve the model in the feature map, to reduce the weight of redundant parameters and to improve the performance of the model. Based on 3D-UNet, the number of channels has been reduced 4 times via each convolution. The network becomes more trainable because fewer channels lead to fewer parameters. Three dimensional depth separable convolutions have de-composed the standard convolution into deep convolution and point convolution of 1×1×1. A standard convolutional layer has been integrated to filter and merge into one output. Deep separable convolution has divided the convolution into two layers for filtering and merging each. The effect of this factorization has greatly reduced computation and model size. The application of deep separable convolution has made the network lightweight to realize fast semantic segmentation. The accuracy of the network has not still been improved. The space and channel compression & excitation module have generated a tensor to represent the importance of the feature map in space or channel direction via compressing and exciting the feature map in space or channel direction. The enhancement of important channels or spatial points has been facilitated. The neglect of unimportant channels or spatial points has been weakened. The space and channel compression & excitation module have yielded the network to remain lightweight under no circumstances of increasing the number of arguments. In addition, the accuracy of the network and the training accuracy of the model have been improved simultaneously. First, the tumors contained in the previously given segmentation map have been synthesized to make larger tumor's training area. Second, the best model of intersection over union(IOU) in the validation set has been the optimal parameters. Thirdly, binary cross entropy(BCE) Dice loss has been adopted as the loss function to solve the class imbalance of the foreground and background of the data set itself. Finally, the predicted results have been submitted online to ensure the fairness of the algorithm. Result The model has been tested in the Brain Tumor Segmentation Challenge 2018(BraTS 2018) online validation experiment. The average Dice coefficients of tumor segmentation in whole tumor, core tumor and enhanced tumor region have reached 0.893 6, 0.804 6 and 0.787 2 respectively. Compared with 3D-UNet, S3D-UNET, 3D-ESPNET and other algorithms, LRUNet has not only assured the improvement of accuracy, but also greatly reduced the consumption of computational parameters and computational costs in the network. Conclusion A new light-weighted UNet network with only 0.97 MB parameters has been developed to 31 GB floating point operations per second(FLOPs) approximately. The number of parameters has been acquired only 1/16 of the 3D-UNet and the FLOPs have reached 1/52 of the 3D-UNet. The illustrated verification has demonstrated that the great advantages in both performance and number of network parameters have been leaked out based on calculated algorithm (note:the segmentation results have been closest to the true tag). The lightweight and efficient nature of the network has been beneficial to the large-scale 3D medical data sets processing.
Keywords
3D image processing fully convolutional network magnetic resonance imaging(MRI) rapid semantic segmentation UNet
|