Current Issue Cover
多尺度分割先验迭代重加权低秩恢复显著性检测

张荣国1, 郑维佳1, 赵建1, 胡静1, 刘小君2(1.太原科技大学计算机科学与技术学院, 太原 030024;2.合肥工业大学机械工程学院, 合肥 230009)

摘 要
目的 现有显著性检测方法大多只关注显著目标的中心信息,使得算法只能得到中心清晰、边缘模糊的显著目标,丢失了一些重要的边界信息,而使用核范数约束进行低秩矩阵恢复,运算过程冗余。为解决以上问题,本文提出一种无监督迭代重加权最小二乘低秩恢复算法,用于图像视觉显著性检测。方法 将图像分为细中粗3种尺度的分割,从细粒度和粗粒度先验的融合中得到分割先验信息;将融合后的分割先验信息通过迭代重加权最小二乘法求解平滑低秩矩阵恢复,生成粗略显著图;使用中粒度分割先验对粗略显著图进行平滑,生成最终的视觉显著图。结果 实验在MSRA10K(Microsoft Research Asia 10K)、SOD(salient object detection dataset)和ECSSD(extended complex scene saliency dataset)数据集上进行测试,并与现有的11种算法进行对比。结果表明,本文算法可生成边界清晰的显著图。在MSRA10K数据集上,本文算法实现了最高的AUC(area under ROC(receiver operating characteristic)curve)和F-measure值,MAE(mean absolute error)值仅次于SMD(structured matrix decomposition)算法和RBD(robust back ground detection)算法,AUC和F-measure值比次优算法RPCA(robust principal component analysis)分别提高了3.9%和12.3%;在SOD数据集上,综合AUC、F-measure和MAE值来看,本文算法优于除SMD算法以外的其他算法,AUC值仅次于SMD算法、SC(smoothness constraint)算法和GBVS(graph-based visual salieney)算法,F-measure值低于最优算法SMD 2.6%;在ECSSD数据集上,本文算法实现了最高的F-measure值75.5%,AUC值略低于最优算法SC 1%,MAE值略低于最优算法HCNs(hierarchical co-salient object detection via color names)2%。结论 实验结果表明,本文算法能从前景复杂或背景复杂的显著图像中更准确地检测出边界清晰的显著目标。
关键词
Iterative reweighted low-rank recovery salient object detection with multiscale segmentation prior

Zhang Rongguo1, Zheng Weijia1, Zhao Jian1, Hu Jing1, Liu Xiaojun2(1.School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China;2.School of Mechanical Engineering, Hefei University of Technology, Hefei 230009, China)

Abstract
Objective Various existing saliency detection methods have been widely used in saliency images processing. Current saliency detection algorithm can only get the significant target with clear center and blurred edges, losing some important boundary information. The existing saliency detection algorithm based on low-rank matrix recovery has required the use of kernel norms to constrain the low-rank matrix. An unsupervised low-rank matrix restoration iterative re-weighted least squares method has been proposed based on multi-scale segmentation prior information for image saliency detection. Method First, the input image has been divided into three levels of granularity:fine-grained, medium-granular and coarse-grained. Fine-grained segmentation can divide an image into multiple superpixels. Medium-grain size segmentation can also segment the image but produce fewer regions. Coarse-grain size segmentation can maximize the separation of significant objects from the background but the image may be under-segmented. The segmentation prior information has been obtained from the fusion of fine-grained and coarse-grained priors. Next, the fused segmentation prior information has been obtained. A coarse significant map has been generated based on iterative re-weighted least squares method. At last, the fused significant map has been post-smoothed via using a medium-grained segmentation prior. The final visual saliency map has been acquired. Result The experiment has used the three datasets of Microsoft Research Asia 10K (MSRA10K), salient object detection dataset (SOD) and extended complex scene saliency dataset (ECSSD) for testing with comparison of the existing eleven algorithms. The demonstrated algorithm can generate significant target accuracy and clear boundaries of the significant graph. The MSRA10K dataset has contained images of various salient objects of different sizes but only one salient object in each image. The highest area under receiver operating characteristic (ROC) curve value and F-measure value on the MSRA10K dataset have been achieved among them. The mean absolute error (MAE) value has been second only to the structured matrix decomposition (SMD) algorithm and robust back ground detection (RBD) algorithm. The area under ROC curve (AUC) value and F-measure value have been improved by 3.9% and 12.3% respectively compared with the suboptimal algorithm robust principal component analysis (RPCA). A simplified priori functionality and no supervision, and even has implemented the hierarchical fusion of hierarchical co-salient object detection via color names (HCNs) algorithm and the exploiting color name (HCN) algorithm. It is hard to choose the appropriate ratio to suppress the background without the size of the object like the frequency-based frequency-tuned (FT) algorithm. The RPCA algorithm has not considered the spatial structure of the image. The outline of the salient target that can be detected in the MSRA10K dataset with a salient target single. The SOD dataset has contained images of multiple salient objects (independent or adjacent). In the SOD dataset, the algorithm of this paper is superior to other algorithms except SMD algorithm in terms of AUC value, F-measure value and MAE value. The AUC value is second only to the SMD algorithm, smoothness constraint (SC) algorithm and graph-based visual saliency (GBVS) algorithm. The F-measure value is lower than the best algorithm SMD 2.6%. The algorithm in this paper is superior to other algorithms except SMD algorithm in terms of AUC value, F-measure value and MAE value. The algorithm is effective in the case of multiple salient targets. However, the performance of saliency filters (SF), segmentation driven low-rank matrix recovery (SLR), robust back ground detection (RBD) is greatly reduced. Just as the SLR algorithm introduces a segmentation prior first, making it more sensitive to the number of significant targets in the image. As for SF due to its dependence on contrast, its AUC value and F-measure value decrease sharply as the significant target increases. The SF algorithm has divided the salient targets that are not significantly contrasted with the background into the background due to multiple salient targets in the two contrast metrics. The RBD algorithm has relied on the boundary connectivity of the background to segment the image caused poorly performance when detecting multiple salient target images. The ECSSD dataset has contained background complex images as well as significant targets of varying sizes. On the ECSSD dataset, the highest F-measure value of 75.5% has been achieved. The AUC value has been slightly lower than the optimal algorithm SC 1%, while the MAE value is slightly lower than the optimal algorithm HCNs 2%. The highest F-measure values and where the AUC and MAE values have been slightly lower than those of the SC algorithm and the HCNs algorithm. The SC algorithm with deformation smoothness constraint is only slightly inferior to the ECSSD dataset. The SMD algorithm has performed well on the MSRA10K dataset and the SOD dataset and moderated on the ECSSD dataset. The tree structure method of capturing image information is not applicable to images with complex backgrounds. Method such as GBVS, foreground-background segmentation (FBS), SF, and RBD have relied on methods like visual saliency, foreground enhancement, background enhancement and the use of a tree structure. Cues like suppression, contrast bias and center bias have not maintained good performance. Conclusion A multi-scale saliency detection algorithm has been demonstrated. First, a coarse saliency map by iterative reweighted least squares method based on the prior of multiscale segmentation has been generated. Second, the coarse saliency map by iterative reweighted least squares has been fused. Finally, the qualified saliency map by smooth fusion of the significant maps with medium granularity has been obtained. The algorithm with the latest methods on three public datasets of MSRA10K, SOD, and ECSSD has been verified. Significant targets have been achieved based on accurate, clear boundary saliency segmentation results. The algorithm has presented more robust.
Keywords

订阅号|日报