Current Issue Cover
分形理论引导的图像临界差异感知阈值估计

郭嘉骏, 姜求平, 邵枫(宁波大学信息科学与工程学院, 宁波 315211)

摘 要
目的 图像的临界差异(just noticeable difference,JND)阈值估计对提升图像压缩比以及信息隐藏效率具有重要意义。亮度适应性和空域掩蔽效应是决定JND阈值大小的两大核心因素。现有的空域掩蔽模型主要考虑对比度掩蔽和纹理掩蔽两方面。然而,当前采用的纹理掩蔽模型不能有效地描述与纹理粗糙度相关的掩蔽效应对图像JND阈值的影响。对此,本文提出一种基于分形理论的JND阈值估计模型。方法 首先,考虑到人眼视觉系统对具有粗糙表面的图像内容变化具有较低的分辨能力,通过经典的分形理论来计算图像局部区域的分形维数,并以此作为对纹理粗糙度的度量,并在此基础上提出一种新的基于纹理粗糙度的纹理掩蔽模型。然后,将提出的纹理掩蔽模型与传统的亮度适应性相结合估计得到初步的JND阈值。最后,考虑到人眼的视觉注意机制,进一步考虑图像内容的视觉显著性,对JND阈值进行感知一致性修正,估计得到最终的JND阈值。结果 选取4种相关方法进行对比,结果表明,在注入相同甚至更多噪声的情况下,相较于对比方法中的最优结果,本文方法的平均VSI (visual saliency-induced index)和平均MOS (mean opinion score)在LIVE (Laboratory for Image&Video Engineering)图像库上分别提高了0.001 7和50%,在TID2013(tampere image database 2013)图像库上分别提高了0.001 9和40%,在CSIQ (categorical subjective image quality)图像库上分别提高了0.001 3和9.1%,在基于VVC (versatile video coding)的JND图像库上分别提高了0.000 3和54.5%。此外,作为另一典型应用,开展了感知冗余去除实验。实验结果表明,在保持视觉质量的前提下,经过本文JND模型平滑处理后的图像,其JPEG压缩图像相比于原图直接JPEG压缩得到的图像能节省12.5%的字节数。结论 本文提出的基于分形维数的纹理粗糙度能够有效刻画纹理掩蔽效应,构建的纹理掩蔽效应与传统的空域掩蔽效应相结合能够大幅提升图像JND阈值估计的准确性和可靠性。
关键词
Fractal-guided JND threshold estimation for natural images

Guo Jiajun, Jiang Qiuping, Shao Feng(School of Information Science and Engineering, Ningbo University, Ningbo 315211, China)

Abstract
Objective The perception of visual information of human visual system has distorted multi-sensitivities due to multiple categories. Current researches have shown that the human visual system is restricted of the changes or distortions in image perception under some threshold circumstances. This kind of threshold is referred to the just noticeable difference (JND) threshold. JND threshold estimation of images is of great significance for many perceptual image processing applications like the optimized image compression and information hiding. Generally, luminance adaptation and spatial masking effect can be as the key factors to identify the JND threshold. The existing spatial masking models have mainly been evolved on two aspects of contrast masking and texture masking. However, the current texture masking model cannot effectively clarify the influence of JND-threshold-oriented texture roughness. Meanwhile, human visual system has lower sensitivities for perceiving the rough-surfaced changes or differences. First, our research is focused on fractal dimension based (FD-based) description using quantitative-oriented image texture roughness in terms of classic fractal theory. A new FD-based texture masking model is demonstrated as well. Regular higher texture-roughness-based targeted areas have stronger texture masking effect due to the lower texture-roughness-derived weaker texture masking effect. Next, our newly texture is developed an improved spatial masking estimation function based on the integration of a masking model and traditional contrast masking. Finally, the improved spatial mask estimation function is combined with luminance adaptability function for the final JND profile. Considering the visual attention mechanism of the human visual system, we also refine the JND threshold using the classical graph based visual saliency (GBVS) method in visual saliency map. Method First, we get the luminance contrast masking of the original image. Then, we calculate the FD of each block with size 8×8 by the widely-adopted differential box-counting (DBC) method and get a FD map of the original image. Third, the new spatial masking map is obtained via combining the luminance contrast map with the proposed FD map. Forth, we also apply the luminance adaptation map to interpret the influence of luminance on JND. Based on the luminance adaptation map and the new spatial masking map, a direct-multiplication-derived coarse JND map can be obtained by fusing the two maps together. Fifth, considering the visual attention mechanism of human visual system, we refine the coarse JND map using visual saliency. To characterize the influence of visual saliency on JND, we compute the visual saliency map based on the GBVS method and a sigmoid-like control function is formulated. Finally, we can obtain the final JND map by fusing the coarse JND map and the multiplication-based sigmoid-like control function. Result Our proposed JND profile has the highest visual saliency-induced index(VSI) and the highest mean opinion score(MOS) when injecting the more equivalent noises. Specifically, our model has a 0.001 7 higher average VSI score than the second-best model on Laboratory for Image & Video Engineering(LIVE), and the MOS is 50% higher. On TID2013, the average VSI of our model is 0.001 9 higher than the second-best model, and the MOS is also 40% higher. On categorical subjective image quality(CSIQ), the average VSI of our model is 0.001 3 higher than the second-best model, and the MOS is 9.1% higher as well. In addition, we also conduct the perceptual redundancy removal experiments to demonstrate the application capability of the proposed JND profile. The experimental results shown that our JND model can save 12.5% bytes on JPEG compression. Conclusion Our FD-based JND model can estimate the JND threshold effectively on the texture region. The fractal-based roughness on local areas, we can propose a new texture masking effect to estimate the JND threshold on these areas. To tackle more distorted issues, our model can inject more noise in the areas with high roughness and less noise in the smooth areas.
Keywords

订阅号|日报