Current Issue Cover
用基于视觉单词上下文的核函数对图像分类

王宇石1, 高 文2(1.哈尔滨工业大学计算机科学与技术学院,哈尔滨 150001;2.北京大学信息科学技术学院,北京 100871)

摘 要
当前在图像分析领域,将局部特征编码为视觉单词的做法非常流行。基于普通的视觉单词,提出了一种新的能够融合单词多层上下文的核函数。设计中体现了如下信息:1)多层的单词直方图;2)多层的“词组”直方图;3)单词(以及词组)的上下文的类别。然后将该核函数应用于支持向量机,对图像进行分类。在Corel图像库等公共测试集上,该方法取得出色的性能。此外,在一个实用性很强的复杂问题中进行了对比:识别成人图像和泳装图像。该方法的识别准确率,比经典方法提高了约7%。实验结果表明,将核函数度量同视觉单词的多层次描述结合在一起,
关键词
Kernel-based Image Classification Using the Context of Visual Words

WANG Yushi1, GAO Wen2(1.School of Computer Science and Technology, Harbin Institute of Technology, Harbin 150001;2.School of Electronic Engineering and Computer Science, Peking University, Beijing 100871)

Abstract
In recent literature of image analysis, it has been very popular to code local features into visual words. We propose a novel kernel which fuses multi-level contexts of visual words. Besides the histogram pyramid of words, our kernel also incorporates the histogram pyramid of visual phrases (the local co-occurrence patterns of words) and the context classes of those words and phrases. Then support vector machines using the kernel are trained to perform image classification. Our method performs well on a wide range of test data, such as the Corel dataset. The method is also tested in a challenging problem, the discrimination of pornographic images from bikini ones. The classification accuracy of our method is 7% higher than that of the baseline method. Experimental results demonstrate that the performance of image classification can be improved by the integration of kernel based measurements and the multi-level representation of visual words. In the future work, more compact and efficient representation of contexts should be researched.
Keywords

订阅号|日报