Current Issue Cover
融合多尺度码本的全局编码图像分类

董振宇, 赵杰煜, 祝军(宁波大学信息科学与工程学院, 宁波 315211)

摘 要
目的 词袋模型在图像分类领域中的分类效果主要受限于局部特征的量化误差.针对这一点,提出一种融合多尺度码本的全局编码图像分类方法,有效减少特征量化误差.方法 通过使用多尺度特征密集采样,构建多尺度码本,使码本具备一种层次结构,通过充分利用图像特征的流形结构,计算码本全局信息,实现全局编码.通过本文方法得到的编码系数比较平滑和准确.最后使用多路径方法,分别将不同尺度的特征表示进行级联,得到最终的图像特征表示.这种特征表示具备了一定程度上的尺度不变性.结果 在UIUC-8和Caltech-101两个常用的标准图像数据集上进行测试,分类准确率分别达到88.0%和83.2%.结论 实验结果表明,相比于基于固定尺度码本的局部编码方法,本文方法在分类识别率方面有了显著提升.
关键词
Image classification based on global coding combined with multi-scale codebook

Dong Zhenyu, Zhao Jieyu, Zhu Jun(College of Information Science and Engineering, Ningbo University, Ningbo 315211, China)

Abstract
Objective The performance of the Bag-of-Words model in the field of image classification is limited mainly by the quantization error of the local feature. To reduce the quantization error of the local feature effectively, an image classification method based on global coding combined with multi-scale codebook is proposed. Method A global coding is implemented by utilizing fully the manifold structure of the image features and by computing the global information of the codebook. The coding coefficients obtained by the method are relatively smooth and accurate. Furthermore, a multi-path method is designed to integrate all feature representations to describe the image. To a certain extent, this method can achieve the scale invariance of feature representations. Conclusion Several experiments are conducted on two commonly used benchmark data sets,namely, UIUC-8 and Catltech-101, and the average classification accuracy rates reach up to 88.0% and 83.2%, respectively. Result Experimental results show that the proposed method improves the performance significantly compared with the fixed-scale locality-constrained coding methods.
Keywords

订阅号|日报