在线复合模板模型表示的视觉目标跟踪

亚森江·木沙; 赵春霞

发布时间： 2015-08-27
摘要点击次数： 3858
全文下载次数： 575
DOI: 10.11834/jig.20150907
2015 | Volume 20 | Number 9

在线复合模板模型表示的视觉目标跟踪

亚森江·木沙^1,2, 赵春霞¹(1.南京理工大学计算机科学与工程学院, 南京 210094;2.新疆大学机械工程学院, 乌鲁木齐 830046)

摘要

目的视觉目标跟踪中,目标往往受到自身或场景中各种复杂干扰因素的影响,这对正确捕捉所感兴趣的目标信息带来极大的挑战。特别是,跟踪器所用的模板数据主要是在线学习获得,数据的可靠性直接影响到候选样本外观模型表示的精度。针对视觉目标跟踪中目标模板学习和候选样本外观模型表示等问题,采用一种较为有效的模板组织策略以及更为精确的模型表示技术,提出一种新颖的视觉目标跟踪算法。方法跟踪框架中,将候选样本外观模型表示假设为由一组复合模板和最小重构误差组成的线性回归问题,首先利用经典的增量主成分分析法从在线高维数据中学习出一组低维子空间基向量(模板正样本),并根据前一时刻跟踪结果在线实时采样一些特殊的负样本加以扩充目标模板数据,再利用新组织的模板基向量和独立同分布的高斯—拉普拉斯混合噪声来线性拟合候选目标外观模型,最后估计出候选样本和真实目标之间的最大似然度,从而使跟踪器能够准确捕捉每一时刻的真实目标状态信息。结果在一些公认测试视频序列上的实验结果表明,本文算法在目标模板学习和候选样本外观模型表示等方面比同类方法更能准确有效地反映出视频场景中目标状态的各种复杂变化,能够较好地解决各种不确定干扰因素下的模型退化和跟踪漂移问题,和一些优秀的同类算法相比,可以达到相同甚至更高的跟踪精度。结论本文算法能够在线学习较为精准的目标模板并定期更新,使得跟踪器良好地适应内在或外在因素(姿态、光照、遮挡、尺度、背景扰乱及运动模糊等)所引起的视觉信息变化,始终保持其最佳的状态,使得候选样本外观模型的表示更加可靠准确,从而展现出更为鲁棒的性能。

关键词

在线学习复合模板模型表示视觉目标跟踪

Online integrative template-based model representation for visual object tracking

Yasin Musa^1,2, Zhao Chunxia¹(1.School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China;2.School of Mechanical Engineering, Xinjiang University, Urumqi 830046, China)

Abstract

Objective Visual object tracking is aprocess that continuously infers the state of a target from several unconstrained scenes. It is commonly formulated as a searching (or classification) problem that aims to identify the candidate that matches the target template the most as the tracking result. The target template is maintained over time and updated online once the tracking result is available. Prior to tracking at the current time, a set of candidates are sampled around the state of the target at the previoustime. Both the target template and candidates are represented by an appearance model. Then, a target searching strategy is employedto find the candidate that matches the template most as the tracking result. Although several excellent methods of visual tracking exist, this area remains an overwhelming research topic because of several unresolved challenging issues that arise from both template learning and appearance modeling. From the point of view of appearance modeling, exploiting several representative templates from online data is the core problem and plays a key role in complex scenes where the target state is being changed over time significantly. Method In the proposed tracking framework, several low-dimensional basis vectors called positive templates are learned from high-dimensional online data by using the online PCA algorithm. Several negative templates are then sampled according to the last tracking result. The most representative object templates are organized by combining both positive and negative templates, and the target candidate is well represented through the use of online learned integrative templates with some additive Gaussian-Laplacian noise. Finally, the maximum likelihood between the target candidate and real object is estimated. Thus, the tracker can capture accurate information on the real object in each frame.Reasonable arrangements of the template update strategy are used to enhance the object templates during tracking. Result The online integrative templates can exploit the most comprehensive information on the target object with positive and negative templates compared with the simplex positive template learning approach because the online negative template expansion operation generates strong magnetic anisotropy between the target candidate and background data. In other words, positive templates help the tracker find the most possible target while negative templates actively represent the background data to help the tracker avoid the drifting problem. Thus, the tracker maintains the good capability to identify the greatest possible target candidate easily. Extensive experiments are conducted to validate the new algorithm. The tracker can learn several comparative object templates and self-updates at a fixed period, adapt well to several variations caused by intrinsic or extrinsic factors (pose, illumination, occlusion, scaling, background cluttering,motion blur, etc.), and maintain the capability to exhibit favorable performance. Conclusion Although template learning with online PCA is a widely-used feature extraction method for computer vision problems(e.g., visual object tracking) and its learned templates contain some representative information on the target object, it is not very representative and needs to be enhanced with some additional information on the object to adapt well to uncertain complex variations. In this paper, two core issues (online template learning and appearance modeling)in visual object tracking are studied. Detailed descriptions of an efficient template organization strategy and an accurate model representation technique are provided, and a novel visual object tracking framework is proposed. The proposed algorithm can automatically exploit several useful integrative templates of the object from online data and self-updates. Hence, model representation exhibits strong robustness and improved tracking accuracy. Experiments on many challenging image sequences demonstrate that the proposed method achieves the same and even better results when compared with several state-of-the-art tracking algorithms.

Keywords

online learning integrative template model representation visual object tracking

在线采编平台

论文出版

年度会议

下载中心

年度信息