Current Issue Cover
加权间隔结构化支持向量机目标跟踪算法

江少杰1,2, 宁纪锋1,2, 李云松2(1.西北农林科技大学信息工程学院, 杨凌 712100;2.西安电子科技大学综合业务网理论及关键技术国家重点实验室, 西安 710071)

摘 要
目的 目标在跟踪过程中,各种因素的干扰使得跟踪结果存在不确定性。因此,将跟踪过程中所提取样本的可靠性融入跟踪模型中,有助于克服低可靠性样本对跟踪算法的影响。为此,基于最近的结构化支持向量机(SSVM)跟踪算法,提出一种包含样本置信度的加权间隔结构化支持向量机跟踪模型(WMSSVM),以增强SSVM跟踪算法性能。方法 首先,基于打分和位置重合率估计样本可靠性;其次,建立WMSSVM模型处理具有不同置信度的跟踪样本训练问题,并采用对偶坐标下降优化算法求解跟踪模型。结果 在包含100个视频的OTB100跟踪数据集上进行测试,提出的WMSSVM跟踪器与基准跟踪器Scale-DLSSVM相比,在精准度和成功率两个指标上分别提高了1%和2%。与最近的跟踪算法相比,提出的方法也表现出更好的性能。结论 本文首次将样本的可靠性融入结构化支持向量机跟踪模型,并提出一种基于加权间隔的结构化支持向量机跟踪模型及其优化求解方法,在包含100个视频序列的跟踪数据集上验证了提出方法的有效性,本文提出的算法能够适应复杂场景下的跟踪任务,并在背景混杂、目标形变、遮挡、运动模糊、目标出界、快速位移等类别的视频中表现出优异的性能。
关键词
Object tracking algorithm via weighted margin structured support vector machine

Jiang Shaojie1,2, Ning Jifeng1,2, Li Yunsong2(1.College of Information, Northwest A & F University, Yangling 712100, China;2.The State Key Laboratory of Integrated Service Networks, Xidian University, Xi'an 710071, China)

Abstract
Objective The distraction of various factors during the object-tracking process makes tracking states unpredictable.For example,when a tracked object is occluded,ordinary trackers are likely to experience model drift problems.This problem occurs when the target is temporarily influenced by other objects or illumination conditions.The training samples extracted from these frames are not as reliable as those extracted from normal scenes.If these samples are fed into the model-updating process,then the tracking model deviates due to these unreliable samples,and the decision boundary between target samples and background samples becomes blurry.Consequently,the discriminability of the tracking model is degraded,which in turn,causes the model drift problem.If we can tag the training samples with different confidence levels and use the tagged samples to train the tracking model,then the tracker is expected to be tolerant of unexpected scenes.Method We base our work on the weighted margin support vector machine (WMSVM) classifier and the recently proposed structured support vector machine (SSVM) tracking algorithm.A weighted margin SSVM tracking model (WMSSVM) is proposed,which can consider the confidence values of training samples,thereby enabling the SSVM tracking algorithm to adapt to different scenes.First,sample confidence is estimated according to the overlap rate of the score range of the tracker and the predicted target positions of the current and last frames.This method can reliably reflect the confidence variation status through the entire tracking process.Second,a WMSSVM tracking model is built to train the samples of different confidence levels,such that these samples can have different influences on the decision boundary of the trained tracker.Accordingly,the WMSSVM model can adaptively adjust to different scenes encountered during tracking.Finally,we demonstrate that the tracking model can be solved by using a dual coordinate descent algorithm.Result We evaluate our tracker on an object tracking benchmark,namely,OTB-100,which contains 100 challenging video sequences.Compared with the base tracker dual linear structured support vector machine (DLSSVM),the proposed WMSSVM presents an increase of 1% in one-pass evaluation (OPE) precision and 2% in OPE overlap.It indicates that the proposed WMSSVM model performs better than the original SSVM model used by scale-DLSSVM in OPE precision and OPE success.We then determine that the tracking algorithm that uses the WMSSVM model is upgraded by 3.4% in background clutter scene,3.4% in deformation scene,1.4% in fast-motion scene,3.4% in motion blur scene,3.6% in occlusion scene,and 4.3% in out-of-view scene.The aforementioned scenes are typical difficult scenes that exist in a tracking area.The proposed tracker performs worse than the original scale-DLSSVM tracker in a sequence called Shaking (its OPE precision score is reduced from 0.459 to 0.056).The reason for this result is the drastic illumination variation that occurs at approximately the 60th frame,which lasts for approximately nine frames,and influences nearly the entire camera scene.This condition is difficult for the tracker to deal with.The proposed WMSSVM tracker also shows promising results compared with other state-of-the-art tracking algorithms.Although our tracker scores lower than the hierarchical convolutional features for visual tracking (HCFT) in OPE precision,two observations are made.First,the OPE precision evaluation metric utilizes a fixed distance of 20 pixels between the predicted target position and the ground truth position,whereas the OPE success metric uses the area under curve as the metric score.Thus,the better performance of WMSSVM in OPE success is more convincing.Second,HCFT uses a high-dimensional feature from an off-line neural network trained from a large object detection dataset,which is time-consuming.By contrast,the proposed WMSSVM uses only an online-extracted features consisting of Lab color feature and local rank transformation feature.Conclusion In this study,the confidence of samples is first incorporated into the SSVM tracking model,and then the WMSSVM tracking model and its optimization method are proposed.The effectiveness of the proposed method is validated on a tracking benchmark dataset with 100 sequences.The proposed method can track objects in complicated scenes,and it exhibits remarkable performance in videos with background clutter,deformation,occlusion,motion blur,and fast motion.
Keywords

订阅号|日报