Current Issue Cover
多相关滤波自适应融合的鲁棒目标跟踪

陈倩茹1,2, 刘日升1,2, 樊鑫1,2, 李豪杰1,2(1. 大连理工大学-立命馆大学国际信息与软件学院, 大连 116621;2.
2. 辽宁省泛在网络与服务软件重点实验室, 大连 116621)

摘 要
目的 由于目标在复杂场景中可能会发生姿态变化、物体遮挡、背景干扰等情况,目标跟踪仍然是一个具有挑战性的课题。目前判别性相关滤波方法在目标跟踪问题上获得了成功而又广泛的应用。标准的相关滤波方法基于循环偏移得到大量训练样本,并利用快速傅里叶变换加速求解滤波器,使其具有很好的实时性和鲁棒性,但边界偏移带来的消极的训练样本降低了跟踪效果。空间正则化的相关滤波跟踪方法引入空间权重函数,增强目标区域的滤波器作用,在增大了目标搜索区域的同时,也增加了计算时间,而且对于目标形变不规则,背景相似的情景也会增强背景滤波器,从而导致跟踪失败。为此,基于以上问题,提出一种自适应融合多种相关滤波器的方法。方法 利用交替方向乘子法将无约束的相关滤波问题转化为有约束问题的两个子问题,在子问题中分别采用不同的相关滤波方法进行求解。首先用标准的相关滤波方法进行目标粗定位,进而用空间正则化的相关滤波跟踪方法进行再定位,实现了目标位置和滤波模板的微调,提高了跟踪效果。结果 本文算法和目前主流的一些跟踪方法在OTB-2015数据集中100个视频上,以中心坐标误差和目标框的重叠率为评判标准进行了对比实验,本文算法能较好地处理多尺度变化、姿态变化、背景干扰等问题,在CarScale、Freeman4、Girl等视频上都表现出了最好的跟踪结果;本文算法在100个视频上的平均中心坐标误差为28.55像素,平均目标框重叠率为61%,和使用人工特征的方法相比,均高于其他算法,与使用深度特征的相关滤波方法相比,平均中心坐标误差高了6像素,但平均目标框的重叠率高了4%。结论 大量的实验结果表明,在目标发生姿态变化、尺度变化等外观变化时,本文算法均具有较好的准确性和鲁棒性。
关键词
Multi-correlation filters method for robust visual tracking

Chen Qianru1,2, Liu Risheng1,2, Fan Xin1,2, Li Haojie1,2(1. DUT-RU International School of Information Science & Engineering, Dalian University of Technology, Dalian 116621, China;2.
2. Key Laboratory for Ubiquitous Network and Service Software of Liaoning Province, Dalian 116621, China)

Abstract
Objective Due to pose variation of the target, occlusion and background clutter in complex scene, visual object tracking is still a challenging task. Recently, discriminative correlation filter methods have been successfully and widely applied to visual tracking problem. The standard correlation filter method obtains a number of training samples by cyclic shift, and solves the filters by fast Fourier transform algorithm, which makes it have good real-time and robustness. However, the negative training samples caused by the boundary shift reduce the tracking effect. Spatially regularized correlation filters based tracker enhances the effect of target area by introducing a spatial weight function, which makes the difference between positive and negative samples more obvious. The target search area is increased while the computation time is also increased. In addition, for those complex scene, in which, target deformation is irregular or background is similar, the background filters are also enhanced which result in failure of tracking. Method In order to address the above problems, an adaptive fusion of multiple correlation filters method is proposed in this paper. The unconstrained correlation filter tracking problem is transformed into two sub problems with constraints via an alternating direction multiplier optimization method. And two sub problems are solved by different correlation filter methods. Firstly, standard correlation filters are used to locate target coarsely, and then the relocation is done via spatially regularized correlation filters, which adjusts the target position to improve the tracking effect. Result In the experiment, the algorithm is evaluated on 100 videos of OTB-2015 benchmark dataset and compared with other state-of-the-art trackers, and the central coordinate error and the overlap rate of target frame are used as evaluation criteria. And the algorithm can handle variation in position, scale, and occlusion and shows the best results in CarScale, Freeman4, Girl and other videos. The average center position error of 100 videos is 28.55 pixels and the average overlap rate of target frame is 61%. Compared with the methods which utilize artificial features, our algorithm is better than those other algorithms. Compared with the correlation filter method using deep feature such as CNN feature, the average center position error of our algorithm is 6 pixels higher, but the average overlap rate of target frame improves 4%. Conclusion Extensive experimental results show that our algorithm has better accuracy and robustness under appearance changes such as variation in position and scale.
Keywords

订阅号|日报