Understanding and Diagnosing Visual Tracking Systems
Several benchmark datasets for visual tracking research have been proposed in recent years. Despite their usefulness, whether they are sufficient for understanding and diagnosing the strengths and weaknesses of different trackers remains questionable.
近年来,人们提出了几种用于视觉跟踪研究的基准数据集。尽管它们很有用,但它们是否足以理解和诊断不同跟踪器的优缺点仍然值得怀疑。
To address this issue, we propose a framework by breaking a tracker down into five constituent parts, namely, motion model, feature extractor, observation model, model updater, and ensemble post-processor.
为了解决这个问题,作者提出了一个框架,将跟踪器分解为五个组成部分,即运动模型、特征提取器、观察模型、模型更新器和集成后处理器。
We then conduct ablative experiments on each component to study how it affects the overall result. Surprisingly, our findings are discrepant with some common beliefs in the visual tracking research community. We find that the feature extractor plays the most import