Oliver Williams, Andrew Blake, and Roberto Cipolla
This paper addresses the problem of applying powerful pattern recognition algorithms based on kernels to efficient visual tracking. Recently Avidan has shown that object recognizers using kernel-SVMs can be elegantly adapted to localization by means of spatial perturbation of the SVM, using optic flow. Whereas Avidan’s SVM applies to each frame of a video independently of other frames, the benefits of temporal fusion of data are well known. This issue is addressed here by using a fully probabilistic ‘Relevance Vector Machine’ (RVM) to generate observations with Gaussian distributions that can be fused over time. To improve performance further, rather than adapting a recognizer, we build a localizer directly using the regression form of the RVM. A classification SVM is used in tandem, for object verification, and this provides the capability of automatic initialization and recovery.fl The approach is demonstrated in real-time face and vehicle tracking systems. The ‘sparsity’ of the RVMs means that only a fraction of CPU time is required to track at frame rate. Tracker output is demonstrated in a camera management task in which zoom and pan are controlled in response to speaker/vehicle position and orientation, over an extended period. The advantages of temporal fusion in this system are demonstrated.
|Published in||Proceedings of the Ninth IEEE International Conference on Computer Vision (ICCV 2003)|
|Publisher||IEEE Computer Society|
Copyright © 2007 IEEE. Reprinted from IEEE Computer Society. This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to firstname.lastname@example.org. By choosing to view this document, you agree to all provisions of the copyright laws protecting it.
Oliver Williams. Bayesian Learning for Efficient Visual Inference, September 2005.
Oliver Williams, Andrew Blake, and Roberto Cipolla. Sparse Bayesian Regression for Efficient Visual Tracking, IEEE Transactions on Pattern Analysis and Machine Intelligence, IEEE Computer Society, August 2005.