![]() |
Sparse Bayesian Learning and the Relevance Vector Machine |
![]() |
Note: Mike Tipping has left Microsoft Research. However, as this is an established URL for the Relevance Vector Machine and since this work was done while Mike was at MSR, this page and its associated material will remain. A possibly more up-to-date page can be found at www.relevancevector.com .
Sparse Bayesian learning is the application of Bayesian automatic relevance determination (ARD) to models linear in their parameters. The "relevance vector machine" is a special case of the technique, applied to linear kernel models of the same form as the popular "support vector machine".
Slides from my lectures at the 2003 Tübingen "Machine Learning Summer School" are now available in ".ps.gz" format:
A comprehensive paper on sparse Bayesian learning from the Journal of Machine Learning Research:
|
|
Sparse Bayesian learning and the relevance vector machine.
Journal of Machine Learning Research 1, 211–244. [Abstract]
[Available from JMLR]
There are a couple of (very minor) typos in the above [corrections here] |
Two early conference publications on the Relevance Vector Machine:
|
|
The Relevance Vector Machine. In S. A. Solla, T. K. Leen, and K.-R. Müller (Eds.), Advances in Neural Information Processing Systems 12, pp. 652658. Cambridge, Mass: MIT Press. [Abstract] [gzipped PostScript] |
|
|
Variational relevance vector machines. In Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence. Accepted to appear. [Abstract] [gzipped PostScript] |
Exploiting the sparse Bayes methodology to realise "sparse kernel PCA":
|
|
Tipping, M. E. (2001). Sparse kernel principal component analysis. In Advances in Neural Information Processing Systems 13. MIT Press. [Abstract] [gzipped PostScript] |
Robust sparse Bayesian regression:
|
|
Faul, A. and M. E. Tipping (2001). A variational approach to robust regression. In G. Dorffner, H. Bischof, and K. Hornik (Eds.), Proceedings of ICANN'01, pp. 95102. Springer. [Abstract] [gzipped PostScript] |
Some theoretical analysis of marginal likelihood optimisation and sparsity:
|
|
Faul, A. and M. E. Tipping (2002). Analysis of Sparse Bayesian Learning. In Proceedings of NIPS*01 [gzipped PostScript] |
An accelerated learning algorithm:
|
|
Tipping, M. E. and A. Faul (2003). Fast marginal likelihood maximisation for sparse Bayesian models. In Proceedings of Artificial Intelligence and Statistics '03 [gzipped PostScript] |
Some simple Matlab code to implement sparse Bayesian regression and classification models (e.g. like the RVM) is now available: [gzipped tar file]
Note that the software is supplied subject to an "end-user license agreement" (included in the package) which is standard for all freely-distributed Microsoft code. For a preview: click here
Machine Learning and Perception—Machine Learning—Relevance Vector Machine