Neil Lawrence, Matthias Seeger, and Ralf Herbrich
January 2003
We present a framework for sparse Gaussian process (GP) methods which uses forward selection with criteria based on information-theoretical principles, previously suggested for active learning. In contrast to most previous work on sparse GPs, our goal is not only to learn sparse predictors (which can be evaluated in O(d) rather than O(n), d \ll n, n the number of training points), but also to perform training under strong restrictions on time and memory requirements. The scaling of our method is at most O(n d^2), and in large real-world classification experiments we show that it can match prediction performance of the popular support vector machine (SVM), yet it requires only a fraction of the training time. In contrast to the SVM, our approximation produces estimates of predictive probabilities (`error bars'), allows for Bayesian model selection and is less complex in implementation.
![]() PostScript file |
In: Advances in Neural Information Processing Systems 15
Publisher: MIT Press
All copyrights reserved by MIT Press 2003.
| Type: | Inproceedings |
| Pages: | 625-632 |