Sparse Spectral Sampling Gaussian Processes

Miguel Lazaro-Gredilla; Joaquin Quiñonero Candela

Sparse Spectral Sampling Gaussian Processes

Miguel Lazaro-Gredilla ,
Joaquin Quiñonero Candela

MSR-TR-2007-152 | November 2007

Download BibTex

In this work we introduce Power Spectral Density Sampling (PSDS), a new method for building a reduced rank approximation to a stationary kernel matrix. We apply this method in the framework of Gaussian Processes (GP). The resulting model can be trained in an online fashion: As each training sample arrives, we can include it in the model using only O(r²) time and storage, where r is the selected number of spectral samples used in the approximation. For n training points, the overall cost is O(nr²) time and O(r²) storage. The resulting GP is properly defined and model hyper-parameters can be selected by evidence maximization. Though it is especially well-suited for low dimensional problems, it can reach and even outperform other (batch) state-of-the-art sparse GP methods on higher dimensional datasets if allowed to work in batch mode, learning the locations of the spectral samples. We check this possibility on some large datasets.