Analysis of Perceptron-Based Active Learning

  • Sanjoy Dasgupta ,
  • Adam Tauman Kalai ,
  • Claire Monteleoni

18th Annual Conference on Learning Theory, COLT 2005, Bertinoro, Italy, June 27-30, 2005 |

Published by Springer Berlin Heidelberg

Publication

We start by showing that in an active learning setting, the Perceptron algorithm needs Ω(1ϵ2)">Ω(1ϵ2)labels to learn linear separators within generalization error ε. We then present a simple selective sampling algorithm for this problem, which combines a modification of the perceptron update with an adaptive filtering rule for deciding which points to query. For data distributed uniformly over the unit sphere, we show that our algorithm reaches generalization error ε after asking for just O~(dlog1ϵ)">O~(dlog1ϵ) labels. This exponential improvement over the usual sample complexity of supervised learning has previously been demonstrated only for the computationally more complex query-by-committee algorithm.