Mathematical Programming for Data Mining: Formulations and Challenges

  • Paul S. Bradley ,
  • Usama M. Fayyad ,
  • Olvi L. Mangasarian

MSR-TR-98-04 |

This paper is intended to serve as an overview of a rapidly emerging research and applications area. In addition to providing a general overview, motivating the importance of data mining problems within the area of knowledge discovery in databases, our aim is to list some of the pressing research challenges, and outline opportunities for contributions by the optimization research communities. Towards these goals, we include formulations of the basic categories of data mining methods as optimization problems. We also provide examples of successful mathematical programming approaches to some data mining problems.