Share this page
Share this page E-mail this page Print this page RSS feeds
Home > Publications > Unbiased Assessment of Learning Algorithms
Unbiased Assessment of Learning Algorithms

In order to rank the performance of machine learning algorithms, many researchs conduct experiments on benchmark datasets. Since most learning algorithms have domain-specific parameters, it is a popular custom to adapt these parameters to obtain a minimal error rate on the test set. The same rate is used to rank the algorithm which causes an optimistic bias. We quantify this bias, showing in particular that an algorithm with more parameters will probably be ranked higher than an equally good algorithm with fewer parameters. We demonstrate this result, showing the number of parameters and trials required in order to pretend to outperform C4.5 or FOIL, respectively, for various benchmark problems. We then describe how unbiased ranking experiments should be conducted.

scheher97.ps
PostScript file

In: Proceedings of the International Joint Conference on Artificial Intelligence

Publisher: Morgan Kaufmann Publishers
All copyrights reserved by Morgan Kaufmann Publishers 1997.

Details

Type: Inproceedings
Pages: 798–803