Candidate Talk: Model Compression

Accurate models often are complex models. For example, ensembles often contain 100’s or 1000’s of base-level classifiers. This complexity makes ensembles more difficult to store, more expensive to execute, and also harder to interpret. Ultimately, this complexity restricts their use in applications where test sets are very large (e.g. web search and image/video recognition), where storage is at a premium (e.g. cell phones and digital cameras), and where computational power is limited (e.g. hearing aids and Mars rovers). In this talk I’ll present Model Compression, a method for compressing large, complex models into smaller, faster models without sacrificing the accuracy of the original model. To help motivate model compression, I’ll summarize our prior work on Ensemble Selection, a method for generating very complex ensembles that usually outperform bagging, boosting, random forests, stacking, and Bayesian averaging. With model compression, we can train classifiers that are nearly as accurate as ensemble selection, but which are more than 1000 times smaller and faster. I’ll also present a new algorithm for density estimation that we developed to make model compression more effective in some applications.

Speaker Details

Rich Caruana is an Assistant Professor in Computer Science at Cornell University. He received his Ph.D. at CMU in 1997 where he worked with Tom Mitchell and Herb Simon. Before joining the faculty at Cornell in 2001 was on the faculty at the Medical School at UCLA, and at CMU’s Center for Learning and Discovery (CALD). Rich’s research is in machine learning and data mining, and in the application of these to areas such medical informatics, ecology, and computer architecture design. He is perhaps best known for his work in inductive transfer, semi-supervised clustering, and optimizing learning for different performance criteria. Rich likes to mix algorithm development with applications work to insure that the new methods he develops actually work well in practice.

Date:
Speakers:
Rich Caruana
Affiliation:
Cornell University