![]() |
s u m i t * b a s u | |
|
background
|
I'm Sumit Basu, a Researcher in the Machine Learning Department at Microsoft Research, Redmond. My research focus is on developing interactive, machine-learning based power tools to assist users in understanding and extracting answers from complex data - teaching material/textbooks, computer systems, sensory signals like speech or music, scientific data, document collections, or the web. These power tools sometimes work by observing a user as they perform a task, then assisting them in their efforts once it understands what's going on; in other cases (as in teaching) they provide inputs to the user and adaptively refine their strategy based on what works best. The interactive aspect comes from having humans in a tight loop with the learning algorithm: instead of getting a big batch of labeled data, interactive learning tasks involve a delicate dance between the human and the algorithm to achieve sufficient performance with a minimum of operator effort. If you're a bright graduate student interested in such problems and curious about internship opportunities, drop me a line! A brief history: I received my BS (1995), MEng (1997), and PhD (2002) all from MIT in Electrical Engineering and Computer Science. I did my graduate work at the Media Lab with Professor Alex Pentland. My doctoral thesis, "Conversational Scene Analysis," examined how machine learning and signal processing techniques could be used to understand the structure of conversational interactions from auditory signals without recognizing words. The common thread through all of my work to date has been the combination of human interaction and machine learning; fortunately there are an endless array of application areas of this ilk, especially if one is flexible in one's definition of interaction. | |
| spotlight |
I just posted the
paper and
slides from our presentation on "Bilinear Logistic Regression for
Factored Diagnosis Problems" at SLAML 2011, including a helpful tutorial on
how to use appropriate statistical tests to determine whether learned
parameters are significant and how to estimate false detection rates in the
absence of ground truth labels. | |
|
current projects |
Teaching with Machine Learning: using machine learning to help humans of all ages learn better. Sho: a powerful interactive environment for scientific computing and prototyping based on IronPython. Find out more and download it here. Also check out this code for getting real-time skeleton data from Kinect in Sho. Songsmith: a songwriting tool that takes melodies and helps develop accompaniments for them: based on this research with Dan Morris, it's now a product (with much help from the MSR Advanced Development Team). Check it out and download the trial here. It's also now free to many educational institutions via MSDN Academic Alliance and the Innovative Teachers' Network. StickySorter: a tool for doing affinity diagramming and other flavors of information organization I developed with Julie Guinn and Office Labs: you can download it here. Music Analysis/Synthesis: using machine learning to help users understand, manipulate, and create music Systems and Machine Learning: using machine learning to address problems in computer systems Conversational Scene Analysis: seeking structure and content from conversational patterns
| |
|
recent papers |
Sumit Basu, John Dunagan, Kevin Duh, and Kiran-Kumar Munuswamy-Reddy. "Bilinear Logistic Regression for Factored Diagnosis Problems." Operating Systems Review, 45(3):31-38. December, 2011. Also presented at the SLAML workshop at SOSP 2011. [paper] [slides] Steven M. Drucker, Danyel Fisher, and Sumit Basu. "Helping Users Sort Faster with Adaptive Machine Learning Recommendations." In Proceedings of INTERACT 2011. September, 2011. Sumit Basu, Danyel Fisher, Steven M. Drucker, and Hao Lu. "Assisting Users with Clustering Tasks by Combining Metric Learning and Classification." In Proceedings of AAAI 2010. July, 2010. Andrew Guillory, Sumit Basu, and Dan Morris. "User-Specific Learning for Recognizing a Singer's Intended Pitch." In Proceedings of AAAI 2010. July, 2010. Eric Nichols, Dan Morris, Sumit Basu, and Christopher
Raphael. "Relationships Between Lyrics and Melody in Popular Music."
In Proceedings of ISMIR 2009. October, 2009. Eric Nichols, Dan Morris, and Sumit Basu. "Data-Driven Exploration of Musical Chord Sequences." In Proceedings of Intelligent User Interfaces 2009 (IUI '09). January, 2009. Michael Gamon, Sumit Basu, Dmitriy Belenko, Danyel Fisher, Matthew Hurst, Arnd Christian König. "BLEWS - Using Blogs to Provide Context for News Articles." In Proceedings of the International Conference on Weblogs and Social Media (ICWSM). April, 2008. [paper] [video demo] Dan Morris, Ian Simon, and Sumit Basu. "Exposing Parameters of a Trained Dynamic Model for Interactive Music Creation." In Proceedings of AAAI 2008. June, 2008. Ian Simon, Dan Morris, and Sumit Basu. "MySong: Automatic Accompaniment Generation for Vocal Melodies." In Proceedings of Computer-Human Interaction 2008 (CHI '08). April, 2008. Sumit Basu, Surabhi Gupta, Milind Mahajan, Patrick Nguyen, and John C. Platt. "Scalable Summaries of Spoken Conversations." In Proceedings of Intelligent User Interfaces 2008 (IUI '08). January, 2008. [slides] Sumit Basu, John Dunagan, and Greg Smith. "Why Did My PC Suddenly Slow Down?" In Proceedings of the Systems and Machine Learning Workshop 2007. April, 2007. Ashish Kapoor, Eric Horvitz, and Sumit Basu. "Selective Supervision: Guiding Supervised Learning with Decision-Theoretic Active Learning." In Proceedings of the Int'l. Joint Conf. on AI (IJCAI '07). January, 2007. Karthik Gopalratnam, Sumit Basu, John Dunagan, and Helen Wang. "Automatically Extracting Fields from Unknown Network Protocols." In Proceedings of the Systems and Machine Learning Workshop 2006 (SysML'06). June, 2006. Sumit Basu. "Acoustic Echo Cancellation in a Channel with Rapidly Varying Gain." In Proceedings of the the Int'l Conf. on Multimedia and Expo 2006 (ICME'06). July, 2006. Jay Stokes, John Platt, and Sumit Basu. "Speaker Identification Using a Microphone Array and a Joint HMM with Speech Spectrum and Angle of Arrival." In Proceedings of the the Int'l Conf. on Multimedia and Expo 2006 (ICME'06). July, 2006. Nebojsa Jojic, Sumit Basu, and Nemanja Petrovic. "Home Video Browsing and Consumption Through Exploration of a Learned Generative Model." (Video Submission). In Proceedings of CVPR'06. June 2006. Nemanja Petrovic, Aleksandar Ivanovic, Nebojsa Jojic, Sumit Basu, and Thomas Huang. "Recursive Estimation of Generative Models of Video." In Proceedings of CVPR'06. June 2006. Ian Simon, Sumit Basu, David Salesin, and Maneesh Agrawala. "Audio Analogies: Creating New Music from an Existing Performance by Concatenative Synthesis." In Proceedings of the Int'l Conf. on Comp. Music 2005. August, 2005. Ashish Kapoor and Sumit Basu. "The Audio Epitome: A New Representation for Modeling and Classifying Auditory Phenomena." In Proceedings of ICASSP 2005. May, 2005. Tanzeem Choudhury and Sumit Basu. "Modeling Conversational Dynamics as a Mixed-Memory Markov Process." In Proceedings of NIPS 2004. December, 2004. Sumit Basu. "Mixing with Mozart." In Proceedings of the Int'l. Comp. Music Conf. (ICMC) 2004. Miami. November, 2004. T. Paek, M. Agrawala, S. Basu, S. Drucker, T. Kristjansson, R. Logan, K. Toyama & A. Wilson. Toward universal mobile interaction for shared displays. Proceedings of Computer Supported Cooperative Work (CSCW), 2004, pp. 266-269. Nebojsa Jojic, Sumit Basu, Nemanja Petrovic, Brendan Frey, and Thomas Huang. "Joint design of Data Analysis Algorithms and User Interface for Video Applications." In Proceedings of the Machine Learning Meets the User Interface Workshop (MLUI) at NIPS 2003. Vancouver, BC. December, 2003. Tanzeem Choudhury, Brian Clarkson, Sumit Basu, and Alex Pentland. "Learning Communities: Connectivity and Dynamics of Interacting Agents." Proceedings of the International Joint Conference on Neural Networks (IJCNN'03), Special Session on on Autonomous Mental Development. July 2003. Sumit Basu, "A Linked-HMM Model for Voicing and Speech Detection." Appears in the Proceedings of the IEEE Conf. on Acoustics, Speech, and Signal Processing ( ICASSP 2003). Hong Kong. May, 2003. For older papers, please check here.
| |
|
community activities |
Senior PC Member, Intelligent User Interfaces 2009 (IUI'09), 2010, 2011. PC Member, AAAI 2011 NECTAR Track Co-Chair (with Ashish Kapoor),
Workshop on
Analysis and Design of Algorithms for Interactive Machine Learning
(ADA-IML'09) at NIPS 2009. Co-Chair (with Archana Ganapathi, Emre Kiciman, and Fei Sha), MLSys'07: Workshop on Statistical Learning Techniques for Systems Problems at NIPS 2007. Publicity Chair, NIPS 2007. You can see the poster I designed for the conference here. PC Member, Systems and Machine Learning Workshop, 2007 (SysML 2007). at NSDI 2007. For fall quarter 2007, Emre Kiciman and I taught a graduate course on Systems Applications of Machine Learning (cse599n) at the University of Washington. Other quarters, I co-taught the Markovia Seminar (cse590mv) on Machine Learning with Tanzeem Choudhury at the University of Washington.
|
|
|
hobbies |
singing/songwriting, guitar, music of almost any
kind, sewing, drawing, and poetry.
| |
|
contact |
s u m i t b @ m i c r o s o f t . c o m, one microsoft way, redmond, wa 98052 | |