The Interactive Visual Media Group develops state-of-the-art algorithms for digital photography and video. Our goal is to provide innovative experiences and tools for such media, for both consumers and visual media professionals.
Overview
The mission of the Interactive Visual Media Group is to develop state-of-the-art algorithms for digital photography and video with the goal of providing innovative experiences and tools for such media. Our target audience includes both consumers and visual media professionals. We develop experimental technology to analyze, retrieve, and manipulate media in order to create compelling new content or organize existing content. This expands the way in which people can interact with their personal media by providing novel photo and video creation and integration tools, as well as engaging viewing/playback experiences. At the same time, we try to impact Microsoft’s current and future offerings in these areas.
Projects
Some of the activities of our group are summarized at the following pages:
Areas of Research
Our core strengths lie in the areas of computer vision, image processing, and statistical signal processing, namely:
- Image and video enhancement techniques
- High-accuracy correspondence algorithms
- 3D reconstruction, image-based modeling and rendering
- Object recognition and image search technology
Downloads
All Publications
Click here for a list of all our publications.
- Dilip Krishnan, Raanan Fattal, and Richard Szeliski, Efficient Preconditioning of Laplacian Matrices for Computer Graphics, in ACM Transactions on Graphics (Proc. SIGGRAPH 2013), vol. 32, no. 4, ACM SIGGRAPH, July 2013
- J. Yan, S. Lin, S.B. Kang, and X. Tang, Learning the change for automatic image cropping, in IEEE Conference on Computer Vision and Pattern Recognition, IEEE, June 2013
- A. Joulin and S.B. Kang, Recovering stereo pairs from anaglyphs, in IEEE Conference on Computer Vision and Pattern Recognition, IEEE, June 2013
- J.-B. Huang, J. Kopf, N. Ahuja, and S.B. Kang, Transformation guided image completion, in IEEE International Conference on Computational Photography, IEEE, April 2013
- Kalyan Sunkavalli, Neel Joshi, Sing Bing Kang, Michael F. Cohen, and Hanspeter Pfister, Video snapshots: Creating high-quality images from video clips, in IEEE Transactions on Visualization and Computer Graphics, IEEE Computer Society, November 2012
- Sungju Hwang, Ashish Kapoor, and Sing Bing Kang, Context-based automatic local image enhancement, in European Conference on Computer Vision, Springer Verlag, October 2012
- Kevin Karsch, Ce Liu, and Sing Bing Kang, Depth extraction from video using non-parametric sampling, in European Conference on Computer Vision, Springer Verlag, October 2012
- Yekeun Jeong, David Nistér, Drew Steedly, Richard Szeliski, and In-So Kweon, Pushing the Envelope of Modern Methods for Bundle Adjustment, in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 8, pp. 1605-1617, IEEE Computer Society, August 2012
- Hyon Lim, Sudipta Sinha, Michael Cohen, and Matt Uyttendaele, Real-time image-based 6-DOF localization in large-scale environments , in Computer Vision and Pattern Recognition (CVPR), IEEE Computer Society, 21 June 2012
- Neel Joshi, Sisil Mehta, Steven Drucker, Eric Stollnitz, Hugues Hoppe, Matt Uyttendaele, and Michael Cohen, Cliplets: Juxtaposing Still and Dynamic Imagery, no. MSR-TR-2012-52, 15 May 2012
Group Alumni
- Anandan (MSR India)
- Kentaro Toyama (MSR India)
- Zhengyou Zhang (MSR Redmond)
- Harry Shum (Live Search)
- Antonio Criminisi (MSR Cambridge)
- Sumit Basu (MSR Redmond)
- Nebojsa Jojic (MSR Redmond)
- Chuck Jacobs (MSR Redmond)
- David Salesin (University of Washington and Adobe)
- Steve Seitz (University of Washington)
- Shai Avidan (Adobe)
- Phil Torr (Oxford Brookes University)
- Ying Shan (Microsoft)
- Yaron Caspi (Weizmann)
- Chris Pal (University of Rochester)
- Matthew Brown (University of British Columbia)










