The Interactive Visual Media Group develops state-of-the-art algorithms for digital photography and video. Our goal is to provide innovative experiences and tools for such media, for both consumers and visual media professionals.
The Interactive Visual Media group conducts state-of-the-art research in a variety of topics related to computer vision, computer graphics and computational photography. Our goal is to develop new applications for manipulation, reasoning and communication with visual media. Core areas of research include:
- 3D reconstruction and image segmentation
- object recognition
- image and video enhancement
Each year we hire exceptional PhD students for summer internships. Offers are generally made in December through March for the following summer. For more information please visit our intern webpage.
- Blink - burst photography for Windows Phone
- Image Composite Editor (ICE) - panorama stitcher for Windows desktop
- Cliplets - cinemagraph creator for Windows 8 and Windows desktop
- HD View - panorama viewer for web browsers running on Windows
- HD View Utilities - command-line panorama generation tools for Windows
- Photosynth Export Plug-in for Photoshop - panorama export plug-in for Photoshop
- JPEG XR Export Plug-in for Photoshop - image file format plug-in for Photoshop
- Microsoft COCO (Common Objects in Context) - image recognition and segmentation dataset
- 3D Video Data - images, depth maps, and calibration parameters
- Easy Camera Calibration Tool - flexible calibration tool for Windows
Other groups at Microsoft Research also conducting research in computer vision and computer graphics include:
Click here for a list of all our publications (for the most up to date lists, please see the individual member's webpages.)
- Hyon Lim and Sudipta N. Sinha, Monocular Localization of a moving person onboard a Quadrotor MAV, in 2014 IEEE International Conference on Robotics and Automation, IEEE – Institute of Electrical and Electronics Engineers, 27 May 2015.
- Anitha Kannan and Simon Baker, Identifying Presentation Styles in Online Educational Videos, no. MSR-TR-2014-141, 6 November 2014.
- T.Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollár, and C. L. Zitnick, Microsoft COCO: Common Objects in Context, in ECCV, European Conference on Computer Vision, September 2014.
- C. Lawrence Zitnick and Piotr Dollár, Edge Boxes: Locating Object Proposals from Edges, in ECCV, European Conference on Computer Vision, September 2014.
- Johannes Kopf, Michael Cohen, and Richard Szeliski, First-person Hyperlapse Videos, in ACM Transactions on Graphics (Proc. SIGGRAPH 2014), vol. 33, no. 4, ACM – Association for Computing Machinery, August 2014.
- Jaesik Park, Sudipta N. Sinha, Yasuyuki Matsushita, Yu-Wing Tai, and In So Kweon, Calibrating a Non-isotropic Near Point Light Source using a Plane, in CVPR, Computer Vision and Patter Recognition, 21 June 2014.
- Sudipta Sinha, Daniel Scharstein, and Richard Szeliski, Efficient High-Resolution Stereo Matching using Local Plane Sweeps, in CVPR, Computer Vision and Patter Recognition, 21 June 2014.
- Bharath Hariharan, C. Lawrence Zitnick, and Piotr Dollár, Detecting Objects using Deformation Dictionaries, in CVPR, Computer Vision and Pattern Recognition, June 2014.
- Neel Joshi and C. Lawrence Zitnick, Micro-Baseline Stereo, no. MSR-TR-2014-73, 22 May 2014.
- Piotr Dollár, Ron Appel, Serge Belongie, and Pietro Perona, Fast Feature Pyramids for Object Detection, in PAMI, Pattern Analysis and Machine Intelligence, April 2014.
- Anandan (MSR India)
- Kentaro Toyama (MSR India)
- Zhengyou Zhang (MSR Redmond)
- Harry Shum (Live Search)
- Antonio Criminisi (MSR Cambridge)
- Sumit Basu (MSR Redmond)
- Nebojsa Jojic (MSR Redmond)
- Chuck Jacobs (MSR Redmond)
- David Salesin (University of Washington and Adobe)
- Steve Seitz (University of Washington)
- Shai Avidan (Adobe)
- Phil Torr (Oxford Brookes University)
- Ying Shan (Microsoft)
- Yaron Caspi (Weizmann)
- Chris Pal (University of Rochester)
- Matthew Brown (University of British Columbia)
- Simon Baker (NVIDIA)
- Piotr Dollar (Facebook)
- Wolf Kienzle (MSR Redmond)