Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Computer Vision

Teaching computers to understand the visual world

The goal of computer vision is to make computers efficiently perceive, process, and understand visual data such as images and videos. The ultimate goal is for computers to emulate the striking perceptual capability of human eyes and brains-or even to surpass and assist the human in certain ways.

Within Microsoft Research, our computer-vision research include investigations into:

  • Imaging and Photogrammetry, including high-resolution cameras, radiometric calibration, photometric stereo, 3-D imaging and video, 3-D scene reconstruction from images and video, and image and video enhancement.
  • Pattern Recognition and Statistical Learning, including data clustering and classification, manifold learning, and high-dimensional geometry and statistics.
  • Object Detection and Recognition, including face detection, alignment, and tagging; video-based face recognition; and sparsity-based robust face recognition. We also investigate general object-class recognition and advanced medical-image analysis.
  • Image and Video Editing and Enhancement, including denoising and deblurring, novel representations for images and video, techniques for content-aware edits such as in-painting, and object removal. 

J. Margeta, A.Criminisi, D.C.Lee, and N.Ayache, Recognizing Cardiac Magnetic Resonance Acquisition Planes using Finetuned Convolutional Neural Networks, in To appear in Computer Methods in Biomechanics and Biomedical Engineering, December 2015.

Hao Fang, Saurabh Gupta, Forrest Iandola, Rupesh Srivastava, Li Deng, Piotr Dollar, Jianfeng Gao, Xiaodong He, Margaret Mitchell, John Platt, Lawrence Zitnick, and Geoffrey Zweig, From Captions to Visual Concepts and Back, in The proceedings of CVPR, IEEE – Institute of Electrical and Electronics Engineers, June 2015.

Toby Sharp, Cem Keskin, Duncan Robertson, Jonathan Taylor, Jamie Shotton, David Kim, Christoph Rhemann, Ido Leichter, Alon Vinnikov, Yichen Wei, Daniel Freedman, Pushmeet Kohli, Eyal Krupka, Andrew Fitzgibbon, and Shahram Izadi, Accurate, Robust, and Flexible Real-time Hand Tracking, CHI, April 2015.

Zhicheng Yan, Hao Zhang, Baoyuan Wang, Sylvain Paris, and Yizhou Yu, Automatic Photo Adjustment Using Deep Learning, ACM Transaction on Graphics, March 2015.

More publications ...