The Multimedia, Interaction, and Communication (MIC) group extends the state of the art of multimedia technologies involving audio, visual, haptic, and other natural signals, comprising acquisition, representation, analysis, compression, transmission, synthesis, and rendering. We apply our expertise in computer vision, acoustics, multimedia signal processing, and information coding to improve people's experience in interacting with each other and with machines.
The applications the MIC group has been working on are immersive human-human telecommunications, human-robot interaction, augmented reality, multimedia retrieval, etc.
The MIC group is the successor of the Communication and Collaboration Systems group. Please visit http://research.microsoft.com/ccs for our earlier work.
The MIC group is managed by Zhengyou Zhang.
We are accepting intern applications. Please visit mic-intern2013 for details.
- Akansel Cosgun, Dinei A. Florencio, and Henrik I. Christensen, Autonomous Person Following for Telepresence Robots, in 2013 IEEE International Conference on Robotics and Automation (ICRA 2013) , IEEE, May 2013
- Cha Zhang and Dinei Florencio, Analyzing the Optimality of Predictive Transform Coding Using Graph-Based Models, in IEEE Signal Processing Letters, IEEE, January 2013
- C. Zhang, Q. Cai, P. Chou, Z. Zhang, and R. Martin-Brualla, Viewport: A Fully Distributed Immersive Teleconferencing System with Infrared Dot Pattern, in IEEE MultiMedia, vol. 20, no. 1, pp. 17-27, IEEE Computer Society, 2013
- Gang Yu, Junsong Yuan, and Zicheng Liu, Predicting Human Activities using Spatio-Temporal Structure of Interest Points, in Multimedia (ACMMM), ACM, 29 October 2012
- Jiang Wang, Zicheng Liu, Jan Chorowski, Zhuoyuan Chen, and Ying Wu, Robust 3D Action Recognition with Random Occupancy Patterns, in 12th European Conference on Computer Vision (ECCV), Springer, 7 October 2012
- Gang Yu, Junsong Yuan, and Zicheng Liu, Propagative Hough Voting for Human Activity Recognition, in 12th European Conference on Computer Vision (ECCV), Springer, 7 October 2012
- Douglas Macharet and Dinei Florencio, A Collaborative Control System for Telepresence Robots, in 2012 IEEE/RSJ Int. Conf. on Intelligent Robots and Systems (IROS’12), IEEE, October 2012
- Flavio Ribeiro, Dinei Florencio, Philip Chou, and Zhengyou Zhang, Auditory Augmented Reality: Object Sonification for the Visually Impaired, in MMSP, IEEE, September 2012
- Shujie Liu, Philip A. Chou, Cha Zhang, Zhengyou Zhang, and Chang Wen Chen, Virtual View Reconstruction Using Temporal Information, in Int'l Conference on Multimedia and Expo (ICME), IEEE, July 2012
- flavio Ribeiro, Dinei Florencio, Demba Ba, and Cha Zhang, Geometrically Constrained Room Modeling with Compact Microphone Arrays, in IEEE Transactions on Audio, Speech, and Language Processing, IEEE, July 2012
- Jiang Wang, Zicheng Liu, Ying Wu, and Junsong Yuan, Mining Actionlet Ensemble for Action Recognition with Depth Cameras, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 18 June 2012
- Yingli Tian, Liangliang Cao, Zicheng Liu, and Zhengyou Zhang, Hierarchical Filtered Motion for Action Recognition in Crowded Videos, in IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS—PART C, May 2012
- Ismael Daribo, Dinei Florencio, and Gene Cheung, Arbitrarily Shaped Sub-Block Motion Prediction in Texture Map Compression using Depth Information, IEEE, May 2012
- Zhengyou Zhang, Microsoft Kinect Sensor and Its Effect, in IEEE MultiMedia, vol. 19, no. 2, pp. 4-12, IEEE Computer Society, April 2012
- Cha Zhang, Qin Cai, Philip Chou, Zhengyou Zhang, and Ricardo Martin-Brualla, Viewport: A Fully Distributed Immersive Teleconferencing System with Infrared Dot Pattern, no. MSR-TR-2012-60, 1 April 2012
- Jianfeng Wang, Cha Zhang, Wenwu Zhu, Zhengyou Zhang, Zixiang Xiong, and Philip A. Chou, 3D Scene Reconstruction by Multiple Structured-Light Based Commodity Depth Cameras, in Int'l Conf. on Acoustics, Speech, and Signal Processing, IEEE, March 2012
- Dinei Florencio and Cormac Herley, Is Everything We Know About Password Stealing Wrong?, in IEEE Security and Privacy magazine, 2012
- Ziqing Mao, Dinei Florencio, and Cormac Herley, Painless Migration from Passwords to Two Factor Authentication, in WIFS, IEEE SPS, 29 November 2011
- Gang Yu, Junsong Yuan, and Zicheng Liu, Real-time Human Action Search using Random Forest based Hough Voting, in Multimeda (ACMMM), ACM, 28 November 2011
- Sanjeev Mehrotra, Wei-ge Chen, and Zhengyou Zhang, Interpolation of Combined Head and Room Impulse Response for Audio Spatialization, in Proceedings of MMSP, IEEE, October 2011
- Myung-Suk Song, Cha Zhang, Dinei Florencio, and Hong-Goo Kang, An Interactive 3-D Audio System With Loudspeakers, in IEEE Transactions on Multimedia, IEEE, October 2011
- Sanjeev Mehrotra, Zhengyou Zhang, Qin Cai, Cha Zhang, and Philip A. Chou, Low-Complexity, Near-Lossless Coding of Depth Maps from Kinect-Like Depth Cameras, in Proceedings of MMSP, IEEE, October 2011
- Flavio Ribeiro and Dinei Florencio, Region of Interest Determination Using Human Computation, IEEE SPS, October 2011
- Shu Shi and Zhengyou Zhang, ViewMark: An Interactive Videoconferencing System for Mobile Devices, in International Workshop on Multimedia Signal Processing (MMSP), IEEE, October 2011
- Flavio Ribeiro, Dinei Florencio, and Vitor Nascimento, Crowdsourcing Subjective Image Quality Evaluation, in ICIP, IEEE SPS, September 2011
- Sanjeev Mehrotra, Wei-ge Chen, Zhengyou Zhang, and Philip A. Chou, Realistic audio in immersive video conferencing , in Int'l Conf. on Multimedia and Expo (ICME), IEEE, July 2011
- Gang Yu, Junsong Yuan, and Zicheng Liu, Unsupervised Random Forest Indexing for Fast Action Search, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 20 June 2011
- Dinei Florencio and Cormac Herley, Where Do All the Attacks Go?, no. MSR-TR-2011-74, June 2011
- Dinei Florencio and Cormac Herley, Sex, Lies and Cyber-crime Surveys, no. MSR-TR-2011-75, June 2011
- Sasa Junuzovic, Kori Inkpen, Rajesh Hegde, and Zhengyou Zhang, Towards ideal window layouts for multi-party, gaze-aware desktop videoconferencing, in Graphics Interface 2011, Canadian Human-Computer Communications Society, 25 May 2011
- Dinei Florencio and Li-wei He, Enhanced Adaptive Playout Scheduling and Loss Concealment Techniques for Voice over IP Networks, in ISCAS 2011, Institute of Electrical and Electronics Engineers, Inc., May 2011
- Sasa Junuzovic, Kori Inkpen, Rajesh Hegde, Zhengyou Zhang, John Tang, and Christopher Brooks, What did I miss? In-Meeting Review using Multimodal Accelerated Instant Replay (AIR) Conferencing, in CHI 2011, ACM Conference on Computer-Human Interaction, May 2011
- Flávio Ribeiro, Dinei Florencio, Cha Zhang, and Michael Seltzer, CROWDMOS: An Approach for Crowdsourcing Mean Opinion Score Studies, in ICASSP, IEEE, May 2011
- Kori Inkpen, Sasa Junuzovic, Asta Roseway, John Tang, and Zhengyou Zhang, (demo) AIRMobile: Accelerated Instant Replay for In-Meeting Review by Mobile Users, in CSCW 2011, ACM Conference on Computer Supported Cooperative Work, March 2011
- Junsong Yuan, Zicheng Liu, and Ying Wu, Discriminative Video Pattern Search for Efficient Action Detection, in IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (TPAMI), 2011
- Cha Zhang, Dinei Florencio, and Zhengyou Zhang, Improving Immersive Experiences in Telecommunication with Motion Parallax, in IEEE Signal Processing Magazine, Institute of Electrical and Electronics Engineers, Inc., January 2011
- Gang Yu, Norberto A. Goussies, Junsong Yuan, and Zicheng Liu, Fast Action Detection via Discriminative Random Forest Voting and Top-K Subvolume Search, in IEEE Transactions on Multimedia, 2011
- Myung-Suk Song, Cha Zhang, Dinei Florencio, and Hing-Goo Kang, Enhanced Binaural Loudspeaker Audio System with Room Modeling, in MMSP, IEEE, October 2010
- Flavio Ribeiro, Cha Zhang, Dinei Florencio, and Demba Ba, Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization, in IEEE Transactions on Audio, Speech, and Language Processing, Institute of Electrical and Electronics Engineers, Inc., September 2010
- Flavio Ribeiro, Demba Ba, Cha Zhang, and Dinei Florencio, Turning Enemies into Friends: Using Reflections to Improve Sound Source Localization, in ICME, IEEE, July 2010
- Cha Zhang and Dinei Florencio, Joint Tracking and Multiview Video Compression, in VCIP, SPIE, July 2010
- Srenivas Varadarajan, Lina Karam, and Dinei Florencio, Background Subtraction Using Spatio-Temporal Continuities, in EUVIP, IEEE, July 2010
- Myung-Suk Song, Cha Zhang, Dinei Florencio, and Hong-Goo Kang, Personal 3D Audio System with Loudspeakers, in ICME (Hot3D), IEEE, July 2010
- Liangliang Cao, Zicheng Liu, and Thomas Huang, Cross-dataset Action Detection, IEEE International Conference on Computer Vision and Pattern Recognition (CVPR), 13 June 2010
- Cha Zhang and Zhengyou Zhang, A Survey of Recent Advances in Face Detection, no. MSR-TR-2010-66, June 2010
- Qin Cai, A. Sankaranarayanan, Q. Zhang, Zhengyou Zhang, and Zicheng Liu, Real Time Head Pose Tracking from Multiple Cameras with a Generic Model, in IEEE Workshop on Analysis and Modeling of Faces and Gestures in conjunction with CVPR 2010, IEEE, June 2010
- Zhengyou Zhang, Estimating Projective Transformation Matrix (Collineation, Homography), no. MSR-TR-2010-63, 31 May 2010
- Kori Inkpen, Rajesh Hegde, Mary Czerwinski, and Zhengyou Zhang, Exploring Spatialized Audio & Video for Distributed Conversations, in CSCW 2010, Association for Computing Machinery, Inc., February 2010
- Cormac Herley and Dinei Florencio, Phishing and Money Mules, in WIFS, Institute of Electrical and Electronics Engineers, Inc., 2010
- Kori Inkpen, Rajesh Hegde, Sasa Junuzovic, John Tang, and Zhengyou Zhang, (poster) AIR Conferencing: Accelerated Instant Replay for In-Meeting Multimodal Review, in MM 2010, ACM Multimedia, 2010
- Demba Ba, Flavio Ribeiro, Cha Zhang, and Dinei Florencio, L1 Regularized Room Modeling with Compact Microphone Arrays, in ICASPP, IEEE, 2010
