|
|
Media Communication Group
Devices such as PCs, Pocket PCs, Smartphones, and embedded devices are taking on a greater and greater role in everyday computing and the lives of individuals. At the same time, the growth in bandwidth of wired and wireless networks and local area and wide area networks is facilitating greater communication across platforms. At Microsoft Research Asia, the Media Communication Group is developing applications and services to advance communication, collaboration, and entertainment among various platforms and under different network conditions.
Researchers in the Media Communication Group leverage their expertise in audio/video compression, image processing, networking, user interface, and database management to develop practical applications and services for both consumers and information workers. The group’s research is used across the realm of communications applications, including video broadcast, instant messaging, video conferencing, multi-view video, and peer-to-peer video streaming. The group’s innovations have made an impact on a number of Microsoft products and technologies.
Primary Contact: Jiang Li
| 
Cai, Hua | 
Hu, Yusuo | 
Fan | 
Jiang | 
Qingwei | 
Lou, Jian-Guang | 
Luo, Chong | | Affiliate Members
| 
Tim | 
TANG, Jian | | |
My Internet TV Station
With the explosive growth of digital video, more and more users are switching from video consumers to creators. The “My Internet TV Station” (MyTV) project helps users establish their own Internet TV stations. By using peer-to-peer (P2P) video broadcasting technology, content owners can broadcast live scenes via cameras or stored video files to a large audience using a normal Internet connection. Through the Internet, anyone can deliver content, share life experiences, attract customers, or even do business with others. By bringing the power of TV broadcasting to normal Internet users, this service will create a new range of opportunities for both business and entertainment.
Lecture Broadcasting System
Lecture Broadcasting System (LBS) is a live video broadcasting system based on P2P streaming technology. It includes:
- Channel-based program scheduling and management by seamless cooperation with Outlook;
- Real-time video/audio capture and sharing;
- Stream transmission based on P2P technology;
- Real-time PowerPoint slides capture and uploading;
- Browser-based synchronous playback of video/audio and PowerPoint slides at the client side.
The software kit of Lecture Broadcasting System, including lecture program manage software, video/audio capturing and uploading software, PowerPoint slides capturing software, and IE plug-in for stream media playback at client side, can also serve as a general solution for constructing lecture live broadcasting or distance education systems.
Real-time Interactive Multi-view Video System
Pre-captured movies and broadcast television (TV), live digital television, video-on-demand (VoD), streaming video over the Internet, and improvements in video technology have had a significant impact over the years by consistently delivering better quality pictures, more programming, and greater flexibility. Despite progress in these fields, traditional video formats are limited to one video stream for an event at any time and one viewing direction, as selected by program editors. Today’s technology places the viewer in a passive mode, having to accept the angle or views presented by a program/event’s editor.
The rapid development of electronic and computing technologies has allowed the Media Communication Group to expand the realm of possibilities with the development of a Real-time Interactive Multi-view Video System and a completely new video format. This new system and video format allow for more proactive participation by viewers, giving them the freedom to choose a viewing angle and enjoy special visual effects. It also delivers a new viewing experience and a high degree of user interactivity. The system is widely applicable to movies, advertisements, educational videos, sports games, and event broadcasts.
Microsoft
Portrait
Microsoft Portrait is a research prototype for mobile video communication. It supports .NET Messenger Service, Session Initiation Protocol and Internet Locator Service on PCs, Pocket PCs, Handheld PCs, and Smartphones. It runs on local area networks, dialup networks, and even wireless networks with bandwidths as low as 9.6 kilobits/second. Microsoft Portrait delivers portrait-like video if users are in low bandwidths, and displays full-color video if users are in broadband. In low bandwidths, portrait video possesses clearer shapes, smoother motion, shorter latency, and much cheaper computational costs than conventional video technologies. Research in Microsoft Portrait is also pursuing presence notification, chat/voice/video functions at any time and anywhere on any device.
Instant Voice Communication
Instant voice communication is receiving renewed interests in the cellular world. Known as push-to-talk (PTT), this half-duplex service provides a lightweight complement to the telephone communication. PTT over cellular (PoC) was first introduced by US operator Nextel. Now, it is provided by almost all the dominant mobile operators, and there are undergoing standardization efforts in Open Mobile Alliance (OMA).
PTT can coexist with the full-duplex telephone service not only because of its cost effectiveness, but it offers increased privacy and requires reduced interactional commitment in comparison to telephone. Similar reasons have boosted the success of the instant messaging (IM) service. While the dominant IM services are based on text, we see the potential of voice IM for the communication between both end consumers and information workers.
In this project, we evaluate the possibility of providing PTT service over IP network, and seek to find a unified solution that could bridge the PTT service between cellular and IP networks. We have designed and implemented an asynchronous record-and-forward voice messaging tool as well as an on-line multiparty PTT system. The prototype of the former one was transferred MSN Messenger team. They released it as Voice Clip feature in version 7.5.
-
A Novel Model-Based Rate-Control Method for Portrait Video Coding
Keman Yu, Jiang Li, Cuizhu Shi, and Shipeng Li, A Novel Model-Based Rate-Control Method for Portrait Video Coding, IEEE Transactions on Circuits and Systems for Video Technology, Vol. 15, No. 12, December 2005, pages: 1545-1556.
-
A Real-Time Interactive Multi-View Video System
Jian-Guang Lou, Hua Cai, Jiang Li, A Real-Time
Interactive Multi-View Video System, The 13th ACM International
Conference on Multimedia (ACMMM 2005), Singapore, 6-11 November 2005,
pages: 161-170. -
Embedded Image Coding With Context Partitioning and Quantization
Hua Cai, Xing San, and Jiang Li, Embedded
Image Coding With Context Partitioning and Quantization, 2005 IEEE
International Conference on Image Processing (ICIP 2005), Genova, Italy,
11-14 September 2005. - Lossless Image Compression With Tree Coding of Magnitude Levels
Hua Cai and Jiang Li, Lossless Image Compression With Tree Coding of Magnitude
Levels, 2005 IEEE International Conference on Multimedia & Expo (ICME 2005),
Amsterdam, The Netherlands, 6-8 July 2005. - A Framework for Fine-granular Computational-complexity Scalable Motion Estimation
Zhi Yang, Hua Cai, and Jiang Li, A Framework for Fine-granular
Computational-complexity Scalable Motion Estimation, The 2005 IEEE International
Symposium on Circuits and Systems (ISCAS 2005), Kobe, Japan, 23-26 May, 2005. -
An Effective Variable Block-Size Early Termination Algorithm for H.264 Video Coding
Libo Yang, Keman Yu, Jiang Li, and
Shipeng Li. An Effective Variable Block-Size Early Termination
Algorithm for H.264 Video Coding, IEEE Transactions on Circuits and
Systems for Video Technology, Vol. 15, No. 6, June 2005. Pages: 784-788. -
A Low Complexity Motion Compensated Frame Interpolation Method
Jiefu Zhai, Keman Yu, Jiang Li, and Shipeng Li,
A Low Complexity Motion Compensated Frame Interpolation Method, The 2005
IEEE International Symposium on Circuits and Systems (ISCAS 2005), Kobe,
Japan, 23-26 May 2005. -
Prediction-based Directional Fractional Pixel Motion Estimation for H.264 Video Coding
Libo Yang, Keman Yu, Jiang Li, and
Shipeng Li, Prediction-based Directional Fractional Pixel Motion
Estimation for H.264 Video Coding, 2005 IEEE International Conference on
Acoustics, Speech, and Signal Processing (ICASSP 2005), Philadelphia, PA,
USA, 18-23 March, 2005. -
DigiMetro - An Application-Level Multicast System for Multi-party Video Conferencing
Chong Luo, Jiang Li, and Shipeng Li, DigiMetro
- An Application-Level Multicast System for Multi-party Video
Conferencing, 2004 IEEE Global Telecommunication Conference (Globecom 2004),
Dallas, Texas, 29 November - 3 December 2004. -
DigiParty- A Decentralized Multi-party Video Conferencing System
Ling Chen, Chong Luo, Jiang Li, and Shipeng Li, DigiParty -
A Decentralized Multi-party Video Conferencing System, 2004 IEEE
International Conference on Multimedia & Expo (ICME 2004), Taipei,
Taiwan, 27 - 30 June 2004.
|