Workshop Program

 

 

WEDNESDAY, OCTOBER 4, 2006

 

8:00                 Opening

Speaker:          Li Deng (Microsoft Research, USA)

Location:         Crystal Ballroom

 

8:15                 Keynote:        The Role of Signal Processing in the Multimedia Communications

            Revolution

Speaker:          Lawrence Rabiner (Rutgers Univ., USA)

Location:         Crystal Ballroom

 

9:15                 Overview:      Managing Spoken Documents

Speaker:          Mari Ostendorf  (Univ. of Washington, USA)

Location:         Crystal Ballroom

 

10:00               Coffee Break

 

10:30               Oral Session 1: Robust Sensing and Video Coding 

Session Chair: Fernando Pereira (Instituto Superior Tecnico, Portugal)

Location:         Crystal Ballroom

 

10:30               Distributed Sensing of Noisy Signals by Thresholding of Redundant Expansions 

Karin Schnass, Pierre Vandergheynst, Pascal Frossard (EPFL, Switzerland)

 

10:50               Distributed Coding of Random Dot Stereograms with Unsupervised Learning of Disparity 

David Varodayan, Aditya Mavlankar, Markus Flierl, Bernd Girod (Stanford Univ., USA)

 

11:10               Robust Decoding of H.264 Encoded Video Transmitted over Wireless Channels  

Galina Sabeva, Salma Ben Jamaa, Michel Kieffer, Pierre Duhamel (LSS-CNRS-Supelec-Univ. Paris-Sud, France)

 

 

11:30               Adaptive Multi-path Prediction for Error Resilient H.264 Coding 

Xiaosong Zhou, C.-C. Jay Kuo (Univ. of Southern California, USA)

 

12:00               Lunch Break

 

13:30               Overview:      Facial Image Synthesis Analysis and Recognition

Speaker:          Demetri Terzopoulos (Univ. of California at Los Angeles, USA)

Location:         Crystal Ballroom

 

14:15               Oral Session 2: Face, Person and Action Recognition 

Session Chair:  Gerasimos Potamianos (IBM Research, USA)

Location:          Crystal Ballroom

 

14:15               Classification-Specific Feature Sampling for Face Recognition 

Effrosyni Kokiopoulou, Pascal Frossard (EPFL, Switzerland)

 

14:35               Lipreading Using Profile Versus Frontal Views 

Patrick Lucey (Queensland Univ. of Technology, Australia), Gerasimos Potamianos (IBM Research, USA)

 

14:55               Person Recognition Based on Head and Mouth Dynamics

Usman Saeed, Federico Matta, Jean-Luc Dugelay (Eurecom Inst., France)

 

15:15               Compressed Domain Real-Time Action Recognition 

Chuohao Yeo, Parvez Ahammad, Kannan Ramchandran, S. Shankar Sastry (Univ. of California Berkeley, USA)

 

15:35               Coffee Break

 

16:00               Oral (Special) Session 3: Multimodal Sensing and Data Fusion for Interactive Media 

Organizers:      Gang Qian, Harvey Thornburg, Andreas Spanias (Arizona State Univ.,USA)

Location:          Crystal Ballroom

 

16:00               Learning Indirect Acquisition of Instrumental Gestures using Direct Sensors 

George Tzanetakis, Adam Kapur Adam Tindale (Univ. of Victoria, Canada)

 

16:20               Joint Segmentation and Temporal Structure Inference for Partially Observed Event Sequences 

Harvey Thornburg, Dilip Swaminathan, Todd Ingalls (Arizona State Univ.); Randal Leistikow (Stanford Univ., USA)

 

16:40               Exploring the Virtual Reed Parameter Space Using Haptic Feedback

Tamara Smyth, Thomas N. Smyth, Arthur E. Kirkpatrick (Simon Fraser Univ., Canada)

 

 

 

Poster Sessions 

 

9:15 – 17:30   Poster Session:  Speech and Audio Processing

Session Chair:    C.C. Jay-Kuo (Univ. of Southern California, USA)

Location:             Palm Court

 

A Qualified ITU-T G.729EV Codec Candidate for Hierarchical Speech and Audio Coding

Bernd Geiser, Peter Jax, Peter Vary (RWTH Aachen Univ., Germany); Herve Taddei,            Martin Gartner (Siemens AG, Germany); Stefan Schandl (Siemens AG, Austria)

 

An Efficient Codebook for the SCELP Low Delay Audio Codec

Hauke Kruger, Peter Vary (RWTH Aachen Univ., Germany)

 

An Enhancement Layer for ACELP Coder

Mickael De Meuleneire (RWTH Aachen Univ., Germany), Martin Gartner (Siemens AG, Germany), Stefan Schandl (Siemens AG, Austria), Herve Taddei  (Siemens AG, Germany)

 

On New Audio Codec Specifications

Imre Varga (Siemens AG, Germany)

 

Conversion of MP3 to AAC in the Compressed Domain 

Koichi Takagi, Satoshi Miyaji, Shigeyuki Sakazawa, Yasuhiro Takishima (KDDI R&D Labs. Inc., Japan)

 

Blind Separation of Speech with a Switched Sparsity and Temporal Criteria

Daniel Smith, Ian Burnett (Univ. of Wollongong, Australia) 

 

Bandwidth-Efficient Mixed Pseudo Analogue-Digital Speech and Audio Transmission

Carsten Hoelper, Peter Vary (RWTH Aachen Univ., Germany)

 

Bandwidth Extension of Audio Based on Partial Loudness Criteria

Visar Berisha, Andreas Spanias (Arizona State Univ., USA)

 

Time Scale Modification for 3G-Telephony Video

Michele Covell (Google, Inc., USA), Sumit Roy (Rhythm NewMedia, USA); Bo Shen, Frederic Huve (Hewlett-Packard, USA)

 

Optimizing Voice-over-IP Speech Quality Using Path Diversity       

Mohamed Ghanassi, Peter Kabal (McGill Univ., Canada)

 

 

 

9:15 - 17:30    Poster Session:  Smart Cameras, Vision, and Graphics 

Session Chair:    Panos Nasiopoulos (Univ. of British Columbia, Canada) 

Location:             Palm Court

 

Image-Adapted Voxelization in Multicamera Settings

Jordi Salvador, Josep R. Casas (Technical Univ. of Catalonia, Spain)

 

An Efficient Compression Scheme for Colour Filter Array Video Sequences

Colin Doutre, Panos Nasiopoulos (Univ. of British Columbia, Canada)

High Performance Low Cost Video Analysis Core for Smart Camera Chips in Distributed Surveillance Network

Weo-Kai Chan, Shao-Yi Chien (National Taiwan Univ., Taiwan)

 

TwinFaces Seamless Textures for Rendering Head Models

Silvina Ferradal, Juan Carlos Gomez (Univ. Nacional de Rosario, Argentina)

 

Segmentation of Epipolar-Plane Image Volumes with Occlusion and Disocclusion Competition

Jesse Berent, Pier Luigi Dragotti (Imperial College, United Kingdom)

 

Region-Based Stereo Panorama Disparity Adjusting

Chiao Wang, A. A. Sawchuk (Univ. of Southern California, USA)

 

3D Shape Reconstruction of Moving Object By Tracking the Sparse Singular Points

Hossein Ebrahimnezhad, Hassan Ghassemian (Tarbiat Modares Univ., Iran)

 

Shadow Removal via Flash/Nonflash Illumination

Cheng Lu, Mark S. Drew (Simon Fraser Univ., Canada); Graham D. Finlayson (Univ. of East Anglia, United Kingdom)

 

 

9:15 - 17:30    Poster Session:  Feature Extraction, Fusion and Classification

Session Chair:     Sleiman Azar (Univ. of Liege, Belgium)

Location:             Palm Court

 

Contourlet Domain Feature Extraction for Image Content Authentication

Ali Bouzidi, Nadia Baaziz (Univ. du Quebec en Outaouais, Canada)

 

Audio-Visual Feature Extraction for Semi-Automatic Annotation of Meetings

Marian Kepesi, Michael Neffe, Tuan Van Pham, Michael Grabner, Helmut Grabner, Andreas Juffinger (Graz Univ. of Technology, Austria)

 

Adaptive Feature Selection for Speech Music Classification

A.R. Abu-El-Quran, R. A. Goubran, A. D. C. Chan (Carleton Univ., Canada)

 

A Fusion Method of Geometric and Topological Features for Boundary-based Shape Matching and Retrieval

Minh-Son Dao, Raffaele De Amicis (GraphiTech, Italy)

 

Music Genre Classification Using Text Categorization Method 

Kai Chen, Sheng Gao, Yongwei Zhu, Qibin Sun (Inst. for Infocomm Research, Singapore)

             

A Music Summarization Scheme using Tempo Tracking and Two Stage Clustering

Sangho Kim, Sungtak Kim, Suk-bong Kwon, Hoirin Kim (Information and Communications Univ., Korea)

 

Fault-Tolerant Music Search by New Ranking Order Algorithm

Wolfgang Theimer, Andree Ross (Nokia Research Center, Germany)

 

 

9:15 - 17:30    Poster Session:  Person and Activity Recognition

Session Chair:     Sleiman Azar (Univ. of Liege, Belgium) 

Location:             Palm Court

 

Hierarchical Models for Activity Recognition

Amarnag Subramanya, Alvin Raj, Jeff Bilmes, Dieter Fox (Univ. of Washington, USA)

 

Using Recognition of Emotions in Speech to Better Understand Brand Slogans

Yun-Maw Cheng, Yue-Sun Kuo (Academia Sinica, Taiwan), Jun-Heng Yeh, Yu-Te Chen, Tsang-Long Pao (Tatung Univ., Taiwan); Charles S. Chien (Feng Chia Univ., Taiwan)

 

Real Time Audio-Visual Person Tracking

Fotios Talantzis, Aristodemos Pnevmatikakis, Lazaros C. Polymenakos (Athens Information Technology, Greece)

 

 

THURSDAY, OCTOBER 5, 2006

 

8:15                 Keynote:        Getting Internet Video Ready for Prime Time

Speaker:          B. Girod (Stanford Univ., USA)

Location:         Crystal Ballroom

 

9:15                 Overview:      Advances in Scalable Video Compression

Speaker:          Jens-Rainer Ohm (RWTH Aachen Univ., Germany)

Location:         Crystal Ballroom

 

10:00               Coffee Break

 

10:30               Oral Session 4: Scalable and Interactive Coding and Transmission of

 Animation/Graphics 

Session Chair: Markus Flierl (Stanford Univ., USA)

Location:          Crystal Ballroom

 

10:30               Gradient Intra Prediction for Coding of Computer Animated Videos

Xiang Li, Norbert Oertel (Siemens Corporate Technology, Germany), Andre Kaup (Univ. Erlangen-Nuremberg, Germany)

 

10:50               4-D Scalable Multi-View Video Coding Using Disparity Compensated View Filtering and Motion Compensated Temporal Filtering

Jens-Uwe Garbas, Ulrich Fecker, Tobias Troeger, Andre Kaup (Univ. Erlangen-Nuremberg, Germany)

 

 

11:10               Server policies for interactive transmission of 3-D scenes

Pietro Zanuttigh, Nicola Brusco (Univ. of Padova, Italy); David Taubman (Univ. of New South Wales, Australia), Guido Maria Cortelazzo (Univ. of Padova, Italy)

 

11:30               On-demand Segmentation and Proxy Buffer Provisioning for Scalable and Interactive Video Streaming Scheme

Md.H. Kabir, Gholamali C. Shoja, Eric G. Manning (Univ. of Victoria, Canada)

 

 

12:00               Lunch Break

 

13:30               Overview:      Smart Surveillance: Advanced Video Analytics and Middleware for

Security and Retail Applications

Speaker:          Andrew Senior (IBM Research, USA)

Location:         Crystal Ballroom

 

14:15               Oral Session 5: Quantization and Learning           

                        Session Chair:  Christine Guillemot (INRIA/IRISA, France)

Location:          Crystal Ballroom

 

14:15               Adaptive Quantization for Matching Pursuit

Alireza Shoa, Shahram Shirani (McMaster Univ., Canada)

 

14:35               Compressing the Laplacian Pyramid

Gagan Rath, Christine Guillemot (IRISA-INRIA, France)

 

14:55               A Novel Learning Method for Hidden Markov Models in Speech and Audio Processing

Xiaodong He, Li Deng (Microsoft Research, USA); Wu Chou (Avaya Labs Research, USA)

 

15:15               Boosting-Based Multimodal Speaker Detection for Distributed Meetings

Cha Zhang (Microsoft Research, USA), Pei Yin (Georgia Inst. of Technology, USA), Yong Rui (Microsoft Research, USA), Ross Cutler (Microsoft Corporation, USA), Paul Viola (Microsoft Research, USA)

 

15:35               Coffee Break

 

16:00               Panel 1:          Audio-Visual Bimodal Fusion

Moderator:      Helen Meng (The Chinese Univ. of Hong Kong, Hong Kong, China) and Gerasimos Potamianos (IBM Research, USA)

Location:         Crystal Ballroom

 

 

 

 

 

Poster Sessions 

 

9:15 – 17:30   Poster Session: Robust Video Coding, Security and Visual Quality

Session Chair: Anthony Vetro (Mitsubishi Research Labs, USA)  

Location:         Palm Court

 

Joint Data Partition and Rate-Distortion Optimized Mode Selection for H.264 Error-Resilient Coding

Yuan Zhang (Communication Univ. of China, China), Wen Gao (Chinese Academy of Sciences, China), Debin Zhao (Inst. of Technology Harbin, China)

 

A Novel Fast Error-Resilient Video Coding Scheme for H.264

Jiajun Bu, Linjian Mo, Genfu Shao, Zhi Yang, Chun Chen (Zhejiang Univ., China)

 

Hybrid Distributed Video Coding Using SCA Codes

Emin Martinian, Anthony Vetro, Jonathan S. Yedidia (Mitsubishi Electric Research Labs, USA); Joao Ascenso (Instituto Superior de Engenharia de Lisboa, Portugal); Ashish Khisti, Dmitry Malioutov (Massachusetts Inst. of Technology, USA)

 

Low-Complexity Wyner-Ziv Video Coding Based on Robust Media Hashing

Li-Wei Kang, Chun-Shien Lu (Academia Sinica, Taiwan)

 

Asymptotic Error-Correcting Performance of Joint Source-Channel Schemes based on Arithmetic Coding

Salma Ben-Jamaa (LSS-CNRS-Supelec-Univ. Paris-Sud, France), Claudio Weidmann (Telecommunications Research Center Vienna, Austria), Michel Kieffer (LSS-CNRS-Supelec –Univ. Paris-Sud, France)

 

A Bit Allocation Method for Smoothing Temporal Picture Quality of Wavelet Video Coders

Chin-Wen Luo, Jiann-Jone Chen (National Taiwan Univ. of Science and Technology, Taiwan)

 

A 5-band Temporal Lifting Scheme for Video Surveillance

Maria Trocan, Christophe Tillier, Beatrice Pesquet-Popescu (GET-ENST, France), Mihaela van der Schaar (Univ. of California Los Angeles, USA)

 

Multiple Description Coding for MJPEG2000 over Congested 802.11e Wireless LANs

Enrico Baccaglini (Politecnico di Torino, Italy), Xin Ji, Gregory Lenoir, Antoine Dejonghe (InterUniv. Microelectronics Center Leuven, Belgium)

 

Robust Index Assignment for MDSQ Encoder Over Noisy Channels

Rui Ma, Fabrice Labeau (McGill Univ., Canada)

 

Spatio-Temporal Concealment in H.264/AVC Video Coding by 3-D Selective Extrapolation

Katrin Meisinger, Sandra Martin, Andre Kaup (Univ. of Erlangen-Nuremberg, Germany)

 

 

Spatio-Bi-Temporal Error Concealment in Block-Based Video Decoding Systems

Markus Friebe, Andre Kaup (Univ. of Erlangen-Nuremberg, Germany)

 

Modelling H.264/AVC sensivitity for error protection in wireless transmissions

Cyril Bergeron, Catherine Lamy-Bergot (THALES Land and Joint Systems, France)

 

Efficient Error Control for Wireless Video Multicast

Ivan Bajic (Simon Fraser Univ., Canada)

 

JPEG Steganalysis Using Empirical Transition Matrix in Block DCT Domain

Dongdong Fu, Yun W. Shi, Dekun Zou (New Jersey Inst. of Technology, USA), Guorong Xuan (Tongji Univ., China)

 

Progressive Randomization for Steganalysis

Anderson Rocha, Siome Gondenstein (Universidade Estadual de Campinas, Brasil)

 

 

Objective Human Visual System Based Video Quality Assessment Metric for Low Bit-Rate Video Communication Systems

David Chih-Che Lin, Paul M. Chau (Univ. of California San Diego, USA)

 

Video Quality Assessment from the Perspective of a Network Service Provider

M. Gumagalli, Rosa Lancini (CEFRIEL-Politecnico di Milano, Italy), Stefano Tubaro (Politecnico di Milano, Italy)

 

 

9:15 – 17:30   Poster Session: Image Enhancement, Segmentation and Interpolation 

Session Chair:   Ivan Bajic (Simon Fraser Univ., Canada)

Location:           Palm Court

 

Local Contrast Enhancement Using 2-Dimensional Recursive Filters

Tarik Arici, Salih Dikbas, Yucel Altunbasak (Georgia Inst. of Technology, USA)

 

An Edge-Preserving Super-Precision for Simultaneous Enhancement of Spatial and Grayscale Resolutions

Hiroshi Hasegawa (Nagoya Univ., Japan), Toshinori Ohtsuka, Isao Yamada, Kohichi Sakaniwa (Tokyo Inst. of Technology, Japan)

 

Fast Image/Video Contrast Enhancement Based on WTHE

Qing Wang, Rabab Ward (Univ. of British Columbia, Canada)

 

An Efficient Bottom-Up Image Segmentation Method Based on Region Growing, Region Competition and the Mumford Shah Functional

Yongsheng Pan, J. Douglas Birdwell, Seddik M. Djouadi (Univ. of Tennessee Knoxville, USA)

 

Efficient Implementation of the Chan-Vese Models Without Solving PDEs

Yongsheng Pan, J. Douglas Birdwell, Seddik M. Djouadi (Univ. of Tennessee Knoxville, USA)

 

An Edge-based Image Interpolation Approach Using Symmetric Biorthogonal Wavelet Transform

Weinzhong Su, Rabab K. Ward (Univ. of British Columbia, Canada)

 

Wavelet Image Interpolation Using Approximate Modeling of Exponential Decay

Sang Soo Kim, Il Kyu Eom, Yoo Shin Kim (Pusan National Univ., South Korea)

 

 

9:15 – 17:30   Poster Session: Software and Hardware Implementations

Session Chair:    Ivan Bajic (Simon Fraser Univ., Canada