|
|
Speech Technology Home
Publications
Here is a list of articles about us
- T. Bishop.
Show and tell at Microsoft's annual research fest
(Seattle PI, 2004).
- D. Barker.
Microsoft Research
Spawns a New Era in Speech Technology
(PC AI Magazine, 2003).
- M. Kanellos.
Talking Computers Nearing Reality
(CNET News.com, 2003).
- M. Brooks.
No one understands me as well as my PC
(New Scientist, 2003).
Here is a list of books
- L. Deng.
DYNAMIC SPEECH MODELS --- Theory, Algorithm, and Application,
Morgan & Claypool Publishers, LaPorte CO, USA, 2006.
- L. Deng and D. O'Shaugnessy.
Speech Processing - A Dynamic and Optimization-Oriented Approach,
Marcel Dekker, 2003.
- X. Huang, A. Acero and H. Hon.
Spoken Language Processing. Prentice Hall, 2001.
- Z. Zhang and S. Ma.
Computer Vision: Fundamentals of Computational Theory and Algorithms (in Chinese).
Chinese Academy of Sciences, 1998 (1st edition), 2003 (2nd edition).
- G. Xu and Z. Zhang.
Epipolar Geometry in Stereo, Motion and Object Recognition: A Unified Approach.
Kluwer Academic Publishers, 1996.
- A. Acero.
Acoustical and Environmental Robustness in Automatic Speech Recognition.
Kluwer Academic Publishers, 1993.
- Z. Zhang and O. Faugeras.
3D Dynamic Scene Analysis: A Stereo Based Approach.
Springer-Verlag, (out-of-print) 1992.
Here is a list of book chapters
- L. Deng and H. Sheikhzadeh.
"Use of an Integrated Neural-Network and Cochlear Model for the Study of Speech Encoding in the Auditory System",
in S. Greenberg and W. Ainsworth (eds.),
Listening to Speech: An Auditory Perspective,
Lawrence Erlbaum Associates, 2006.
- B. Raj, M. Seltzer and M. Reyes-Gomez.
"Speech Recognizer-based Maximum Likelihood Beamforming",
in P. Divenyi, ed.,
Speech Separation by Humans and Machines,
Kluwer Academic Publishers, 2004.
- Z. Zhang.
"Camera Calibration",
in G. Medioni and S.B. Kang, eds.,
Emerging Topics in Computer Vision,
Prentice Hall PTR, 2004.
- Z. Liu and B. Guo.
"Face Synthesis,"
in Li, Z.; Jain, K. (eds.)
Handbook of Face Recognition
Springer, New York, 2004.
- C. Avendano, L. Deng, H. Hermansky, and B. Gold.
"The Analysis and Representation of Speech,"
in Greenberg, S.; Ainsworth, W.A.; Popper, A.N.; Fay, R.R. (eds.)
Speech Processing in the Auditory System
Springer, New York, 2004, pp. 63-100.
- L. Deng.
"Switching Dynamic System Models for Speech Articulation and Acoustics,"
in M. Johnson, M. Ostendorf, S. Khudanpur, and R. Rosenfeld (eds.)
Mathematical Foundations of Speech and Language Processing
Springer, York, March 2004, pp.115-134.
- B. Zhang, Z. Liu, and B. Guo.
"Photo-Realistic Conversation Agent."
in D. Zhang, M. Kamel, and G. Baciu, eds.
Image and Graphics Technologies,
Kluwer Academic Publishers, 2003. Norwell, MA.
- L. Deng.
"Articulatory Features and Associated Production Models in Statistical Speech Recognition," in K. Ponting
ed.
Computational Models of Speech Pattern Processing
(NATO ASI Series), Springer, 1999, pp. 214-224.
- L. Deng.
"Computational Models for Auditory Speech Processing," in K. Ponting (ed.)
Computational Models of Speech Pattern Processing (NATO ASI Series),
Springer, 1999, pp. 67-77.
- L. Deng.
"Computational Models for Speech Production," in K. Ponting (ed.)
Computational Models of Speech Pattern Processing
(NATO ASI Series), Springer, 1999, pp. 199-213.
- X. Huang, A. Acero, F. Alleva, M. Hwang, L. Jiang and M. Mahajan.
"From Sphinx-II to Whisper: Making Speech Recognition Usable" in C. Lee,
F. Soong and K. Paliwal eds.
Automatic Speech and Speaker Recognition, Advanced Topics,
Kluwer Academic Publishers, Norwell, MA, 1996.
- R. Stern, A. Acero, F. Liu and Y. Ohshima.
"Signal Processing for Robust Speech Recognition."
in C. Lee, F. Soong and K. Paliwal eds.
Automatic Speech and Speaker Recognition, Advanced Topics,
Kluwer Academic Publishers, Norwell, MA, 1996.
- A. Acero.
"The Role of Phoneticians in Speech Technology" in G. Bloothooft, V. Hazan, D. Huber and J.Llisterri,
eds. European Studies in Phonetics and Speech Communication,
OTS Publications. August 1995.
Here is a list of publications on noise robust speech recognition
- I. Tashev, J. Droppo, and A. Acero.
Suppression Rule for Speech Recognition Friendly Noise Suppressors,
in Proc. of the 2006 Int. Conference Digital Signal Processing and Applications (DSPA). Moscow, Russia, Mar. 2006.
- M. Seltzer and A. Acero.
An EM Algorithm for Training Wideband Acoustic Models from Mixed-Bandwidth Training Data,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Puerto Rico, Dec, 2005.
- L. Deng, J. Wu, J. Droppo, and A. Acero.
Analysis and Comparison of Two Speech Feature Extraction/Compensation Algorithms,
in IEEE Signal Processing Letters. Volume: 12 Issue: 6, Jun 2005. pp. 477-480.
- L. Deng, J. Droppo, and A. Acero.
Dynamic Compensation of HMM Variances Using the Feature Enhancement Uncertainty Computed From a Parametric Model of Speech Distortion,
in IEEE Trans. on Speech and Audio Processing. Volume: 13 Issue: 3, May 2005. pp. 412-421.
- M. Seltzer and A. Acero.
Training Wideband Acoustic Models using Mixed-Bandwidth Training Data via Feature Bandwidth Extension
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- M. Seltzer, B. Raj, and R. Stern.
Likelihood-Maximizing Beamforming for Robust Hands-Free Speech Recognition,
in IEEE Trans. on Speech and Audio Processing. Volume: 12 Issue: 5, Sep 2004. pp. 489-498.
- B. Raj, M. Seltzer, and R. Stern.
Reconstruction of Missing Features for Robust Speech Recognition,
in Speech Communication, Elsevier. Volume: 43 Issue: 4, Sep 2004. pp. 275-296.
- M. Seltzer, B. Raj, and R. Stern.
A Bayesian Classifier for Spectrographic Mask Estimation for Missing Feature Speech Recognition,
in Speech Communication, Elsevier. Volume: 43 Issue: 4, Sep 2004. pp. 379-393.
- Z. Zhang, Z. Liu, M. Sinclair, A. Acero, L. Deng, J. Droppo, X. Huang, and Yanli Zheng.
Multi-Sensory Microphones for Robust Speech Detection, Enhancement and Recognition
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- J. Droppo and A. Acero.
Noise Robust Speech Recognition with a Switching Linear Dynamic Model
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- M. Seltzer and R. Stern.
Parameter Sharing in Subband Likelihood-Maximizing Beamforming for Speech Recognition Using Microphone Arrays
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- L. Deng, J. Droppo, and A. Acero.
Estimating Cepstrum of Speech Under the Presence of Noise Using a Joint Prior of Static and Dynamic Features,
in IEEE Trans. on Speech and Audio Processing. Volume: 12 Issue: 3 , May 2004. pp. 218-233.
- L. Deng, J. Droppo, and A. Acero.
Enhancement of log Mel Power Spectra of Speech using a Phase-Sensitive Model of the Acoustic Environment and Sequential
Estimation of the Corrupting Noise,
in IEEE Trans. on Speech and Audio Processing. Volume: 12 Issue: 2 , Mar 2004. pp. 133-143.
- J. Wu, J. Droppo, L. Deng and A. Acero.
A Noise-Robust ASR Front-End Using Wiener Filter Constructed from MMSE Estimation of Clean Speech and Noise,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Virgin Islands, Dec, 2003.
- L. Deng, J. Droppo, and A. Acero.
Recursive Estimation of Nonstationary Noise using Iterative Stochastic Approximation for Robust Speech Recognition,
in IEEE Trans. on Speech and Audio Processing. Volume: 11 Issue: 6 , Nov 2003. pp. 568-580.
- M. Seltzer, J. Droppo, and A. Acero.
A Harmonic-Model-Based Front End for Robust Speech Recognition,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- J. Droppo, L. Deng and A. Acero.
A Comparison of Three Non-Linear Observation Models for Noisy Speech Features,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- J. Xin, Y. Qi, and L. Deng.
Time Domain Computation of a Nonlinear Nonlocal Cochlear Model with Applications to Multitone Interactions in Hearing,
in Communications in Mathematical Sciences. Volume: 1 Issue: 2, pp. 211-227.
- L. Deng, J. Droppo and A. Acero.
Incremental Bayes Learning with Prior Evolution for Tracking Non-Stationary Noise Statistics from Noisy Speech Data,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, April 2003.
- J. Droppo, L. Deng, and A. Acero.
Evaluation of SPLICE on the Aurora 2 and 3 Tasks,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- J. Droppo, A. Acero, and L. Deng.
A Nonlinear Observation Model for Removing Noise from Corrupted Speech Log Mel-Spectral Energies,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- L. Deng, J. Droppo, and A. Acero.
Exploiting Variances in Robust Feature Extraction Based on a Parametric Model of Speech Distortion,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- L. Deng, J. Droppo, and A. Acero.
Log-Domain Speech Feature Enhancement Using Sequential MAP Noise Estimation and a Phase-sensitive Model of the Acoustic Environment,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- Y. Xiang, Y. Hua, S. An, A. Acero.
Separating Colored Signals Distorted by Convolutive Channels Using Diagonal Constrained Decorrelation,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Orlando, Florida, May, 2002.
- J. Droppo, L. Deng and A. Acero.
Uncertainty Decoding with SPLICE for Noise Robust Speech Recognition,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Orlando, Florida, May, 2002.
- L. Deng, J. Droppo and A. Acero.
A Bayesian Approach to Speech Feature Enhancement using the Dynamic Cepstral Prior,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Orlando, Florida, May, 2002.
- L. Deng, J. Droppo and A. Acero.
Recursive Noise Estimation Using Iterative Stochastic Approximation for Stereo-based Robust Speech Recognition,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Madonna di Campiglio, Italy, Dec, 2001.
- T. Kristjansson, B. Frey and L. Deng.
Joint Estimation of Noise and Channel Distortion in a Generalized EM Framework,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Madonna di Campiglio, Italy, Dec, 2001.
- J. Droppo, A. Acero and L. Deng.
Evaluation of the SPLICE Algorithm on the Aurora2 Database,
in Proc. of the Eurospeech Conference. Aalborg, Denmark, Sep, 2001.
- H. Attias, L. Deng, A. Acero and J. Platt.
A New Method for Speech Denoising and Robust Speech Recognition Using Probabilistic Models for Clean Speech and for Noise,
in Proc. of the Eurospeech Conference. Aalborg, Denmark, Sep, 2001.
- B. Frey, L. Deng, A. Acero and T. Kristjansson.
ALGONQUIN: Iterating Laplace's Method to Remove Multiple Types of Acoustic Distortion for Robust Speech Recognition,
in Proc. of the Eurospeech Conference. Aalborg, Denmark, Sep, 2001.
- L. Deng, A. Acero, L. Jiang, J. Droppo and X. Huang
High-Performance Robust Speech Recognition Using Stereo Training Data,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- J. Droppo, A. Acero and L. Deng.
Efficient Online Acoustic Environment Estimation for FCDCN in a Continuous Speech Recognition System,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- T. Kristjansson, B. Frey, L. Deng and A. Acero.
Towards Non-Stationary Model-Based Noise Adaptation for Large Vocabulary Speech Recognition
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- Y. Xiang, Y. Hua, S. An and A. Acero.
Experimental Investigation of Delayed Instantaneous Demixer for Speech Enhancement
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- H. Attias, J. Platt, A. Acero and L. Deng.
Speech Denoising and Dereverberation Using Probabilistic Models,
in NIPS, Denver, Nov. 2000.
- A. Acero, L. Deng, T. Kristjansson and J. Zhang.
HMM Adaptation Using Vector Taylor Series for Noisy Speech Recognition,
in Proc. Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- A. Acero, S. Altschuler and L. Wu.
Speech/Noise Separation Using Two Microphones and a VQ Model of Speech Signals,
in Proc. Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- L. Deng, A. Acero, M. Plumpe and X. Huang.
Large-Vocabulary Speech Recognition under Adverse Acoustic Environments,
in Proc. Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- X. Shen and L. Deng.
A Dynamic System Approach to Speech Enhancement Using the H-inf Filtering Algorithm,
in IEEE Trans. on Speech and Audio Processing. Volume: 7 Issue: 4 , July 1999, pp. 391-399.
- H. Sameti, H. Sheikhzadeh, L. Deng and R. Brennan.
HMM-based Strategies for Enhancement of Speech Signals Embedded in Nonstationary Noise,
in IEEE Trans. on Speech and Audio Processing. Volume: 6 Issue: 5 , Jan. 1998, pp. 445-455.
- A. Acero and X. Huang.
Augmented Cepstral Normalization for Robust Speech Recognition,
in Proc. of the IEEE Workshop on Automatic Speech Recognition. Snowbird, UT. Dec 1995.
and publications on acoustic modeling for speech recognition:
- X. He, L. Deng, and W. Chou.
A Novel Learning Method for Hidden Markov Models in Speech and Audio Processing,
in Proc. IEEE Workshop on Multimedia Signal Processing, Victoria, BC, Oct. 2006.
- D. Yu, L. Deng, and A. Acero.
A Lattice Search Technique for Long-contextual-span Hidden Trajectory Model of Speech,
in Speech Communication, Elsevier. Volume: 48 Issue: 9, Sep 2006. pp. 1214-1226.
- X. Li, L. Deng, D. Yu and A. Acero.
A Time-Synchronous Phonetic Decoder for a Long-Contextual-Span Hidden Trajectory Model,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- D. Yu, L. Deng, X. He and A. Acero.
Use of Incrementally Regulated Discriminative Margins in MCE Training for Speech Recognition,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- L. Deng, D. Yu, and A. Acero.
Structured Speech Modeling,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 5, Sep 2006. pp. 1492- 1504.
- M. Mahajan, A. Gunawardana and A. Acero.
Training Algorithms for Hidden Conditional Random Fields
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- J. Droppo and A. Acero.
Joint Discriminative Front End and Back End Training for Improved Speech Recognition Accuracy
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- L. Deng, A. Acero, and I. Bazzi.
Tracking Vocal Tract Resonances Using a Quantized Nonlinear Function Embedded in a Temporal Constraint,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 2, Mar 2006. pp. 425-434.
- L. Deng, D. Yu, and A. Acero.
A Bidirectional Target Filtering Model of Speech Coarticulation: two-stage Implementation for Phonetic Recognition,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 1, Jan 2006. pp. 256-265.
- L. Deng, D. Yu, X. Li, and A. Acero.
A Long-Contextual-Span Model of Resonance Dynamics for Speech Recognition: Parameter Learning and Recognizer Evaluation,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Puerto Rico, Dec, 2005.
- J. Droppo, M. Mahajan, A. Gunawardana, and A. Acero.
How to Train a Discriminative Front End with Stochastic Gradient Descent and Maximum Mutual Information,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Puerto Rico, Dec, 2005.
- J. Droppo and A. Acero.
Maximum Mutual Information SPLICE Transform for Seen and Unseen Conditions,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- A. Gunawardana, M. Mahajan, A. Acero, and J. Platt.
Hidden Conditional Random Fields for Phone Classification,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- L. Deng, D. Yu, and A. Acero.
Learning Statistically Characterized Resonance Targets in a Hidden Trajectory Model of Speech Coarticulation and Reduction,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- D. Yu, L. Deng, and A. Acero.
Evaluation of a Long-Contextual-Span Hidden Trajectory Model and Phonetic Recognizer Using A* Lattice Search,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- L. Deng, X. Li, D. Yu, and A. Acero.
A Hidden Trajectory Model with Bi-directional Target-Filtering: Cascaded vs. Integrated Implementation for Phonetic Recognition
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- L. Deng, X. Li, D. Yu, and A. Acero.
Novel Acoustic Modeling with Structured Hidden Dynamics for Speech Coarticulation and Reduction,
in Proc. of the DARPA RT04 Workshop. Palisades, New York, Nov 2004.
- L. Deng, D. Yu, and A. Acero.
A Quantitative Model for Formant Dynamics and Contextually Assimilated Reduction in Fluent Speech,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- R. Togneri and L. Deng.
Use of Neural Network Mapping and Extended Kalman Filter to Recover Vocal Tract Resonances from the MFCC Parameters of Speech,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- L. Lee, H. Attias, L. Deng, and P. Fieguth.
A Multimodal Variational Approach to Learning and Inference in Switching State Space Models
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- L. Deng, L. Lee, H. Attias, and A. Acero.
A Structured Speech Model with Continuous Hidden Dynamics and Prediction-Residual Training for Tracking Vocal Tract Resonances
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- L. Deng and X. Huang.
Challenges in Adopting Speech Recognition,
in Communications of the ACM, Vol. 47, No. 1, Jan 2004, pp. 69-75.
- J. Ma and L. Deng.
Target-Directed Mixture Dynamic Models for Spontaneous Speech Recognition,
in IEEE Trans. on Speech and Audio Processing. Volume: 12 Issue: 1 , Jan 2004.
- J. Ma and L. Deng.
A Mixed-Level Switching Dynamic System for Continuous Speech Recognition,
in Computer, Speech and Language. Vol. 18, 2004, pp. 49-65.
- R. Togneri and L. Deng.
Joint State and Parameter Estimation for a Target-Directed Nonlinear Dynamic System Model,
in IEEE Trans. on Signal Processing. Vol. 51, No. 12, December 2003, pp. 3061-3070.
- J. Ma and L. Deng.
Efficient Decoding Strategies for Conversational Speech Recognition Using a Constrained Nonlinear State-Space Model,
in IEEE Trans. on Speech and Audio Processing. Volume: 11 Issue: 6 , Nov 2003.
- A. Gunawardana and A. Acero.
Adapting Acoustic Models to New Domains and Conditions Using Untranscribed Data,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- L. Deng, I. Bazzi and A. Acero.
Tracking Vocal Tract Resonances Using an Analytical Nonlinear Predictor and a Target-guided Temporal Constraint,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- Y. Deng, M. Mahajan and A. Acero.
Estimating Speech Recognition Error Rate without Acoustic Test Data,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- D. Yu, K. Wang, M. Mahajan, P. Mau and A. Acero.
Improved Name Recognition With User Modeling,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- L. Lee, H. Attias and L. Deng.
Variational Inference and Learning for Segmental Switching State Space Models of Hidden Speech Dynamics,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- F. Seide, J. Zhou and L. Deng.
Coarticulation Modeling by Embedding a Target-Directed Hidden Trajectory Model into HMM - MAP Decoding and Evaluation,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- J. Zhou, F. Seide and L. Deng.
Coarticulation Modeling by Embedding a Target-Directed Hidden Trajectory Model into HMM - Model and Training,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- I. Bazzi and J. Glass.
A Multi-Class Approach for Modelling Out-of-Vocabulary Words,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- C. Chelba and R. Morton.
Mutual Information Phone Clustering for Decision Tree Induction,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- A. Gunawardana and W. Byrne.
Discriminative Speaker Adaptation with Conditional Maximum Likelihood Linear Regression,
in Proc. of the Eurospeech Conference. Aalborg, Denmark, Sep, 2001.
- A. Gunawardana and W. Byrne.
Convergence of DLLR Rapid Speaker Adaptation Algorithms,
in Proc. of ISCA-ITR Workshop on Adaptation Methods for Speech Recognition. Sophia-Antipolis, France, Aug. 2001.
- R. Chengalvarayan and L. Deng.
A Maximum a Posteriori Approach to Speaker Adaptation Using the Trended Hidden Markov model,
in IEEE Trans. on Speech and Audio Processing. Volume: 9 Issue: 5 , July 2001, pp. 549-557.
- R. Togneri and L. Deng.
An EKF-Based Algorithm for Learning Statistical Hidden Dynamic Model Parameters for Phonetic Recognition,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- L. Lee, P. Fieguth and L. Deng.
A Functional Articulatory Dynamic Model for Speech Production,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- J. Ma and L. Deng.
A Path-Stack Algorithm for Optimizing Dynamic Regimes in a Statistical Hidden Dynamical Model of Speech,
in Computer, Speech and Language. Academic Press, 2000.
- L. Deng and J. Ma.
Spontaneous Speech Recognition Using a Statistical Coarticulatory Model for the Vocal Tract Resonance Dynamics,
in Journal of the Acoustical Society of America, 2000.
- J. Sun, X. Jing and L. Deng.
Data-driven Model Construction for Continuous Speech Recognition Using Overlapping Articulatory Features,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- H. Jiang and L. Deng.
A Robust Training Strategy Against Straneous Acoustic Variations for Spontaneous Speech Recognition,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- L. Deng.
Switching Dynamic System Models for Speech Articulation and Acoustics,
in Proc. of the IMA Workshop, Sep, 2000.
- M. Richardson, M, Hwang, A. Acero and X. Huang.
Improvements on Speech Recognition for Fast Talkers,
in Proc. of the Eurospeech Conference. Budapest, Sep 1999.
- R. Chengalvarayan and L. Deng.
Speech Trajectory Discrimination using the Minimum Classification Error Learning,
in IEEE Trans. on Speech and Audio Processing. Volume: 6 Issue: 6 ,
Nov. 1998, pp.505-515.
- H. Sheikhzadeh and L. Deng.
Speech Analysis and Recognition using Interval Statistics Generated from a Composite Auditory Model,
in IEEE Trans. on Speech and Audio Processing. Volume: 6 Issue: 1 , Jan. 1998, pp. 90-94.
- A. Acero and X. Huang.
Speaker and Gender Normalization for Continuous-Density Hidden Markov Models, in
Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing.
Atlanta, GA. May 1996.
- X. Huang, A. Acero, F. Alleva, M. Y. Hwang, L. Jiang and M. Mahajan.
Microsoft Windows Highly Intelligent Speech Recognizer: Whisper
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Detroit, MI. May 1995.
and publications on statistical language modeling:
- D. Yu, M. Mahajan, P. Mau, and A. Acero.
Maximum Entropy Based Generic Filter for Language Model Adaptation
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- C. Chelba and A. Acero.
Adaptation of Maximum Entropy Capitalizer: Little Data Can Help a Lot,
in Proc. of EMNLP. Barcelona, Spain, Jul. 2004.
- J. G. Kahn, M. Ostendorf and C. Chelba.
Parsing Conversational Speech Using Enhanced Segmentation,
in Proc. of HLT/NAACL. Boston, MA, May 2004.
- C. Chelba and A. Acero.
Conditional ML Estimation Using Rational Function Growth Transform,
in Snowbird Learning Workshop. Utah, Apr. 2004.
- A. Acero, Y. Wang and K. Wang.
A Semantically Structured Language Model,
in Special Workshop in Maui (SWIM), Jan 2004.
- C. Chelba and A. Acero.
Discriminative Training of N-gram Classifiers for Speech and Text Routing,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- C. Chelba, M. Mahajan and A. Acero.
Speech Utterance Classification,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- C. Chelba and P. Xu.
Richer Syntactic Dependencies for Structured Language Modeling,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Madonna di Campiglio, Italy, Dec, 2001.
- C. Chelba. and M. Mahajan.
Information Extraction Using the Structured Language Model,
in Proc. of the Int. Conf. on Empirical Methods in Natural Language Processing. Pittsburgh, PA, June 2001.
- J. Goodman.
Classes for Fast Maximum Entropy Training
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- C. Chelba.
Portability of Syntactic Structure for Language Modeling,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- J. Goodman and J. Gao.
Language Model Size Reduction by Pruning and Clustering,
in Proc. Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- J. Goodman.
Putting it all Together: Language Model Combination
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Istanbul, Turkey, Utah, June, 2000.
- F. Jelinek and C. Chelba.
Putting Language into Language Modeling,
in Proc. of the Eurospeech Conference. Budapest, Hungary, Sep, 1999.
- C. Chelba and F. Jelinek.
Recognition Performance of a Structured Language Model,
in Proc. of the Eurospeech Conference. Budapest, Hungary, Sep, 1999.
- M. Mahajan, D. Beeferman and X. Huang.
Improved Topic-Dependent Language Modeling Using Information Retrieval Techniques,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Phoenix, Mar., 1999.
- C. Chelba.
Refinement of a Structured Language Model,
in Proc. of the Int. Conf. on Advances in Pattern Recognition. 1998.
- C. Chelba.
Exploiting Syntactic Structure for Language Modeling,
in Proc. of the Association for Computational Linguistics. Stanford, CA, Aug, 1998.
- C. Chelba, D. Engle, F. Jelinek, V. Jimenez, S. Khudanpur, L. Mangu, H.
Printz, E. Ristad, R. Rosenfeld, A. Stolcke and D. Wu.
Structure and Performance of a Dependency Language Model,
in Proc. of the Eurospeech Conference. Rhodes, Greece, Sep, 1997.
- C. Chelba.
A Structured Language Model,
in Proc. of the European Association for Computational Linguistics. Madrid, July, 1997.
and publications on spoken language systems:
- D. Yu, Y. Ju, and A. Acero.
An Effective and Efficient Utterance Verification Technology Using Word N-gram Filler Models,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Y. Ju, Y. Wang, and A. Acero.
Call Analysis with Classification Using Speech and Non-Speech Features,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Y. Wang and A. Acero.
Discriminative Models for Spoken Language Understanding,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Y. Wang, J. Lee and A. Acero.
Speech Utterance Classification Model Training without Manual Transcriptions
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- D. Yu, Y. Ju, Y. Wang, and A. Acero.
N-Gram Based Filler Model for Robust Grammar Authoring
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- J. Silva, C. Chelba and A. Acero.
Pruning Analysis for the Position Specific Posterior Lattices for Spoken Document Search
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- A. Acero.
Building Voice User Interfaces,
in MSDN Magazine. Feb. 2006.
- L. Deng and D. Yu.
A Speech-Centric Perspective for Human Computer Interface -- A Case Study,
in Journal of VLSI Signal Processing Systems (Special issue on Multimedia Signal Processing),
Vol. 41, N. 3, Nov 2005, pp. 255-269.
- Y. Wang, L. Deng, and A. Acero.
Spoken Language Understanding,
in IEEE Signal Processing Magazine. Volume: 22 Issue: 5, Sep. 2005, pp. 16-31.
- D. Yu and A. Acero.
Semiautomatic Improvements of System-Initiative Spoken Dialog Applications Using Interactive Clustering,
in IEEE Trans. on Speech and Audio Processing. Volume: 13 Issue: 5 , Sep. 2005, pp. 661-671.
- C. Chelba and A. Acero.
Indexing Uncertainty for Spoken Document Search,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- Y. Wang and A. Acero.
SGStudio: Rapid Semantic Grammar Development for Spoken Language Understanding,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- C. Chelba and A. Acero.
SPEECH OGLE: Indexing Uncertainty for Spoken Document Search
in Proc. of the Association for Computational Linguistics. Ann Arbor, June, 2005.
- C. Chelba and A. Acero.
Position Specific Posterior Lattices for Indexing Speech
in Proc. of the Association for Computational Linguistics. Ann Arbor, June, 2005.
- X. Li, A. Gunawardana, and A. Acero.
Unsupervised Semantic Intent Discovery from Call Log Acoustics
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- D. Ollason, Y. Ju, S. Bhatia, D. Herron and J. Liu.
MS Connect: A fully featured auto-attendant. System Design, Implementation and Performance,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- Y. Wang and Y. Ju.
Creating Speech Recognition Grammars from Regular Expressions for Alphanumeric Concepts,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- K. Wang.
Spoken Language Interface in ECMA/ISO Telecommunication Standards,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- D. Yu, M. Hwang, P. Mau, A. Acero and L. Deng.
Unsupervised Learning from Users’ Error Correction in Speech Dictation,
in Proc. Int. Conf. on Spoken Language Processing. Jeju, South Korea, Oct, 2004.
- L. Deng and X. Huang.
Forum: Author Response to 'For Voice Interfaces, Hold the SALT'
in Communications of the ACM. Vol. 47, No. 7, July 2004, pp. 11-13.
- K. Wang.
A Detection Based Approach to Robust Speech Understanding
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- L. Deng, Y. Wang, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, D. Jacoby, M. Mahajan, C. Chelba, and X. Huang.
Speech and Language Processing for Multimodal
Human-Computer Interaction (invited),
in Journal of VLSI Signal Processing Systems (Special issue on Real-World Speech Processing),
Vol. 36, No. 2, February 2004, pp. 161-187.
- Y. Wang, A. Acero, and C. Chelba.
Is Word Error Rate a Good Indicator for Spoken Language Understanding Accuracy,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Virgin Islands, Dec, 2003.
- K. Wang.
Semantic Synchronous Understanding for Robust Spoken Language Applications,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Virgin Islands, Dec, 2003.
- L. Deng, K. Wang, A. Acero, H. Hon, J. Droppo, C. Boulis, Y. Wang, D. Jacoby, M. Mahajan, C. Chelba, and X.D.Huang.
Distributed Speech Processing in MiPad's Multimodal User Interface,
in IEEE Trans. on Speech and Audio Processing. Volume: 10 Issue: 8 , Nov 2002, pp. 605-619.
- K. Wang.
A Study of Semantics Synchronous Understanding on Speech Interface Design,
in Proc. of the UIST Conference. Vancouver, Canada, Nov, 2003.
- K. Wang.
Semantic Object Synchronous Understanding in SALT for Highly Interactive User Interface,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- Y. Wang and A. Acero.
Combination of CFG and N-gram Modeling in Semantic Grammar Learning,
in Proc. of the Eurospeech Conference. Geneva, Switzerland, Sep, 2003.
- Y. Wang and A. Acero.
Concept Acquisition in Example Based Grammar Authoring,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- Y. Wang, A. Acero, C. Chelba, B. Frey, and L. Wong.
Combination of Statistical and Rule-based Approaches for Spoken Language Understanding,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- K. Wang.
SALT: A Spoken Language Interface for Web-based Multimodal Dialog Systems,
in Proc. Int. Conf. on Spoken Language Processing. Denver, Colorado, Sep, 2002.
- Y. Wang, A. Acero.
Evaluation of Spoken Language Grammar Learning in the ATIS Domain,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Orlando, Florida, May, 2002.
- Y. Wang and A. Acero.
Grammar Learning for Spoken Language Understanding,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Madonna di Campiglio, Italy, Dec, 2001.
- K. Wang.
Natural Language Enabled Web Applications,
in Proc. of the 1st NLP and XML Workshop. Tokyo, Nov 2001.
- Y. Wang.
Robust Language Understanding in MiPAD,
in Proc. of the Eurospeech Conference. Aalborg, Denmark, Sep, 2001.
- X. Huang, A. Acero, C. Chelba, L. Deng, J. Droppo, D. Duchene, J. Goodman,
H. Hon, D. Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S.
Mughal, S. Neto, M. Plumpe, K. Stery,. G. Venolia, K. Wang, Y. Wang.
MIPAD: A Multimodal Interaction Prototype,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Salt Lake City, Utah, May, 2001.
- X. Huang, A. Acero, C. Chelba, L. Deng, D. Duchene, J. Goodman, H. Hon, D.
Jacoby, L. Jiang, R. Loynd, M. Mahajan, P. Mau, S. Meredith, S. Mughal, S. Neto,
M. Plumpe, K. Wang, Y. Wang.
MIPAD: A Next Generation PDA Prototype,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- K. Wang.
Implementation of a Multimodal Dialog System Using Extended Markup Languages,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- K. Wang.
A Plan-Based Dialog System with Probabilistic Inferences,
in Proc. of the Int. Conf. on Spoken Language Processing. Beijing, China, Oct, 2000.
- Y. Wang, M. Mahajan and X. Huang.
A Unified Context-Free Grammar And N-Gram Model for Spoken Language Processing,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Istanbul, Turkey, June, 2000.
- Y. Rui and A. Gupta, and A. Acero.
Automatically Extracting Highlights for TV Baseball Programs,
in ACM Multimedia, pp. 105-115, 2000.
- Y. Wang.
A Robust Parser for Spoken Language Understanding,
in Proc. of the Eurospeech Conference. Budapest, Hungary, Sep, 1999.
- K. Wang.
An Event Driven Model for Dialog Systems,
in Proc. of the Int. Conf. on Spoken Language Processing. Sydney, Australia. Dec 1998.
and publications on speech synthesis:
- A. Acero.
Formant Analysis and Synthesis using Hidden Markov Models,
Proc. of the Eurospeech Conference. Budapest, Sep 1999.
- A. Acero.
A Mixed-Excitation Frequency Domain Model for Time-Scale Pitch-Scale Modification of Speech,
in Proc. of the Int. Conf. on Spoken Language Processing. Sydney, Australia. Dec 1998.
- M. Plumpe and S. Meredith.
Which is More Important in a Concatenative Text-to-Speech System - Pitch, Duration, or Spectral Discontinuity?,
in Proc. of the Third Speech Synthesis Workshop, Jenolan Caves, Australia. Nov 1998.
- M. Plumpe, A. Acero, H. Hon and X. Huang.
HMM-Based Smoothing for Concatenative Speech Synthesis,
in Proc. of the Int. Conf. on Spoken Language Processing. Sydney, Australia. Dec 1998.
- H. Hon, A. Acero, X. Huang, J. Liu and M. Plumpe.
Automatic Generation of Synthesis Units for Trainable Text-to-Speech Systems,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Seattle, WA. May 1998.
- A. Acero.
Source-Filter Models for Time-Scale Pitch-Scale Modification of Speech,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Seattle, USA. May 1998.
- X. Huang, A. Acero, H. Hon, Y. Ju, J. Liu, S. Meredith, M. Plumpe.
Recent Improvements on Microsofts Trainable Text-to-Speech System: Whistler,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Munich, Germany. Apr. 1997.
- X. Huang, A. Acero, J. Adcock, H. Hon, J. Goldsmith, and J. Liu.
Whistler: A Trainable Text-to-Speech System,
in Proc. of the Int. Conf. on Spoken Language Processing. Philadelphia, PA. October 1996.
and publications on speech analysis:
- L. Deng, X. Cui., R. Pruvenok, J. Huang, S. Momen, Y. Chen and A. Alwan.
A Database of Vocal Tract Resonance Trajectories for Research in Speech Processing
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Toulouse, May, 2006.
- I. Bazzi, L. Deng and A. Acero.
An Expectation Maximization Approach for Formant Tracking Using a Parameter-Free Nonlinear Predictor,
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Hong Kong, Apr, 2003.
- J. Droppo and L. Atlas.
Distance Metrics for Discrete Time-Frequency Representations,
in Proc. of the Int. Workshop on DSP. Hunt, Texas, Oct, 2000.
- M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds.
Modeling of the glottal flow derivative waveform with application to speaker identification,
in IEEE Trans. on Speech Audio Processing, vol. 7, no. 5, pp. 569–585, Sept 1999.
- J. Droppo and A. Acero.
Maximum a Posteriori Pitch Tracking,
in Proc. of the Int. Conf. on Spoken Language Processing. Sydney, Australia. Dec 1998.
and publications on speech enhancement
- M. Seltzer and R. Stern.
Subband Likelihood-Maximizing Beamforming for Speech Recognition in Reverberant Environments,
in IEEE Trans. on Audio, Speech and Language Processing. Volume: 14 Issue: 6, Nov 2006. pp. 2109-2121.
- I. Tashev and A. Acero.
Microphone Array Post-Processor Using Instantaneous Direction of Arrival,
in Int. Workshop on Acoustic, Echo and Noise Control (IWAENC). , Paris, France, Sep, 2006.
- A. Subramanya, M. Seltzer, and A. Acero.
Automatic Removal of Typed Keystrokes from Speech Signals,
in Proc. of the Interspeech Conference. Pittsburgh, Sep, 2006.
- Z. Liu, M. Seltzer, A. Acero, I. Tashev, Z. Zhang, and M. Sinclair.
A Compact Multi-Sensor Headset for Hands-Free Communication,
in Proc. of the Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA, Oct. 2005.
- A. Subramanya, Z. Zhang, Z. Liu, J. Droppo, and A. Acero.
A Graphical Model for Multi-Sensory Speech Processing in Air-and-Bone Conductive Microphones,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- M. Seltzer, A. Acero, and J. Droppo.
Robust Bandwidth Extension of Noise-corrupted Narrowband Speech,
in Proc. of the Interspeech Conference. Lisbon, Portugal, Sep, 2005.
- I. Tashev.
Beamformer Sensitivity to Microphone Manufacturing Tolerances,
in Proc. of the Nineteenth Int. Conference Systems for Automation of Engineering and Research (SAER). St. Konstantin Resort, Bulgaria, Sep. 2005.
- I. Tashev, M. Seltzer, and A. Acero.
Microphone Array for Headset with Spatial Noise Suppressor,
in Proc. of the Ninth Int. Workshop on Acoustic, Echo and Noise Control (IWAENC). Eindhoven, The Netherlands, Sep. 2005.
- I. Tashev and H. Malvar.
A New Beamformer Design Algorithm for Microphone Arrays
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- I. Tashev and D. Allred.
Reverberation Reduction for Better Speech Recognition
in Joint Workshop on Hands-Free Speech Communication and Microphone Arrays (HSCMA). Rutgers, NJ, Mar, 2005.
- Z. Liu, A. Subramanya, Z. Zhang, J. Droppo, and A. Acero.
Leakage Model and Teeth Clack Removal for Air- and Bone-Conductive Integrated Microphones
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- J. Hershey, T. Kristjansson and Z. Zhang.
Model-Based Fusion of Bone and Air Sensors for Speech Enhancement and Robust Speech Recognition,
in Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audio Processing. Jeju, South Korea, Oct, 2004.
- Z. Liu, Z. Zhang, A. Acero, J. Droppo and X. Huang.
Direct Filtering for Air- and Bone-Conductive Microphones,
in Proc. IEEE Int. Workshop on Multimedia Signal Processing. Siena, Italy. Sep, 2004.
- L. Deng, Z. Liu, Z. Zhang, and A. Acero.
Nonlinear Information Fusion in Multi-Sensor Processing - Extracting and Exploiting Hidden Dynamics of Speech Captured by a Bone-Conductive Microphone,
in Proc. IEEE Int. Workshop on Multimedia Signal Processing. Siena, Italy. Sep, 2004.
- I. Tashev.
Gain Self-Calibration Procedure for Microphone Arrays,
in Proc. of the Int. Conf. for Multimedia and Expo (ICME). Taipei, Taiwan, Jun. 2004.
- T. Kristjansson, H. Attias, and J. Hershey.
Single Microphone Source Separation Using High Resolution Signal Reconstruction
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- Y. Zheng, Z. Liu, Z. Zhang, M. Sinclair, J. Droppo, L. Deng, A. Acero and X. Huang.
Air and Bone-Conductive Integrated Microphones for Robust Speech Detection and Enhancement,
in Proc. of the IEEE Workshop on Automatic Speech Recognition and Understanding. Virgin Islands, Dec, 2003.
- I. Tashev.
Improving Meetings with Microphone Array Algorithms,
in Proc. of the Machine Learning Meets the User Interface Workshop, Neural Information Processing Systems (NIPS). Whistler, Canada, Dec. 2003.
and publications on vision and audiovisual processing:
- Y. Rui, E. Rudolph, L. He, R. Malvar, M. Cohen, and I. Tashev.
PING: A Group-to-Individual Distributed Meeting System,
in Int. Conference Multimedia and Expo ICME’06. Toronto, Canada, July 2006.
- Y. Rui, Z. Liu, S. Kallin, G. Janke and C. Paya.
Characters or Faces: A User Study on Ease of Use for HIPs
in Proc. Int. Conf. on Human Interactive Proofs. Bethlelem, PA, May, 2005.
- G. Guo, C. Dyer, and Z. Zhang.
Linear Combination Representation for Outlier Detection in Motion Tracking
in Proc. Int. Conf. on Computer Vision and Pattern Recognition. June 20-25, 2005.
- L. He and Z. Zhang.
Real-Time Whiteboard Capture and Processing Using a Video Camera for Teleconferencing
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Philadelphia, Mar, 2005.
- Z. Zhang and L. He.
Remote Collaboration on Physical Whiteboards
in Proc. Fifth Pacific-Rim Conference on Multimedia, Nov.30-Dec.3, 2004.
- Z. Zhang.
Response to Dialog: Object Detection and Object Variance in Autonomous Mental Development
in the Newsletter of the Autonomous Mental Development Technical Committee, IEEE Computational Intelligence Society.
Vol. 1, No. 2, pp. 5-6, October, 2004.
- H. Zhou, Z. Zhang, and T. Huang.
Visual Echo Cancellation in a Projector-Camera-Whiteboard System
in Proc. Int. Conf. on Image Processing.
Vol. 5, pp. 2885--2888, Oct.24--27, 2004, Singapore.
- R. Yang and Z. Zhang.
Eye Gaze Correction With Stereovision for Video-Teleconferencing
in IEEE Trans. on Pattern Analysis and Machine Intelligence. Vol.26, No.7, pages 956--960, 2004
- Y. Rui and Z. Liu.
ARTiFACIAL: Automated Reverse Turing test using FACIAL features
ACM Multimedia Systems Journal (Springer). Vol 9, No. 6, pp. 493 - 502, June 2004.
- C. Wu, C. Liu, H.-Y. Shum, Y.-Q. Xu, and Z. Zhang.
Automatic Eyeglasses Removal from Face Images
in IEEE Trans. on Pattern Analysis and Machine Intelligence. Vol.26, No.3, pages 322–336, 2004.
- Z. Zhang.
Camera Calibration With One-Dimensional Objects
in IEEE Trans. on Pattern Analysis and Machine Intelligence. Vol.26, No.7, pages 892—899, 2004
- Z. Liu, Z. Zhang, and Y. Shan.
Image-Based Surface Detail Transfer
IEEE Computer Graphics and Applications. Vol.24, No.3, pages 30–35, 2004.
- Z. Zhang, Z. Liu, D. Adler, M. F. Cohen, E. Hanson, and Y. Shan.
Robust and Rapid Generation of Animated Faces from Video Images: A Model-Based Modeling Approach
Int. Journal of Computer Vision. Vol.58, No.2, pages 93—119, 2004.
- J. Hershey, H. Attias, N. Jojic, and T. Kristjansson.
Audio-Visual Graphical Models for Speech Processing
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- Z. Zhang and L. He.
Notetaking with a Camera: Whiteboard Scanning and Image Enhancement
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- H. Xiao, J. Weng, and Z. Zhang.
Office Presence Detection Using Multimodal Context Information
in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing. Montreal, May, 2004.
- Z. Wen, Z. Liu, M. Cohen, J. Li, K. Zheng, and T. Huang.
Low Bit-Rate Video Streaming for Face-To-Face Teleconference
in Proc. of the IEEE Int. Conf. on Multimedia and Expo. Taipei, Jun, 2004.
- Z. Zhang and Y. Shan.
Incremental Motion Estimation through Modified Bundle Adjustment
in Proc. Int. Conference on Image Processing (ICIP). Vol.II, pp.343–346, September 14-17, 2003, Barcelona, Spain
- R. Cutler, Y. Rui, A. Gupta, J. Cadiz, I. Tashev, L. He, A. Colburn, Z. Zhang, Z. Liu, and S. Silverberg.
Distributed Meetings: A Meeting Capture and Broadcasting System,
in Proc. of the MACM Multimedia 2002. Nice, France, Dec. 2002.
Last updated: Jan 27, 2006
|