Research interests
- Multichannel audio signals processing
- Algorithms for arrays of transducers (microphones, speakers, antennas)
- Processing of signals for enhancement, de-noising, de-reverberation, etc.
- Statistical processing of audio, biological, radio signals
Background
Education
- Several short specializations in the area of eLearning in France (ENIC, ENSEA) and Italy (Bologna University) (1995-1996).
- Ph.D. in Computer Science, 1990, Technical University of Sofia, Bulgaria.
- Master's Degree in Electronics, 1984, Technical University of Sofia, Bulgaria.
Career
- Microsoft Corporation, Redmond, USA: various positions in COM+ and Application Center teams, 1998-2001. Joined Microsoft Research in 2001.
- Technical University of Sofia, Bulgaria: Assistant Professor, 1989-1998, Faculty of Electronic Engineering and Technology (lectures and labs for "Data and Signal Processing" and "Real time systems programming").
- Technical University of Sofia, Bulgaria: Researcher, 1986-1989, Research and Development Department
- Technical University of Sofia, Bulgaria: R&D engineer, 1984-1986, Research and Development Department
Professional Activities
- Senior Member of IEEE since 2006.
- Member of AES, serving as member of the AES Pacific Northwest Committee since 2006.
- Tutorial "Defeating ambient noise - practical approaches" during ICASSP 2006 in Toulouse, France.
- Served as member of the Technical Program Committee of ICME 2002 and ICME 2003, session chair in ICME 2003.
- Co-founder in 1986 and organizational secretary for 12 years of the International Conference Systems for Automation of Engineering and Research (SAER), now serving as member of the International Organizing Committee. Now it is renamed to International Conference on Information Technologies INFOTECH.
- Member of the TC of International Workshop on Acoustics, Echo and Noise Control (IWAENC) since 2006. One of the organizers of IWAENC 2008 in Seattle.
- Member of the TC of Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) since 2006.
- Member of IEEE Signal Processing Society Technical Committee on Audio and Electroacoustics since 2009.
- Reviewer for most of the signal and audio processing journals.
Selected projects
Here are some of the projects I enjoyed during my career in no particular order.
Microphone Array Project
Providing a hands free sound capture in modern computers for the needs of real-time communication and speech recognition (voice commands and dictation) is a technologically difficult and challenging task. Designing a real-time microphone array processing algorithm for optimal noise suppression, besides being an interesting research problem, provides additional challenges when the time comes for productizing of the software and hardware. After building the first prototypes of USB linear four element array for office use and circular eight element array for the center of the conference room table started the difficult transition from "microphone array algorithms" to "algorithms for manufactureable microphone arrays". The whole process includes working with product teams inside Microsoft and evangelization of this technology to laptop, tablet and computer monitor manufacturers outside Microsoft by publishing white papers and giving talks. The project is in very advanced phase for shipping as integrated in Windows Vista microphone array support. Article about the project is posted here, more details can be found on the Microphone Array Project web page.
Control system for Universal Laser Ranging System ULIS-630
In the pre-GPS era the way to measure precisely the geographical coordinates of given point was by watching satellites and measure the distance to them using laser ranging systems. Universal Laser Ranging System ULIS-630 was a large international project with participation of research and academic organizations from Eastern Europe (Academy of Sciences of the ex-Soviet Union in Moscow, Lithuanian Technical University in Riga, Bulgarian Academy of Sciences in Sofia, Moscow Electro-technical Institute, Technical University of Sofia, others). The optical system is with so called horizontal mount and consists of two paired telescopes: transmitting (Galileo type, focal length of 12 meters, green laser with 1 J energy of the 1ns long pulse) and receiving (Cassegrain type, 630 mm main mirror diameter, 11.5 meters focal length and photo-multiplier tube as receiver). It is controlled by a distributed computer system for tracking the visible trajectory of the satellite, firing the laser, collecting and processing the results. My work here become the core of my Ph.D. thesis. The system itself, besides everything else, offered decent view of planets and celestial objects (see picture of Jupiter here).
Application Center 2000 Pre-Flight Checks
Microsoft Application Center is a web and component clusters management software. Targeting middle range web sites it offers set of innovative features. The "Application Center 2000 Pre-Flight Checks" was designed to improve out-of-the-box experience of the IT personnel, but become the unofficial tutorial of the product. I enjoyed finding the easiest way to demonstrate the product features with minimum additional software. Parts of this document were used in the Application Center Resource Kit.
Distributed Meetings System
This was my first project in Microsoft Research. It was a big, complex system for meetings capture and recording. The capturing devices were 360 degrees RingCam, eight element circular microphone array, overview camera, whiteboard camera, etc. The project combined technologies, designed in the Collaboration, Communication and Multimedia Team and included deployment of ten of these systems in various conference rooms in Microsoft to study the users behavior and to get some feedback. See our ACM paper for more details.
Dereverberation Project
The project started with simple goal to reduce the word error rate for speech recognition purposes for distances of up to 1.5 meters. This was the summer project of my intern Daniel Allred, Ph.D. student in GeorgiaTech. During his the second internship in Microsoft Research the project was extended and re-scoped to cover some perceptual scenarios as well. See his final presentation here. More details can be found in the Dereverberation project page.
Speaker Array Project
This is pure research project for now. With my colleagues Jasha Droppo and Mike Seltzer, we decided to see what can be reused from our experience in beamforming design for microphone arrays. The loudspeaker array consists of sixteen inexpensive speakers and has linear geometry. The project was demonstrated during Microsoft Research TechFest 2007 as "Personal Audio Space" and definitely had the "Wow!" effect among the visitors in our booth. We demonstrated focusing the sound in given area and dual beam mode when you hear one music channel in one place and a second music channel in another. The attending journalists liked the demo and it was widely published in the press: WIRED Blog Network, Seattle PI, MIT Technology Review, MSR web site, many others, in different languages and from different countries. Currently we are exploring various scenarios and potential applications for this technology. See further the project page here.
Selected presentations
- "Commute UX: Telephone Dialog System for Location-based Services". Presentation during SIGdial Workshop on Disclosure and Dialogue 2007, Antwerp, Belgium, September 2007. [PDF]
- "Defeating Ambient Noise: Practical Approaches for Noise Reduction and Suppression". Tutorial during IEEE International Conference of Acoustic, Speech and Signal Processing ICASSP 2006, Toulouse, France, May 2006. [Slides, References]
- "Microphone Array for Headset with Spatial Noise Suppressor". Presentation during Ninth International Workshop on Acoustic, Echo and Noise Control IWAENC 2005, Eindhoven, The Netherlands, September 2005. [PDF]
- "Microphone Array Support in Windows Longhorn". Presentation during Windows Hardware Experience Conference WinHEC 2005, Seattle, USA, May 2005. The same talk given during WinHEC 2005, Taipei, Taiwan, June 2005. [PDF]
- "Microphone Array Project in Microsoft Research: Approach and Results". Presentation in National Taiwan University and Academia Sinica, July 2004, Taipei, Taiwan. [PDF]
- "Improving Meetings with Microphone Array Algorithms". Presentation during Machine Learning Meets the User Interface Workshop, Neural Information Processing Systems NIPS 2003, Whistler, Canada, December 2003. [PDF]
- "Application Center 2000 Interoperability and Reliability". Presentation during Application Center 2000 Airlift, Redmond, USA, May 2001. [PDF]
2009
- Ivan Tashev, Michael L. Seltzer, and Yun-Cheng Ju, Speech and sound for in-car infotainment systems, in Automotive User Interfaces and Interactive Vehicular Applications (AutomotiveUI 2009), Association for Computing Machinery, Inc., Essen, Germany, 22 September 2009
- Ivan Tashev, Michael Seltzer, Yun-Cheng Ju, Ye-Yi Wang, and Alex Acero, Commute UX: Voice Enabled In-car Infotainment System, in Mobile HCI '09: Workshop on Speech in Mobile and Pervasive Environments (SiMPE), Association for Computing Machinery, Inc., Bonn, Germany, 15 September 2009
- Yun-Cheng Ju, Michael Seltzer, and Ivan Tashev, Improving Perceived Accuracy for In-Car Media Search, International Speech Communication Association, September 2009
- Ivan Tashev, Andrew Lovitt, and Alex Acero, Unified Framework for Single Channel Speech Enhancement, in 2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, IEEE, Victoria B.C., Canada, 24 August 2009
- Ivan Tashev, Sound Capture and Processing: Practical Approaches, pp. 388, Wiley, July 2009
- Young-In Song, Ye-Yi Wang, Yun-Cheng Ju, Mike Seltzer, Ivan Tashev, and Alex Acero, Voice Search of Structured Media Data, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electornic Engineers, Inc., Taipei, Taiwan, April 2009
2008
- Michael Seltzer and Ivan Tashev, A Log-MMSE Adaptive Filter Using a non-Linear Spatial Filter, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2008, Seattle, USA, September 2008
- Ivan Tashev and Michael Seltzer, Data Driven Beamformer Design for Binaural Headset, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2008, Seattle, USA, September 2008
- Ivan Tashev, Slavy Mihov, Tyler Gleghorn, and Alex Acero, Sound Capture System and Spatial Filter for Small Devices, in Proceedings of Interspeech 2008, International Speech Communication Association, Brisbane, Australia, September 2008
- Dong Yu, Li Deng, Jasha Droppo, Jian Wu, Yifan Gong, and Alex Acero, Robust speech recognition using cepstral minimum-mean-square-error noise suppressor, in IEEE Trans. Audio, Speech, and Language Processing, vol. 16, no. 5, Institute of Electrical and Electronics Engineers, Inc., July 2008
- Slavy Mihov, Tyler Gleghorn, and Ivan Tashev, Enhanced Sound Capture System for Small Devices, in Proceedings of XLIII International Scientific Conference on Information, Communication, and Energy Systems and Technologies ICEST 2008, Nis, Serbia, June 2008
- Ivan Tashev, Jasha Droppo, Michael Seltzer, and Alex Acero, Robust Design of Wideband Loudspeaker Arrays, in Proc. of International Conference on Audio, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Las Vegas, USA, April 2008
- Nilesh Madhu, Ivan Tashev, and Alex Acero, An EM-based Probabilistic Approach for Acoustic Echo Suppression, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2008, Institute of Electrical and Electronics Engineers, Inc., Institute of Electrical and Electronics Engineers, Inc., Las Vegas, USA, April 2008
- Luis Buera, Jasha Droppo, and Alex Acero, Speech Enhancement using a Pitch Predictive Model, in Proc. of the Int. Conf. on Acoustics, Speech, and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., April 2008
2007
- Ivan Tashev, Michael Seltzer, Y. C. Ju, Dong Yu, and Alex Acero, Commute UX: Telephone Dialog System for Location-based Services, in Proceedings of SIGdial Workshop on Disclosure and Dialogue 2007, Antwerp, Belgium, September 2007
- Michael Seltzer, Y. C. Ju, Ivan Tashev, and Alex Acero, Robust Location Understanding in Spoken Dialog Systems Using Intersections, in Proceedings of Interspeech 2007, Antwerp, Belgium, August 2007
- Byung-Jun Yoon, Ivan Tashev, and Alex Acero, Robust Adaptive Beamforming Algorithm Using Instantaneous Direction of Arrival with Enhanced Noise Suppression Capability, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2007, Honolulu, USA, April 2007
- Ivan Tashev and Henrique Malvar, Stationary-tones Interference Cancellation Using Adaptive Tracking, in International Conference on Acoustics, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Honolulu, USA, April 2007
- Michael Seltzer, Ivan Tashev, and Alex Acero, Microphone Array Post-Filter Using Incremental Bayes Learning to Track the Spatial Distribution of Speech and Noise, in Proceedings of International Conference on Audio, Speech and Signal Processing ICASSP 2007, Honolulu, USA, April 2007
2006
- Ivan Tashev and Alex Acero, Microphone Array Post-Processor Using Instantaneous Direction of Arrival, in Proceedings of International Workshop on Acoustic, Echo and Noise Control IWAENC 2006, Paris, France, September 2006
- Yong Rui, Eric Rudolph, Li-wei He, Rico Malvar, Michael Cohen, and Ivan Tashev, PING: A Group-to-Individual Distributed Meeting System, in Proceedings of International Conference Multimedia and Expo, Institute of Electrical and Electronics Engineers, Inc., Toronto, Canada, July 2006
- Ivan Tashev, Jasha Droppo, and Alex Acero, Suppression Rule for Speech Recognition Friendly Noise Suppressors, in Proceedings of Eight International Conference Digital Signal Processing and Applications DSPA’06, Moscow, Russia, March 2006
2005
- Zicheng Liu, Michael Seltzer, Alex Acero, Ivan Tashev, Zhengyou Zhang, and Mike Sinclair, A Compact Multi-Sensor Headset for Hands-Free Communication, in Proceedings of Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, USA, October 2005
- Ivan Tashev, Beamformer Sensitivity to Microphone Manufacturing Tolerances, in Proceedings of Nineteenth International Conference Systems for Automation of Engineering and Research SAER 2005, St. Konstantin Resort, Bulgaria, September 2005
- Ivan Tashev, Michael Seltzer, and Alex Acero, Microphone Array for Headset with Spatial Noise Suppressor, in Proceedings of Ninth International Workshop on Acoustic, Echo and Noise Control IWAENC 2005, Eindhoven, The Netherlands, September 2005
- Ivan Tashev and Henrique Malvar, A new beamformer design algorithm for microphone arrays, in International Conference of Acoustic, Speech and Signal Processing, Institute of Electrical and Electronics Engineers, Inc., Philadelphia, USA, March 2005
- Ivan Tashev and Daniel Allred, Reverberation reduction for improved speech recognition, in Proceedings of Hands-Free Communication and Microphone Arrays, Piscataway, USA, March 2005
2004
- Ivan Tashev, Gain Self-Calibration Procedure for Microphone Arrays, in Proceedings of International Conference for Multimedia and Expo ICME 2004, Taipei, Taiwan, June 2004
2003
- Ivan Tashev, Improving Meetings with Microphone Array Algorithms, in Machine Learning Meets the User Interface Workshop, Neural Information Processing Systems NIPS 2003, Whistler, Canada, December 2003
2002
- Ross Cutler, Yong Rui, Anoop Gupta, JJ Cadiz, Ivan Tashev, Li-wei He, Alex Colburn, Zhengyou Zhang, Zicheng Liu, and Steve Silverberg, Distributed Meetings: A Meeting Capture and Broadcasting System, in Proceedings of ACM Multimedia 2002, Nice, France, December 2002
For the full list of publications click here.
Contact info
E-mail: ivantash--at--microsoft--dot--com
Postal Mail: Microsoft Corporation, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: +1 (425) 706-1667
Fax: +1 (425) 936-7329 (This is the main Microsoft FAX number so make sure to send documents to my attention)



