Imed Zitouni

Imed Zitouni
PRINCIPAL RESEARCHER
.

I am a member of the Relevance and Measurement team of Microsoft that I joined in October 2012. I work in improving Bing’s quality by providing better metrics. My current research interest is in the area of information retrieval (IR) focusing on the use of statistics and machine learning techniques to develop web scale offline and online metrics for search engines. I am also interested in using Natural Language Processing (NLP) technologies to add a layer of semantics and understanding to search engines. I am a believer that next generation search engines will be based on dialog and language understanding.

Prior to joining Microsoft, I worked as a research member for IBM Research in the Multilingual NLP group for almost a decade, where I served as team-lead in several NLP projects. During this time, I was again focusing on NLP, IR, machine translation, speech-recognition, language modeling and machine learning. I was key member of several government projects including the GALE (Global Autonomous Language Exploitation) program. Prior to IBM, I was a research member of Bell Laboratories, Lucent Technologies, for almost half dozen years working on language modeling, speech recognition, spoken dialog systems and speech understanding. During the last few years at Bell Labs, I was also in charge of the speech and natural language call routing activities leading a small team of very talented researchers. Prior to Bell Labs, I experiment the startup experience at DIALOCA in Paris, France, working on e-mail steering and language modeling. I also served as temporary assistant professor at the University of Nancy 1, France. I received my M.Sc. and Ph.D. with the highest-honors from the University-of-Nancy1 France. In 1995, I obtained a MEng degree in computer science from ENSI in Tunisia.

I am a senior member of IEEE, served as a member of the IEEE Speech and Language Processing Technical Committee (99-11), the Information Officer of the ACL SIG on Semitic-Languages, associate editor of TALIP ACM journal and a member of ISCA and ACL. I served as chair and reviewing-committee-member of several conferences and journals. I am the author/co-author of more than 80 papers in international conferences and journals.

My recent book is “Multilingual Natural Language Processing Application: from Theory to Practice”, by Prentice Hall.

Publications

Publications

Book

  • I. Zitouni and D. Bikel. Multilingual Natural Language Processing: from Theory to Practice. Prentice Hall. 2012

 

Book Chapters

  • Xiaoqiang Luo, Imed Zitouni. Entity Detection and Tracking. Multi-Lingual Natural Language Processing, Prentice Hall, 2012
  • D. Bikel, V. Castelli, R. Florian, X. Luo, S. McCarley, T. Ward and I. Zitouni. “Snippets: Using Heuristics to Bootstrap a Machine Learning Approach.” Chapter 4 on Distillation. Handbook of Natural Language Processing and Machine Translation – DARPA Global Autonomous Language Exploration. Editors: Joseph Olive, Caitlin Christianson and John McCary. Publisher: Springer. 2011. ISBN 978-1-4419-7712-0.
  • Imed Zitouni & Xiaoqiang Luo & Radu Florian. A Statistical Model for Arabic Mention Detection and Chaining. Arabic Computational Linguistics. Chapter 9. ISBN: 9781575865430 March 2010.
  • Jeff Sorensen & Imed Zitouni. Finite State Based Arabic Word Segmentation. Arabic Computational Linguistics. Chapter 5. ISBN: 9781575865430 March 2010.
  • Q. Zhou and I. Zitouni. Arabic Dialectal Speech Recognition in Mobile Communication Services. Speech Recognition. Edited by: France Mihelic and Janez Zibert, 2008, ISBN 978-953-7619-29-9. Publisher: IN-TECH
  • I. Zitouni, Linearly Interpolated Hierarchical n-gram Language Models for Speech Recognition Engines. Robust Speech Recognition and Understanding, Edited by M. Grimm and K. Kroschel, 2007. ISBN 978-3-902613-08-0

 

Journal

  • Imed Zitouni. Introduction to Arabic Natural Language Processing. Computational Linguistics Journal, V. 37 N. 3. 2011.
  • Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli, Imed Zitouni . Using Bagging and Boosting Techniques for Improving Coreference Resolution. Informatica 1(34), 111-118, 2010
  • Imed Zitouni and Radu Florian. Cross Language Information Propagation for Arabic Mention Detection. Journal of ACM Transactions on Asian Language Information Processing. December 09.
  • Yassine Benajiba and Imed Zitouni. Morphology Based Segmentation Combination for Arabic Mention Detection. ACM Transactions on Asian Language Information Processing. December 09.
  • I. Zitouni, X. Luo and R. Florian. A Cascaded Approach to Mention Detection and Chaining in Arabic. IEEE Transactions on Audio, Speech and Language Processing. Volume: 17, p 935-944. July 2009.
  • I. Zitouni and R. Sarikaya. Arabic Diacritic Restoration Approach Based on Maximum Entropy Models. Computer Speech and Language Journal, June 2008.
  • I. Zitouni. Constrained Minimization and Discriminative Training for Natural Language Call Routing. IEEE Transactions on Audio, Speech and Language Processing. January 2008.
  • I. Zitouni. Backoff Hierarchical Class N-gram Language Models: Effectiveness to Model Unseen Events in Speech Recognition. Journal of Computer Speech and Language, Academic Press. January 2007.
  • I. Zitouni, Q. Zhou, M. Lee, P. Danielsen. Media Services in SIP Networks. Bell-Labs Technical Journal. 2004.
  • I. Zitouni, H. Kuo, C. Lee. Boosting and Combination of Classifiers for Natural Language Call Routing Systems. Speech Communication Journal 41, 2003.
  • I. Zitouni, K. Smaili, J-P. Haton. Statistical Language Modelling Based on Variable Length Sequences. Journal of Computer Speech and Language, Academic Press, Volume 7, Issue 1, January 2003.
  • I. Zitouni. A Hierarchical Language Model Based on Variable-Length Class Sequences: The MC approach. IEEE Transactions on Speech and Audio Processing, March (2002).
  • I. Zitouni. Modélisation statistique du langage utilisant des séquences de longueur variable. Revue francophone internationale en Sciences Cognitives In Cognito, 19 (2001).
  • I. Zitouni. Modélisation du langage pour les systèmes de reconnaissance de la parole: application à MAUD. Revue francophone internationale en Sciences Cognitives In Cognito, 20 (2001).
  • F. Bimbot, M. El-Beze, S. Igounet, M. Jardino, K. Smaïli, I. Zitouni. An Alternative Scheme for Perplexity Estimation and its Assessment for the Evaluation of Language Models. Journal of Computer Speech and Language, Academic Press. January 2001

 

International Conference and Workshop

  • Imed Zitouni. Cross Language Information Propagation for Better Semantic Knowledge and Models: Case of Mention Detection. ALECSO Workshop of experts on morphological analyzers for the Arabic Language. Tunis, Tunisia, August 2011
  • Imed Zitouni. Statistical Approach to Restore Arabic Diacritics. ALECSO Workshop of experts on morphological analyzers for the Arabic Language. Damascus, Syria, April 2011
  • Lamia Hadrich Belguith, Imed Zitouni, Nouha Chaâben Kammoun et Chafik Aloulou, Necessary Steps toward Syntactic analyzer for Arabic "المراحل الأساسية لبناء محلل نحوي للغة العربية". ALECSO Workshop of experts on morphological analyzers for the Arabic Language. Damascus, Syria, April 2011
  • Yassine Benajiba and Imed Zitouni. Using Parallel Corpora to Enhance Mention Detection. EMNLP’10. October 9-11, 2010 — MIT, Massachusetts, USA.
  • Radu Florian, John Pitrelli, Salim Roukos and Imed Zitouni. Improving Mention Detection Robustness to Noisy Input. EMNLP’10. October 9-11, 2010 — MIT, Massachusetts, USA.
  • Ahmad Emami, Hong-Kwang J. Kuo, Imed Zitouni and Lidia Mangu. Augmented Context Features for Arabic Speech Recognition. Interspeech’10. Makuhari, Japan. September 26-30, 2010
  • Yassine Benajiba begin_of_the_skype_highlightingend_of_the_skype_highlighting, Imed Zitouni, Mona Diab and Paolo Rosso. Arabic Named Entity Recognition: Using Features Extracted from Noisy Data. Proceedings of the ACL’10.Uppsala, Sweden. July 11-16, 2010
  • Yassine Benajiba and Imed Zitouni. Arabic Word Segmentation for Better Unit of Analysis. LREC, Malta. May 2010
  • Yassine Benajiba and Imed Zitouni. Arabic Mention Detection: Toward Better Unit of Analysis. Proceedings of the NAACL HLT, Los Angelos, CA. June 2-4, 2010
  • Hong-Kwang Jeff Kuo, Lidia Mangu, Ahmad Emami, Imed Zitouni. Morphological And Syntactic Features for Arabic Speech Recognition. ICASSP, Dallas, TX, March 14-19, 2010
  • Leiming R Qian and Imed Zitouni, Following Global Events with IBM Translingual Automatic Language Exploration System (TALES). IEEE Speech and Language Processing Technical Committee Newsletter Spring 2010. http://www.signalprocessingsociety.org/technical-committees/list/sl-tc/spl-nl/2010-04/ibm-tales/
  • Hong-Kwang Jeff Kuo, Lidia Mangu, Ahmad Emami, Imed Zitouni, Young-Suk Lee. Syntactic Features for Arabic Speech Recognition. IEEE workshop on Automatic Speech Recognition and Understanding (ASRU), 13-17, 2009. Murano, Italy. (Best paper award)
  • Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli and Imed Zitouni. Classifier Combination Techniques Applied to Coreference Resolution. Proceedings of the NAACL HLT Student Research Workshop and Doctoral Consortium, pages 1–6, Boulder, Colorado, June 2009
  • Smita Vemulapalli, Xiaoqiang Luo, John F. Pitrelli and Imed Zitouni. Using Bagging and Boosting Techniques for Improving Coreference Resolution. 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing). March 1-7, 2009. Mexico City, Mexico
  • I.Zitouni and R.Florian, Mention Detection Crossing the Language Barrier, EMNLP’08. October 25-27, 2008 — Waikiki, Honolulu, Hawaii
  • F.Huang, A.Emami and I. Zitouni, When Harry Met Harri: Cross-lingual Name Spelling Normalization, EMNLP’08. October 25-27, 2008 — Waikiki, Honolulu, Hawaii
  • Emami, I. Zitouni, L. Mangu, Rich Morphology Based N-Gram Language Models for Arabic. InterSpeech’08. Brisbane, Australia September 22-26, 2008
  • Zitouni and Q. Zhou. Hierarchical Linear Discounting Class n-gram Language Models: A Multilevel Class Hierarchy Approach. ICASSP 2008, Las Vegas, 2008.
  • Ruhi Sarikaya, Ossama Emam, Imed Zitouni and Yuqing Gao. Maximum Entropy Modeling for Diacritization of Arabic Text. InterSpeech06, September, Pittsburg, PA, USA
  • Radu Florian, Hongyan Jing, Nanda Kambhatla, Imed Zitouni, Factorizing Complex Models: A Case Study in Mention Detection. COLING / ACL 2006. July 2006, Sydney Australia.
  • Imed Zitouni, Jeffrey S. Sorensen, Ruhi Sarikaya, Maximum Entropy Based Restoration of Arabic Diacritics. COLING / ACL 2006. July 2006, Sydney Australia.
  • Xiaoqiang Luo and Imed Zitouni, Multi-Lingual Coreference Resolution with Syntactic Features. Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP 05). October 2005, Vancouver, British Columbia, Canada.
  • Imed Zitouni, Jeffrey Sorensen, Xiaoqiang Luo and Radu Florian, The Impact of Morphological Stemming on Arabic Mention Detection and Coreference Resolution, Computational Approaches to Semitic Languages, 43rd Annual Meeting of the Association of Computational Linguistics (ACL05). June 2005, Ann Arbor, Michigan, USA.
  • Imed Zitouni, Hui Jiang, Qiru Zhou. Discriminative Training and Support Vector Machine for Natural Language Call Routing, InterSpeech’05. September 2005, Lisboa, Portugal.
  • M. Lee, I. Zitouni. Prediction-based packet loss concealment for Voice Over IP: A statistical n-gram approach. IEEE Globecom 2004 - Signal Processing for Communications. November 2004, Dallas, Texas, USA.
  • Zitouni, M.Lee, H. Jiang. Constrained Minimization Technique for Topic Identification Using Discriminative Training and Support Vector Machines. International Conference on Spoken Language Processing (ICSLP 2004). October 2004, Jeju Island, Korea.
  • M. Lee, I. Zitouni, Q. Zhou. On A N-Gram Model Approach for Packet Loss Concealment. International Conference on Spoken Language Processing (ICSLP 2004). October 2004, Jeju Island, Korea.
  • P. Liu, H. Jiang, I.Zitouni. Discriminative Training of Naïve Bayes Classifiers for natural Language Call Routing. International Conference on Spoken Language Processing (ICSLP 2004). October 2004, Jeju Island, Korea.
  • Zitouni, H-K. Kuo. Effectiveness of the Backoff Hierarchical Class N-Gram Language Models to Model Unseen Events in Speech Recognition. December 2003, Proceedings IEEE ASRU, 2003, St. Thomas, USA
  • Zitouni, Q. Zhou, Q.P. Li. A Hierarchical Approach for Better Estimation of Unseen Event Likelihood in Speech Recognition. International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE’03. October 2003, Beijing, China.
  • Q. Zhou, I. Zitouni, Q.P. Li. Bell Labs Connected Digit Databases for Telephone Speech Recognition. International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE’03. October 2003, Beijing, China.
  • Zitouni, O. Siohan, C. Lee. Hierarchical Class n-gram Language Models: Toward Better Estimation of Unseen Events in Speech Recognition. Eurospeech 2003, September 2003. Geneva, Switzerland.
  • D. Iskra, I. Zitouni, R. Siemund. Validation and Quality of Spoken Language Resources: Orientel - Collection of Speech Ressources for the Mediterranean and Middle East Countries. COCOSDA Workshop 2003. August 2003. Geneva, Switzerland.
  • H. Kuo, C. Lee, I. Zitouni, E. Fosler-Lussier Minimum Verification Error Training for Topic Verification. International Conference on Acoustic Speech and Signal Processing (ICASSP 2003). Hong-Kong.
  • Zitouni, H. Kuo, O. Siohan. Backoff Hierarchical Class n-gram Language Modelling for Automatic Speech Recognition Systems. International Conference on Spoken Language Processing (ICSLP 2002). Denver, Colorado, USA.
  • H. Kuo, C. Lee, I. Zitouni, E. Fosler-Lussier, A. Egbert. Discriminative Training for Call Classification and Routing. International Conference on Spoken Language Processing (ICSLP 2002). Denver, Colorado, USA.
  • Zitouni, J. Olive, D. Iskra, K. Choukri, O. Emam, O. Gedge, E. Maragoudakis, H. Tropf, A. Moreno, A.N. Rodriguez, B. Heuft, R. Siemund. Orientel: Speech-Based Interactive Communication Applications for the Mediterranean and the Middle East. International Conference on Spoken Language Processing (ICSLP 2002). Denver, Colorado, USA.
  • Zitouni, H. Kuo, C. Lee. Combination of Boosting and Discriminative Training Techniques for Natural Language Call Steering Systems. ICASSP 2002. Orlando, Florida, USA.
  • R. Argiles-Solsona, J. Fosler-Lussier, H.J. Kuo, A. Potamianos, I. Zitouni. Adaptive Language Models for Spoken Dialogue Systems. ICASSP 2002.
  • R. Siemund (Philips), B. Heuft (Philips), K. Choukri (ELDA), O. Emam (IBM), E. Maragoudakis (Univ. of Patras), H. Tropf (Siemens), O. Gedge (NSC), S. Shammass (NSC), A. Moreno (UPC), A.N. Rodriguez (UPC), I. Zitouni (Lucent Technologies), S. Samuels (Lucent Technologies), D. Iskra (SPEX). OrienTel - Multilingual access to interactive communication services for the Mediterranean and the Middle East. International Conference on Language Resources and Evaluation (LREC 2002). Las Palmas, Spain.
  • R. Siemund (Philips), B. Heuft (Philips), K. Choukri (ELDA), O. Emam (IBM), E. Maragoudakis (Univ. of Patras), H. Tropf (Siemens), O. Gedge (NSC), S. Shammass (NSC), A. Moreno (UPC), A.N. Rodriguez (UPC), I. Zitouni (Lucent Technologies), D. Iskra (SPEX). OrienTel – Arabic speech resources for the IT market. Arabic Workshop at LREC 2002.
  • Zitouni, H-K. J. Kuo, C-H. Lee. Natural Language Call Routing: Towards Combination and Boosting of Classifiers. Automatic Speech Recognition and Understanding Workshop (ASRU 2001). December 2001. Madonna di Campiglio, Trento, Italy.
  • B. Bigi, A. Brun, J.P. Haton, K. Smaïli, I. Zitouni. Comparative Study of Topic Identification on Newspaper and E-mail. String Processing and Information Retrieval (Spire). November 2001. Laguna de San Rafael, Chile.
  • B. Bigi, A. Brun, J.P. Haton, K. Smaïli, I. Zitouni. Dynamic Topic Identification: Towards Combination of Methods. RANLP (Recent Advances in NLP). September 2001.Tzigov Chark, Bulgaria.
  • Zitouni, K. Smaïli, J.P. Haton. Statistical Language Model based on a Hierarchical Approach: MCnv. European Conference on Speech Communication and Technology. September 2001. AALBORG, Denmark.
  • Zitouni, K. Smaïli, J.P. Haton. Beyond the Conventional Statistical Language Models: the Variable-Length Sequences Approach. The 6th International Conference on Spoken Language Processing. October 16-20 2000. Beijing, CHINA.
  • Zitouni, K. Smaïli, J-P. Haton. Variable-Length Class Sequences Based on a Hierarchical Approach : MCnv. The 4th Word Multiconference on Systemics, Sybernitics and Informatics. July 2000, Orlando Florida (USA).
  • Zitouni, K. Smaïli. Vers une meilleure modélisation du langage : la prise en compte des séquences dans les modèles statistiques. XXIIIèmes Journées d'Etude sur la Parole JEP'2000. Juin 2000, Aussois France.
  • K. Smaïli, I. Zitouni, J.P. Haton. Towards a Better Collaboration Between n-class and n-gram Language Models. International Workshop on Speech and Communication. October 1999, Moscou Russia
  • I. Zitouni, J-F. Mari, K. Smaïli, J-P. Haton. Variable-Length Sequence Language Model for Large Vocabulary Continuous Dictation Machine. European Conference on Speech Communication and Technology. September 1999. Budapest, Hungary.
  • K. Smaïli, A. Brun, I. Zitouni, J-P. Haton. Automatic and Manual Clustering for Large Vocabulary Speech Recognition: A Comparative Study. European Conference on Speech Communication and Technology. September 1999. Budapest, Hungary.
  • I. Zitouni. A Language Modeling Based on a Hierarchical Approach: Mnv. The 7th Australian International Speech Science and Technology Conference. November 1998. Sydney, Australia.
  • I. Zitouni, K. Smaïli, J-P. Haton, S. Deligne, F. Bimbot. A Comparative Study Between Polyclass and Multiclass Models. International Conference on Spoken Language Processing (ICSLP 1998). Sydney, Australia.
  • I. Zitouni, K. Smaïli, J-P. Haton. Variable-Length Class Sequences Based on a Hierarchical Approach: MC. International Workshop on Speech and Communication. October 1998, St Petersburg Russia.
  • M. Jardino, F. Bimbot, S. Igounet, K. Smaïli, I. Zitouni, M. El-Beze. A First Evaluation Compaign for Language Models. International Conference on Language Resources and Evaluation. May 1998, Grenade Spain.
  • D. Fohr, J-P. Haton, J-F. Mari, K. Smaïli, I. Zitouni. Towards an Oral Interface of Data Entry: The MAUD System. 3rd European Research Consortium for Informatics and Mathematics Workshop on "User Interface for All". November 1997. Strasbourg, France.
  • K. Smaïli, I. Zitouni, F. Charpillet, J-P. Haton. A Hybrid Language Model for a Continuous Dictation Prototype. European Conference on Speech Communication and Technology. September 1997. Rhodes, Greece.
  • D. Fohr, J-P. Haton, J-F. Mari, K. Smaïli, I. Zitouni. MAUD: Un prototype de machine à dicter vocale. Journées scientifique et technique du réseau Francophone de L’Ingénierie de la Langue de l’AUPELF-UREF. Avril 1997, Avignon France.
  • I. Zitouni, K. Smaïli. Apport d’une grammaire d’unification dans un système de dictée automatique. Deuxièmes journées jeunes chercheurs en parole. Novembre 1997, La Rochelle France.
  • D. Fohr, J-P. Haton, J-F. Mari, K. Smaïli, I. Zitouni. Traitements lexicaux autour de MAUD. Séminaire GDR-PRC lexique et communication parlée. Octobre 1996, Toulouse France.

E-mail: Imed Zitouni

U.S.Mail: Microsoft Corporation, izitouni, One Microsoft Way, Redmond WA, 98052-6399, USA
Tel: (425) 706-9575