Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
A Kumaran

Director - Applied Sciences

As the head of Applied Sciences group in Microsoft Research India, I coordinate all collaborative research activities between the Online Services Division's Ad Centre and Microsoft Research India.  We are actively engaged in several research projects - Keyword Suggestions Platform, Bid Traffic Estimation, Smart Pricing, Privacy Preserving Environments, etc. - that has potential product impact on Online Advertising area of Ad Centre's product suite.

As a part of my research in the area of Multilingual Systems, I am exploring multilingual technologies that involve transparent handling of information in multiple languages simultaneously.  My research interests include Machine Translation and Transliteration, Cross-lingual Information Access and creation of multilingual data for research.

I am actively involved in exploring crowdsourcing as a methodology for creation of language data through a collaborative research project - WikiBABEL - with Wikimedia Foundation.  WikiBABEL explored creation of multilingual Wikipedia content and produce as a by-product parallel data for Machine Translation research.  In Oct 2010, we released an open source content creation tool for Wikipedia - WikiBhasha - as a MediaWiki Extension for open source developers, and as a user-gadget for Wikipedia content creators.  We are actively engaged with Wikipedia communities around the world, for studying the adoption of WikiBhasha and its potential for data creation. 

As a part of Multilingual Systems area in Microsoft Research India, we collaborate with many researchers in India and around the world. 

Along with other researchers in the Multilingual Systems research area, I championed the creation of Pan Indian POS linguistic annotation standards that was adopted by the Bureau of Indian Standards.  I have co-organized the Named Entities WorkShop (NEWS) series focused on Named Entities in multilingual corpora; these workshops were run as a part of ACL conferences (in 2009, 2010, 2012 and 2015) and IJCNLP conferences (in 2009, 2011 and 2015).  The primary outcome of these workshops were the baselining of  the quality of Named Entities Transliteration in about a dozen languages from around the world (العربية, বাংলা, 中文, English, עברית, हिन्दी, 日本語, ಕನ್ನಡ, 한국어, Русский, தமிழ் and ไทย).  The workshop series provided standard datasets in about dozen language pairs, evaluation metrics, and quantified the state of the art in names transliteration.

 

Publications

    2015

    2014

    2013

    2012

    2011

    2010

    2009

    2008

    2007

    2006

    2005

     

    I joined Microsoft Research India in July 2005, and am currently the Director of Applied Sciences groups, and heading the Multilingual Systems Research group.

     

    I did my PhD in Indian Institute of Science, Bangalore, India. I have a Bachelors degree from College of Engineering, Chennai, India and a Masters degree from
    Rutgers University, New Jersey, USA.

     

    My prior work experience includes
    ~5 years in Bell Communications Research and ~10 years in Oracle Corporation.  I served as an Adjunct Professor at Anna University, Chennai 1997-99.