A Kumaran

Director - Applied Sciences & Head - Multilingual Systems Research

As the head of Applied Sciences group in Microsoft Research India, I coordinate all collaborative research activities between the Online Services Division's Ad Centre and Microsoft Research India.  We are actively engaged in several research projects - Keyword Suggestions Platform, Bid Traffic Estimation, Smart Pricing, Privacy Preserving Environments, etc. - that has potential product impact on Online Advertising area of Ad Centre's product suite.

As a part of my research in the area of Multilingual Systems, I am interested in all multilingual technologies that involve transparent handling of information in multiple langauges simultaneously.  My research interests include Machine Translation and Transliteration, Crosslingual Information Access and creation of multilingual data for research.

I am actively involved in exploring crowdsourcing as a methodology for creation of language data through a collaborative research project - WikiBABEL - with Wikimedia Foundation.  WikiBABEL explored creation of multilingual Wikipedia content and produce as a by-product parallel data for Machine Translation research.  In Oct 2010, we released an open source content creation tool for Wikipedia - WikiBhasha - as a MediaWiki Extension for open source developers, and as a user-gadget for Wikipedia content creators.  We are actively engaged with Wikipedia communities around the world, for stydying the adoption of WikiBhasha and its potential for data creation. 

As a part of Multilingual Systems area in Microsoft Research India, we collaborate with many researchers in India and around the world. 

Along with other researchers in the Multilingual Systems research area, I championed the creation of Pan Indian POS linguistic annotation standards that was adopted by the Bureau of Indian Standards.  I have organized the Named Entities WorkShop (NEWS) focused on Named Entities in multilingual corpora, and the Crosslingual Information Access (CLIA) workshop series.  The NEWS workshops, co-located with the ACL (in 2009, 2010 and 2012) and IJCNLP (in 2011) conferences, baselines the state of the art Machine Transliteration & Mining in about a dozen languages from around the world (العربية, বাংলা, 中文, English, עברית, हिन्दी, 日本語, ಕನ್ನಡ, 한국어, Русский, தமிழ் and ไทย).  The CLIA workshops, co-located with IJCNLP/NAACL/COLING Conferences (in 2008, 2009, 2010 and 2011) focus primarily on the emerging area of cross-language technologies and systems.

 

Publications

    2013

    2012

    2011

    2010

    2009

    2008

    2007

    2006

    2005

     

    I joined Microsoft Research India in July 2005, and am currently the Director of Applied Sciences groups, and heading the Multilingual Systems Research group.

     

    I did my PhD in Indian Institute of Science, Bangalore, India. I have a Bachelors degree from College of Engineering, Chennai, India and a Masters degree from
    Rutgers University, New Jersey, USA.

     

    My prior work experience includes
    ~5 years in Bell Communications Research and ~10 years in Oracle Corporation.  I served as an Adjunct Professor at Anna University, Chennai 1997-99.