Monojit Choudhury

Monojit Choudhury

Research Interests

My research interests broadly span two areas and more specifically their intersection: Computational linguistics and Complex Networks. I study language and related cognitive phenomena from the perspective of a dynamic and adaptive complex system. Complex network is a very promising and powerful tool in modeling complex systems, and hence, I model and study linguistic networks at various levels of linguistic organizations (e.g., sounds, words, meaning, and so on). Recently, I have been fascinated by the fact that web search queries are evolving and becoming more complex both syntactically and semantically; I am investigating their sturctural properties, which leads to me to believe that they represent an evolving protolanguage that will soon crystallize into a full-fledged language with its own structural properties.

I am also interested in computational linguistics and natural language processing for resource-poor languages, including unsupervised and semisupervised machine learning techniques for building tools, and innovative techniques for language resource creation.



Parth Gupta, Kalika Bali, Rafael E. Banchs, Monojit Choudhury, and Paolo Rosso, Query Expansion for Mixed-Script Information Retrieval, ACM – Association for Computing Machinery, July 2014


Sai Sumanth Miryala, Ranjita Bhagwan, Monojit Choudhury, and Kalika Bali, Automatically Identifying Vocal Expressions for Music Transcription, in 2013 International Society of Music Information Retrieval, November 2013

Monojit Choudhury, Ranjita Bhagwan, and Kalika Bali, The use of Melodic Scales in Bollywood Music: An Empirical Study, in Proceedings of the 14th International Society for Music Information Retrieval Conference, International Society for Music Information Retrieval, November 2013

Rohan Ramanath, Monojit Choudhury, and Kalika Bali, Entailment: An Effective Metric for Comparing and Evaluating Hierarchical and Non-hierarchical Annotation Schemes, in Proceedings of LAW VII and ID, Association for Computational Linguistics, July 2013

Rohan Ramanath, Monojit Choudhury, Kalika Bali, and Rishiaj Saha Roy, Crowd Prefers the Middle Path: A New IAA Metric for Crowdsourcing Reveals Turker Biases in Query Segmentation, in Proceedings of ACL, Association for Computational Linguistics, July 2013

Rishiraj Saha Roy, Anusha Suresh, Niloy Ganguly, and Monojit Choudhury, Place value: Word position shifts vital to search dynamics, in Proceedings of WWW (Companion Volume), WWW Conference 2013, 2013

Animesh Mukherjee, Monojit Choudhury, Fernando Peruani, Niloy Ganguly, and Bivas Mitra, Dynamics On and Of Complex Networks - II: Applications to Time-Varying Dynamical Systems, Birkhäuser , 2013


Rishiraj Saha Roy, Niloy Ganguly, Monojit Choudhury, and Srivatsan Laxman, An IR-based Evaluation Framework for Web Search Query Segmentation, in SIGIR 2012, ACM, August 2012

K Saravanan, Monojit Choudhury, Raghavendra Udupa, and A Kumaran, An Empirical Study of the Occurrence and Co-Occurrence of Named Entities in Natural Language Corpora, in In Proceedings of LREC 2012, European Language Resources Association, May 2012


Umair Z Ahmed, Kalika Bali, Monojit Choudhury, and Sowmya V. B, Challenges in Designing Input Method Editors for Indian Languages: The Role of Word-Origin and Context, in Proceedings of IJCNLP Workshop on Advances in Text Input Methods , Association for Computational Linguistics, November 2011

All publications...

Current Activities:

We are organizing the Second Workshop on Advances in Text Input Methods (WTIM -2) in conjunction with Coling 2012. Submissions invited (Deadline: September 30th, 2012)