The Web is described as a distributed, large-scale, volatile, unstructured, heterogeneous, and hidden information source, which poses big challenges to the management of Web data. The mission of Web Data Management (WDM) Group is to develop systems and algorithms to address these challenges and thus make Web data management as effective as a database system, and as flexible as an information retrieval system.
- Shuming Shi, Huibin Zhang, Xiaojie Yuan, and Ji-Rong Wen, Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches, in Proceedings of COLING 2010, August 2010.
- Huibin Zhang, Mingjie Zhu, Shuming Shi, and Ji-Rong Wen, Employing Topic Models for Pattern-based Semantic Class Discovery, in Proceedings of the Annual Meeting of the Association for Computational Linguistics (ACL'09), ACL/SIGPARSE, August 2009.
- Mingjie Zhu, Shuming Shi, Zaiqing Nie, and Ji-Rong Wen, Aggregation-Aware Top-k Computation for Full-Text Search, no. MSR-TR-2009-47, April 2009.