Dongdong Zhang

LEAD RESEARCHER
.

Dr. Dongdong Zhang is a researcher in Natural Language Computing group at Microsoft Research Asia, Beijing, China. He received his Ph.D in Dec. 2005 from Department of Computer Science of Harbin Institute of Technology under the supervision of Prof. Jianzhong Li. Before that, he received a B.S. degree and M.S. degree from the same department in 1999 and 2001 respectively.

 

Dongdong's research interests include natural language processing, machine translation and machine learning. He is now working on research and development of advanced statistical machine translation systems (SMT) as well as related fundamental NLP problems, models, algorithms and innovations.

  • EMAIL: dozhang [AT] microsoft [DOT] com

Publications

#: Students I mentored in MSRA.

2013

  • Lei Cui#, Xilun Chen#, Dongdong Zhang, Shujie Liu, Mu Li, and Ming Zhou, Multi-domain Adaptation for SMT Using Multi-task Learning, EMNLP, October 2013
  • Dongdong Zhang, Shuangzhi Wu#, Nan Yang, Mu Li. Punctuation Prediction with Transition-based Parsing. ACL, Auguest 2013.
  • Lei Cui#, Dongdong Zhang, Shujie Liu, Mu Li, and Ming Zhou, Bilingual Data Cleaning for SMT using Graph-based RandomWalk, ACL, August 2013
  • Lei Cui#, Dongdong Zhang, Shujie Liu, Mu Li, and Ming Zhou, Collective Corpus Weighting and Phrase Scoring for SMT using Graph-based Random Walk, NLP-CC, November 2013

2012

  • Seung-Wook Lee#, Dongdong Zhang, Mu Li, Ming Zhou, Hae-Chang Rim: Translation Model Size Reduction for Hierarchical Phrase-based Statistical Machine Translation. ACL (2) 2012: 291-295
  • Nan Yang, Mu Li, Dongdong Zhang, Nenghai Yu: A Ranking-based Approach to Word Reordering for Statistical Machine Translation. ACL (1) 2012: 912-920
  • Yang Feng#, Dongdong Zhang, Mu Li, Qun Liu: Hierarchical Chunk-to-String Translation. ACL (1) 2012: 950-958
  • Yang Feng#, Dongdong Zhang, Qun Liu. Prepositional Phrase Reordering for Hierarchical Phrase-Based Translation. Journal of Chinese Information Processing. 2012: 26(1).

2011

  • Lei Cui#, Dongdong Zhang, Mu Li and Ming Zhou. Function Word Generation in Statistical Machine Translation Systems. Machine Translation Summit XIII, September 2011

2010

  • Lei Cui#, Dongdong Zhang, Mu Li, Ming Zhou, Tiejun Zhao: A Joint Rule Selection Model for Hierarchical Phrase-Based Translation. ACL (Short Papers) 2010: 6-11
  • Lei Cui#, Dongdong Zhang, Mu Li, Ming Zhou, Tiejun Zhao: Hybrid Decoding: Decoding with Partial Hypotheses Combination over Multiple SMT Systems. COLING (Posters) 2010: 214-222
  • Nan Duan, Mu Li, Dongdong Zhang, Ming Zhou: Mixture Model-based Minimum Bayes Risk Decoding using Multiple Machine Translation Systems. COLING 2010: 313-321
  • Mu Li, Yinggong Zhao, Dongdong Zhang, Ming Zhou: Adaptive Development Data Selection for Log-linear Model in Statistical Machine Translation. COLING 2010: 662-670

2009

  • Mu Li, Nan Duan, Dongdong Zhang, Chi-Ho Li, Ming Zhou: Collaborative Decoding: Partial Hypothesis Re-ranking Using Translation Consensus between Decoders. ACL/IJCNLP 2009: 585-592
  • Tong Xiao, Mu Li, Dongdong Zhang, Jingbo Zhu, Ming Zhou: Better Synchronous Binarization for Machine Translation. EMNLP 2009: 362-370
  • Dongdong Zhang, Chi-ho Li, Nan Duan, Shujie Liu, Mu Li, and Ming Zhou, The Evaluation Technical Report of Chinese-to-Enlighs Machine Translation System from Microsoft Research Asia, CWMT, 2009

2008

  • Dongdong Zhang, Mu Li, Nan Duan#, Chi-Ho Li, Ming Zhou: Measure Word Generation for English-Chinese SMT Systems. ACL 2008: 89-96
  • Ming Zhou, Bo Wang, Shujie Liu, Mu Li, Dongdong Zhang, Tiejun Zhao: Diagnostic Evaluation of Machine Translation Systems Using Automatically Constructed Linguistic Check-Points. COLING 2008: 1121-1128
  • Xiaodong He, Jianfeng Gao, Chris Quirk, Patrick Nguyen, Arul Menezes, Robert Moore, Kristina Toutanova, Mei Yang, Bill dolan, Mu Li, Chi-Ho Li, Dongdong Zhang, Long Jiang, and Ming Zhou, The MSR-MSRA MT System for NIST Open Machine Translation 2008 Evaluation, in The 2008 NIST Open Machine Translation Evaluation Workshop, 2008

2007

  • Chi-Ho Li, Minghui Li, Dongdong Zhang, Mu Li, Ming Zhou, Yi Guan: A Probabilistic Approach to Syntax-based Reordering for Statistical Machine Translation. ACL 2007
  • Dongdong Zhang, Mu Li, Chi-Ho Li, Ming Zhou: Phrase Reordering Model Integrating Syntactic Knowledge for SMT. EMNLP-CoNLL 2007: 533-540

2006

  • Dongdong Zhang, Jianzhong Li, Kimutai Kimeli, Weiping Wang." SlidingWindow based Multi-Join Algorithms over Distributed Data Streams". The 22nd IEEE International Conference on Data Engineering (ICDE), 2006.
  • Weiping Wang, Jianzhong Li, Dongdong Zhang, Longjiang Guo. An Algorithm for Continuous J-A Queries Processing over Data Streams based on Sliding Windows. Journal of Software, 2006.

2005

  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Longjiang Guo, Chunyu Ai. Processing Frequent Items over Distributed Data Streams. Web Technologies Research and Development 7th Asia-Pacific Web Conference (APWEB), Shanghai, China, 2005. Lecture Notes in Computer Science 3399 Springer 2005: 523-529.
  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Longjiang Guo, Chunyu Ai. Reducing Communication Overhead over Distributed Data Streams By filtering Frequent Items. Journal of Digital Information Management. 2005, Vol. 3, No.2.
  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Longjiang Guo. Algorithms for Storing and Aggregating Historical Streaming Data. Journal of Software. 2005.
  • Jianzhong Li, Longjiang Guo, Dongdong Zhang, Weiping Zhang. Processing Algorithms for Predictive Aggregate Queries over Data Streams. Journal of Software, 2005,16(7):1252~1261.
  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Longjiang Guo. Distributed Compound-Data Streams Processing.Journal of Computer Research and Development. 2004, Vol. 41, No. 10, pp: 1780-1785.

2004

  • Jianzhong Li, Dongdong Zhang. Algorithms for Dynamically Adjusting the Sizes of Sliding Windows. Journal of Software, 2004, Vol. 15, No. 12, pp: 1800-1814.
  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Jinbao Li, Longjiang Guo. Processing Distributed Compound-Data Streams. The East-European Conference on Advances in Databases and Information Systems (ADBIS), Budapest, Hungary, 2004. (local proceedings)
  • Longjiang Guo, Jianzhong Li, Weiping Wang, Dongdong Zhang. Predictive Continuous Aggregate Queries over Data Streams, Journal of Computer Research and Development, 2004, Vol. 41, No. 10, pp: 1690-1695.
  • Weiping Wang, Jianzhong Li, DongDong Zhang, Longjiang Guo. Processing Sliding Window Join Aggregate in Continuous Queries Over Data Streams. The East-European Conference on Advances in Databases and Information Systems (ADBIS). Budapest, Hungary, 2004. Lecture Notes in Computer Science 3255 Springer 2004: 348-363.
  • Weiping Wang, Jianzhong Li, Xu Wang, DongDong Zhang, Longjiang Guo. Evaluating Stream and Disk Join in Continuous Queries. Grid and Cooperative Computing: Third International Conference (GCC), Wuhan, China. Lecture Notes in Computer Science 3251 Springer, 2004: 823-826.
  • Dongdong Zhang, Jianzhong Li, Zhaogong Zhang, Weiping Wang, Longjiang Guo. Dynamic Adjustment of Sliding Windows over Data Streams. Advances in Web-Age Information Management: 5th International Conference (WAIM), Dalian, China, 2004. Lecture Notes in Computer Science 3129 Springer 2004: 24-33.
  • Weiping Wang, Jianzhong Li, Dongdong Zhang, Longjiang Guo. Periodically Updated Sliding Window Join Algorithm Over Data Streams. Journal of Harbin Institute of Technology, 2004, Vol.36, No.10.
  • Weiping Wang, Jianzhong Li, Dongdong Zhang, Longjiang Guo, Xu Wang. A Parallel Method for Processing Continuous Queries on Data Streams, Journal of Computer Research and Development, 2004, Vol. 41, No. 10 (Suppl.), pp: 603-609.

2003

  • Dongdong Zhang, Jianzhong Li, Weiping Wang, Longjiang Guo. Algorithms for Aggregating History of Time-Series Data Streams. Computer Science, 2003, Vol. 30, No. 10 (Suppl. A), pp: 291-295.
  • Jianzhong Li, Dongdong Zhang, Yanqiu Zhang. Join Algorithm Based on Tertiary Storage.Journal of Software. 2003, Vol. 14, No. 5, pp: 947-954.
  • Weiping Wang, Jiangzhong Li, Dongdong Zhang, Longjiang Guo. Research of Timestamp-based Sliding Window Join Algorithm Over Data Stream. Computer Science, 2003, Vol.30, No. 10 (Suppl. A), pp: 174-177.

2002

  • Dongdong Zhang, Jianzhong Li, Hong Gao. Aggregate Index Tree: An Approach for Range Sum Queries. Computer Science, 2002, Vol. 29, No. 8 (Suppl. A), pp: 132-134. (Best paper in NDBC'02)

2001

  • Dongdong Zhang, Jianzhong Li, Yanqiu Zhang. Join Algorithms Based on Tertiary Storage. Computer Science, 2001, Vol. 28, No. 8 (Suppl. A).

 

Events

Honors & Awards

  • No. 1 Place, Chinese-to-English track, 2009 CWMT MT Evaluation.
  • No. 1 Place, Chinese-to-English track, 2008 CWMT MT Evaluation.
  • No. 1 Place, English-to-Chinese common data track, 2008 NIST MT Evaluation (Over BLEU-4 normalized and BLEU-4 Word Segmented Evaluation).
  • No. 1 Place, Chinese-to-English common data track, 2008 NIST MT Evaluation.
  • Distinguished Paper in Journals of China Association for Science and Technology, 2006. Jianzhong Li, Dongdong Zhang. Algorithms for Dynamically Adjusting the Sizes of Sliding Windows.
  • Second Prize of State Scientific and Technology Progress Award. Jianzhong Li, Jinbao Li, Hong Gao, Longjiang Guo, Zhaogong Zhang, Yibo Song, Jizhou Luo, Dongdong Zhang, Weiping Wang, Baoliang Liu. Parallel Databased System over Computer Cluster, 2004.

Students

  • Shuihua Li
  • Qingchun Ma
  • Xin Wang
  • Linlin Gao
  • Lin Liu
  • Danran Chen
  • Shaosheng Cao
  • Longfei Bai
  • Shuangzhi Wu
  • Xilun Chen
  • Seung-Wook Lee
  • Yang Feng
  • Lei Cui
  • Nan Duan
  • Hong Sun
  • Chao Ma
  • Lin Xu