Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
Bolin Ding

Bolin Ding
RESEARCHER
.

Contact Information

Microsoft Corporation
One Microsoft Way
Redmond, WA 98052
Tel: +1 (425) 704-9176
Email: bolind@microsoft.com

I am a Researcher in the Data Management, Exploration and Mining (DMX) group at Microsoft Research. Before joining Microsoft, I completed my Ph.D. in Computer Science at University of Illinois at Urbana-Champaign under the supervision of Prof. Jiawei Han.

Research Interests

My research goals and interests span different aspects of large-scale data management, including search and explore structured data, database systems, data mining algorithms and applications, and data privacy. I'm particularly interested in (randomized or approximation) algorithms which have performance guarantees in theory, and are effective and robust in practice as well.

  • Search structured data: inventing new search models / interfaces to help people explore structured (text) data more easily, and developing efficient algorithms and index structures to support them
  • Data mining: developing data mining algorithms for various applications
  • Query processing: i) faster algorithms for building block components, and ii) progress estimation in query processing
  • Data privacy: reducing the amount of noise while providing (mathematically) rigours and flexible privacy guarantees
  • Graphs in databases

Selected Publications (the three most recent per topic)

Search Structured Data: Models and Algorithms

Fotis Psallidas, Bolin Ding, Kaushik Chakrabarti, and Surajit Chaudhuri, S4: Top-k Spreadsheet-Style Search for Query Discovery, in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2015), ACM – Association for Computing Machinery, June 2015.

Mohan Yang, Bolin Ding, Surajit Chaudhuri, and Kaushik Chakrabarti, Finding Patterns in a Knowledge Base using Keywords to Compose Table Answers, in Proceedings of the VLDB Endowment, the 41st International Conference on Very Large Data Bases (VLDB 2015), vol. 7, ACM – Association for Computing Machinery, September 2014.

Yanyan Shen, Kaushik Chakrabarti, Surajit Chaudhuri, Bolin Ding, and Lev Novik, Discovering Queries based on Example Tuples, in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2014), ACM – Association for Computing Machinery, June 2014.

More on Search Structured Data

Data Mining: Algorithms and Applications

Herodotos Herodotou, Bolin Ding, Shobana Balakrishnan, Geoff Outhred, and Percy Fitter, Scalable Near Real-Time Failure Localization of Data Center Networks, in in Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2014), ACM – Association for Computing Machinery, August 2014.

Zhenhui Li, Bolin Ding, Fei Wu, Tobias Kin Hou Lei, Roland Kays, and Margaret C. Crofoot, Attraction and Avoidance Detection from Movements, in Proceedings of the VLDB Endowment, the 40th International Conference on Very Large Data Bases (VLDB 2014), vol. 7, no. 3, VLDB – Very Large Data Bases, September 2013.

Zhenhui Li, Jiawei Han, Bolin Ding, and Roland Kays, Mining periodic behaviors of object movements for animal and biological sustainability studies, in Journal of Data Mining and Knowledge Discovery (DMKD), Springer, March 2012.

More on Data Mining

Query Processing

Arnd Christian König, Bolin Ding, Surajit Chaudhuri, and Vivek R. Narasayya, A Statistical Approach Towards Robust Progress Estimation, in Proceedings of the VLDB Endowment, the 38th International Conference on Very Large Data Bases (VLDB 2012), vol. 5, no. 4, pp. 382-393, Very Large Data Bases Endowment Inc., 2012.

Bolin Ding and Arnd Christian König, Fast Set Intersection in Memory, in Proceedings of the VLDB Endowment, the 37th International Conference on Very Large Data Bases (VLDB 2011), vol. 4, no. 4, pp. 255-266, Very Large Data Bases Endowment Inc., 2011.

Wenfei Fan, Jeffrey Xu Yu, Jianzhong Li, Bolin Ding, and Lu Qin, Query translation from XPath to SQL in the presence of recursive DTDs, in The VLDB Journal, vol. 18, no. 4, pp. 857-883, Springer, 2009.

More on Query Processing

Data Privacy

Xi He, Ashwin Machanavajjhala, and Bolin Ding, Blowfish Privacy: Tuning Privacy-Utility Trade-offs using Policies, in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2014), ACM – Association for Computing Machinery, June 2014.

Bolin Ding, Marianne Winslett, Jiawei Han, and Zhenhui Li, Differentially private data cubes: optimizing noise sources and consistency, in Proceedings of the ACM SIGMOD International Conference on Management of Data (SIGMOD 2011), ACM SIGMOD, 2011.

More on Data Privacy

Graphs (in Databases)

Ruoming Jin, Lin Liu, Bolin Ding, and Haixun Wang, Distance-Constraint Reachability Computation in Uncertain Graphs, in Proceedings of the VLDB Endowment, the 37th International Conference on Very Large Data Bases (VLDB 2011), vol. 4, no. 9, pp. 551-562, Very Large Data Bases Endowment Inc., 2011.

Lu Su, Bolin Ding, Yong Yang, Tarek F. Abdelzaher, Guohong Cao, and Jennifer C. Hou, oCast: Optimal Multicast Routing Protocol for Wireless Sensor Networks, in Proceedings of the 17th IEEE International Conference on Network Protocols (ICNP 2009), IEEE Computer Society, 2009.

Lu Qin, Jeffrey Xu Yu, Bolin Ding, and Yoshiharu Ishikawa, Monitoring Aggregate k-NN Objects in Road Networks, in Proceedings of the 20th International Conference on Scientific and Statistical Database Management (SSDBM 2008), Springer, 2008.

More on Graphs in Databases

Professional Activities

Program Committee Memberships:

  • Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD): 2015, 2014, 2013
  • The 1st International Workshop on Privacy-Preserving Data Publication and Analysis (PrivDB 2013), in conjunction with the 29th IEEE International Conference on Data Engineering (ICDE 2013)

Reviewer for Journals:

ACM Transactions on Database Systems, IEEE Transactions on Knowledge and Data Engineering, ACM Transactions on Knowledge Discovery from Data, Theoretical Computer Science, Pattern Recognition, Information Sciences, Knowledge and Information Systems