Database
The charter of the Microsoft Research Database Group is to increase the usefulness of database systems to users by creating, extending, and applying database technology. To that end, we consult with the database product groups at Microsoft and take part in exploratory research projects. We are located in Redmond, Washington, which is in the greater Seattle area.
People
Selected Professional Activities
We actively participate in the database research community. Current activities include the following Conferences activities, and Editorships and Board Memberships:
Major Conferences
- VLDB 2012
(4 on program committee) - SIGMOD 2012
(2 on program committee) - ICDE 2012
(3 on program committee) - VLDB 2011
(1 general chair, 1 web chair, 2 on program committee) - ICDE 2011
(3 on program committee) - VLDB 2010
(2 on program committee) - SIGMOD 2010
(1 program committee group leader) - ICDE 2010
(4 on program committee)
Editorships and Board Memberships
- Distributed and Parallel Databases Journal: editorial board (2 members)
- Bulletin of the IEEE TC on Data Engineering: editor-in-chief
- International Conference on Data Engineering: Steering Committee (2 members)
- Information Systems: Editorial Advisory Board
- Sigmod Anthology: 2 associate editors
- VLDB Endowment: Board of Trustees member
- Computing Research Ass'n (CRA): Treasurer, Board of Directors member
- IEEE-CS Technical Committee on Data Engineering: chair
Projects
- Big DataBusinesses today operate on the monitor-mine-manage (M3) cycle: they monitor and archive large amounts of data, which they mine to derive insights such as models. The models are used during the manage phase to add value to the business, e.g., by scoring the models with real-time data. This project looks at the broad area of platforms and applications for big data analytics, from a database-oriented perspective, in the context of achieving a frictionless M3 cycle.
- StreamsThe stream processing project - also called CEDR for Complex Event Detection and Response - proposes novel architectures, processing techniques, and applications to support time-oriented queries over temporal and real-time data streams. CEDR shipped in 2010 as Microsoft StreamInsight - a commercial stream processing system. Our currect focus areas include scale-out, high-availability, query optimization, and new platforms for stream processing.
- Database Security and ComplianceMuch of the sensitive data in any organization is stored in a database system. There is a natural need to manage the security of the sensitive information. Our goal in this project is to develop tools to manage database security.
- DeuteronomyTraditionally, a DBMS kernel has recovery, concurrency control and access method code tightly bound together. We factor the kernel into a transactional component (TC) that knows about transactions and their “logical” concurrency control and undo/redo recovery, and a data component (DC) that knows about the access methods and supports a record oriented interface with atomic operations. The interaction of the components is governed by a contract or covenant.
- Phoenix Application RecoveryThe Phoenix goal is to improve application availability and error handling robustness. The project exploits database recovery techniques for enabling applications to survive system crashes. Two prototype systems have been built. Phoenix/ODBC provides persistent database sessions across database system failures. Phoenix/App provides persistent middle-tier applications across application server failures.
- Immortal DBThe Immortal DB project began in the fall of 2002. This project's goal is to provide the infrastructure for saving and indexing all prior states of a database. Foundational work for this effort has been published: on indexing versions, and on choosing timestamps. We have built a prototype system that includes indexing, version compression, and bad user transaction recovery. A major goal has been to provide performance close to that of an unversioned database.
Recent Publications
- Badrish Chandramouli, Joris Claessens, Suman Nath, Ivo Santos, and Wenchao Zhou, RACE: Real-time Applications over Cloud-Edge, in ACM SIGMOD International Conference on Management of Data (SIGMOD 2012), ACM SIGMOD, May 2012
- David Lomet, Alan Fekete, Rui Wang, and Peter Ward, Multi-Version Concurrency via Timestamp Range Conflict Management, in ICDE, IEEE Computer Society, April 2012
- Badrish Chandramouli, Jonathan Goldstein, and Songyun Duan, Temporal Analytics on Big Data for Web Advertising, in 28th International Conference on Data Engineering (ICDE '12), IEEE, April 2012
- Justin J. Levandoski, Mohamed Sarwat, Ahmed Eldawy, and Mohamed F. Mokbel, LARS: A Location-Aware Recommender System, in 28th International Conference on Data Engineering (ICDE 2012), IEEE, April 2012
- Badrish Chandramouli, David Maier, and Jonathan Goldstein, Physically Independent Stream Merging, in 28th International Conference on Data Engineering (ICDE '12), IEEE, April 2012
- Umar Farooq Minhas, David Lomet, and Chandramohan A. Thekkath, Chimera: Data Sharing Flexibility, Shared Nothing Simplicity, in IDEAS, Springer Verlag, 21 September 2011
- Justin J. Levandoski, Michael D. Ekstrand, Michael J. Ludwig, Ahmed Eldawy, Mohamed F. Mokbel, and John T. Riedl, RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures, in VLDB, Very Large Data Bases Endowment Inc., September 2011
- David Lomet, Kostas Tzoumas, and Michael Zwilling, Implementing Performance Competitive Logical Recovery, Very Large Data Bases Endowment Inc., 29 August 2011
- David Lomet, Transactions: From Local Atomicity to Atomicity in the Cloud, in Lecture Notes on Computer Science 6875, Springer Verlag, August 2011
- Mohamed Ali, Badrish Chandramouli, Jonathan Fay, Curtis Wong, Steven Drucker, and Balan S. Raman, Online Visualization of Geospatial Stream Data using the WorldWide Telescope, in International Conference on Very Large Databases (VLDB), Very Large Data Bases Endowment Inc., August 2011
- Raghav Kaushik and Ravi Ramamurthy, Efficient Auditing For Complex SQL queries, in SIGMOD, ACM, June 2011
- Badrish Chandramouli, Justin J. Levandoski, Ahmed Eldawy, and Mohamed Mokbel, StreamRec: A Real-Time Recommender System, in ACM SIGMOD International Conference on Management of Data (SIGMOD 2011), ACM SIGMOD, June 2011
- Philip A. Bernstein, Istvan Cseri, Nishant Dani, Nigel Ellis, Ajay Kallan, Gopal Kakivaya, David B. Lomet, Ramesh Manne, Lev Novik, and Tomas Talius, Adapting Microsoft SQL Server for Cloud Computing, in ICDE, IEEE Computer Society, 11 April 2011
- Mohamed Ali, Badrish Chandramouli, Jonathan Goldstein, and Roman Schindlauer, The Extensibility Framework in Microsoft StreamInsight, in 27th International Conference on Data Engineering, IEEE, April 2011
- Badrish Chandramouli, Jonathan Goldstein, Roger Barga, Mirek Riedewald, and Ivo Santos, Accurate Latency Estimation in a Distributed Event Processing System, in 27th International Conference on Data Engineering (ICDE '11), IEEE, April 2011
- Rui Wang, Betty Salzberg, and David B. Lomet, Log-Based Middleware Server Recovery with Transaction Support, in The VLDB Journal, Very Large Data Bases Endowment Inc., April 2011
- Justin J. Levandoski, David Lomet, Mohamed F. Mokbel, and Kevin Keliang Zhao, Deuteronomy: Transaction Support for Cloud Data, in Conference on Innovative Data Systems Research (CIDR), www.crdrdb.org, 12 January 2011
- Arvind Arasu, Raghav Kaushik, and Jian Li, Data Generation using Declarative Constraints, in SIGMOD, ACM, 2011
- Surajit Chaudhuri, Raghav Kaushik, and Ravi Ramamurthy, Database Access Control & Privacy: Is There A Common Ground?, in Conference on Innovative Database Research, January 2011
Interns
A number of interns visit the Database Group each summer. Some of our more recent interns are listed below:
- Spyros Blanas
University of Wisconsin - Yupeng Fu
University of California - San Diego - Hamid Mousavi
University of California - Los Angeles - Aditya Parameswaran
Stanford University - Radu Stoica
EPFL - Khai Tran
University of Wisconsin - Kostas Tzoumas
Aarhus University - Di Wang
Worcester Polytechnic Institute - Kevin Zhao
University of California - San Diego - Wenchao Zhou
University of Pennsylvania
A more complete intern list is here.
Visiting Researchers
- Suad Alagic (2000-2001)
University of Southern Maine - Paolo Atzeni (2003)
University of Rome 3 - Christoph Freytag (2002)
Humboldt University - Wolfgang Lehner (2004)
Dresden University of Technology - David Maier (2007, 2010)
Portland State University - Rene Miller (2001, 2002)
University of Toronto - Mohamed Mokbel (2008, 2010)
University of Minnesotoa - Elizabeth O'Neil (2001)
University of Massachusetts, Boston - Patrick O'Neil (2001)
University of Massachusetts, Boston - Lemonia Ragia (2003)
University of Aachen - Erhard Rahm (2000)
University of Leipzig - Betty Salzberg (1996, 2002)
Northeastern University - Gerhard Weikum (1997, 2000, 2001)
University of Saarland



