Database
The charter of the Microsoft Research Database Group is to increase the usefulness of database systems to users by creating, extending, and applying database technology. To that end, we consult with the database product groups at Microsoft and take part in exploratory research projects. We are located in Redmond, Washington, which is in the greater Seattle area.
People
Jonathan Goldstein
Selected Professional Activities
We actively participate in the database research community. Current activities include the following Conferences activities, and Editorships and Board Memberships:
Major Conferences
- VLDB 2012
(4 on program committee) - SIGMOD 2012
(2 on program committee) - ICDE 2012
(3 on program committee) - VLDB 2011
(1 general chair, 1 web chair, 2 on program committee) - ICDE 2011
(3 on program committee) - VLDB 2010
(2 on program committee) - SIGMOD 2010
(1 program committee group leader) - ICDE 2010
(4 on program committee)
Editorships and Board Memberships
- Distributed and Parallel Databases Journal: editorial board (2 members)
- Bulletin of the IEEE TC on Data Engineering: editor-in-chief
- International Conference on Data Engineering: Steering Committee (2 members)
- Information Systems: Editorial Advisory Board
- Sigmod Anthology: 2 associate editors
- VLDB Endowment: Board of Trustees member
- Computing Research Ass'n (CRA): Treasurer, Board of Directors member
- IEEE-CS Technical Committee on Data Engineering: chair
Projects
- Column store technologyColumn store technology can provide very substantial performance improvements on data warehousing workloads. This project investigated how to integrate columnar storage into SQL Server. The solution adopted was to add a new index type, columnstore index, that stores data column wise instead of row wise. Columnstore indexes first shipped in SQL Server 2012 and significant enhancements will be included in the next release.
- Main-Memory DatabasesCurrent database systems were designed assuming that data resides on disk. This assumption is no longer valid; main memories have become sufficiently large that most OLTP databases can reside entirely in memory. In this project we investigate what architectural changes and new techniques are required to realize the potential for great performance improvements offered by storing data in main memory.
- Big DataBusinesses today operate on the monitor-mine-manage (M3) cycle: they monitor and archive large amounts of data, which they mine to derive insights such as models. The models are used during the manage phase to add value to the business, e.g., by scoring the models with real-time data. This project looks at the broad area of platforms and applications for big data analytics, from a database-oriented perspective, in the context of achieving a frictionless M3 cycle.
- StreamsThe stream processing project - also called CEDR for Complex Event Detection and Response - proposes novel architectures, processing techniques, and applications to support time-oriented queries over temporal and real-time data streams. CEDR shipped in 2010 as Microsoft StreamInsight - a commercial stream processing system. Our currect focus areas include scale-out, high-availability, query optimization, and new platforms for stream processing.
- Database Security and ComplianceMuch of the sensitive data in any organization is stored in a database system. There is a natural need to manage the security of the sensitive information. Our goal in this project is to develop tools to manage database security.
- DeuteronomyTraditionally, a DBMS kernel has recovery, concurrency control and access method code tightly bound together. We factor the kernel into a transactional component (TC) that knows about transactions and their “logical” concurrency control and undo/redo recovery, and a data component (DC) that knows about the access methods and supports a record oriented interface with atomic operations. The interaction of the components is governed by a contract or covenant.
- Phoenix Application RecoveryThe Phoenix goal is to improve application availability and error handling robustness. The project exploits database recovery techniques for enabling applications to survive system crashes. Two prototype systems have been built. Phoenix/ODBC provides persistent database sessions across database system failures. Phoenix/App provides persistent middle-tier applications across application server failures.
- Immortal DBThe Immortal DB project began in the fall of 2002. This project's goal is to provide the infrastructure for saving and indexing all prior states of a database. Foundational work for this effort has been published: on indexing versions, and on choosing timestamps. We have built a prototype system that includes indexing, version compression, and bad user transaction recovery. A major goal has been to provide performance close to that of an unversioned database.
- Model ManagementThe goal of model management is to develop a generic infrastructure that offers an order-of-magnitude productivity improvement to builders of model-driven applications, such as database tools, application design tools, message translators, and customizable commercial applications.
Recent Publications
- Cristian Diaconu, Craig Freedman, Erik Ismert, Per-Ake Larson, Pravin Mittal, Ryan Stonecipher, Nitin Verma, and Mike Zwilling, Hekaton: SQL Server’s Memory-Optimized OLTP Engine, ACM International Conference on Management of Data, 22 June 2013
- Mike Barnett, Badrish Chandramouli, Robert DeLine, Steven Drucker, Danyel Fisher, Jonathan Goldstein, Patrick Morrison, and John Platt, Stat! - An Interactive Analytics Environment for Big Data, in ACM SIGMOD International Conference on Management of Data (SIGMOD 2013), ACM SIGMOD, June 2013
- Daniel Fabbri, Ravi Ramamurthy, and Raghav Kaushik, SELECT Triggers For Data Auditing, in IEEE International Conference on Data Engineering (ICDE), International Conference on Data Engineering, 9 April 2013
- Justin J. Levandoski, Per-Ake Larson, and Radu Stoica, Identifying Hot and Cold Data in Main-Memory Databases, in 2013 IEEE 29th International Conference on Data Engineering (ICDE), International Conference on Data Engineering, 8 April 2013
- Justin Levandoski, David Lomet, and Sudipta Sengupta, The Bw-Tree: A B-tree for New Hardware, in 2013 IEEE 29th International Conference on Data Engineering (ICDE), International Conference on Data Engineering, 8 April 2013
- Arvind Arasu, Ken Eguro, Raghav Kaushik, and Ravi Ramamurthy, Querying Encrypted Data (Tutorial), in ICDE, , April 2013
- Arvind Arasu, Spyros Blanas, Ken Eguro, Raghav Kaushik, Donald Kossmann, Ravi Ramamurthy, and Ramaratnam Venkatesan, Orthogonal Security With Cipherbase, in 6th Biennial Conference on Innovative Data Systems Research (CIDR'13), , 8 January 2013
- Prasang Upadhyaya, Nick Anderson, Magdalena Balazinska, Bill Howe, Raghav Kaushik, Ravi Ramamurthy, and Dan Suciu, Stop That Query! The Need for Managing Data Use, in 6th Biennial Conference on Innovative Data Systems Research (CIDR ’13), , 8 January 2013
- Yupeng Fu, Raghav Kaushik, and Ravi Ramamurthy, On Scaling Up Sensitive Data Auditing, Very Large Data Bases Endowment Inc., 1 January 2013
- Mr.XEtAl, Secure HPTS requires Secure Hardware, 2013
- Badrish Chandramouli, Joris Claessens, Suman Nath, Ivo Santos, and Wenchao Zhou, RACE: Real-time Applications over Cloud-Edge, in ACM SIGMOD International Conference on Management of Data (SIGMOD 2012), ACM SIGMOD, May 2012
- Justin J. Levandoski, Mohamed Sarwat, Ahmed Eldawy, and Mohamed F. Mokbel, LARS: A Location-Aware Recommender System, in 28th International Conference on Data Engineering (ICDE 2012), IEEE, April 2012
- Badrish Chandramouli, David Maier, and Jonathan Goldstein, Physically Independent Stream Merging, in 28th International Conference on Data Engineering (ICDE '12), IEEE, April 2012
- Badrish Chandramouli, Jonathan Goldstein, and Songyun Duan, Temporal Analytics on Big Data for Web Advertising, in 28th International Conference on Data Engineering (ICDE '12), IEEE, April 2012
- David Lomet, Alan Fekete, Rui Wang, and Peter Ward, Multi-Version Concurrency via Timestamp Range Conflict Management, in ICDE, IEEE Computer Society, April 2012
- Umar Farooq Minhas, David Lomet, and Chandramohan A. Thekkath, Chimera: Data Sharing Flexibility, Shared Nothing Simplicity, in IDEAS, Springer Verlag, 21 September 2011
- Justin J. Levandoski, Michael D. Ekstrand, Michael J. Ludwig, Ahmed Eldawy, Mohamed F. Mokbel, and John T. Riedl, RecBench: Benchmarks for Evaluating Performance of Recommender System Architectures, in VLDB, Very Large Data Bases Endowment Inc., September 2011
- David Lomet, Kostas Tzoumas, and Michael Zwilling, Implementing Performance Competitive Logical Recovery, Very Large Data Bases Endowment Inc., 29 August 2011
- Mohamed Ali, Badrish Chandramouli, Jonathan Fay, Curtis Wong, Steven Drucker, and Balan S. Raman, Online Visualization of Geospatial Stream Data using the WorldWide Telescope, in International Conference on Very Large Databases (VLDB), Very Large Data Bases Endowment Inc., August 2011
- David Lomet, Transactions: From Local Atomicity to Atomicity in the Cloud, in Lecture Notes on Computer Science 6875, Springer Verlag, August 2011
Interns
A number of interns visit the Database Group each summer. Some of our more recent interns are listed below:
- Spyros Blanas
University of Wisconsin - Yupeng Fu
University of California - San Diego - Hamid Mousavi
University of California - Los Angeles - Aditya Parameswaran
Stanford University - Radu Stoica
EPFL - Khai Tran
University of Wisconsin - Kostas Tzoumas
Aarhus University - Di Wang
Worcester Polytechnic Institute - Kevin Zhao
University of California - San Diego - Wenchao Zhou
University of Pennsylvania
A more complete intern list is here.
Visiting Researchers
- Suad Alagic (2000-2001)
University of Southern Maine - Paolo Atzeni (2003)
University of Rome 3 - Christoph Freytag (2002)
Humboldt University - Wolfgang Lehner (2004)
Dresden University of Technology - David Maier (2007, 2010)
Portland State University - Rene Miller (2001, 2002)
University of Toronto - Mohamed Mokbel (2008, 2010)
University of Minnesotoa - Elizabeth O'Neil (2001)
University of Massachusetts, Boston - Patrick O'Neil (2001)
University of Massachusetts, Boston - Lemonia Ragia (2003)
University of Aachen - Erhard Rahm (2000)
University of Leipzig - Betty Salzberg (1996, 2002)
Northeastern University - Gerhard Weikum (1997, 2000, 2001)
University of Saarland
