I am a member of the Cloud and Information Services Lab at Microsoft.
My interests are in storage and compute infrastructure for processing massive datasets. In the past, I have designed/implemented/deployed Kosmos distributed filesystem (KFS) to manage PB's of storage. KFS is currently deployed on a cluster of over 1000 nodes. I have designed/implemented Sailfish, a compute infrastructure which improves handling of intermediate data (i.e., "shuffle" phase in a Map-Reduce computation). Sailfish is based on the observation that the bandwidth within a datacenter will increase substantially in the next few years (viz., 10Gbps between pairs of nodes will be commonplace). We leverage such an expected increase to do network data aggregation and thereby increase disk subsystem performance when transporting intermediate data in a Map-Reduce computation. Our results show that Sailfish can improve job completion times at scale by 20% to 5x.
Both KFS and Sailfish have been released as open-source projects.
At CISL, I am working on building Hadoop related services on Windows Azure.
- Sriram Rao, Benjamin Reed, and Adam Silberstein, HotROD: Managing Grid Storage With On-Demand Replication, Workshop on Data Management in the Cloud (DMC'13), April 2013
- Ganesh Ananthanarayanan, Christopher Douglas, Raghu Ramakrishnan, Sriram Rao, and Ion Stoica, True Elasticity in Multi-Tenant Clusters through Amoeba, in ACM Symposium on Cloud Computing, October 2012
- Sriram Rao, Raghu Ramakrishnan, Adam Silberstein, Mike Ovsiannikov, and Damian Reeves, Sailfish: A Framework For Large Scale Data Processing, in ACM Symposium on Cloud Computing, October 2012
- Jianjun Chen, Chris Douglas, Michi Mutsuzaki, Patrick Quaid, Raghu Ramakrishnan, Sriram Rao, and Russell Sears, Walnut: a unified cloud object store, in SIGMOD Conference, May 2012