Sriram Rao

PRINCIPAL SCIENTIST LEAD
.

I am a member of the Cloud and Information Services Lab at Microsoft.

My interests are in storage and compute infrastructure for processing massive datasets.  In the past, I have designed/implemented/deployed Kosmos distributed filesystem (KFS) to manage PB's of storage.  KFS is currently deployed on a cluster of over 1000 nodes.  I have designed/implemented Sailfish, a compute infrastructure which improves handling of intermediate data (i.e., "shuffle" phase in a Map-Reduce computation).  Sailfish is based on the observation that the bandwidth within a datacenter will increase substantially in the next few years (viz., 10Gbps between pairs of nodes will be commonplace).  We leverage such an expected increase to do network data aggregation and thereby increase disk subsystem performance when transporting intermediate data in a Map-Reduce computation.  Our results show that Sailfish can improve job completion times at scale by 20% to 5x.  

Both KFS and Sailfish have been released as open-source projects.

At CISL, I am working on building Hadoop related services on Windows Azure. 

Publications
Share
Share this page on Facebook
Share this page on Twitter
Share this page on LinkedIn
E-mail this page
RSS feeds