Srikanth Kandula

Networking Research Group

My interests are in building and analyzing networked systems. Of late, I have worked on data center networks and services. I completed my PhD in Computer Science from MIT in 2008.

srikanth at microsoft dotcom
(425) 538 5407
One Microsoft Way, Redmond, WA 98052

My work falls into two main bins: improving big data analytics and improving network resource allocation in datacenters.
My blog.

Recent Papers (All)

Calendaring for Wide Area Networks
Srikanth Kandula, Ishai Menache, Roy Schwartz, Spandana Babbula

Multi-Resource Packing for Cluster Schedulers
Robert Grandl, Ganesh Ananthanarayanan, Srikanth Kandula, Sriram Rao, Aditya Akella

Traffic Engineering with Forward Fault Correction
Hongqiang Liu, Srikanth Kandula, Ratul Mahajan, Ming Zhang, David Gelernter

Dionysus: Dynamic Scheduling of Network Updates
X. Jin, H. Liu, R. Gandhi, S. Kandula, R. Mahajan, J. Rexford, R. Wattenhofer, M. Zhang

Speeding up Distributed Request-Response Workflows
MSR, Bing, UIUC, Steklov Math Inst.

We show how to improve the tail latency of datacenter services which are built as workflows over many components by appropriately allocating additional resources across the various stages in the workflow. Also, even a small amount of incompleteness (i.e., returning partial results) can improve latency by a lot if used well.

Leveraging Endpoint Flexibility in Data-Intensive Clusters
UC Berkeley, MSR

Some network traffic in DCs does not care about where it is sent modulo some constraints. We consider using such traffic to better balance network load.

Achieving High Utilization with Software-Driven WAN

We show that given responsive networks and responsive applications adapting who gets to send how much, when, and along which network paths can improve network utilization without losing out on business priorities.

Virtualizing Traffic Shapers for Practical Resource Allocation
MSR, UC Berkeley

Can we make a switch with a few (4-8) traffic shapers (WFQs, Priority Queues) mimic the shaping behavior of a switch that has 100s-1000s of shapers?

Act for Affordable Data Care

Is there a market of data breach insurance, i.e., one that risk assesses users and businesses, incents better behavior from both and, pays out damages to recover from accidental data breaches?

Re-optimizing Data Parallel Computing
MSR, Bing, UCBerkeley
This ships in Bing's Cosmos clusters since December 2011.

How would execution plans for jobs in big data clusters change if given additional information about properties of the user code, data and how the code and data interact? Can we extract such properties at scale?

PACMan: Coordinated Memory Caching for Parallel Jobs
Ganesh Anantharanayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker, Ion Stoica

How to build an input cache spanning a cluster of machines to speed-up parallel executions? Hint: Not LRU.

Recurring Job Optimizations in Scope
Bing, MSR
Industrial Track Paper

This describe how we gather statistics about code and data and use them to improve execution plans in SCOPE.

Jockey: Guaranteed Job Latency in Data Parallel Clusters
MSR, Brown, Bing

How to schedule data-parallel jobs given per-job deadlines?

Augmenting Data Center Networks with Multi-Gigabit Wireless Links
MSR, Univ. of Washington

How to use dynamically available links to relieve congestion off the network core? How to build such links using 60GHz devices?

Scarlett: Coping with Skewed Content Popularity in MapReduce Clusters
UCBerkeley, MSR, Bing

Popularlity skew in input datasets slows down jobs, and a fix.

Sharing the Datacenter Network
MSR, Cornell, Azure, Bing

Adversarial network control, i.e., despite the traffic profile used by selfish or misbehaving tenants, how to ensure desired network allocation across entities?

Profiling Network Performance for Multi-tier Data Center Applications
Minlan Yu, Albert Greenberg, Dave Maltz, Jennifer Rexford, Lihua Yuan, Srikanth Kandula, Changhoon Kim

The many ways in which a vanilla network config is unsuitable for datacenter applications.

SideCar: Building Programmable Datacenter Networks without Programmable Switches
Alan Shieh, Srikanth Kandula, Emin Gun Sirer.

Can we programmatically process a non-trivial fraction of packets in the middle of the network? Then what?

Reining in the Outliers in Map-Reduce Clusters using Mantri
Ganesh Ananthanarayanan, Srikanth Kandula, Albert Greenberg, Ion Stoica, Yi Lu, Bikas Saha, Ed Harris.
This ships in Cosmos servers since May 2010. Tech. Report: MSR-TR-2010-69. Slides.

CloudCmp: Comparing Public Cloud Providers
Ang Li, Xiaowei Yang, Srikanth Kandula, Ming Zhang
A First Look at Traffic on Smartphones
Hossein Falaki, Dimitrios Lymberopoulos, Ratul Mahajan, Srikanth Kandula, Deborah Estrin


Seawall: Performance Isolation in Cloud Datacenter Networks
Alan Shieh, Srikanth Kandula, Albert Greenberg, Changhoon Kim
Press: ZDNet
CloudCmp: Shopping for a Cloud Made Easy
Ang Li, Xiaowei Yang, Srikanth Kandula, Ming Zhang
Project Site. Press: MIT Tech. Review, ZDNet

Diversity in Smartphone Usage
Hossein Falaki, Ratul Mahajan, Srikanth Kandula, Dimitrios Lymberopoulos, Ramesh Govindan, Deborah Estrin

Trusted End Host Monitors For Securing Cloud Datacenters
Alan Shieh, Srikanth Kandula, Albert Greenberg

Flyways to De-Congest Data Center Networks
Srikanth Kandula, Jitu Padhye, Victor Bahl

Sampling Biases in Network Path Measurements and What to Do About It
Srikanth Kandula, Ratul Mahajan.
Software built as part of this work is available here.
The Nature of Datacenter Traffic: Measurements and Analysis
Srikanth Kandula, Sudipta Sengupta, Albert Greenberg, Parveen Patel, Ronnie Chaiken. Talk@IMC.

Detailed Diagnosis in Computer Networks.
Srikanth Kandula, Ratul Mahajan, Patrick Verkaik, Sharad Agarwal, Jitu Padhye, Victor Bahl.
A Scalable and Flexible Datacenter Network.
Albert Greenberg, Navendu Jain, James Hamilton, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, Dave Maltz, Parveen Patel, Sudipta Sengupta

I've worked with some amazing interns at MSR.
IMC, NetDB (co-chair), HotCloud, SLAML, LADIS, MobiHoc
  • Flare: Splitting flowlets over multiple paths
  • wcAsync: An asynchronous web traffic generator
  • ospfOpt: Finding optimal weights for OSPF traffic engineering
  • Broom: Unbiasing Internet path measurements

Short Bio
Srikanth Kandula is a Researcher at Microsoft Research. His research interests span many aspects of networked systems including datacenters, network management, diagnosis, applied statistical inference and security. He has published over 15 papers in top-tier venues such as SIGCOMM, NSDI, and MobiSys. He is a winner of the NSDI best student paper award (2005). He obtained his Ph. D. from the Massachusetts Institute of Technology (2008).