Striving to deliver a highly reliable cloud network infrastructure

Research Goals

The NetWiser project spans several areas of data center networks, from designing new scalable network architectures, understanding network failures and developing techniques for improving availability of services hosted in the cloud.
  • Understand network failures in data centers: The aim is to characterize failures of network devices in data centers by analyzing failure incidents and correlating them with network traffic, estimating impact of failures, and deriving implications for designing future network architectures. [SIGCOMM 2011].
  • Design scalable and flexible data center network architectures VL2 is a new data center network architecture designed for hundreds of thousands of servers and built from commodity switches that enables high-bisection bandwidth between all communicating server pairs, agility in mapping any service to any server, and achieves graceful performance degradation under failures [SIGCOMM 2009, CACM 2011].


  • An Empirical Study of Intra-datacenter and Inter-datacenter Network Failures for Geo-distributed Services
    Rahul Potharaju and Navendu Jain
    To appear in ACM SIGMETRICS 2013 (Extended Abstract).
  • Juggling the Jigsaw: Towards Automated Problem Inference from Network Trouble Tickets
    Rahul Potharaju, Navendu Jain and Cristina Nita-Rotaru
    To appear in 10th USENIX Symposium on Network Design and Implementation (NSDI '13).
  • Understanding Network Failures in Data Centers: Measurement, Analysis, and Implications.
    Phillipa Gill, Navendu Jain, and Nachi Nagappan.
    Proceedings of the ACM Special Interest Group on Data Communications (SIGCOMM '11), Toronta, Canada, August 2011.
    [PDF] [Bibtex]
  • VL2: A Scalable and Flexible Data Center Network.
    Albert Greenberg, James Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, Dave Maltz, Praveen Patel, and Sudipta Sengupta.
    Communications of the ACM (CACM '11), Research highlights. (A conference version of this work appeared in the ACM Special Interest Group on Data Communications (SIGCOMM '09), Barcelona, Spain, August 2009.) [PDF] [PS] [Bibtex] [Project Page]