|
Networks are being deployed extensively in large corporations, small offices, and homes.
However, a significant number of ``pain points'' remain for end-users and
network administrators. To resolve complaints quickly and efficiently, network
administrators need tools that can assist them in detecting, isolating,
diagnosing, and correcting faults. Furthermore, such tools should also detect
network security breaches, possibly caused by innocent employees. The NetHealth project
is about detecting, infering, diagnosing, and recovering from user perceived performance
problems in enterprise networks.
Existing products do a reasonable job of presenting statistical data
from the network. However, they do not do a comprehensive job of gathering
and analyzing the data to establish the root cause of the problem. Furthermore, on the
wireless side, most products gather data from the
Access Points (APs) only and neglect the client-side view of the network. Some products
that monitor the network from the client's perspective require hardware sensors, which
can be expensive to deploy and maintain. Also, current solutions do not provide any
support for disconnected clients even though these are the ones that need the most help. On
the wired side, a number of researchers have come up with solutions for
diagnosing problems over WANs; however, most of those approaches are not
integrated to perform end-to-end inference and diagnostics.
Under the NetHealth umbrella, we are building algorithms and tools that
- allow generalist operators to diagnose end-to-end performance as “seen” by users
- produce near real-time and historical-analysis reports of end-to-end performance problems with networked services and components
- prioritize and raise alerts based on impact analysis on users from performance glitches/problems
- automatically resolve the problem or offer meaningful resolution strategies
- provide detailed analysis of wireless failures for mobile devices
- provide snapshots of the “health” of network elements and services
- compliment existing detailed networked diagnosis technologies
In contrast to traditional network-based and bolt-on approaches, NetHealth leverages clients and servers.
NetHealth agents on the end systems are positioned to harvest available application data, and infer
application-level dependencies, rather than reverse this information out from the network or from
summarized logs and alerts from computing and network elements, and associated management systems.
As a result, the NetHealth approach is well-suited for effective problem location and resolution,
and for bringing together the intelligence needed to support meaningful resilience and self-healing,
self-* capabilities.
- Sherlock (Previously called AND) - Enterprise Network Management via Analysis of Network Dependencies
-
DAIR - Enterprise wireless LAN Management Via Dense Array of Inexpensive Radios
-
SureMail - Notification system to make intended recipients aware of silent email loss
-
NetProfiler - Cooperative Network Monitoring & Diagnosis
- Larry Greenemeier,InformationWeek,
Inside Microsoft's Labs,
December 04 , 2006
- Gary Anthes, Computerworld,
The Future of E-mail,
June 12, 2006
- Gary Anthes, Computerworld,
Projects in the Microsoft Research labs,
June 5, 2006
- Joris Evers, The Industry Standard,
Microsoft Reseachers target worms,
March 4, 2005
- Paramvir Bahl, Ranveer Chandra, Albert Greenberg, Srikanth Kandula, David A. Maltz, Ming Zhang,
Towards Highly Reliable Enterprise Network Services Via Inference of Multi-level Dependencies,
Proceedings of ACM SIGCOMM 2007, Kyoto, Japan, September 2007
(pdf, 581 Kbytes)
- Sharad Agarwal, Venkata N. Padmanabhan, Dilip Joseph,
Addressing Email Loss with SureMail: Measurement, Design, and Evaluation,
Proceedings of the Anuual USENIX Technical conference, Santa Clara, CA, June 2007
- Ranveer Chandra, Hitendra Padhye, Alec Wolman, Brian Zill,
A location based Management System for Enterprise Wireless LANs,
Proceedings of USENIX NSDI 2007, Cambridge, Massachusetts, USA, April 2007
(pdf, 524 Kbytes)
- Venkata N. Padmanabhan, Narayanan S. Ramabhadran, Sharad Agarwal, Jitendra Padhye,
A Study of End-to-End Web Access Failures,
in Proceedings of CoNext 2006, Lisboa, Portugal, December 2006
- Paramvir Bahl, Paul Barham, Richard Black, Ranveer Chandra, Moises Goldszmidt, Rebecca Isaacs, Srikanth Kandula, Lun Li, John MacCormick, David A. Maltz, Richard Mortier, Mike Wawrzoniak, Ming Zhang,
Discovering Dependencies for Network Management,
HotNets-V, November 2006.
(pdf, 90 Kbytes)
- Ranveer Chandra, Venkata N. Padmanabhan and Ming Zhang,
WiFiProfiler: Cooperative Diagnosis in Wireless LANs,
Proceedings of ACM/USENIX MobiSys, Uppsala, Sweden, June 2006
(pdf, 355 Kbytes)
- Paramvir Bahl, Ranveer Chandra, Jitendra Padhye, Lenin Ravindranath, Manpreet Singh, Alec Wolman and Brian Zill,
Enhancing the Security of Corporate Wi-Fi Networks Using DAIR,
Proceedings of ACM/USENIX MobiSys, Uppsala, Sweden, June 2006.
(pdf, 348 Kbytes)
- Paramvir Bahl, Jitendra Padhye, Lenin Ravindranath, Manpreet Singh, Alec Wolman, Brian Zill,
DAIR: A Framework for Managing Enterprise Wireless Networks Using Desktop Infrastructure,
HotNets-IV, November 2005 (pdf, 64 Kbytes)
- Sharad Agarwal, Venkata N. Padmanabhan, Dalip A. Joseph,
SureMail: Notification Overlay for Email Reliability,
HotNets-IV, November 2005 (pdf, 150 Kbytes)
- Venkata N. Padmanabhan, Sriram Ramabhadran, and Jitendra Padhye,
Profiling Wide-Area Networks Using Peer Cooperation, IPTPS 2005,
(pdf, 62 Kbytes)
- Karthik Lakshminarayanan, Venkata N. Padmanabhan, Jitendra Padhye,
Bandwidth Estimation in Broadband Access Networks,
Proceedings of ACM Internet Measurment Conference 2004,
Sicily, Italy (pdf, 106 Kbytes)
previously published as
Microsoft Technical Report, MSR-TR-2004-44 May 2004
- John Dunagan, Nicholas J.A. Harvey, Michael B. Jones, Dejan Kostic, Marvin Theimer, and Alec Wolman,
FUSE: Lightweight Guaranteed Distributed Failure Notification
Proceedings of the Sixth Symposium on Operating Systems Design and Implementation (OSDI),
San Francisco, CA, December 2004 (pdf)
- Atul Adya, Paramvir Bahl, Ranveer Chandra, Lili Qiu,
Architecture
and Techniques for Diagnosing Faults in IEEE 802.11 Infrastructure Networks,
In Proceedings of MobiCom, September 2004,
(pdf, 259 KBytes)
- Lili Qiu, Paramvir Bahl, Ananth Rao, Lidong Zhou,
Fault
Detection, Isolation, and Diagnosis in Multi-hop Wireless Networks,
Microsoft Technical Report, MSR-TR-2004-11, December 2003
(pdf, 320 Kbytes)
|