Share on Facebook Tweet on Twitter Share on LinkedIn Share by email
PAD: Performance Anomaly Detection in Multi-Server Distributed Systems

Manjula Peiris, James H. Hill, Jorgen Thelin, Sergey Bykov, Gabriel Kliot, and Christian Konig

Abstract

Multi-server distributed systems are becoming increasingly popular with the emergence of cloud computing. These systems need to provide high throughput with low latency, which is a difficult task to achieve. Manual performance tuning and diagnosis of such systems, however, is hard as the amount of relevant performance diagnosis data is large. To help system developers with performance diagnosis, we have developed a tool called Performance Anomaly Detector (PAD). PAD combines user-driven navigation analysis with automatic correlation and comparative analysis techniques. The combination results in a powerful tool that can help find a number of performance anomalies. Based on our experience in applying PAD to the Orleans system, we discovered that PAD was able to reduce developer time and effort detecting anomalous performance cases and improve a developer’s ability to perform deeper analysis of such behaviors.

Details

Publication typeInproceedings
Published in7th IEEE International Conference on Cloud Computing (IEEE Cloud 2014)
> Publications > PAD: Performance Anomaly Detection in Multi-Server Distributed Systems