Artemis
Artemis is a modular application designed for analyzing and troubleshooting the performance of large clusters running datacenter services. Artemis is composed of four modules: (1) distributed log collection and extraction, (2) a database storing the extracted data, (3) an interactive visualization tool for exploring the data, and (4) a plug-in interface (and a set of sample plug-ins) allowing users to implement data analysis tools.
Publications
- Úlfar Erlingsson, Marcus Peinado, Simon Peter, and Mihai Budiu, Fay: Extensible Distributed Tracing from Kernels to Clusters, in ACM Symposium on Operating Systems Principles (SOSP), ACM, October 2011
- Mihai Budiu, User interfaces for exploring multi-dimensional data sets, no. MSR-TR-2010-67, 4 June 2010
- Moises Goldszmidt, Mihai Budiu, Yue Zhang, and Michael Pechuk, Toward Automatic Policy Refinement in Repair Services for Large Distributed Systems, in The 3rd ACM SIGOPS International Workshop on Large Scale Distributed Systems and Middleware, 17 September 2009
- Gabriela Cretu, Mihai Budiu, and Moises Goldszmidt, Hunting for problems with Artemis, in USENIX Workshop on the Analysis of System Logs (WASL), USENIX, December 2008
