Automated Known Problem Diagnosis with Event Traces

  • Chun Yuan ,
  • Ni Lao ,
  • Ji-Rong Wen ,
  • Jiwei Li ,
  • Zheng Zhang ,
  • Yi-Min Wang ,
  • Wei-Ying Ma

MSR-TR-2005-81 |

Publication

Computer problem diagnosis remains a serious challenge to users and support professionals. Traditional troubleshooting methods relying heavily on human intervention make the process inefficient and the results inaccurate even for solved problems, which contribute significantly to user’s dissatisfaction. We propose to use system behavior information such as system event traces to build correlations with solved problems, instead of using only vague text descriptions as in existing practices. The goal is to enable automatic identification of the root cause of a problem if it is a known one, which would further lead to its resolution. By applying statistical learning techniques to classifying system call sequences, we show our approach can achieve considerable accuracy of root cause recognition by studying four case examples.