Failure prediction of data centers using time series and Fault Tree Analysis

Conference proceedings article


Authors/Editors


Strategic Research Themes

No matching items found.


Publication Details

Author listChalermarrewong T., Achalakul T., See S.C.W.

Publication year2012

Start page794

End page799

Number of pages6

ISBN9780769549033

ISSN1521-9097

eISSN1521-9097

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-84874067812&doi=10.1109%2fICPADS.2012.129&partnerID=40&md5=84ea883e7e23219fcd6901d6d0cf8d62

LanguagesEnglish-Great Britain (EN-GB)


View in Web of Science | View on publisher site | View citing articles in Web of Science


Abstract

This paper proposes a framework for online failure prediction of data centers. A data center often has a high failure rate as it features a number of servers and components. Moreover, long running applications and intensive workloads are common in such facilities. Performance of the system depends on the availability of the machines, which can be easily compromised if failure cannot be handled gracefully. The main idea of this paper is to create an effective prediction model focusing on hardware failure. Accurate prediction may enhance the overall system performance. In this work, we employ two methods, namely, ARMA (Auto Regressive Moving Average) and Fault Tree Analysis. Experiments were then performed on a simulated cluster built based on Simics platform. The results show prediction accuracy of 97%, which is very high. We thus believe that our framework is practical and can be adapted to use in data centers in the future. ฉ 2012 IEEE.


Keywords

Fault managementFault Tree AnalysisPerformance enhancementTime series prediction


Last updated on 2023-13-10 at 07:35