Anomaly Detection in Large-Scale Monitoring Systems using a Language Model
Conference proceedings article
Authors/Editors
Strategic Research Themes
Publication Details
Author list: Supasate Vorathammathorn, Nopphakorn Subsa-ard, Tawan Thaepprasit, Phond Phunchongharn, Sansiri Tarnpradab
Publication year: 2025
URL: https://dl.acm.org/doi/10.1145/3718350.3718354
Abstract
Large-scale monitoring systems encounter significant challenges in detecting anomalies, which can disrupt operations and degrade overall system performance. This study proposes Anomaly Detection in Large-Scale Monitoring Systems using a Language Model (AD-LM), an innovative approach designed to address these challenges by employing a language model specifically tailored for anomaly detection. Through two main steps, AD-LM first utilizes BERTopic for topic modeling, clustering log entries into meaningful topics that uncover patterns indicative of anomalies. Then, the system employs various classification models, including tree-based, graph-based, and sequence-based approaches, to predict and diagnose failures. Extensive experiments were conducted on three real-world log datasets: Hadoop Distributed File System (HDFS), BlueGene/L (BGL) supercomputer system, and Thunderbird supercomputer system. The model achieved F1 scores of 0.998, 0.999, and 0.999, respectively, across these datasets, demonstrating its capability to significantly improve anomaly detection performance.
Keywords
No matching items found.