A Fine-Tuned BERT-Based Model for Individual Log Anomaly Detection in Operational Monitoring at Paranal Observatory
School authors:
author photo
Rodrigo Arnaldo Carrasco
External authors:
  • Andres H. Catalan ( Universidad Adolfo Ibanez , Directorate Programs Res & Dev DIPRIDA )
  • Gonzalo A. Ruz ( Universidad Adolfo Ibanez , Ctr Appl Ecol & Sustainabil CAPES , Millennium Nucleus Social Data Sci SODAS )
  • Juan P. Gil ( Paranal Observ )
Abstract:

In operational environments such as astronomical observatories, continuous monitoring of system logs is critical yet challenging due to the vast volume of data generated. Manual inspection of these logs is impractical, needing automated methods capable of accurate and scalable anomaly detection. This paper proposes a novel sentiment-aware anomaly detection framework, leveraging Bidirectional Encoder Representations from Transformers with In-Task Pre-Training and Fine-Tuning (BERT-ITPT-FiT), to classify textual logs based on their inherent sentiment polarity. Specifically tailored for the Paranal Observatory, our approach incorporates a computationally efficient, character-based preprocessing strategy that retains essential semantic elements, such as acronyms and technical terms, thereby eliminating the need for traditional parsing. This method efficiently processes a substantial dataset comprising 7,359,506 log entries from three VLTI instruments in an average runtime of only 114.23 seconds. Extensive experiments utilizing real operational data demonstrate that our architecture achieves an effective throughput of approximately 3,958 logs per second during the combined embedding generation and model training phases while consistently delivering an excellent classification performance. Besides, it significantly outperforms existing deep learning methods combined with conventional embeddings in both in- and cross-instrument scenarios, achieving F1-Scores exceeding 99.99% and 99.96%, respectively. This work emphasizes a critical balance between computational efficiency and classification performance, providing a robust and scalable solution for anomaly detection in high-stakes observatory operations.

UT WOS:001527215700013
Number of Citations 0
Type
Pages 117464-117478
ISSUE
Volume 13
Month of Publication
Year of Publication 2025
DOI https://doi.org/10.1109/ACCESS.2025.3586586
ISSN
ISBN