One-Class Classification and Cluster Ensembles for Anomaly Detection and Diagnosis in Multivariate Time Series Data

Authors Adil Mukhtar , Thomas Hirsch, Gerald Schweiger



PDF
Thumbnail PDF

File

OASIcs.DX.2024.14.pdf
  • Filesize: 1.3 MB
  • 19 pages

Document Identifiers

Author Details

Adil Mukhtar
  • Institute of Software Technology, Graz University of Technology, Austria
Thomas Hirsch
  • Institute of Software Technology, Graz University of Technology, Austria
Gerald Schweiger
  • Vienna University of Technology, Austria

Cite As Get BibTex

Adil Mukhtar, Thomas Hirsch, and Gerald Schweiger. One-Class Classification and Cluster Ensembles for Anomaly Detection and Diagnosis in Multivariate Time Series Data. In 35th International Conference on Principles of Diagnosis and Resilient Systems (DX 2024). Open Access Series in Informatics (OASIcs), Volume 125, pp. 14:1-14:19, Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2024) https://doi.org/10.4230/OASIcs.DX.2024.14

Abstract

Real-world automated systems such as building automation, power plants, and more have benefited from data-driven learning methodologies for anomaly detection and diagnosis. Typically, these methodologies heavily rely on prior knowledge related to abnormal operations, i.e., data points labeled as anomalies. However, in practice, such labelled data points are often unavailable which poses challenges in effective anomaly detection, particularly in diagnosis. In this paper, we propose One-class Classification Cluster ENsembles (OCCEN) anomaly detection and diagnosis approach for multivariate time series data. OCCEN utilizes one-class classification learning methods for anomaly detection followed by the decomposition of anomalies into multiple clusters. Then each cluster is treated as a binary classification problem and classifiers are trained to learn cluster representations. These trained models in combination with explainable AI models are used to generate a ranked list of diagnoses, i.e., features. Finally, we re-rank those features to account for temporal dependencies through the dynamic time-warping technique. The practical evaluation of OCCEN for air handling units (AHU) demonstrates its effectiveness in identifying faults. The framework consistently outperforms the baseline in fault diagnosis, as higher scores are observed for detection and diagnostic evaluation metrics, including F1 score, intersection over union, HitRate@k, and RootCause@k.

Subject Classification

ACM Subject Classification
  • Computing methodologies → Machine learning algorithms
  • Computing methodologies → Artificial intelligence
  • Computing methodologies → Model development and analysis
  • Hardware → Fault models and test metrics
Keywords
  • Anomaly Detection and Diagnosis
  • Machine Learning
  • Explainable AI
  • One-class Classification

Metrics

  • Access Statistics
  • Total Accesses (updated on a weekly basis)
    0
    PDF Downloads

References

  1. Anam Abid, Muhammad Tahir Khan, and Javaid Iqbal. A review on fault detection and diagnosis techniques: basics and beyond. Artificial Intelligence Review, 54:3639-3664, 2021. URL: https://doi.org/10.1007/S10462-020-09934-2.
  2. Mennatallah Amer, Markus Goldstein, and Slim Abdennadher. Enhancing one-class support vector machines for unsupervised anomaly detection. In Proceedings of the ACM SIGKDD workshop on outlier detection and description, pages 8-15, 2013. Google Scholar
  3. Yeonjin Bae, Saptarshi Bhattacharya, Borui Cui, Seungjae Lee, Yanfei Li, Liang Zhang, Piljae Im, Veronica Adetola, Draguna Vrabie, Matt Leach, et al. Sensor impacts on building and hvac controls: A critical review for building energy performance. Advances in Applied Energy, 4:100068, 2021. Google Scholar
  4. Anna M Bartkowiak. Anomaly, novelty, one-class classification: a comprehensive introduction. International Journal of Computer Information Systems and Industrial Management Applications, 3(1):61-71, 2011. Google Scholar
  5. A Beghi, R Brignoli, Luca Cecchinato, Gabriele Menegazzo, Mirco Rampazzo, and F Simmini. Data-driven fault detection and diagnosis for hvac water chillers. Control Engineering Practice, 53:79-91, 2016. Google Scholar
  6. Efrem Heri Budiarto, Adhistya Erna Permanasari, and Silmi Fauziati. Unsupervised anomaly detection using k-means, local outlier factor and one class svm. In 2019 5th international conference on science and technology (ICST), volume 1, pages 1-5. IEEE, 2019. Google Scholar
  7. Xuewu Dai and Zhiwei Gao. From model, signal to knowledge: A data-driven perspective of fault detection and diagnosis. IEEE Trans. Ind. Informatics, 9(4):2226-2238, 2013. URL: https://doi.org/10.1109/TII.2013.2243743.
  8. Laura Erhan, M Ndubuaku, Mario Di Mauro, Wei Song, Min Chen, Giancarlo Fortino, Ovidiu Bagdasar, and Antonio Liotta. Smart anomaly detection in sensor systems: A multi-perspective review. Information Fusion, 67:64-79, 2021. URL: https://doi.org/10.1016/J.INFFUS.2020.10.001.
  9. Astha Garg, Wenyu Zhang, Jules Samaran, Ramasamy Savitha, and Chuan-Sheng Foo. An evaluation of anomaly detection and diagnosis in multivariate time series. IEEE Transactions on Neural Networks and Learning Systems, 33(6):2508-2517, 2021. URL: https://doi.org/10.1109/TNNLS.2021.3105827.
  10. Zhiqiang Ge. Review on data-driven modeling and monitoring for plant-wide industrial processes. Chemometrics and Intelligent Laboratory Systems, pages 16-25, 2017. Google Scholar
  11. Jessica Granderson, Guanjing Lin, Yimin Chen, Armando Casillas, Jin Wen, Zhelun Chen, Piljae Im, Sen Huang, and Jiazhen Ling. A labeled dataset for building hvac systems operating in faulted and fault-free states. Scientific Data, 2023. Google Scholar
  12. H Burak Gunay and Zixiao Shi. Cluster analysis-based anomaly detection in building automation systems. Energy and Buildings, 228:110445, 2020. Google Scholar
  13. Fouzi Harrou, Abdelkader Dairi, Bilal Taghezouit, and Ying Sun. An unsupervised monitoring procedure for detecting anomalies in photovoltaic systems using a one-class support vector machine. Solar Energy, 179:48-58, 2019. Google Scholar
  14. Andreas Holzinger, Anna Saranti, Christoph Molnar, Przemyslaw Biecek, and Wojciech Samek. Explainable ai methods-a brief overview. In International workshop on extending explainable AI beyond deep models and classifiers, pages 13-38. Springer, 2022. Google Scholar
  15. Rolf Isermann. Fault-diagnosis applications: model-based condition monitoring: actuators, drives, machinery, plants, sensors, and fault-tolerant systems. Springer Science & Business Media, 2011. Google Scholar
  16. Neha Kant and Manish Mahajan. Time-series outlier detection using enhanced k-means in combination with pso algorithm. In Engineering Vibration, Communication and Information Processing: ICoEVCI 2018, India, pages 363-373. Springer, 2019. Google Scholar
  17. Shehroz S Khan and Michael G Madden. One-class classification: taxonomy of study and review of techniques. The Knowledge Engineering Review, 29(3):345-374, 2014. URL: https://doi.org/10.1017/S026988891300043X.
  18. Zhiling Lan, Ziming Zheng, and Yawei Li. Toward automated anomaly identification in large-scale systems. IEEE Transactions on Parallel and Distributed Systems, 21(2):174-187, 2009. URL: https://doi.org/10.1109/TPDS.2009.52.
  19. Jinbo Li, Hesam Izakian, Witold Pedrycz, and Iqbal Jamal. Clustering-based anomaly detection in multivariate time series data. Applied Soft Computing, 100:106919, 2021. URL: https://doi.org/10.1016/J.ASOC.2020.106919.
  20. Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou. Isolation forest. In 2008 eighth ieee international conference on data mining, pages 413-422. IEEE, 2008. URL: https://doi.org/10.1109/ICDM.2008.17.
  21. Hang Liu, Youyuan Wang, and WeiGen Chen. Anomaly detection for condition monitoring data using auxiliary feature vector and density-based clustering. IET Generation, Transmission & Distribution, 14(1):108-118, 2020. Google Scholar
  22. Scott M Lundberg and Su-In Lee. A unified approach to interpreting model predictions. Advances in neural information processing systems, 30, 2017. Google Scholar
  23. James MacQueen et al. Some methods for classification and analysis of multivariate observations. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 281-297. Oakland, CA, USA, 1967. Google Scholar
  24. Luigi Martirano and Massimo Mitolo. Building automation and control systems (bacs): a review. In 2020 IEEE International Conference on Environment and Electrical Engineering and 2020 IEEE Industrial and Commercial Power Systems Europe (EEEIC/I&CPS Europe), pages 1-8, 2020. Google Scholar
  25. Walter Hugo Lopez Pinaya, Sandra Vieira, Rafael Garcia-Dias, and Andrea Mechelli. Autoencoders. In Machine learning, pages 193-208. Elsevier, 2020. Google Scholar
  26. Douglas A Reynolds et al. Gaussian mixture models. Encyclopedia of biometrics, 741(659-663), 2009. Google Scholar
  27. Marco Tulio Ribeiro, Sameer Singh, and Carlos Guestrin. " why should i trust you?" explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining, pages 1135-1144, 2016. Google Scholar
  28. Stan Salvador and Philip Chan. Toward accurate dynamic time warping in linear time and space. Intelligent Data Analysis, 11(5):561-580, 2007. URL: http://content.iospress.com/articles/intelligent-data-analysis/ida00303.
  29. Bernhard Schölkopf, John C Platt, John Shawe-Taylor, Alex J Smola, and Robert C Williamson. Estimating the support of a high-dimensional distribution. Neural computation, 13(7):1443-1471, 2001. URL: https://doi.org/10.1162/089976601750264965.
  30. Pavel Senin. Dynamic time warping algorithm review. Information and Computer Science Department University of Hawaii at Manoa Honolulu, USA, 855(1-23):40, 2008. Google Scholar
  31. Chuxu Zhang, Dongjin Song, Yuncong Chen, Xinyang Feng, Cristian Lumezanu, Wei Cheng, Jingchao Ni, Bo Zong, Haifeng Chen, and Nitesh V Chawla. A deep neural network for unsupervised anomaly detection and diagnosis in multivariate time series data. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pages 1409-1416, 2019. URL: https://doi.org/10.1609/AAAI.V33I01.33011409.
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail