Siirry päänavigointiin Siirry hakuun Siirry pääsisältöön

LO2: Microservice API Anomaly Dataset of Logs and Metrics

Tutkimustuotos: Artikkeli kirjassa/raportissa/konferenssijulkaisussaKonferenssiartikkeliTieteellinenvertaisarvioitu

Abstrakti

Context. Microservice-based systems have gained significant attention over the past years. A critical factor for understanding and analyzing the behavior of these systems is the collection of monitoring data such as logs, metrics, and traces. These data modalities can be used for anomaly detection and root cause analysis of failures. In particular, multi-modal methods utilizing several types of this data at once have gained traction in the research community since these three modalities capture different dimensions of system behavior.Aim. We provide a dataset that supports research on anomaly detection and architectural degradation in microservice systems. We generate a comprehensive dataset of logs, metrics, and traces from a production microservice system to enable the exploration of multi-modal fusion methods that integrate multiple data modalities.Method. We dynamically tested the various APIs of the MS-based system, implementing the OAuth2.0 protocol using the Locust tool. For each execution of the prepared test suite, we collect logs and performance metrics for correct and erroneous calls with data labeled according to the error triggered during the call.Contributions. We collected approximately 657,000 individual log files, totaling over two billion log lines. In addition, we collected more than 45 million individual metric files that contain 485 unique metrics. We provide an initial analysis of logs, identify key metrics through PCA, and discuss challenges in collecting traces for this system. Moreover, we highlight the possibilities for making a more fine-grained version of the data set. This work advances anomaly detection in microservice systems using multiple data sources.
Alkuperäiskielienglanti
OtsikkoPROMISE '25: Proceedings of the 21st International Conference on Predictive Models and Data Analytics in Software Engineering
Sivumäärä10
KustantajaACM
Julkaisupäivä26 kesäk. 2025
Sivut1-10
ISBN (elektroninen)979-8-4007-1594-5
DOI - pysyväislinkit
TilaJulkaistu - 26 kesäk. 2025
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisuussa
TapahtumaInternational Conference on Predictive Models and Data Analytics in Software Engineering - Trondheim, Norja
Kesto: 26 kesäk. 2025 → …
Konferenssinumero: 21

Tieteenalat

  • Software System Performance and Reliability
  • 113 Tietojenkäsittely- ja informaatiotieteet

Siteeraa tätä