Saved in:
Bibliographic Details
Main Authors: Zou, Xinkai, Jiang, Xuan, Huang, Ruikai, He, Haoze, Kapoor, Parv, Wu, Hongrui, Wang, Yibo, Sha, Jian, Shi, Xiongbo, Huang, Zixun, Zhao, Jinhua
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.01844
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909823554027520
author Zou, Xinkai
Jiang, Xuan
Huang, Ruikai
He, Haoze
Kapoor, Parv
Wu, Hongrui
Wang, Yibo
Sha, Jian
Shi, Xiongbo
Huang, Zixun
Zhao, Jinhua
author_facet Zou, Xinkai
Jiang, Xuan
Huang, Ruikai
He, Haoze
Kapoor, Parv
Wu, Hongrui
Wang, Yibo
Sha, Jian
Shi, Xiongbo
Huang, Zixun
Zhao, Jinhua
contents Anomaly detection in cloud environments remains both critical and challenging. Existing context-level benchmarks typically focus on either metrics or logs and often lack reliable annotation, while most detection methods emphasize point anomalies within a single modality, overlooking contextual signals and limiting real-world applicability. Constructing a benchmark for context anomalies that combines metrics and logs is inherently difficult: reproducing anomalous scenarios on real servers is often infeasible or potentially harmful, while generating synthetic data introduces the additional challenge of maintaining cross-modal consistency. We introduce CloudAnoBench, a large-scale benchmark for context anomalies in cloud environments, comprising 28 anomalous scenarios and 16 deceptive normal scenarios, with 1,252 labeled cases and roughly 200,000 log and metric entries. Compared with prior benchmarks, CloudAnoBench exhibits higher ambiguity and greater difficulty, on which both prior machine learning methods and vanilla LLM prompting perform poorly. To demonstrate its utility, we further propose CloudAnoAgent, an LLM-based agent enhanced by symbolic verification that integrates metrics and logs. This agent system achieves substantial improvements in both anomaly detection and scenario identification on CloudAnoBench, and shows strong generalization to existing datasets. Together, CloudAnoBench and CloudAnoAgent lay the groundwork for advancing context-aware anomaly detection in cloud systems. Project Page: https://jayzou3773.github.io/cloudanobench-agent/
format Preprint
id arxiv_https___arxiv_org_abs_2508_01844
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Towards Generalizable Context-aware Anomaly Detection: A Large-scale Benchmark in Cloud Environments
Zou, Xinkai
Jiang, Xuan
Huang, Ruikai
He, Haoze
Kapoor, Parv
Wu, Hongrui
Wang, Yibo
Sha, Jian
Shi, Xiongbo
Huang, Zixun
Zhao, Jinhua
Artificial Intelligence
Anomaly detection in cloud environments remains both critical and challenging. Existing context-level benchmarks typically focus on either metrics or logs and often lack reliable annotation, while most detection methods emphasize point anomalies within a single modality, overlooking contextual signals and limiting real-world applicability. Constructing a benchmark for context anomalies that combines metrics and logs is inherently difficult: reproducing anomalous scenarios on real servers is often infeasible or potentially harmful, while generating synthetic data introduces the additional challenge of maintaining cross-modal consistency. We introduce CloudAnoBench, a large-scale benchmark for context anomalies in cloud environments, comprising 28 anomalous scenarios and 16 deceptive normal scenarios, with 1,252 labeled cases and roughly 200,000 log and metric entries. Compared with prior benchmarks, CloudAnoBench exhibits higher ambiguity and greater difficulty, on which both prior machine learning methods and vanilla LLM prompting perform poorly. To demonstrate its utility, we further propose CloudAnoAgent, an LLM-based agent enhanced by symbolic verification that integrates metrics and logs. This agent system achieves substantial improvements in both anomaly detection and scenario identification on CloudAnoBench, and shows strong generalization to existing datasets. Together, CloudAnoBench and CloudAnoAgent lay the groundwork for advancing context-aware anomaly detection in cloud systems. Project Page: https://jayzou3773.github.io/cloudanobench-agent/
title Towards Generalizable Context-aware Anomaly Detection: A Large-scale Benchmark in Cloud Environments
topic Artificial Intelligence
url https://arxiv.org/abs/2508.01844