Benchmarking the Robustness of Agentic Systems to Adversarially-Induced HarmsJonathan N\"other, Adish Singla, Goran Radanovichttps://arxiv.org/abs/2508.16481 https://
Benchmarking the Robustness of Agentic Systems to Adversarially-Induced HarmsEnsuring the safe use of agentic systems requires a thorough understanding of the range of malicious behaviors these systems may exhibit when under attack. In this paper, we evaluate the robustness of LLM-based agentic systems against attacks that aim to elicit harmful actions from agents. To this end, we propose a novel taxonomy of harms for agentic systems and a novel benchmark, BAD-ACTS, for studying the security of agentic systems with respect to a wide range of harmful actions. BAD-ACTS co…