Tootfinder

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:55:30

Benchmarking the Robustness of Agentic Systems to Adversarially-Induced Harms
Jonathan N\"other, Adish Singla, Goran Radanovic
https://arxiv.org/abs/2508.16481 https://

Benchmarking the Robustness of Agentic Systems to Adversarially-Induced Harms
Ensuring the safe use of agentic systems requires a thorough understanding of the range of malicious behaviors these systems may exhibit when under attack. In this paper, we evaluate the robustness of LLM-based agentic systems against attacks that aim to elicit harmful actions from agents. To this end, we propose a novel taxonomy of harms for agentic systems and a novel benchmark, BAD-ACTS, for studying the security of agentic systems with respect to a wide range of harmful actions. BAD-ACTS co…

Tootfinder

Opt-in global Mastodon full text search. Join the index!