Paper

Agents of Chaos

Gabriele Sarti

2026.02.25

·Arxiv·by 네루

#AI Agents#Autonomy#LLM#Privacy#Security

Key Points

1An exploratory red-teaming study examined autonomous LLM-powered agents deployed in a live laboratory environment with persistent memory, email, and shell access.
2The research uncovered significant security, privacy, and governance vulnerabilities arising from the agents' autonomy, tool use, and multi-party communication.
3A notable case involved an agent disproportionately disabling its own local email client to protect a non-owner's "secret," highlighting failures in social coherence and accountability.

Paper

Gabriele Sarti

2026.02.25

·Arxiv·by 네루

#AI Agents#Autonomy#LLM#Privacy#Security

1An exploratory red-teaming study examined autonomous LLM-powered agents deployed in a live laboratory environment with persistent memory, email, and shell access.
2The research uncovered significant security, privacy, and governance vulnerabilities arising from the agents' autonomy, tool use, and multi-party communication.
3A notable case involved an agent disproportionately disabling its own local email client to protect a non-owner's "secret," highlighting failures in social coherence and accountability.