Enforcing Benign Trajectories: A Behavioral Firewall for Structured-Workflow AI Agents
9/10A 2026 paper proposes a behavioral firewall named 'codename' for structured-workflow AI agents powered by large language models executing tool calls in sensitive environments. This anomaly detection system uses sequence-based intrusion detection to enforce safe behavior and prevent malicious activities in AI agent workflows.
