Anthropic's SHADE-Arena: Evaluating sabotage and monitoring in LLM agents

4 thoughtpeddler 0 6/17/2025, 8:01:10 AM anthropic.com ↗

Comments (0)

No comments yet