Show HN: Agent Arena – crowdsourced testbed for evaluating AI agents in the wild
2 tejpalv8 0 6/24/2025, 5:03:49 PM twitter.com ↗
We just launched Agent Arena -- a crowdsourced testbed for evaluating AI agents in the wild.
Think Chatbot Arena, but for agents.
It’s completely free to run matches. We cover the inference.
I always find myself debating whether to use 4o or o3, but now I just try both on Agent Arena!
Try it out: https://obl.dev/
No comments yet