Show HN: Agent Arena – crowdsourced testbed for evaluating AI agents in the wild

2 tejpalv8 0 6/24/2025, 5:03:49 PM twitter.com ↗
We just launched Agent Arena -- a crowdsourced testbed for evaluating AI agents in the wild.

Think Chatbot Arena, but for agents.

It’s completely free to run matches. We cover the inference.

I always find myself debating whether to use 4o or o3, but now I just try both on Agent Arena!

Try it out: https://obl.dev/

Comments (0)

No comments yet