Show HN: DesignArena – crowdsourced benchmark for AI-generated UI/UX
48 grace77 16 7/12/2025, 3:07:46 PM designarena.ai ↗
I’ve been using AI to generate some repetitive frontend (guilty), and while most outputs felt vibe-coded, some results were surprisingly good. So I cleaned it up and made a ranking game out of it with friends, and you can check it out here: https://www.designarena.ai/vote
/vote: Your prompt will be answered by four random, anonymous models. You pick the one you prefer and crown the winner, tournament-style.
/leaderboard: See the current winning models, as dictated by voter preferences.
/play: Iterate quickly by seeing four models respond to the same input and pressing space to regenerate the results you don’t lock-in.
We were especially impressed with the quality of DeepSeek and Grok, and variance between categories (To judge by the results so far, OpenAI is very good for game dev, but seems to suck everywhere else).
We’ve learned a lot, and are curious to hear your comments and questions. Excited to make this better!
But this could be a legitimate way to design apps in general if you could tell the models what you liked and didn't like.
To preserve the voter experience without introducing bias, our current approach waits for the slowest model within each binary comparison — so even if one model is faster, we don’t display until both are ready. You're right that this does introduce some bias for the two smallest models, and we'd love to hear suggestions for how to make this better!
As for the 5th request: we actually kick off one reserve model alongside the four randomly selected for the tournament. This backup isn’t shown unless one of the four fails — it’s not the fastest or lowest-latency model, just a randomly selected fallback to keep the system robust without skewing results.
Do launch on https://www.superlaun.ch for more traffic and exposure for your web app.