Show HN: Aisir – AI models deliberate and critique each other like a council

Comments (1)

esamust · 4h ago

Hi HN,

I'm esamust, the creator behind Aisir.

I built Aisir because I often found myself wanting multiple AI perspectives for complex coding/analysis tasks and was kind of frustrated by the blindspots of single models. So then the natural thing to try was, what if rather than just queueing each model in one go they actually talk to each other and work together to solve the problem.

Aisir uses a "council" approach where agents (currently Gemini, Claude Sonnet, o4-mini, WebSearcher (Gemini based)) deliberate on a query over a number of rounds. A moderator agent guides the discussion, chooses what to do next, identifies issues, and pushes for refinement before a final answer is synthesized. This is obviously overkill on simple queries but I would speculate (obviously biased) that it could beat price per token on the some very complicated queries vs like o1-pro as multiple models working together have a quicker way in token terms to find the right answer than a single model. This is based on my anecdotal experiments vs singular models.

So to explain, the difficult (and very limited) benchmarking I've done on it is using the epoch.ai FrontierMath examples to make sure the result would atleast not be worse than just using the best singular model but turns out it's more likely to answer correctly than any single model. This is slightly obvious in the sense that if a model can't answer one specific question, another one might be able to instead even if they don't talk to eachother. The next test would be to see if there are specific problems that none of the best models can answer individually but can be solved using this council method. Let me know if you find any by hand.

You can see the 'thinking' process unfold, showing each agent's contribution and the moderator's comments.

This is very much an experiment and currently free to use. Running these models is expensive over time, so if it gets traction, I might need to add limits/subscriptions later, but for now, I'm focused on seeing if the core idea is useful. I'd love to get your feedback, especially on: a) Does the multi-agent approach yield better results for you? b) What kinds of complex problems would this be most useful for? c) Any suggestions for improving? There's a lot of optimization I see with token spent etc but I want to see if this is interesting or valuable to anyone other than me.

Link to the tool: https://aisirai.com

Show HN: Create your own finetuned AI model using Google Sheets (promptrepo.com)

Show HN: ART – a new open-source RL framework for training agents (github.com)

Show HN: Kexa.io – Open-Source IT Security and Compliance Verification

Show HN: I built a fun AI tour guide into Google Street View (streetwhip.com)

Show HN: Jarvis-AI, an AI Agents network that kills admin work in big corporate (github.com)

Show HN: Beatsync – perfect audio sync across multiple devices (github.com)

Show HN: An MCP server for understanding AWS costs

Show HN: AgenticSeek – Self-hosted alternative to cloud-based AI tools (github.com)

Show HN: Typeconf – Dynamic Configs in TypeScript (github.com)

Show HN: An interactive demo of QR codes' error correction (qris.cool)

Show HN: The $300K DevinAI Secret is Now Open Source (github.com)

Show HN: Aisir – AI models deliberate and critique each other like a council (aisirai.com)

Show HN: A Chrome extension that will auto-reject non-essential cookies (blog.bymitch.com)

Show HN: I built a hardware processor that runs Python (runpyxl.com)

Show HN: Sim Studio – Open-Source Agent Workflow GUI (github.com)

Show HN: Web Tool to Create a Universal Database MCP Server (centralmind.ai)

Show HN: Prettier Email Headers (emailheaders.dev)

Show HN: Open-source sound effects and react library to spice up your website (reactsounds.com)

Show HN: Heart Rate Zones Plus – The first iOS app I developed (apps.apple.com)

Show HN: Flowcode – Turing-complete visual programming platform (app.getflowcode.io)

Show HN: Web-eval-agent – Let the coding agent debug itself (github.com)

Show HN: Daily Digest of the Least Popular Posts on Hacker News (leastpopular.io)

Show HN: A pure WebGL image editor with filters, crop and perspective correction (github.com)

Show HN: I486SX_soft_FPU – Software FPU Emulator for NetBSD 10 on 486SX (github.com)

Show HN: Built a API that returns your GitHub Contribution chart (github.com)

Show HN: CodeClarity – an open source source code analysis platform (codeclarity.io)

Show HN: Neurox – GPU Observability for AI Infra (github.com)

Show HN:I Open Sourced Deepwiki (github.com)

Show HN: Daily Jailbreak – Prompt Engineer's Wordle (vaultbreak.ai)

Show HN: A Common Lisp implementation in development, supports ASDF (savannah.nongnu.org)

Show HN: Autarkie – Instant grammar fuzzing using Rust macros (github.com)

Show HN: My self-written hobby OS is finally running on my vintage IBM ThinkPad (github.com)

Show HN: I created snapDOM to capture DOM nodes as images with exceptional speed (github.com)

Show HN: I made a web-based, free alternative to Screen Studio (screenrecorder.me)

Show HN: Tariff Calculator for Amazon (twitter.com)

Show HN: Generate discord timestamp that converts to each user's local timezone (discordtimestamp.cc)

Show HN: I built an AI that turns GitHub codebases into easy tutorials (github.com)

Show HN: Remote-Controlled IKEA Deathstar Lamp (gitlab.com)

Show HN: Rad Type - Can we make gamepad typing fast? (tyleo.com)

Show HN: Bhvr, a Bun and Hono and Vite and React Starter (bhvr.dev)

Show HN: GS-Calc – A modern spreadsheet with Python integration (citadel5.com)

Show HN: Neuro Tools, a collection of tools to help neurodivergent people (neurotools.app)

Show HN: POC to scrape and structure HTML into JSON for RAG (structured.pages.dev)

Show HN: Colanode, open-source and local-first Slack and Notion alternative (github.com)

Show HN: A Chrome extension to open a link without leaving your real footprints (chromewebstore.google.com)

Show HN: NanoAgent, zero-dependency 1k-LOC AI-agent runtime (github.com)

Show HN: I made an app to learn guitar scales (guitartonic.com)

Show HN: Discorss – RSS Feeds for Discord (discorss.fldr.zip)

Show HN: Rowboat – Open-source IDE for multi-agent systems (github.com)

Show HN: Auto-fix your GitHub PR issues with Proton for FREE (proton.codes)

Show HN: Aisir – AI models deliberate and critique each other like a council

Comments (1)