PolyThink: A Multi-Agent AI System to Eliminate Hallucinations

8 kuberwastaken 9 4/17/2025, 8:55:04 AM
Excited to announce PolyThink Alpha's early access! Our multi-agent AI system fights hallucinations with consensus-driven, accurate answers from multiple models. I'd love for you to join the waitlist at https://www.polyth.ink/ as I'm planning to randomly roll out invites starting May. Feedback will shape our final launch! I'd love thoughts and suggestions too! What would you like to see here?

Comments (9)

stereo · 1d ago
Isn’t this basically the Swiss cheese model? If your two input AIs hallucinate, or your consensus AI misunderstands the input, you will still have confabulations in the output?
kuberwastaken · 1h ago
From all my testing, this never really happened even once honestly, plus the judge model (that I've kept strictly a reasoning model) also evaluates individually before "judging" the consensus.
TheKelsbee · 20h ago
I have this same thought, and have tried similar approaches.

OP: Have you trained or fine tuned a model that specifically reasons the worker model inputs against the user input? Or is this basically just taking a model and turning the temperature down to near 0?

kuberwastaken · 1h ago
Low temperature, heavy prompting to answer in a structured way. Sadly can't fine train models since this is API based but the approach does work!
sks38317 · 1d ago
I’m genuinely interested in how you arrived at the concept of using AI as a method to treat hallucinations. What inspired that approach?
kuberwastaken · 1h ago
Honestly, personal use cases. I am a STEM student and deal with a lot of "hard" questions that are about 60% of the time miscalculated by LLMs, I used to manually paste in approaches from say ChatGPT to DeepSeek and now grok and asked them what do you think is better. I created this out of necessity to automate this then realized how cool it can be if it scales further haha
tough · 19h ago
not op but LLM as Judge is a thing https://arxiv.org/abs/2411.15594
consumer451 · 1d ago
Very interesting. Will this be available as a meta model via API, allowing use in the coding tool of my choice?
kuberwastaken · 1h ago
Eventually yes, that's the plan! It's extremely good with code too, especially with more vague requests, tends to take about 2-3 rounds but almost always gets a great approach.