Approach is analogous to Grok 4 Heavy: use multiple "reasoning" agents in parallel and then compare answers before coming back with a single response, taking ~30 minutes. Great results, though it would be more fair for the benchmark comparisons to be against Grok 4 Heavy rather than Grok 4 (the fast, single-agent model).
stingraycharles · 36m ago
Yeah the general “discovery” is that using the same reasoning compute effort, but spreading them over multiple different agents generally leads to better results.
It solves the “longer thinking leads to worse results” problem by approaching multiple paths of thinking in parallel, but just not think as long.
lynx97 · 28m ago
I am surprised such a simple approach has taken so long to be actually used. My first image description cli attempt did basically that: Use n to get several answers and another pass to summarize.
cinntaile · 25m ago
It's very resource intensive so maybe they had to wait until processes got more efficient? I can also imagine they would want to try and solve it in a... better way before doing this.
simianwords · 20m ago
I agree but I think its hard to get a sufficient increase in performance that would justify 3-4x increase in cost.
amatic · 46m ago
At the moment, Deep Think is only available with the ULTRA subscription ($250 per month).
siva7 · 8m ago
Is it available in EU? Someone can confirm?
stingraycharles · 22m ago
It’s not available through the API?
simianwords · 16m ago
Grok 4 heavy, o3 pro and Gemini Deep Think all are equivalent. I wonder how they compare?
It solves the “longer thinking leads to worse results” problem by approaching multiple paths of thinking in parallel, but just not think as long.
No comments yet
So if someone cool enough, they could actually give us a DeepThought model?
Please, let that happen.
Vendor-DeepThought-42B maybe?