One man cost American Airlines £21M using his lifetime first class air pass (aerotime.aero)

Approach is analogous to Grok 4 Heavy: use multiple "reasoning" agents in parallel and then compare answers before coming back with a single response, taking ~30 minutes. Great results, though it would be more fair for the benchmark comparisons to be against Grok 4 Heavy rather than Grok 4 (the fast, single-agent model).

stingraycharles · 36m ago

Yeah the general “discovery” is that using the same reasoning compute effort, but spreading them over multiple different agents generally leads to better results.

It solves the “longer thinking leads to worse results” problem by approaching multiple paths of thinking in parallel, but just not think as long.

lynx97 · 28m ago

I am surprised such a simple approach has taken so long to be actually used. My first image description cli attempt did basically that: Use n to get several answers and another pass to summarize.

cinntaile · 25m ago

It's very resource intensive so maybe they had to wait until processes got more efficient? I can also imagine they would want to try and solve it in a... better way before doing this.

simianwords · 20m ago

I agree but I think its hard to get a sufficient increase in performance that would justify 3-4x increase in cost.

amatic · 46m ago

At the moment, Deep Think is only available with the ULTRA subscription ($250 per month).

siva7 · 8m ago

Is it available in EU? Someone can confirm?

stingraycharles · 22m ago

It’s not available through the API?

simianwords · 16m ago

Grok 4 heavy, o3 pro and Gemini Deep Think all are equivalent. I wonder how they compare?

No comments yet

cmrdporcupine · 52m ago

Why can't this submission be replaced with the actually-public official link at https://blog.google/products/gemini/gemini-2-5-deep-think/ instead of the one hosted on the members only, controversial, X?

stingraycharles · 49m ago

It should be replaced indeed, but it’s only just reaching the front page, so it hasn’t been moderated yet.

simianwords · 44m ago

i can't find it in EU with the ultra subscription

lynx97 · 30m ago

Wait...

So if someone cool enough, they could actually give us a DeepThought model?

Please, let that happen.

Vendor-DeepThought-42B maybe?

One man cost American Airlines £21M using his lifetime first class air pass (aerotime.aero)

The Grand Encyclopedia of Eponymous Laws (secretorum.life)

Understanding Node.js Event Loop: The Heart of Asynchronous JavaScript (medium.com)

Google is indexing ChatGPT conversations, potentially exposing user data (fastcompany.com)

Public ChatGPT Queries Are Indexed by Google (techcrunch.com)

A.I. Researchers Are Negotiating $250M Pay Packages. Just Like NBA Stars (nytimes.com)

Motion – using frames difference online (hiddenmotion.priyavkaneria.com)

NixOS and Flakes Book (nixos-and-flakes.thiscute.world)

Style Guide to AI: Generate Websites with Your Exact Design Vibe (stylespark.dev)

A Smarter Docusign (doclair.io)

Show HN: HydraFlow – Type-safe ML experiment tracking with Hydra and MLflow (github.com)

CI in the Age of AI (dagger.io)

Rabbit Rabbit Rabbit (en.wikipedia.org)

Why LLMs Struggle with Text-to-SQL (selectstar.com)

MAME 0.279 (mamedev.org)

Functorizing Large Collections of Modules (inbox.vuxu.org)

The Thermodynamics of Trading (signalsandthreads.com)

Massachusetts to offer discounted electric rates to heat pump owners this winter (wbur.org)

Linus still uses an RX580 and ditches Apple Silicon for an Intel laptop (tomshardware.com)

Enterprise software giants weaponize AI to kill discounts and deepen lock-in (theregister.com)

Accessing the Kubernetes API from SQL Server 2025 (dbafromthecold.com)

The Garden Token Factory – Reinventing Token Launch Stack (taikai.network)

In California, an invasive mustard is destabilizing desert plant communities (news.mongabay.com)

Reddit wants to be a search engine now (theverge.com)

GPT-5 is already (ostensibly) available via API (old.reddit.com)

Show HN: I built a YouTube Thumbnail generator with your face (newhero.ai)

Meschers: Geometry Processing of Impossible Objects (dl.acm.org)

Harvard Business School Pricing Lab Tariff Tracker (pricinglab.org)

Record-breaking baby born from oldest ever embryo (news.sky.com)

AI Quiz Maker (minform.io)

Delusions of Grandeur Go South (paulkrugman.substack.com)

The Rule of Law Is Dead in the US (thenation.com)

AI that splits bills from a photo and voice (multilingual)

The Making of Amazon Prime (2019) (vox.com)

SuperClaude v3 – Advanced Development Framework for Claude Code (superclaude-org.github.io)

Ask HN: How does MCP tool calling work?

Portfolio Builder – Create Professional Portfolio in 2 Minutes with GitHub (portfoliomatic-builder.vercel.app)

Andrew Ng and Yann LeCun: US Is Losing AI Race Due to Closed Models (haebom.dev)

Ask HN: Which software companies hire people in Africa for remote work?

Tim Cook Has Now Been Apple's CEO for Longer Than Steve Jobs (macrumors.com)

What I have learned about startups from building my own

Live coding interviews measure stress, not coding skills (hadid.dev)

NASA intern loses job after profanity-laced tweet to Space Council fellow (the-independent.com)

Global ocean simulations examine tritium release from Fukushima (physicsworld.com)

Google knows who visited. Stripe knows who paid. I built the missing link (getboone.com)

Ask HN: Is manually discovering and configuring MCP servers the only way?

Study mode and spaced repetition. Feature requests, Monetization ideas? (app.polymax.ai)

U.S. hiring was weak in July, with 73,000 jobs added (wsj.com)

IRS chief says agency plans to end free filing program (cnbc.com)

Gemini Deep Think – the model that won IMO is available (twitter.com)

Gemini 2.5 Deep Think

Comments (13)