The Philosophy of Composition (en.wikipedia.org)

1 points by Bluestein 1m ago 0 comments

Whoop Faces Backlash After Charging Existing Users Upgrade Fee for New Models (bloomberg.com)

1 points by tosh 1m ago 0 comments

A Go string concatenation library that is more efficient than strings.Builder (github.com)

1 points by theThree 2m ago 0 comments

How to reduce conflict between people and grizzly bears in rural communities (knowablemagazine.org)

1 points by rbanffy 2m ago 0 comments

Elizabeth Holmes' partner raises millions for new biotech testing startup (npr.org)

1 points by sseagull 9m ago 0 comments

AI-powered headphones offer group translation with voice cloning and 3D audio (techxplore.com)

1 points by daoboy 11m ago 0 comments

Autism diagnoses are on the rise – but autism itself may not be (bbc.com)

1 points by pseudolus 15m ago 0 comments

Where Christo and Jeanne-Claude Cast Their Spells (nytimes.com)

1 points by pseudolus 17m ago 1 comments

Show HN: Tgcsm – Recursive containment model for cognition and collapse logs (github.com)

1 points by EthanManners 18m ago 0 comments

The Stealthy Lab Cooking Up Amazon's Secret Sauce (wsj.com)

1 points by grzm 24m ago 0 comments

Giacomo Turra and the Crisis of Credibility (youtube.com)

2 points by belter 24m ago 0 comments

US Government considering suspending habeas corpus (bbc.com)

4 points by intunderflow 24m ago 0 comments

Origins of language: Wild chimps mirror linguistic structures in human language (phys.org)

2 points by pseudolus 28m ago 0 comments

Cameras mounted on nets show the destructiveness of bottom trawling (nationalgeographic.com)

2 points by lloydjones 28m ago 0 comments

A tool to verify estimates, II: a flexible proof assistant (terrytao.wordpress.com)

1 points by jjgreen 38m ago 0 comments

Policy of Transience (chiark.greenend.org.uk)

2 points by pekim 38m ago 0 comments

Web .NET Prototype – Godot Engine (godotengine.org)

1 points by npinsker 39m ago 0 comments

Tired of paying for SaaS tools to send emails? (github.com)

1 points by aapanel 40m ago 1 comments

Modern business adventures: short stories of techno-optimistic folly (logos.substack.com)

1 points by ArisC 41m ago 0 comments

How Unreal Engine 5 Is Killing Games [video] (youtube.com)

3 points by xeonmc 42m ago 0 comments

Testing sourcery.ai and GitHub Copilot for cockpit PR reviews (piware.de)

2 points by todsacerdoti 50m ago 0 comments

Show HN: QitOps – A CLI tool for unified API, performance, and security testing (github.com)

1 points by qitops 50m ago 0 comments

Real-Time 3D Model Generation in Augmented Reality (arxiv.org)

1 points by PaulHoule 50m ago 0 comments

OpenEmail – Mail/HTTPS Protocol (github.com)

2 points by cidra_ 52m ago 1 comments

AI Builder for your Front end (baloon.dev)

1 points by ankurpata 53m ago 1 comments

10k Drum Machines (10kdrummachines.com)

2 points by TomWhitwell 55m ago 0 comments

25 Years of Autocratization – Democracy Trumped? [pdf] (v-dem.net)

2 points by pieterr 59m ago 0 comments

Broadcom sends cease-and-desist letters to perpetual VMware license holders (arstechnica.com)

1 points by bit_qntum 1h ago 0 comments

A Typical Workday at a Japanese Hardware Tool Store [video] (youtube.com)

2 points by Erikun 1h ago 0 comments

Judge questions Meta AI training as fair use (arstechnica.com)

1 points by byte-bolter 1h ago 0 comments

Show HN: Build website to list newest SRE jobs (newsrejobs.com)

1 points by theykk 1h ago 0 comments

Google Is Cooked (monkeylike.substack.com)

3 points by TIJ 1h ago 0 comments

The Event Horizon of Fantasy (medium.com)

2 points by bryanrasmussen 1h ago 0 comments

The Deathbed Fallacy (hjorthjort.xyz)

3 points by mefengl 1h ago 0 comments

Domain Connect (domainconnect.org)

1 points by gregsadetsky 1h ago 0 comments

Michael Saylor Bitcoin for Corporations 2025 Keynote Speech [video] (youtube.com)

1 points by simonebrunozzi 1h ago 0 comments

Google auto-converts passwords to passkeys on Android (androidpolice.com)

1 points by vdelitz 1h ago 0 comments

Vibe Coding Tools (directory with demos) (indiehackers.com)

2 points by rmason 1h ago 0 comments

Ransomware group LockBit appears to have been hacked (reuters.com)

4 points by gray_amps 1h ago 0 comments

Meta Locate Objects in 3D (locate3d.atmeta.com)

1 points by totalview 1h ago 0 comments

Chinese chipmaker readies 128-core, 512-thread CPU with AVX-512 (tomshardware.com)

3 points by ksec 1h ago 1 comments

The Taxonomy for Data Transformations in AI Systems (hopsworks.ai)

1 points by jamesblonde 1h ago 0 comments

Writing LSP client in Clojure in 200 lines of code (vlaaad.github.io)

2 points by vlaaad 1h ago 0 comments

It Couldn't Be Done (twitter.com)

2 points by tosh 1h ago 0 comments

Jony Ive's next product is driven by the 'unintended consequences' of the iPhone (theverge.com)

4 points by antfarm 1h ago 1 comments

GRPO experiment - I trained a Language Model to schedule events (github.com)

1 points by anakin87 1h ago 1 comments

Chinese jacket maker is now the biggest company, according to Bloomberg (ft.com)

4 points by KnuthIsGod 2h ago 1 comments

Show HN: The Internet Rich List (theinternetrichlist.com)

1 points by sixpoundham 2h ago 0 comments

Bold linker v0.2.0 release – bold just got faster (github.com)

2 points by todsacerdoti 2h ago 0 comments

Rules-based world order in retreat and violence on the rise. Has WW3 begun? (theguardian.com)

17 points by prmph 2h ago 4 comments

Ask HN: How expensive are LLMs to query, really?

5 teach 3 5/9/2025, 7:58:32 PM

I'm starting to see things pop-up from well-meaning people worried about the environmental cost of large language models. Just yesterday I saw a meme on social media that suggested that "ChatGPT uses 1-3 bottles of water for cooling for every query you put into it."

This seems unlikely to me, but what is the truth?

I understand that _training_ an LLM is very very expensive. (Although so is spinning up a fab for a new CPU.) But it seems to me the incremental costs to query a model should be relatively low.

I'd love to see your back-of-the-envelope calculations for how much water and especially how much electricity it takes to "answer a single query" from, say, ChatGPT, Claude-3.7-Sonnet or Gemini Flash. Bonus points if you compare it to watching five minutes of a YouTube video or doing a Google search.

Links to sources would also be appreciated.

Comments (3)

serendipty01 · 15h ago

Some links:

https://www.sustainabilitybynumbers.com/p/carbon-footprint-c...

https://andymasley.substack.com/p/a-cheat-sheet-for-conversa...

(discussion on lobste.rs - https://lobste.rs/s/bxixuu/cheat_sheet_for_why_using_chatgpt...)

(discussion on HN, 320 comments: https://news.ycombinator.com/item?id=42745847)

teach · 15h ago

These are excellent, thank you!

a_conservative · 13h ago

my m4max macbook can run local inference on a medium-ish gemini model (32b IIRC). The power consumption spikes by about 120 watts over idle (with multiple electron apps, docker, etc). It runs about 70 tokens/sec and usually responds within 10 to 20 seconds.

So.. picking some numbers for calculation. 4 answers per minute @ 120 watts is about .5 watt-hours per answer. ~200 responses would be enough to drain the (normally quite long lasting battery).

How does that compare to the more common nvidia GPUs? I don't know.