Thinking Machines: Mathematical Reasoning in the Age of LLMs (arxiv.org)

1 points by chrsw 42s ago 0 comments

The Telemessage saga, and how you can view the data (theregister.com)

1 points by defrost 1m ago 0 comments

The new Compute's Gazette magazine has a BBS column (old.reddit.com)

1 points by JPolka 6m ago 1 comments

Cryptophasia (en.wikipedia.org)

1 points by thunderbong 10m ago 0 comments

Designing an SOI Interleaver Using Genetic Algorithm (mdpi.com)

1 points by PaulHoule 10m ago 0 comments

Show HN: 16-Pad Sampler from Your Videos (sampler.rlafuente.com)

1 points by andes314 12m ago 0 comments

The Tooth Fairy Is Real. She's a Dentist in Seattle (nytimes.com)

1 points by gmays 13m ago 0 comments

It Looks Like a School Bathroom Smoke Detector A Hacker Showed It Could Be a Bug (wired.com)

3 points by voxadam 14m ago 0 comments

Show HN: Gemlink.app – A Social-First Pocket Alternative to Save and Share Web (gemlink.app)

1 points by wainguo 15m ago 0 comments

Buttercup is now open-source (blog.trailofbits.com)

1 points by wrayjustin 18m ago 0 comments

FidoNet Global HyperText Interface (github.com)

1 points by xk3 20m ago 0 comments

Introduction to the Linux Laptop PCI-DSS at OVHcloud (blog.ovhcloud.com)

1 points by pabs3 21m ago 1 comments

GPT-5: "How many times does the letter b appear in blueberry?" (kieranhealy.org)

2 points by Wonnk13 23m ago 0 comments

Hamas Pulls Israel Deeper into Gaza (wsj.com)

1 points by andsoitis 35m ago 0 comments

Turns out GPT-5 can count but GPT-5-chat can't (bsky.app)

1 points by CarefreeCrayon 36m ago 0 comments

HHS cites list of studies as scientific justification for mRNA cancellation (statnews.com)

3 points by zzzeek 40m ago 0 comments

Trump administration threatens to strip Harvard University of lucrative patents (theguardian.com)

5 points by andsoitis 45m ago 0 comments

Subversive of What? (1948) (theatlantic.com)

1 points by Jtsummers 47m ago 0 comments

Galileo's telescopes: Seeing is believing (2010) (historytoday.com)

1 points by hhs 49m ago 0 comments

Official Prompt Optimizer for GPT-5 (platform.openai.com)

1 points by ayushnangia16 1h ago 0 comments

Google is killing millions of web links to save a few bucks (washingtonpost.com)

5 points by pseudolus 1h ago 3 comments

Democratizing Access to Alternative Assets for 401(K) Investors (whitehouse.gov)

2 points by harporoeder 1h ago 0 comments

Exploring AI Memory Architectures (Part 2): MemOS Framework (blog.lqhl.me)

1 points by lqhl 1h ago 0 comments

The Linguistics of Brain Rot (theamericanscholar.org)

2 points by gmays 1h ago 0 comments

AI Image Watermarking Faces New Threat from "Unmarker" (spectrum.ieee.org)

2 points by pseudolus 1h ago 0 comments

Exploring AI Memory Architectures (Part 3): From Prototype to Blueprint (blog.lqhl.me)

1 points by lqhl 1h ago 0 comments

Avatarl: Training language models from scratch with pure reinforcement learning (tokenbender.com)

1 points by neehao 1h ago 0 comments

Apple researchers taught an LLM to predict tokens up to 5x faster (9to5mac.com)

3 points by guiambros 1h ago 0 comments

Why good houseguests don't 'make themselves at home' (text.npr.org)

2 points by colinprince 1h ago 0 comments

China's Disastrous Demographic Outlook (twitter.com)

5 points by toomuchtodo 1h ago 6 comments

AI's Overlooked $97B Contribution to the Economy (wsj.com)

2 points by whatisabcdefgh 1h ago 0 comments

Show HN: Hacker5News is now web available (hacker5news.duckdns.org)

1 points by lafalce 1h ago 0 comments

Where Are They? (2008) (nickbostrom.com)

3 points by doughnutstracks 1h ago 0 comments

Efficient Strategies for Microglia Replacement in the Central Nervous System (sciencedirect.com)

1 points by bookofjoe 1h ago 0 comments

Show HN: AI Coloring Pages Generator (colori.io)

1 points by iliaddh 2h ago 2 comments

EPA Registers Novel(dsRNA) Pesticide Technology for Potato Crops (epa.gov)

1 points by bookmtn 2h ago 0 comments

Nanowhisker glue uses ultrasound to form resilient bonds (phys.org)

1 points by PaulHoule 2h ago 0 comments

Just Buy Nothing: A fake online store to combat shopping addiction (justbuynothing.com)

98 points by Improvement 2h ago 18 comments

Tiny Awards 2025 voting is now open (tinyawards.net)

1 points by CharlesW 2h ago 0 comments

Musicians do not demonstrate long-believed advantage in processing sound (michiganmedicine.org)

3 points by geox 2h ago 2 comments

GPT-5: Overdue, overhyped and underwhelming. And that's not the worst of it (garymarcus.substack.com)

187 points by kgwgk 2h ago 136 comments

Eighteen Years of Greytrapping – Is the Weirdness Paying Off? (nxdomain.no)

3 points by peter_hansteen 2h ago 0 comments

Interactive UI Components for Django using Htmx (github.com)

1 points by 8organicbits 2h ago 0 comments

Steve Wozniak's Perforated Pads of $2 Bills (coinbooks.org)

6 points by CharlesW 2h ago 4 comments

Ask HN: How do you pronounce "gradlew"?

1 points by higgins 2h ago 2 comments

Show HN: Connective, Back to the Roots (connective-app.com)

1 points by joacon 2h ago 0 comments

Fitness Landscape (baku89.com)

3 points by mrcgnc 2h ago 1 comments

The Welfare Costs of Low-Friction Idea Production (gojiberries.io)

2 points by neehao 2h ago 0 comments

From GPT-2 to GPT-OSS: Analyzing the Architectural Advances (magazine.sebastianraschka.com)

1 points by mdp2021 2h ago 0 comments

Ask HN: Will LLM API costs be negligible in a year?

1 changisaac 4 8/9/2025, 10:57:06 PM

Hi HN. We’re managing costs at my startup and by far our largest spend is on calls to Anthropic, OpenAI, etc. We’ve considered things like spinning up our own open source model but decided it’s not worth it considering we don’t even have PMF yet.

Optimistically though, I see that token prices to LLMs have been going down a lot in the past few years. Do you think if this continues that it’ll eventually become a negligible expense? Or do you think we will forever be gouged by these foundation model companies? (: Much like how cloud computing has went (AWS, GCP, etc.)

Comments (4)

ben_w · 3h ago

Define "negligible".

You need to know how much LLM output you need to get your product working, before you even know what you're hoping for regarding a target cost per million tokens. When you do get PMF, can some of the work be offloaded to a smaller and cheaper model? Can you determine this division of labour yet?

Consider also that "computer" used to be a job title, that since then the cost of doing computations has reduced by a factor of at least 1e14, and yet that you're only asking this question at all because you're still compute limited.

changisaac · 2h ago

> and yet that you're only asking this question at all because you're still compute limited.

Very good point.

musbemus · 3h ago

If they do start to become unsustainable you might see more companies moving to a BYOK or usage-based billing model. If they do that, I don't know if the use cases for AI would justify the cost for consumers (but perhaps so for businesses). There's been a ton of build out of data centers so I do think the cost reduction we've seen so far may extrapolate but at the expense of more performant models. Hard to tell right now though

codingdave · 2h ago

At some point AI providers will need to break down profit/token and price accordingly. Right now, they are losing money to gain market share. Also, AI consumers will need to get the expense of AI into their own profit calculations.

Hard to say how it will play out, aside from both sides are going to strive to maximize their own benefit, and time will tell how the actual numbers balance out.

This is one reason why it matters whether or not the AI bubble is all hype. There is a non-trivial chance that once people truly figure out the monetary value of AI's help on their processes and cut out all hype-based use cases... their spending limits to reach that value might not match what the providers need to run the platforms.