Territorial Markings as a Predictor of Driver Aggression and Road Rage (2008) (onlinelibrary.wiley.com)

GPT-4 at $24.7 per million tokens vs Mixtral at $0.24 - that's a 100x cost difference! Even if routing gets it wrong 20% of the time, the economics still work. But the real question is how you measure 'performance' - user satisfaction doesn't always correlate with technical metrics.

FINDarkside · 11m ago

It's trivial to get around the same score than GPT-4 with 1% of the cost by using my propertiary routing algorithm that routes all requests to Gemini 2.5 Flash.

Keyframe · 32m ago

number of complaints / million tokens?

pqtyw · 16m ago

> GPT-4 at $24.7 per million tokens

While technically true why would you want to use it when OpenAI itself provides a bunch of many times cheaper and better models?

QuadmasterXLII · 37m ago

The framing in the headline is interesting. As far as I recall, spending 4x more compute on a model to improve performance by 7% is the move that has worked over and over again up to this point. 101 % of GPT-4 performance (potentially at any cost) is what I would expect an improved routing algorithm to achieve.

spoaceman7777 · 22m ago

Incredible that they are using contextual bandits, and named it: Preference-prior Informed Linucb fOr adaptive rouTing (PILOT)

Rather than the much more obvious: Preference-prior Informed Linucb For Adaptive Routing (PILFAR)

fny · 51m ago

Is there a reason human preference data is even needed? Don't LLMs already have a strong enough notion of question complexity to build a dataset for routing?

delichon · 43m ago

> a strong enough notion of question complexity

Aka Wisdom. No, LLMs don't have that. Me neither, I usually have to step in the rabbit holes in order to detect them.

jibal · 20m ago

LLMs don't have notions ... they are pattern matchers against a vast database of human text.

mhh__ · 8m ago

Please do a SELECT * from this database

andrewflnr · 48m ago

Is this really the frontier of LLM research? I guess we really aren't getting AGI any time soon, then. It makes me a little less worried about the future, honestly.

kenjackson · 41m ago

First, I don't think we will ever get to AGI. Not because we won't see huge advances still, but AGI is a moving ambiguous target that we won't get consensus on.

But why does this paper impact your thinking on it? It is about budget and recognizing that different LLMs have different cost structures. It's not really an attempt to improve LLM performance measured absolutely.

yahoozoo · 26s ago

That and LLMs are seemingly plateauing. Earlier this year, it seemed like the big companies were releasing noticeable improvements every other week. People would joke a few weeks is “an eternity” in AI…so what time span are we looking at now?

srekhi · 43m ago

I'm not following this either. You'd think this would be frontier back in 2023

jibal · 19m ago

LLMs are not on the road to AGI, but there are plenty of dangers associated with them nonetheless.

guluarte · 34m ago

I'm starting to think that there will not be an 'AGI' moment, we will simply slowly build smarter machines over time until we realize there is 'AGI'. It would be like video calls in the '90s everybody wanted them, now everybody hates them, lmao.

Adaptive LLM routing under budget constraints (arxiv.org)

Cloudflare Radar: AI Insights (radar.cloudflare.com)

Making Minecraft Spherical (bowerbyte.com)

Ask HN: Who is hiring? (September 2025)

Bear is now source-available (herman.bearblog.dev)

Effective learning: Rules of formulating knowledge (1999) (supermemo.com)

Search engine referral report for 2025 Q2 (radar.cloudflare.com)

Territorial Markings as a Predictor of Driver Aggression and Road Rage (2008) (onlinelibrary.wiley.com)

Optery (YC W22) Is Hiring in Engineering, Legal, Sales, Marketing (U.S., Latam) (optery.com)

AI enters the grant game, picking winners (science.org)

An adventure in writing compatible systems (turso.tech)

CocoaPods trunk read-only plan (blog.cocoapods.org)

Google AI Overview made up an elaborate story about me (bsky.app)

Ask HN: Who wants to be hired? (September 2025)

The Steve Ballmer Interview (acquired.fm)

Show HN: woomarks, transfer your Pocket links to this app or self-host it (woomarks.com)

A review of Nim 2: The good and bad with example code (miguel-martin.com)

One of Britain's largest stocks of second-hand books ever amassed (worldofinteriors.com)

The time picker on the iPhone's alarm app isn't circular, it's just a long list (old.reddit.com)

Isolated(any) (nshipster.com)

A Unique, High-Tech (Family) Computer (nicole.express)

Git for Music – Using Version Control for Music Production (2023) (grechin.org)

Tetris is NP-hard even with O(1) rows or columns (2020) [pdf] (martindemaine.org)

Can You Develop Film in a Jägerbomb? (petapixel.com)

Preserving Order in Concurrent Go Apps: Three Approaches Compared (destel.dev)

India's billion-dollar e-waste empire (restofworld.org)

Lessons from building an AI data analyst (pedronasc.com)

Zfsbackrest: Pgbackrest style encrypted backups for ZFS filesystems (github.com)

We should have the ability to run any code we want on hardware we own (hugotunius.se)

UK's largest battery storage facility at Tilbury substation (nationalgrid.com)

What Is Complexity in Chess? (lichess.org)

Show HN: Simple modenized .NET NuGet server reached RC (github.com)

What brain surgery taught me about the fragile gift of consciousness (bigthink.com)

Eternal Struggle (yoavg.github.io)

Welcome to the Technocracy: Dreams of forgotten movement from the 1930s live on (novum.substack.com)

Nintendo Switch 2 Dock USB-C Compatibility (lttlabs.com)

Lewis and Clark marked their trail with laxatives (offbeatoregon.com)

Compiling Dinner (gist.github.com)

Trade in War (news.mit.edu)

Anti-establishment versus authoritarian populists and support for the strongman (frontiersin.org)

C++: Strongly Happens Before? (nekrozqliphort.github.io)

Bash Prompts Collection (gilesorr.com)

De-Googling TOTP Authenticator Codes (imrannazar.com)

Use One Big Server (2022) (specbranch.com)

A Linux version of the Procmon Sysinternals tool (github.com)

Pong Clock (bigjobby.com)

Chronicle – Idiomatic, type safe event sourcing framework for Go (github.com)

A Crack in the Cosmos (drb.ie)

What to do with C++ modules? (nibblestew.blogspot.com)

The Qweremin (linusakesson.net)

Adaptive LLM routing under budget constraints

Comments (16)