How does the US use water? (construction-physics.com)
225 points by juliangamble 1d ago 171 comments
Show HN: OS X Mavericks Forever (mavericksforever.com)
384 points by Wowfunhappy 4d ago 173 comments
Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing
64 omarsar 16 8/22/2025, 2:43:31 PM arxiv.org ↗
I've thought for a while that ensembling approaches would become the next stage of LLM development after CoT, since it provides yet another effective, independent axis for scaling laws. Great to see that perspective is taking off. The open weight community has an opportunity to take these ideas and run with them better than OpenAI has.
Yeah, the signals they get will improve things over time. You can do a lot of heavy lifting with embedding models nowadays, get "satisfaction" signals from chats, and adjust your router based on those. It will be weird at first, some people will complain, but at the end of the day, you don't need imo-gold levels of thinking to write a fitness plan that most likely the user won't even follow :)
Signal gathering is likely the driver of most of the subsidised model offerings we see today.
Also the paper has some pie chart crimes on page 6.