Show HN: Sleipner.ai – Cut Your LLM Costs by 40-70% (Private Beta)

1 Arnell0 0 8/5/2025, 11:28:02 AM sleipner.ai ↗

We're launching Sleipner.ai, a zero-friction cost-control layer for teams using large language models (LLMs).

Here's how it works:

Intelligent model routing: Automatically selects the smallest (cheapest) model that can effectively handle your prompt.

Prompt compression: Strips unnecessary filler, reducing tokens without changing meaning.

Semantic caching: Answers repeated or similar queries instantly—no model call needed.

Real-time analytics: Detailed insights into routing decisions, costs, latency, and token usage.

Early adopters are consistently seeing their total LLM spend drop by 40-70%, all while keeping response times under a second. Integration requires zero prompt or SDK changes—just swap one base URL and add your existing API key.

Our pricing is transparent and risk-free: you only pay 25% of the savings we deliver. If we don’t save you money, you pay nothing.

We're looking for a few more teams for our private beta. If your AI costs are climbing faster than you'd like, let's talk.

More info: https://sleipner.ai

Feedback and questions very welcome!

Ask HN: How do you manage downtime as a developer?

Skyrora wins green light to lob rockets from Scotland (theregister.com)

Show HN: Sleipner.ai – Cut Your LLM Costs by 40-70% (Private Beta) (sleipner.ai)

Agents or Bots? Making Sense of AI on the Open Web (perplexity.ai)

SentinelOne to Acquire Prompt Security (prompt.security)

Decades of Blunders Put a Lethal Wall at the End of a South Korean Runway (nytimes.com)

Brazil's Supreme Court Illegally Used Social Media Posts to Frame Protesters (public.news)

Ranking every Lego minifigure using an ELO algorithm (brickelo.com)

You Have Never Felt Climate Change Like This (nautil.us)

Simulating Ice Worlds in the Lab – Universe Today (universetoday.com)

Unitree A2 Stellar Hunter [video] (youtube.com)

Using AI to generate a JUCE tutorial (github.com)

FlyingFox - Lightweight, HTTP server written in Swift using async/await (github.com)

Idempotent Equilibrium Analysis of Hybrid Workflow Allocation (arxiv.org)

A NASA satellite that scientists and farmers rely on may be destroyed on purpose (npr.org)

How did 8088/86 PCs implement flood-fill before SSE and GPUs? (retrocomputing.stackexchange.com)

Show HN: I built an app that turns any image into 5-step pencil drawing practice (restoftheowl.app)

The new Dependabot NuGet updater: 65% faster with native .NET (devblogs.microsoft.com)

AMD Ryzen AI 5 340 "Krackan Point" Offers Outstanding Value in Sub-$500 Laptops (phoronix.com)

Making of SARE: Master Seeds in Hybrid Post-Quantum Encryption (zola.ink)

Scientific fraud has become an 'industry,' alarming analysis finds (science.org)

A script that manages sing-box VPN configurations (github.com)

Decoding the Iconic Cover of 'The Great Gatsby' (news.artnet.com)

Microsoft promises to eventually make WinUI 'open source' (theregister.com)

Eating earlier linked to long-term weight-loss success (medicalxpress.com)

Schizophrenic Zip Files (blog.isec.pl)

Equivalent constant engine torque is clearer than Power (engineering.stackexchange.com)

V. Cheng took advantage of HK lockdown to get dramatic shot of Monster Building (theguardian.com)

Homebrew 4.6.0 (brew.sh)

Chipmaker TSMC says it has discovered potential trade secret leaks (cnbc.com)

Zig Library for Ohlcv Data and Technical Indicators (github.com)

What's wrong with the JSON gem API? (byroot.github.io)

The UK's Online Safety Act Is the Privacy-Crushing Failure Everyone Warned About (techdirt.com)

Expanded colorectal cancer screening finds more cases (axios.com)

From detection to trust: the evolving challenge of AI bot authentication (blog.castle.io)

Nerd vs. Dealers

Lessons from a Principal Engineer (Señors at Scale Podcast, Ep 3) [video] (youtube.com)

Understanding Stack Traces in Elixir (blog.appsignal.com)

Godot Foundation Welcomes JetBrains as Platinum Sponsor (godotengine.org)

US to require bonds of up to $15,000 for some tourists (dw.com)

What I'm reading: Fungus the Bogeyman (2007) (theguardian.com)

Choosing AI Accelerators for Robots (medium.com)

The Chore of People Getting Water in Gaza (youtube.com)

Codestral 25.08 and the Mistral Coding Stack (mistral.ai)

Rob Pike's Rules of Programming (1989) (users.ece.utexas.edu)

Example Problems from the Logic Olympiad (scientificamerican.com)

Built a wrapper on OpenAI, hit 100k users. Then came the email that ended it

Entities enabling scientific fraud at scale are large, resilient, and growing (pnas.org)

Vibe Engineering: A Field Manual for AI Coding in Teams (alexchesser.medium.com)

Rust, Python, and TypeScript: the new trifecta (smallcultfollowing.com)

Show HN: Sleipner.ai – Cut Your LLM Costs by 40-70% (Private Beta)

Comments (0)