“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

Seeing the recent adjustment of Claude Code limits made me think about how the incentives of Anthropic and users are not necessarily aligned for Claude code. Anthropic probably errs on the side of sending too many tokens, while some users might be okay with risking lower quality output for the sake of getting more out of their Claude code plan. So, I developed VibeBooster, which is a proxy that intercepts and shorten the requests that Claude Code makes to the Anthropic API.

By using a cheaper (or even free) LLM, we can summarize, compress, or even outright delete some extraneous tokens that likely wouldn't help Claude Code's output. There are some pretty egregious wastes of your precious tokens that I found by inspecting the requests that were made, and VibeBooster can avoid that waste! From my casual testing, I've been able to save about *40%* of the input tokens sent to the Anthropic API via Claude Code on large messages. Check out the README.md if you want to see some examples of token savings.

The project is open source, and bring your own key. I think there's potential to further prompt tune this, and also flip the philosophy of the product by trying to improve the performance of Claude Code rather than trying to save tokens. I don't feel this is a pure utility; it's also a research project into how Claude Code works, and an art piece demonstrating the weird state LLM-powered products are at this point in time.

Please let me know any feedback or suggestions you might have, thanks!

Comments (0)

No comments yet

We should have the ability to run any code we want on hardware we own (hugotunius.se)

Cognitive load is what matters (github.com)

Ask HN: The government of my country blocked VPN access. What should I use?

Do the simplest thing that could possibly work (seangoedecke.com)

Next.js is infuriating (blog.meca.sh)

Making Minecraft Spherical (bowerbyte.com)

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

Updates to Consumer Terms and Privacy Policy (anthropic.com)

Google AI Overview made up an elaborate story about me (bsky.app)

Tesla said it didn't have key data in a fatal crash, then a hacker found it (washingtonpost.com)

Eternal Struggle (yoavg.github.io)

Notes on Managing ADHD (borretti.me)

Claude Code: Now in Beta in Zed (zed.dev)

Anthropic raises $13B Series F (anthropic.com)

MIT Study Finds AI Use Reprograms the Brain, Leading to Cognitive Decline (publichealthpolicyjournal.com)

Google: 'Your $1000 phone needs our permission to install apps now' [video] (youtube.com)

Bear is now source-available (herman.bearblog.dev)

Some users have noticed settings that let Meta analyze and retain phone photos (zdnet.com)

John Carmack's arguments against building a custom XR OS at Meta (twitter.com)

A staff engineer's journey with Claude Code (sanity.io)

Are OpenAI and Anthropic losing money on inference? (martinalderson.com)

Grok Code Fast 1 (x.ai)

Implementing a Foil Sticker Effect (4rknova.com)

An LLM is a lossy encyclopedia (simonwillison.net)

Claude Sonnet will ship in Xcode (developer.apple.com)

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

We already live in social credit, we just don't call it that (thenexus.media)

Magic Lantern Is Back (magiclantern.fm)

Are we decentralized yet? (arewedecentralizedyet.online)

Six months into tariffs, businesses have no idea how to price anything (wsj.com)

The Little Book of Linear Algebra (github.com)

The web does not need gatekeepers: Cloudflare’s new “signed agents” pitch (positiveblue.substack.com)

The Synology End Game (lowendbox.com)

Aspects of modern HTML/CSS you may not be familiar with (lyra.horse)

Uncertain<T> (nshipster.com)

The staff ate it later (en.wikipedia.org)

AI adoption linked to 13% decline in jobs for young U.S. workers: study (cnbc.com)

If you have a Claude account, they're going to train on your data moving forward (old.reddit.com)

Jujutsu for everyone (jj-for-everyone.github.io)

Patrick Winston: How to Speak (2018) [video] (youtube.com)

%CPU utilization is a lie (brendanlong.com)

Some thoughts on LLMs and software development (martinfowler.com)

VibeVoice: A Frontier Open-Source Text-to-Speech Model (microsoft.github.io)

FreeDroidWarn (github.com)

Cloudflare Radar: AI Insights (radar.cloudflare.com)

Twitter Shadow Bans Turkish Presidential Candidate (utkusen.substack.com)

You don't want to hire "the best engineers" (otherbranch.com)

You Have to Feel It (mitchellh.com)

Essential Coding Theory [pdf] (cse.buffalo.edu)

Show HN: Get more out of your Claude Code plan with VibeBooster

Comments (0)