CCProxy – Use any AI model with Claude Code (90% cost reduction)

3 seedblocks 1 7/22/2025, 3:10:18 PM ccproxy.orchestre.dev ↗

Comments (1)

seedblocks · 7h ago
Hi HN! I've been using Claude Code daily and while it's the best coding agent I've used, the costs were killing me ($30+/day). So I built CCProxy - a simple proxy that lets Claude Code work with any AI model.

## What it does

CCProxy translates Claude Code's API calls to work with any provider. One environment variable change and suddenly Claude Code can use:

- Gemini's free tier (1M token context!) - Kimi K2 (100x cheaper than Claude, same SWE-bench score) - Qwen3 235B (free, beats GPT-4 on reasoning) - Any OpenRouter model (100+ options) - Even local models via Ollama

## Technical details

It's a Go proxy that: - Translates Anthropic's API format to each provider's format - Handles streaming responses correctly - Preserves Claude Code's tool calling capabilities - Routes requests based on context size and task type

The code is clean, well-tested, and MIT licensed.

## Real impact

My usage pattern: - Simple tasks → Free models (Gemini Flash, Qwen3) - Code generation → Kimi K2 ($0.15/M vs $15/M) - Complex reasoning → Keep Claude for these

Result: 90% cost reduction, same productivity.

## Why open source?

Tools this fundamental should be free. The AI development revolution shouldn't be gatekept by pricing.

GitHub: https://github.com/orchestre-dev/ccproxy Docs: https://ccproxy.orchestre.dev

Would love your feedback. What models would you want to use with Claude Code?