AdaptiQ Core – Optimize your LLM agents with RL and save 30% tokens

Comments (2)

adaptiq · 6h ago

Thanks for checking out AdaptiQ!

Here’s what’s next in the coming days:

Today: AdaptiQ Core (what you see now) – CLI-based prompt/agent optimizer – Reinforcement Learning loop (offline Q-table) – Token/cost/CO₂ tracking per run – Markdown + badge reporting – Works with CrewAI + OpenAI (GPT-4, GPT-3.5)

Next Week: AdaptiQ ACE – HTTP Proxy Edition – Drop-in FinOps proxy for Claude, Gemini, GPT – Rewrites prompts on-the-fly – Tracks latency, compile pass, retries – GitHub Action: block PR if cost/test fails – RL reward = quality − β·tokens − γ·latency

What we’re solving: > Agents fail silently, burn credits, drift from style guides. > AdaptiQ gives your LLM prompts a feedback loop.

We’re building this in the open: roadmap, CLI, and future trace-spec are all public.

Questions? Feedback? Want support for LangChain / Autogen / Mistral? Let us know below – we’d love to expand!

Also: if you drop your prompt logs (token usage + outcome), we can pre-train the Q-table for your setup.

Cheers – Wassim / AdaptiQ team

adaptiq · 7h ago

Hi HN!

We just open-sourced [AdaptiQ Core](https://github.com/adaptiq-ai/adaptiq), a CLI tool that uses reinforcement learning (Q-learning) to optimize your LLM agents and reduce token usage, retries, and failed outputs.

It observes your agent runs, builds a local Q-table, and learns how to improve prompts/configs — all offline.

---

*What it does:*

• Prompt & agent optimizer (crewAI-compatible) • RL loop (offline Q-learning) • Pre-run cost prediction • FinOps reporting (token, $ and CO₂) • Markdown reports + GitHub badge • Works via CLI (`wizard`, `validate`, `report`)

---

*FinOps in Action* Saved Tokens: –37% Retries reduced: –60% Compile-pass +15 pts

GitHub: https://github.com/adaptiq-ai/adaptiq

Ask HN: Why is Gmail so incompetent at basic search?

Instant responsiveness in user interfaces is annoying

I Use EDA and Local LLMs to Make Better Product Decisions

Ask HN: How to find non-popular blogs and forums?

I'm Peter Roberts, immigration attorney who does work for YC and startups. AMA

Where can I sell a dataset I've created?

Tell HN: Windows notepad can now edit Markdown files

Ask HN: What is a great project based Rails tutorial for 2025?

Ask HN: Any active COBOL devs here? What are you working on?

Ask HN: Is there a way to get uBlock Origin on Chrome >= 138?

Ask HN: US expats/nomads, how do you find remote-out-of-US jobs in US?

Ask HN: What is your most disturbing moment with generative AI?

Ask HN: What Pocket alternatives did you move to?

Ask HN: What's Your Useful Local LLM Stack?

Ask HN: How do I prevent AI from reading/training off my content?

Navigating AI Dementia: Strategies for Safe Rollback

Why do we still flatten embedding spaces?

Ask HN: Looking for alpha testers for HRAM (asm)

Ask HN: What could I build to make your life a little easier?

Ask HN: Is it time to fork HN into AI/LLM and "Everything else/other?"

Metis Agent Starter Kit – Build production AI agents in minutes, not weeks

Ask HN: Recommendation for good app to read ArXiv on iOS?

Ask HN: What's the competitive advantage these days?

Which SaaS have you been able to replace with AI?

Ask HN: Does anyone have OpenBSD projects looking for unpaid/paid help?

Ask HN: Will AI models over time converge into the same system?

How do you retain what you read from nonfiction books?

Ask HN: Having terrible time with paid versions of ChatGPT and Claude

Tell HN: Notion Desktop is monitoring your audio and network

Ask HN: Changing Developer Career Specialty

Ask HN: OpenAI zero'd balance (actual money, not free credits) after inactivity

Ask HN: How do you stay on top of AI tech?

AdaptiQ Core – Optimize your LLM agents with RL and save 30% tokens

Comments (2)