Show HN: Shannon Control Unit – Adaptive PI Control for LLM Training

Comments (1)

hunterbown · 10h ago

Hey HN,

I'm a solo researcher (and 2nd year law student) building tools at the intersection of information theory and control systems for AI/ML. Inspired by Claude Shannon's work at Bell Labs, I created the Shannon Control Unit (SCU): cruise control for neural network training.

SCU senses the info-ratio and auto-adjusts via PI control for steady, efficient introduction of information.

The mechanism dynamically maintains a target Shannon Information Ratio (S = ParamBPT / (DataBPT + ParamBPT)).

No more manual hyperparam tuning — it self-regulates λ for stability under data drift and faster generalization.

Core formula:Adjust λ via: λ_new = λ · exp(-(Kp·error + Ki·I))

Ablation shows adaptive PI outperforms fixed λ by up to 1.8% BPT. Validated on Llama-3.2:1B: -15.6% perplexity (15.14 → 12.78), -6.2% BPT 3B: -12.6% perplexity (3.56 → 3.11), -10.6% BPT

It's open-source under AGPL 3.0 (for those who want to build on it while sharing improvements). Implemented as LoRA adapters via PEFT/Transformers—load on Meta's base models.

Quick start: python from transformers import AutoModelForCausalLM, AutoTokenizer from peft import PeftModel

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-3.2-1B") model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-1B") model = PeftModel.from_pretrained(model, "hunterbown/shannon-control-unit")

Try the Colab demo: https://colab.research.google.com/github/Hmbown/shannon-cont... HF space: https://huggingface.co/hunterbown/shannon-control-unit

X thread for more context: https://x.com/huntermbown/status/1963802419785039878

DMs open for feedback or 7B+ scale partners—happy to offer a 2-week trial to replicate results.

What do you think: Does this generalize beyond 3B? Going from 1B to 3B required discovering the natural fit of the new model, so I suspect there could be a natural equilibrium where models train most efficiently using this method.

If AI agents take the jobs, who buys the stuff?

Ask HN: Looking for headless CMS recommendation

Ask HN: Is anyone here deliberately low‑tech? If so, why and how?

Ask HN: Who wants to be hired? (September 2025)

Ask HN: Who is hiring? (September 2025)

Ask HN: When was the last time you visited Stack Overflow?

Ask HN: Why does Google word privacy settings like you agree even when off?

Ask HN: Is your company still hiring junior engineers?

Ask HN: VSCode AI Autocomplete Woes

Ask HN: What is your average acceptance rate for code suggestions from an agent?

Tell HN: My advice after I applied to 450 positions before getting hired

Ask HN: How do you fight YouTube addiction and procrastination? I'm struggling

Google's move to restrict Android sideloading could face EU pushback

Ask HN: Why are developers switching to CLI-based coding agents?

Ask HN: A memory challenge/nerd cry for help finding back a blog posted on HN

File protection: anonymous, open source and fast

Ask HN: I got fired from 100k job so I've made a game and it failed

Tell HN: I kinda want to go back to Java

Ask HN: Is Google AI Overview giving you scam phone numbers?

You can customize HN to be darkmode friendly

Ask HN: How to Harden Your Phone?

Ask HN: How to protect own privacy under ChatControl?

Understanding Android's Boot Process

Ask HN: Can/Will server, network and security jobs ever be outsourced?

Show HN: Shannon Control Unit – Adaptive PI Control for LLM Training

Comments (1)