Show HN: Shannon Control Unit – Adaptive PI Control for LLM Training

Comments (1)

hunterbown · 7h ago

Hey HN,

I'm a solo researcher (and 2nd year law student) building tools at the intersection of information theory and control systems for AI/ML. Inspired by Claude Shannon's work at Bell Labs, I created the Shannon Control Unit (SCU): cruise control for neural network training.

SCU senses the info-ratio and auto-adjusts via PI control for steady, efficient introduction of information.

The mechanism dynamically maintains a target Shannon Information Ratio (S = ParamBPT / (DataBPT + ParamBPT)).

No more manual hyperparam tuning — it self-regulates λ for stability under data drift and faster generalization.

Core formula:Adjust λ via: λ_new = λ · exp(-(Kp·error + Ki·I))

Ablation shows adaptive PI outperforms fixed λ by up to 1.8% BPT. Validated on Llama-3.2:1B: -15.6% perplexity (15.14 → 12.78), -6.2% BPT 3B: -12.6% perplexity (3.56 → 3.11), -10.6% BPT

It's open-source under AGPL 3.0 (for those who want to build on it while sharing improvements). Implemented as LoRA adapters via PEFT/Transformers—load on Meta's base models.

Quick start: python from transformers import AutoModelForCausalLM, AutoTokenizer from peft import PeftModel

tokenizer = AutoTokenizer.from_pretrained("meta-llama/Llama-3.2-1B") model = AutoModelForCausalLM.from_pretrained("meta-llama/Llama-3.2-1B") model = PeftModel.from_pretrained(model, "hunterbown/shannon-control-unit")

Try the Colab demo: https://colab.research.google.com/github/Hmbown/shannon-cont... HF space: https://huggingface.co/hunterbown/shannon-control-unit

X thread for more context: https://x.com/huntermbown/status/1963802419785039878

DMs open for feedback or 7B+ scale partners—happy to offer a 2-week trial to replicate results.

What do you think: Does this generalize beyond 3B? Going from 1B to 3B required discovering the natural fit of the new model, so I suspect there could be a natural equilibrium where models train most efficiently using this method.

Started with a container restart, now hacking on a control tool (github.com)

Show HN: Desk-and-Bedside Glucose Monitor (github.com)

California AG to OpenAI: Harm to Children Will Not Be Tolerated (oag.ca.gov)

Anthropic to pay $1.5B to settle lawsuit over pirated training material (axios.com)

Show HN: Stroboscopic Instrument Tuner (github.com)

Great attorney (Mom and Pop shop) for incorporating Delaware C-corp

The Wotancraft Rider V2 Photography Sling Is Inspired by Cycling Bags (petapixel.com)

Introducing Speed Brain: helping web pages load 45% faster (blog.cloudflare.com)

Show HN: I built a public and open llms.txt endpoint for every domain (llms.page)

The Day I Kissed Comment Culture Goodbye (sustainableviews.substack.com)

Managing Multiple Claude Code Sessions Without Worktrees (gitbutler.ghost.io)

Russian spy drones over Germany: Why the Bundeswehr cannot shoot them down (euronews.com)

Show HN: ClodPod – Run Claude Code Safely in a macOS VM (github.com)

William James at CERN (1995) (bactra.org)

What Happened to WASD Keyboards? (discourse.codinghorror.com)

MileSan: Detecting μ-Architectural Leakage via Differential HW/SW Taint Tracking (comsec.ethz.ch)

A Greek Island's First Settlers Weren't Human (newlinesmag.com)

The lone unsolved problem from the ICPC 2025 world finals [pdf] (worldfinals.icpc.global)

Microsoft 365 Copilot pilot: DBT evaluation report (gov.uk)

Feds order Pennsylvania fossil-fuel plant to stay open another 90 days (power-eng.com)

Flookup: The OpenRefine Alternative for Google Sheets Users (getflookup.com)

The Boring Future of GenAI (blog.circuitsofimagination.com)

Anthropic reaches landmark $1.5B copyright deal with book authors (washingtonpost.com)

Ask HN: How do I manage auto-updates during testing on Windows?

Scale AI's former CTO launches AI agent that could solve big data's big problem (techcrunch.com)

A venture capitalist goes to extremes to punish her surrogate (wired.com)

Students face new cellphone restrictions in 17 states as school year begins (apnews.com)

X Design Notes: Unifying OCaml Modules and Values (blog.polybdenum.com)

Teen loneliness triggers 'reward seeking' behaviour (cam.ac.uk)

Ask HN: Why is everyone suddenly so interested in AI browsers? (techcrunch.com)

Ask HN: How don't use AI passively?

What Is Plus Times Plus? (Lambda Calculus Intro) [video] (youtube.com)

Disney Blacklists Artist Mulligan After Confirming He Used AI for Card Art (msn.com)

Little old lady from the south pacific vs. street scammers in Croati (odt.co.nz)

Let's rename the "vibecoding" tag to "LLMs" (lobste.rs)

How to Retire a Few Decades Early [video] (youtube.com)

Seedship [Text-Based Game] (philome.la)

The impact of sea level rise on the cities (thenextwavefutures.wordpress.com)

Do Language Models Agree with Human Perceptions of Suspense in Stories? (arxiv.org)

Show HN: Turn pose photos into editable animation code

LLM as Pair? (ronjeffries.com)

Zuckerberg caught on hot mic promoting fake investment figures to support Trump (bsky.app)

Multi-Level Marketing (en.wikipedia.org)

Anthropic Agrees to Pay $1.5B to Settle Lawsuit with Book Authors (nytimes.com)

How to (and how not to) fix color banding (blog.frost.kiwi)

Video games use LUTs and how you can too (blog.frost.kiwi)

Will solo founders be the new normal? (peignoir.medium.com)

Reflecting on Software Engineering Handbook (yusufaytas.com)

Anthropic to Pay $1.5B to Settle Author Copyright Claims (bloomberg.com)

Turn pose photos into editable animation code

Show HN: Shannon Control Unit – Adaptive PI Control for LLM Training

Comments (1)