Intercepting LLM to transform every other token reveals surprising robustness

Comments (1)

Shmungus · 9h ago

I was experimenting with OpenAI's streaming API and had a weird thought: what happens if you intercept and corrupt tokens as they're being generated, rather than after completion? Built a simple Python script that transforms every odd token in real-time - reversing characters, adding noise, uppercasing, etc. The results were unexpectedly interesting. LLMs maintain coherent meaning even with 50% of tokens corrupted. A sentence like "The quick brown fox jumps over the lazy dog" becomes "The kciuq brown xof jumps revo the yzal dog" but remains largely comprehensible. More surprisingly, the semantic degradation isn't linear. Technical explanations break down faster than creative writing. Mathematical content becomes nonsense immediately, while stories can handle significant corruption. This suggests something about how these models encode information - maybe redundancy is built deeper into the token relationships than we assumed. The tool is dead simple (100 lines of Python) but opens up some research questions I hadn't considered:

How much disruption can different model architectures handle? Does token position matter more than token content for meaning preservation? Could this be used for real-time LLM steering or interpretability research?

Not sure if this is useful to anyone else, but it's been a fun way to poke at how these systems actually work under the hood. The streaming interception approach might have applications beyond just corruption experiments.

OneText (YC W23) Is Hiring a DevOps/DBA Lead Engineer (jobs.ashbyhq.com)

Gander (YC F24) Is Hiring Founding Engineers and Interns (ycombinator.com)

Ziina (YC W21) the Series A fintech is hiring product engineers (ziina.notion.site)

Onyx (YC W24) – AI Assistants for Work Hiring Founding AE (ycombinator.com)

Great Question (YC W21) Is Hiring a Director of Customer Success (ycombinator.com)

Deepnote (YC S19) is hiring engineers to build an AI-powered data notebook (deepnote.com)

Converge (YC S23) Well-capitalized New York startup seeks product developers (runconverge.com)

CircuitHub (YC W12) is hiring full-stack robotics engineers (workatastartup.com)

AtoB (YC S20) – Stripe for Transportation – is hiring engineers (jobs.ashbyhq.com)

PromptArmor (YC W24) Is Hiring in San Francisco (ycombinator.com)

Depot (YC W23) is hiring an enterprise support engineer (UK/EU) (ycombinator.com)

Patched (YC S24) Is Hiring SWEs in Singapore (ycombinator.com)

Activeloop (YC S18) Is Hiring Senior Back End and AI Search Engineers(Onsite, MV) (careers.activeloop.ai)

Morph (YC S23) Is Hiring a ML Engineer

Spark AI (YC W24) Is Hiring a Full Stack Engineer in San Francisco (ycombinator.com)

Demodesk (YC W19) Is Hiring Rails Engineers (demodesk.com)

Piramidal (YC W24) Is Hiring a Senior Full Stack Engineer (ycombinator.com)

AccessOwl (YC S22) is hiring an AI TypeScript Engineer to connect 100s of SaaS (ycombinator.com)

StackAI (YC W23) Is Looking for SWR and Tailwind Wizards (ycombinator.com)

Weave (YC W25) is hiring a founding engineer (ycombinator.com)

Infisical (YC W23) Is Hiring Full Stack Engineers (TypeScript) in US and Canada (ycombinator.com)

GoGoGrandparent (YC S16) is hiring Back end Engineers

Roundtable (YC S23) Is Hiring a Member of Technical Staff (ycombinator.com)

Diligent (YC S23) Is Hiring a Founding AI Engineer (ycombinator.com)

Venta AI (YC S23) is hiring a full stack engineer in Amsterdam (ycombinator.com)

Martin (YC S23) Is Hiring Founding AI/Product Engineers to Build a Better Siri (ycombinator.com)

Trellis (YC W24) Is Hiring founding SDR to help automate healthcare paperwork (ycombinator.com)

Sorcerer (YC S24) Is Hiring a Lead Hardware Design Engineer (jobs.ashbyhq.com)

Harper (YC W25) Is Hiring Applied AI / AI Context Engineers and Data Scientist (ycombinator.com)

Overlap (YC S24) Is Hiring (ycombinator.com)

Ashby (YC W19) Is Hiring Engineering Managers (ashbyhq.com)

Intercepting LLM to transform every other token reveals surprising robustness

Comments (1)