Scary Cool Sad Goodbye 77 Up in Northern Michigan's Lynchian Underbelly (scarycoolsadgoodbye.substack.com)

- No installs, no signup, no hidden code — just copy-paste the file into any LLM chat window (GPT, Claude, Gemini, etc.). - +22.4% semantic accuracy, +42.1% reasoning success, and 3.6× more stability (benchmarked on GSM8K and Truthful-QA). - Features Semantic Tree Memory, Hallucination Shield, and fully exportable logic. - MIT Licensed, zero tracking, zero ads.

Why did I build this? I wanted to prove that advanced reasoning and memory could be made open, portable, and accessible to anyone — just with pure text, no software or setup.

A note: I'm from China, and English is not my first language. This post and the docs were partly assisted by AI, but I personally reviewed and approved every line of content. All ideas, design, and code are my own work. If anything is unclear or could be improved, I really welcome your feedback!

I'm the author, and happy to answer any questions or suggestions here!

Comments (6)

ultimateking · 14h ago

Really cool project! Quick questions:

1. How does TXT OS store its “Semantic Tree Memory” between sessions? 2. When `kbtest` detects a hallucination, what happens next? 3. Any idea of the speed impact on smaller models like LLaMA-2-13B?

Thanks for sharing—excited to try it out!

TXTOS · 14h ago

Semantic Tree Memory

We actually serialize the tree as a compact JSON-like structure right in the TXT file—each node gets a header like #NODE:id and indented subtrees. When you reload, TXT OS parses those markers back into your LLM’s memory map. No external DB needed—just plain text you can copy-paste between sessions.

--- When kbtest Fires

Internally it tracks our ΔS metric (semantic tension). Once ΔS crosses a preset threshold, kbtest prints a warning and automatically rolls you back to the last “safe” tree checkpoint. That means you lose only the bad branch, not your entire session. Think of it like an undo button for hallucinations.

--- Performance on LLaMA-2-13B

Benchmarks were on GPT-4, but on a 13B model you’ll see roughly a 10–15% token-generation slow-down thanks to the extra parsing and boundary checks. In practice that’s about +2 ms per token, which most folks find an acceptable trade-off for the added stability.

Hope that clears things up—let me know if you hit any weird edge cases!

brown2000 · 12h ago

interesting! Quick question:

does TXT OS work equally well with open-source models, or is it optimized more for models like GPT-4 or Claude?

TXTOS · 12h ago

Hey, good question!

I've actually tested TXT OS with about 10 different AIs already—you can check out the full rundown on my repo. Generally, ChatGPT, Grok, Claude, and Perplexity gave the smoothest and best experience. The others still work fine, but some, like Gemini, have minor quirks (Gemini randomly adds a weird parameter during initial setup, but it sorts itself out after the first step).

So, long story short, if you want a hassle-free experience, go with ChatGPT, Grok, Claude, or Perplexity!

yyhhooq · 9h ago

could you explain the four math a little bit? Why it can activate AI ?

TXTOS · 9h ago

Sure — it’s not about activating AI like magic, it’s about steering its reasoning process.

Each formula plays a role in making the LLM more stable, coherent, and logically self-aware:

• = I - G + mc² defines semantic residue — how far the current output strays from meaning. • BigBig(G) recombines context & error to steer output back toward intent. • BBCR detects collapse and triggers reset → rebirth (like fail-safe logic). • BBAM models attention decay — restoring continuity over multiple steps.

Together, this makes the LLM act less like autocomplete… and more like a self-guided reasoner.

Efficient Document Clustering (2008) (patents.google.com)

Postman to Bruno: A Weekend Migration That Transformed Our API Workflow (ashwch.com)

Fears 'stable' Patagonia glacier in irreversible decline (theguardian.com)

Track Aipac (trackaipac.com)

Generating (almost) equally-spaced points along a parabola (arjuns07.github.io)

Scary Cool Sad Goodbye 77 Up in Northern Michigan's Lynchian Underbelly (scarycoolsadgoodbye.substack.com)

Adventures in Data Corruption (oxide-and-friends.transistor.fm)

Advice on Building Voice AI in June 2025 (daily.co)

A chrome extension that detects malicious websites (cheztrap.com)

State of GPGPU and the JVM (youtube.com)

China Biotech's Advance Is Changing the Drug Pipeline (bloomberg.com)

Traditional Chinese Medicine Has Not Been Vindicated by Science (mcgill.ca)

Lt. Columbo (Peter Falk) Roasts Frank Sinatra (1978) [video] (youtube.com)

Defending Against Prompt Injection with a Few DefensiveTokens (arxiv.org)

Daily Notes Considered Harmful (literallythevoid.com)

Show HN: Shadow VCS quarantines AI generated commits before they break your repo (github.com)

Tim Cook Has Outpaced Jobs in Shareholder Value, but AI Era Exposes Weaknesses (fortune.com)

The human harbor: Navigating identity and meaning in the AI age (venturebeat.com)

Task Runner Census 2025 (aleyan.com)

OpenCut: The open-source CapCut alternative (github.com)

APKLab: Android Reverse-Engineering Workbench for VS Code (github.com)

La Scala Warns Opera Patrons: No Flip-Flops or Tank Tops Allowed (nytimes.com)

Cinema Digital Sound (CDS) (in70mm.com)

What birdsong and back ends can teach us about magic (digitalseams.com)

BOE Governor Bailey Warns Banks Against Issuing Own Stablecoins (bloomberg.com)

Show HN: A Browser-Only Dream Interpreter Using Symbol Logic and JavaScript (github.com)

Show HN: I made a free, simple open-source Stripe invoice generator (oneoffinvoice.com)

Are a few people ruining the internet for the rest of us? (theguardian.com)

Legalise AC (samdumitriu.com)

When Novels Mattered (nytimes.com)

Efficiency of a key enzyme in photosynthesis boosted (news.mit.edu)

How to put your phone down and get back into habit of reading books (2024) (theguardian.com)

Programming Language Theory has a public relations problem (happyfellow.bearblog.dev)

Ancient Neanderthal 'Fat Factory' Reveals How Advanced They Were (sciencealert.com)

No Code Is Dead (thenewstack.io)

Quality and Food Safety Consultant – Emilia Wardach (emiliawardach.com)

Show HN: Hawk – Pandas-like data analysis for JSON/YAML/CSV in CLI (github.com)

Autodesk Weighs Takeover of Engineering Software Firm PTC (finance.yahoo.com)

Show HN: Type-safe PostgreSQL helpers for Kysely – arrays, JSONB, and vector ops (github.com)

Show HN: Shepherd – Generating Synthetic Data with Claude and MCP (github.com)

Five companies now control over 90% of the restaurant food delivery market (marketsaintefficient.substack.com)

Ask HN: How much of OpenAI code is written by AI?

Hungary's oldest library is fighting to save books from beetle infestation (apnews.com)

Book Review: The Laws of Trading (astralcodexten.com)

Add Furigana to your Japanese homework based on your JLPT level (github.com)

Vertical tiny homes redefine compact living (foxnews.com)

Building an open source multi-modal AI assistant (getubo.com)

On the Cyclical Nature of Nostalgia (2023) (lithub.com)

Using Gemini and Claude for SQL Analytics – A Bake Off (benjaminwootton.com)

Hypercapitalism and the AI Talent Wars (blog.johnluttig.com)

Show HN: TXT OS – Open-Source AI Reasoning, One Plain-Text File at a Time

Comments (6)