Tokasaurus: An LLM Inference Engine for High-Throughput Workloads (scalingintelligence.stanford.edu)

Thanks for sharing this. It’s a sharp example of why model performance alone isn’t enough. I really like how Open AI said this is rare in real world use cases. That isn't good enough for enterprise to trust using AI.

We’re building a governance layer (called Promethios) that wraps LLMs with decision-level constraints: agents are required to reflect, check for ethical violations, and pause or defer responses when appropriate. No fine-tuning or RLHF — just structured cognitive scaffolding.

What we’re seeing in these failures isn’t just hallucination. It’s a lack of internal accountability.

Governance won’t solve everything, but it gives agents a way to say, “I shouldn’t answer that.” And that alone prevents a lot of harm.

Happy to share more if you're curious. We've been running benchmark comparisons that show meaningful behavioral shifts with this kind of wrapper.

Ancapistani · 7h ago

> I really like how Open AI said this is rare in real world use cases. That isn't good enough for enterprise to trust using AI.

No, but it's likely good enough to develop a more complex system that _is_ good enough.

> We’re building a governance layer (called Promethios) that wraps LLMs with decision-level constraints: agents are required to reflect, check for ethical violations, and pause or defer responses when appropriate. No fine-tuning or RLHF — just structured cognitive scaffolding.

Ah, cool. I'm working on something similar in use case but different in approach on the side, and am working on yet another "similar but different" system at work :)

> What we’re seeing in these failures isn’t just hallucination. It’s a lack of internal accountability.

Right.

I'm coming to really dislike the word "hallucinations" w/r/t AI. They aren't hallucinations. They're "bullshit". Here's the report where I first saw that term used in this context: https://philpapers.org/rec/HICCIB

The simplest applicable definition for "hallucination" is "a false or mistaken idea". That can't apply in the context of an LLMs because LLMs do not have a concept of "truth".

They aren't lies for the same reason. You can't lie if you lack the ability to discern truth.

LLMs bullshit you. They make statements, without factual support, clearly and confidently. In fact, I think it's a mistake to consider _any_ LLM output as anything other than bullshit. If it's working well, the bullshit is true as often as your situation requires.

In that context, "internally accountability" has no real meaning that I can see.

> Governance won’t solve everything, but it gives agents a way to say, “I shouldn’t answer that.” And that alone prevents a lot of harm.

Yep.

In practice, right now I'm using two layers for stuff like this. One on input to ask "does this prompt comply with our usage policy?". If that fails, it never gets to the target model. There's another on output that asks "does this response comply with our policies?". If that one fails, the response is removed from the conversation, clarifications on permissible output placed just before the user's last message, and is sent back to the target model. If the second attempt fails, a different flow is triggered to return an error to the user.

> Happy to share more if you're curious. We've been running benchmark comparisons that show meaningful behavioral shifts with this kind of wrapper.

Yeah, I'd really appreciate that.

The system you describe sounds interesting enough to justify the time reviewing it thoroughly, but I'm at least as interesting in seeing how you structured your benchmarks and measured outcomes.

Ancapistani · 10h ago

This seemed surprising to me. Is there any real evidence for it?

> Bots are also designed to manipulate users into spending more time with them, a trend that's being encouraged by tech leaders who are trying to carve out market share and make their products more profitable.

josefritzishere · 12h ago

Death by hype cycle.

Tokasaurus: An LLM Inference Engine for High-Throughput Workloads (scalingintelligence.stanford.edu)

Show HN: Claude Composer (github.com)

I Do Not Remember My Life and It's Fine (aethermug.com)

The impossible predicament of the death newts (crookedtimber.org)

APL Interpreter – An implementation of APL, written in Haskell (2024) (scharenbroch.dev)

What a developer needs to know about SCIM (tesseral.com)

Machine Learning: The Native Language of Biology (decodingbiology.substack.com)

Show HN: Ask-human-mcp – zero-config human-in-loop hatch to stop hallucinations (masonyarbrough.com)

SkyRoof: New Ham Satellite Tracking and SDR Receiver Software (rtl-sdr.com)

Air Lab – A portable and open air quality measuring device (networkedartifacts.com)

Test Postgres in Python Like SQLite (github.com)

Anthropic co-founder on cutting access to Windsurf (techcrunch.com)

Seven Days at the Bin Store (defector.com)

The Universal Tech Tree (asteriskmag.com)

ICANN fee price hike by 11% [pdf] (itp.cdn.icann.org)

Converge (YC S23) Well-capitalized New York startup seeks product developers (runconverge.com)

Show HN: iOS Screen Time from a REST API (thescreentimenetwork.com)

Show HN: Lambduck, a Functional Programming Brainfuck (imjakingit.github.io)

Programming language Dino and its implementation (github.com)

A proposal to restrict sites from accessing a users’ local network (github.com)

Show HN: ClickStack – Open-source Datadog alternative by ClickHouse and HyperDX (github.com)

Eleven v3 (elevenlabs.io)

How Common Is Multiple Invention? (construction-physics.com)

Understanding the PURL Specification (Package URL) (fossa.com)

Apple Notes Will Gain Markdown Export at WWDC, and, I Have Thoughts (daringfireball.net)

Phptop: Simple PHP ressource profiler, safe and useful for production sites (github.com)

Show HN: Container Use for Agents (github.com)

Autonomous drone defeats human champions in racing first (tudelft.nl)

Aurora, a foundation model for the Earth system (nytimes.com)

Twitter's new encrypted DMs aren't better than the old ones (mjg59.dreamwidth.org)

OpenAI slams court order to save all ChatGPT logs, including deleted chats (arstechnica.com)

From tokens to thoughts: How LLMs and humans trade compression for meaning (arxiv.org)

parrot.live (github.com)

Rare black iceberg spotted off Labrador coast could be 100k years old (cbc.ca)

LLMs and Elixir: Windfall or Deathblow? (zachdaniel.dev)

I made a search engine worse than Elasticsearch (2024) (softwaredoug.com)

End of an Era: Landsat 7 Decommissioned After 25 Years of Earth Observation (usgs.gov)

Show HN: String Flux – Simplify everyday string transformations for developers (stringflux.io)

Show HN: I made a 3D SVG Renderer that projects textures without rasterization (seve.blog)

Prompt engineering playbook for programmers (addyo.substack.com)

The iPhone 15 Pro’s Depth Maps (tech.marksblogg.com)

A Spiral Structure in the Inner Oort Cloud (iopscience.iop.org)

Cursor 1.0 (cursor.com)

Reproducing the deep double descent paper (stpn.bearblog.dev)

Gemini-2.5-pro-preview-06-05 (deepmind.google)

Show HN: Grab a Random ArXiv Paper (jepedersen.dk)

FFmpeg merges WebRTC support (git.ffmpeg.org)

When memory was measured in kilobytes: The art of efficient vision (softwareheritage.org)

Cysteine depletion triggers adipose tissue thermogenesis and weight loss (nature.com)

How we reduced the impact of zombie clients (letsencrypt.org)

Therapy Chatbot Tells Recovering Addict to Have a Little Meth as a Treat

Comments (4)