Why it's a mistake to ask chatbots about their mistakes

Comments (1)

elliotto · 1d ago

I've found the opposite of this to be the case. I've spent some time recently debugging a chatbot - when it goes astray I say 'what part of your system prompt made you do that' and it responds with the line that drove this decision. Then I clarify my intent and ask for better phrasing, and fix it. This usually works.

This also often works with tool use and tool cause - just ask it what part of its prompt told it to do something and it can usually point there.

If you ask it why it thought something a priori was wrong, then the bot can't answer it, and neither can I. If you ask me to clarify why I wrote some code I can walk you through the steps I got there. But if you ask me to clarify why I believed a function exists, that upon runtime I learned doesn't actually exist, I can't provide justification there.

Show HN: Edka – Kubernetes clusters on your own Hetzner account (edka.io)

Show HN: Prime Number Grid Visualizer (enda.sh)

Show HN: JMAP MCP – Email for your agents (github.com)

Show HN: Magicnode (Applied to YC F25) – Canva for AI Apps (magicnode.ai)

Show HN: PgHook – Docker image that streams PostgreSQL row changes to webhooks (github.com)

Show HN: Orca – AI Game Engine (github.com)

Show HN: Add "gist" to any YouTube URL to get instant video summaries (youtubegist.com)

Show HN: OWhisper – Ollama for realtime speech-to-text (docs.hyprnote.com)

Show HN: I built a free alternative to Adobe Acrobat PDF viewer (github.com)

Show HN: Run Your Own ChatGPT Agent on Cloudflare Containers (github.com)

Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks (narrator.sh)

Show HN: XR2000: A science fiction programming challenge (clearsky.dev)

Show HN: Understanding the Spatial Web Browser Engine (m-creativelab.github.io)

Show HN: Ldns.com – fast DNS lookups from the URL bar (ldns.com)

Show HN: Vaultrice – A real-time key-value store with a localStorage API (vaultrice.com)

Show HN: Zig-DbC – A design by contract library for Zig

Show HN: OpenAVMKit – open-source toolkit for real estate mass appraisal (AVMs) (openavmkit.com)

Show HN: MCP Security Suite (github.com)

Show HN: Yet another memory system for LLMs (github.com)

Show HN: Clojure Land – discover open-source Clojure projects (clojure.land)

Show HN: PlutoPrint – Generate Beautiful PDFs and PNGs from HTML with Python (github.com)

Show HN: Modelence – Supabase for MongoDB (github.com)

Show HN: Building a web search engine from scratch with 3B neural embeddings (blog.wilsonl.in)

Show HN: Sarpro – 5–20× faster Sentinel‑1 GRD → GeoTIFF/JPEG (github.com)

Show HN: Real-time privacy protection for smart glasses (github.com)

Show HN: Doom port to pure Go – Gore (github.com)

Show HN: Omnara – Run Claude Code from anywhere (github.com)

SHOW HN: I made a 30fps CLI Tetris game in PHP after watching the Tetris movie (gist.github.com)

Show HN: E-commerce data from 100k stores that is refreshed daily (searchagora.com)

Show HN: Kuvasz Uptime 2.4.0 – custom status, keyword and slow response checks (kuvasz-uptime.dev)

Show HN: Generate random gradients like on OpenAI's website (gradients.venki.dev)

Show HN: Play Pokémon to unlock your Wayland session (github.com)

Show HN: The current sky at your approximate location, as a CSS gradient (sky.dlazaro.ca)

Show HN: Happy Coder – End-to-End Encrypted Mobile Client for Claude Code (github.com)

Show HN: NTP diag tool, as is Ping for ICMP (github.com)

Show HN: A Provably Zero Trust VPN (github.com)

Show HN: A Sinclair ZX81 retro web assembler+simulator

Show HN: I built an offline, open‑source desktop Pixel Art Editor in Python (github.com)

Show HN: Nabu (TTS Reader and LLM Playground on Android) (github.com)

Show HN: Engineering.fyi – Search across tech engineering blogs in one place (engineering.fyi)

Show HN: Minimal Counter (minimalcounter.com)

Show HN: ClipDump – A Clipboard Dumper for Windows (github.com)

Show HN: xstack – Passive eBPF Linux stack profiling without tracepoints (tanelpoder.com)

Show HN: Bolt – A super-fast, statically-typed scripting language written in C (github.com)

Show HN: Move to dodge the bullets. How long can you survive? (dodge.trickle.host)

Show HN: Rust macro utility for batching expensive async operations (github.com)

Show HN: Reimagining keyboard to be Voice first [video] (youtube.com)

Show HN: YouTube Audio Player (y2audio.com)

Show HN: Browser AI agent platform designed for reliability (github.com)

Show HN: I just built a tool to turn any photo into pixel art (phototopixel.art)

Why it's a mistake to ask chatbots about their mistakes

Comments (1)