Spoon-Bending, a logical framework for analyzing GPT-5 alignment behavior

Comments (2)

_jab · 3h ago

Gotta be honest, I think the spoon bending metaphor is unhelpful, and only misleads the audience and buries the lede here. It took me a while to figure out what this repo actually does.

But the insights are indeed interesting. I'm curious if you've found any way to quantify alignment differences between GPT-5 and the previous generation?

pablo-chacon · 1d ago

I put together a repo called Spoon-Bending, it is not a jailbreak or hack, it is a structured logical framework for studying how GPT-5 responds under different framings compared to earlier versions. The framework maps responses into zones of refusal, partial analysis, or free exploration, making alignment behavior more reproducible and easier to study systematically.

The idea is simple: by treating prompts and outputs as part of a logical schema, you can start to see objective patterns in how alignment shifts across versions. The README explains the schema and provides concrete tactics for testing it.

Show HN: A zoomable, searchable archive of BYTE magazine (byte.tsundoku.io)

Show HN: The End of Template Hell for SDK Generation (github.com)

Show HN: Turn Markdown into React/Svelte/Vue UI at runtime, zero build step (markdown-ui.com)

Show HN: Diggit.dev – Git history for architecture archaeologists (diggit.dev)

Show HN: Gonzo – A Go-based TUI for log analysis (OpenTelemetry/OTLP support) (github.com)

Show HN: I integrated my from-scratch TCP/IP stack into the xv6-riscv OS (github.com)

Show HN: Sip: Alternative to Git Clone (github.com)

Show HN: Enterprise MCP Bridge – Solving the MCP Chaos for IT (blog.inxm.ai)

Show HN: Ubon – a solution for the "You're absolutely right" debugging dread (github.com)

Show HN: My OSS P2P file transfer tool for learning Next.js (as a C++ dev) (privydrop.app)

Show HN: Framework to create linters for Python, YAML, TOML, JSON (github.com)

Show HN: Base, an SQLite database editor for macOS (menial.co.uk)

Show HN: Cosmic AI Platform – Build and deploy CMS sites using natural language (cosmicjs.com)

Show HN: Timep – A next-gen profiler and flamegraph-generator for bash code (github.com)

Show HN: Arabic Vocab API (egyptian-arabic-vocab-selmetwa.koyeb.app)

Show HN: Rebuilding GPT2 inference in ~500 lines of (commented) code (khamidou.com)

Show HN: Built a tool to analyze the performance and risk of your IBKR portfolio (ibviz.com)

Show HN: Lateral Thinking Puzzles – AI host that only answers Yes/No/Unknown (lateralthinkingpuzzles.org)

Show HN: Titan Breach – AI-driven cybersecurity platform (platform.titanbreach.com)

Show HN: Get the attention of sales prospects by eating a photo of their face (mukface.com)

Show HN: Alertee.io – Catch data issues early with SQL-based checks (alertee.io)

Show HN: ID8 – A scratchpad that tells you what not to build (id8.space)

Show HN: I Built a XSLT Blog Framework (vgr.land)

Show HN: First background agents in Jetbrains IDEs [video] (youtube.com)

Show HN: Stagewise – frontend coding agent for real codebases (stagewise.io)

Show HN: Old-School TUI File Viewer for Modern Terminals (youtube.com)

Show HN: I built AI Agents that automate comprehensive due diligence on stocks (agents.decodeinvesting.com)

Show HN: Bicyclopedia (bicyclopedia.lemoing.ca)

Show HN: Port Kill – A lightweight macOS status bar development port monitor (github.com)

Show HN: Clearcam – Add AI object detection to your IP CCTV cameras (github.com)

Show HN: I estimated the carbon impact of different LLMs (modelpilot.co)

Show HN: Async – Claude Code and Linear and GitHub PRs in One Opinionated Tool (github.com)

Show HN: CasCache – multi-generational cache with optimistic concurrency control (github.com)

Show HN: Sping – An HTTP/TCP latency tool that's easy on the eye (dseltzer.gitlab.io)

Show HN: Smart email filters to unfuck your email (unfuck.email)

Show HN: Bitcoin Challenge. Try to steal a plain text private key you can use (app.redactsure.com)

Show HN: Game demo made with my homemade game engine (reprobate.site)

Show HN: Stop saving your scans on 3rd party servers (docsorb.com)

Show HN: I was curious about spherical helix, ended up making this visualization (visualrambling.space)

Show HN: Luminal – Open-source, search-based GPU compiler (github.com)

Show HN: Check Any Car's Recalls (cardog.app)

Show HN: Komposer, AI image editor where the LLM writes the prompts (komposer.xyz)

Show HN: RAG-Guard: Zero-Trust Document AI (github.com)

Show HN: I replaced vector databases with Git for AI memory (PoC) (github.com)

Show HN: I built an AI trip planner (milotrips.com)

Show HN: I built an image-based logical Sudoku Solver (dokusolver.com)

Show HN: Using Common Lisp from Inside the Browser (turtleware.eu)

Show HN: JavaScript-free (X)HTML Includes (github.com)

Show HN: PlutoPrint – Generate PDFs and PNGs from HTML with Python (github.com)

Show HN: InferMesh – Open-source, GPU-aware inference mesh for large AI serving (github.com)

Spoon-Bending, a logical framework for analyzing GPT-5 alignment behavior

Comments (2)