Show HN: TextPolicy – reinforcement learning for text generation on a MacBook

4 teilom 0 8/30/2025, 4:34:08 PM github.com ↗

I built TextPolicy because I wanted a way to study reinforcement learning for text generation without needing a cluster or cloud GPUs. A MacBook is enough. The toolkit is simple: Implements GRPO and GSPO algorithms Provides a decorator interface for custom reward functions Includes LoRA and QLoRA utilities Runs on MLX, so it is efficient on Apple Silicon It is not intended for production. The purpose is learning and experimentation: to understand algorithms, to test ideas, to see how reward shaping affects behavior. Installation is through pip: pip install textpolicy There is a minimal example in the README. I am interested in feedback on: the clarity of the API, the usefulness of the examples, and whether this lowers the barrier for people new to RL. Repository: github.com/teilomillet/textpolicy

Show HN: Hacker News em dash user leaderboard pre-ChatGPT (gally.net)

Show HN: I made an Animal Crossing style letter editor (acmail.idreesinc.com)

Show HN: I made an English version of the game "Funeral of Freiren" (github.com)

Show HN: An interface for doing research fast with an LLM (proread.ai)

Show HN: Sosumi.ai – Convert Apple Developer docs to AI-readable Markdown (sosumi.ai)

Show HN: Captan – Open-Source Cap Table Management CLI (github.com)

Show HN: An AI coding tool for unserious projects (crazycontext.com)

Show HN: Sometimes GitHub is boring, so I made a CLI tool to fix it (github.com)

Show HN: Find Hidden Gems on HN (pj4533.com)

Show HN: OpenAnimation – KMP app for exploring and editing Lottie animations (github.com)

Show HN: Yet another daily word game – wotd (wotd.is)

Show HN: Tool that helps you find domains for your idea (helpmefindagooddomainnameformyidea.com)

Show HN: Sourcerer – MCP for semantic code search that reduces token waste (github.com)

Show HN: Give Claude Code control of your browser (open-source) (cli-agents.click)

Show HN: Auto-Match – How We Built Receipt-to-Transaction Matching (Open Source) (midday.ai)

Show HN: TextPolicy – reinforcement learning for text generation on a MacBook (github.com)

Show HN: I made a mini site to see timezone shifts (tz.pert.dev)

Show HN: A simple CLI tool to list network ports and their associated bin (github.com)

Show HN: A minimal TS library that generates prompt injection attacks (prompt-injector.blueprintlab.io)

Show HN: Datacmd – Terminal-native dashboards from CSV/API in one command (github.com)

Show HN: Meetup.com and eventribe alternative to small groups (github.com)

Show HN: Octarine – a fast, lightweight, opinionated Markdown notes app (octarine.app)

Show HN: Magic links – Get video and dev logs without installing anything

Show HN: SwiftAI – open-source library to easily build LLM features on iOS/macOS (github.com)

Show HN: Clone of a Macintosh-Like Computer with a Built-In Basic Interpreter (reprobate.site)

Show HN: PageIndex – Vectorless RAG (github.com)

Show HN: Discover Fast-Growing Websites Before They Go Mainstream (websitegrowthtracker.com)

Show HN: Yoink AI – macOS AI app that edits directly in any textfield of any app (useyoink.ai)

Show HN: I integrated my from-scratch TCP/IP stack into the xv6-riscv OS (github.com)

Show HN: An open source implementation of OpenStreetMap in Electron (github.com)

Show HN: Warrify – Stop losing money on expired warranties

Show HN: A zoomable, searchable archive of BYTE magazine (byte.tsundoku.io)

Show HN: Turn Markdown into React/Svelte/Vue UI at runtime, zero build step (markdown-ui.com)

Show HN: Renamify – Case-aware search and replace for AI agents (docspring.github.io)

Show HN: Envoy – Command Logger (github.com)

Show HN: FilterQL – A tiny query language for filtering structured data (github.com)

Show HN: Async – Claude code and Linear and GitHub PRs in one opinionated tool (github.com)

Show HN: Base, an SQLite database editor for macOS (menial.co.uk)

Show HN: OAuth for AI Agents (github.com)

Show HN: Grammit – Local-only AI grammar checker (Chrome extension) (chromewebstore.google.com)

Show HN: Docustore – Vectorized Technical Documentations (github.com)

Show HN: Conversational Companion with realistic mood state machine (dmwithme.com)

Show HN: AI Conway's Game of Life (benlirio.com)

Show HN: Emergency SOS App (play.google.com)

Show HN: Modern UI Composition, Right Inside Django (gist.github.com)

Show HN: Diggit.dev – Git history for architecture archaeologists (diggit.dev)

Show HN: A private, flat monthly subscription for open-source LLMs (synthetic.new)

Show HN: I Built a XSLT Blog Framework (vgr.land)

Show HN: A collection of generic header only data structures written in C (github.com)

Show HN: Gonzo – A Go-based TUI for log analysis (OpenTelemetry/OTLP support) (github.com)

Show HN: TextPolicy – reinforcement learning for text generation on a MacBook

Comments (0)