New protein therapy shows promise as antidote for carbon monoxide poisoning (medschool.umaryland.edu)

Hey HN! I'd love to get some people to mess around with a little side project I built to teach myself DSPy! I've been a big fan of reading fiction + webnovels for a while now, and have always been curious about two things: how can LLMs iteratively learn to write better based on reader feedback, and which LLMs are actually best at creative writing (research benchmarks are cool, but don't necessarily translate to real-world usage).

That's exactly why I built narrator.sh! The platform takes in a user input for a novel idea, then generates serialized fiction chapter-by-chapter by using DSPy to optimize the writing based on real reader feedback. I'm using CoT and parallel modules to break down the writing task, refine modules + LLM-as-a-judge for reward functions, and the SIMBA optimizer to recompile user ratings from previous chapters to improve subsequent ones.

Instead of synthetic benchmarks, I track real reader metrics: time spent reading, ratings, bookmarks, comments, and return visits. This creates a leaderboard of which models actually write engaging fiction that people want to finish.

Right now the closest evals for creative writing LLMs come from the author perspective (OpenRouter's usage data for tools like Novelcrafter). But ultimately readers decide what's good, not authors.

You can try it at https://narrator.sh. Here's the current leaderboard: https://narrator.sh/llm-leaderboard (it's a bit bare right now b/c there's not that many users haha)

(Fair warning: there's some adult content since I posted on Reddit for beta testers and people got creative with prompts. I'm working on diversifying the content!)

Comments (0)

No comments yet

Gemma 3 270M: Compact model for hyper-efficient AI (developers.googleblog.com)

How to rig elections [video] (media.ccc.de)

Org-social is a decentralized social network that runs on Org Mode (github.com)

Fundamental Flaw of Hustle Culture (brodzinski.com)

Steve Wozniak: Life to me was never about accomplishment, but about happiness (yro.slashdot.org)

Streaming services are driving viewers back to piracy (theguardian.com)

New protein therapy shows promise as antidote for carbon monoxide poisoning (medschool.umaryland.edu)

I made a real-time C/C++/Rust build visualizer (danielchasehooper.com)

Architecting large software projects [video] (youtube.com)

Blood oxygen monitoring returning to Apple Watch in the US (apple.com)

What's the strongest AI model you can train on a laptop in five minutes? (seangoedecke.com)

Bluesky: Updated Terms and Policies (bsky.social)

Axle (YC S22) is hiring product engineers (ycombinator.com)

All Souls exam questions and the limits of machine reasoning (resobscura.substack.com)

Launch HN: Cyberdesk (YC S25) – Automate Windows legacy desktop apps

Show HN: I built a free alternative to Adobe Acrobat PDF viewer (github.com)

Reverse Proxy Deep Dive: Why Load Balancing at Scale Is Hard (startwithawhy.com)

Show HN: OWhisper – Ollama for realtime speech-to-text (docs.hyprnote.com)

Show HN: OSS MCP Security Scanner – Don't Blindly Trust, Verify (github.com)

"Privacy preserving age verification" is bullshit (pluralistic.net)

1976 Soviet edition of 'The Hobbit' (2015) (mashable.com)

Sam Altman is in damage-control mode after latest ChatGPT release (cnn.com)

Why and how to write things on the Internet (2022) (benkuhn.net)

What does Palantir actually do? (wired.com)

Show HN: Modelence – Supabase for MongoDB (github.com)

Nyxt: The Emacs-like web browser (lwn.net)

500 days of math (gmays.com)

Funding Open Source like public infrastructure (dri.es)

NSF and Nvidia award Ai2 $152M to support building an open AI ecosystem (allenai.org)

iPhone DevOps (2023) (clearsky.dev)

Show HN: Zig-DbC – A design by contract library for Zig

Arch shares its wiki strategy with Debian (lwn.net)

Kodak has no plans to cease, go out of business, or file for bankruptcy (kodak.com)

Jujutsu and Radicle (radicle.xyz)

SIMD Binary Heap Operations (0x80.pl)

Passion over Profits (dillonshook.com)

Zenobia Pay – A mission to build an alternative to high-fee card networks (zenobiapay.com)

A new poverty line shifted the World Bank's poverty data (ourworldindata.org)

Linux address space isolation revived after lowering performance hit (phoronix.com)

Show HN: Yet another memory system for LLMs (github.com)

Fun with Finite State Transducers (blog.yossarian.net)

Is chain-of-thought AI reasoning a mirage? (seangoedecke.com)

What are the real numbers, really? (2024) (infinitelymore.xyz)

KosmicKrisp a Vulkan on Metal Mesa 3D Graphics Driver (lunarg.com)

Companies Are Pouring Billions into A.I. It Has yet to Pay Off (nytimes.com)

What medieval people got right about learning (2019) (scotthyoung.com)

Show HN: XR2000: A science fiction programming challenge (clearsky.dev)

Launch HN: Golpo (YC S25) – AI-generated explainer videos (video.golpoai.com)

Convo-Lang: LLM Programming Language and Runtime (learn.convo-lang.ai)

Nginx introduces native support for ACME protocol (blog.nginx.org)

Show HN: Evaluating LLMs on creative writing via reader usage, not benchmarks

Comments (0)