Show HN: A PSX/DOS style 3D game written in Rust with a custom software renderer (totenarctanz.itch.io)

I've seen many previous attempts to turn HN threads into podcasts, but they all shared a common issue IMO: trying to reduce the very rich back-and-forth into a single-thread single-reader boring podcast. Instead, I wanted to hear the actual debate from the actual thread!

So I asked Claude 3.7 to build this for me as a browser-only app. It just needs a thread URL and an Elevenlabs API key (this all remains in your browser, you can check the source code, it's only 3 files, there is no server storage of anything).

To make the resulting audio experience as natural as possible, each commenter has a different voice.

Commenters who appear multiple times in the thread have the same voice, and introduce themselves. A bit of context is also introduced when coming back "up" from deeply nested comments.

You can play the resulting audio or download it for later listening. I'm planning to later add the ability to load multiple threads so I can have a playlist generated for listening in the gym!

Any comments or improvement suggestions are appreciated!

Comments (56)

sebastiennight · 152d ago

Issues I've noticed when running it against more threads:

- don't use Legacy voices as they seem to be of much lower quality (sounds like someone is calling in from an international landline)

- when the same poster appears many times, it gets tedious to hear them restate who they are. I think after the first 3, we should recognize the voice so that's not necessary anymore

Feature requests I'll add:

- emphasize quotes better

- add audio chapter marks if possible, so it's possible to skip ahead

- attach a speaker's voice to the relevant voice in the 11Labs account if there's a voice with the same name as the username

- add sound effects if people write down sound effects in their comments (this seems tough)

Anything I'm missing?

sebastiennight · 151d ago

Alright, I've made several updates based on feedback!

Cost Estimation

    - Shows (very rough) character count estimate (rounded to nearest thousand)
    - Displays approximate cost at $0.12 per thousand characters
    - Updates dynamically as selections change

Advanced Input Options

    - Added toggle between single thread URL and top 100 stories selection
    - Implemented multi-thread selection with checkboxes
    - Saves input mode preference to localStorage

Comment Limit Improvements

    - Changed to "All" as default with option for custom limit
    - Original post no longer counts against comment limit

Quote Formatting

    - Text with > is now properly recognized as quotes
    - Quotes are transformed with random introduction phrases
    - Adds "End of quote" with variations at the end of quoted text

Link Handling

    - Preserves shared links in expandable section at the bottom
    - Different random phrases for first, second, and multiple links
    - Links open in new tabs when clicked

Voice Matching

    - Matches commenter usernames to ElevenLabs voices if names match
    - Falls back to deterministic assignment if no match found

Error Handling & Recovery

    - Saves progress and allows resuming after errors
    - Shows "Retry" button with partial audio when errors occur
    - Audio generated so far is available for download

UI Improvements

    - Added tooltip with API key information
    - Persistent theme preferences via localStorage
    - Improved responsive design for mobile
    - the filename of the generated MP3 file matches the thread title

rgbrgb · 152d ago

This is cool. Any chance you can drop an example?

sebastiennight · 152d ago

Here's a quick example:

First 20 comments of "John Carmack: writing Rust code feels wholesome"

Here is the rendered mp3 : https://drive.google.com/file/d/1yG1mwD70ZteXtdh8Jk_sXUXS_sQ...

The thread: https://news.ycombinator.com/item?id=19126795

First 30 comments of a recent thread, "AGI is still 30 years away": https://drive.google.com/file/d/1YbgRXBv1LC3IdMl8Xb4i9y98S2T...

The thread: https://news.ycombinator.com/item?id=43719280

sebastiennight · 152d ago

Given recent developments I think it might be fun to listen to this very thread as audio!

gojomo · 152d ago

Can I upload my own voiceprint so my comments are said in my voice, voice of my choosing?

Can I navigate by voice commands, for example if listening while driving?

sebastiennight · 152d ago

1. This should be possible, I think for example if you saved your cloned voice in your account with the same name as your HN handle. I'll add this. This should then work for using any voice for a specific user (just use the right username as the voice's name in 11Labs).

2. No navigation by voice commands sadly - it generates a single audio track. I might be able to insert chapter marks for each comment though, so that it'd be possible to "skip" to the next comment!

01HNNWZ0MV43FF · 152d ago

I don't have a voice print, can I put something in my profile to get a generic feminine voice? I don't suppose there's a pronouns field

sebastiennight · 152d ago

I would think once I introduce the feature above, you could just create a "01HNNWZ0MV43FF" voice with the Voice Lab[0] inside your account (not necessarily duplicating your real voice but just using 11Lab's tool to get a feminine voice). Would that work?

[0]: https://elevenlabs.io/app/voice-lab

mosquitobiten · 152d ago

One big post can have a bigger reply counter-arguing every point 1b1. It would be nice if the arguments go back and forth, basically segmenting the post and the replies into multiple lines of dialog, rather than feeling like you are listening to a speech.

sebastiennight · 152d ago

Wait... do you mean, quoting the original (or parent) poster in their own voice when there's a quote?

That seems less natural. I think what I can do though, is turn quotes into actual quotes, eg. turning

> One big post can have a bigger reply counter-arguing every point 1b1

into:

"Look; you said 'One big post can have a bigger reply counter-arguing every point 1b1'"

mosquitobiten · 152d ago

>Wait... do you mean, quoting the original (or parent) poster in their own voice when there's a quote?

yeah, I think what I'm getting at is when there is a big argumentative post crossing the line from chit-chat to speech, break out of the structure of the website, let the LLM get the arguments out and connect them to the counter-arguments and turn it into a back a forth with shorter dialog lines, without repeating too much or one person talking for very long.

Also I agree, the LLM should be free to transform or add dialog how it sees fit so it feels more natural but always keeping it true to what is written.

sebastiennight · 152d ago

In this app, the process runs entirely in the browser and has no LLM calls at all, so we don't have the ability to rewrite the conversation (other than performing regexes or other crude operations on the text of a comment, which is how links are turned into "See the link I posted in the thread").

I also think it's incredibly difficult (even with an LLM) to render properly a multi-turn multi-user conversation without sticking to the actual hierarchy of the thread. We would probably run into the "summarize the thread and lose nuance" problem again.

sebastiennight · 152d ago

Note: I'm particularly interested in feedback on making the conversation feel even more "natural" so that the audio is as similar as possible as if we were really listening in on the watercooler chat.

No comments yet

plun9 · 152d ago

It seems that in the generated audio, the number of comments is off by one. It is missing 1 comment.

sebastiennight · 152d ago

I think it counts the original post as a comment, so the total shown is (original posts plus number of comments). Is it actually missing one comment in your audio ? which one? first or last?

plun9 · 152d ago

The last one. I did https://news.ycombinator.com/item?id=43552385 and entered 26 comments.

sebastiennight · 152d ago

Ah! You don't need to enter the exact number of comments in this field, you can leave it at 100.

Entering a max of "26" manually is what created the off-by-one error, I think, because of the original post being counted as a comment.

But yeah, I'll fix that.

If I leave the max at 100, then I get every comment (original post + al 26 comments), here's the output audio: https://drive.google.com/file/d/1fIis8yQn-YuOmJwq1J4cLtthQV0...

sebastiennight · 151d ago

Update: I fixed it. The parent post is no longer counting towards the limit.

wewewedxfgdf · 152d ago

This is pretty good I might listen to this as alternative to a podcast.

Maybe publish it as a podcast.

sebastiennight · 152d ago

Thank you!

I have no plans to publish as a podcast (if I was going to go through all the trouble to put a podcast together, it would be an actual podcast for my startup, not for a hobby project!) but I'd love it if someone did it!

devrandoom · 152d ago

Oh nice cool water. It's a bit muddy looking? Is it safe to drink?

01HNNWZ0MV43FF · 152d ago

Continue straight for eleven thousand miles, then turn lreft

thegreatpeter · 152d ago

rips hair out

sebastiennight · 152d ago

This sounds painful! I think I'll add a feature so 11Labs generates sound effects for comments like this, so they can be enjoyed in their full glory

Show HN: A PSX/DOS style 3D game written in Rust with a custom software renderer (totenarctanz.itch.io)

Show HN: I built a platform for long-form media recs (books, articles, etc.) (rhomeapp.com)

Show HN: STT –> LLM –> TTS pipeline in C (github.com)

Show HN: A store that generates products from anything you type in search (anycrap.shop)

Show HN: Coding AI Agent API for Developers (workser.ai)

Show HN: Pyproc – Call Python from Go Without CGO or Microservices (github.com)

Show HN: I reverse engineered macOS to allow custom Lock Screen wallpapers (cindori.com)

Show HN: Daffodil – Open-Source Ecommerce Framework to connect to any platform (github.com)

Show HN: I wrote a from-scratch OS to serve my blog (github.com)

Show HN: HuMo AI – Create Realistic Videos with Text, Image, and Audio Inputs (humoai.co)

Show HN: Omarchy on CachyOS (github.com)

Show HN: AI-powered web service combining FastAPI, Pydantic-AI, and MCP servers (github.com)

Show HN: AI Code Detector – detect AI-generated code with 95% accuracy (code-detector.ai)

Show HN: Ghostpipe – Connect files in your codebase to user interfaces (github.com)

Show HN: Semlib – Semantic Data Processing (github.com)

Show HN: Dagger.js – A buildless, runtime-only JavaScript micro-framework (daggerjs.org)

Show HN: npm-daycare, an NPM proxy that filters out recent & small packages (github.com)

Show HN: Small Transfers – charge from 0.000001 USD per request for your SaaS (smalltransfers.com)

Show HN: Quizquestions.org – A free library for quiz questions (quizquestions.org)

Show HN: Scientific Calculator for Android (play.google.com)

Show HN: MCP Server Installation Instructions Generator (hyprmcp.com)

Show HN: I built a decentralized protocol for predicting interest rate movement (kairosswap.com)

Show HN: I built a tool to visually manage my LLM prompt templates and save them (promptcanvas.ml4den.com)

Show HN: Drop-in Redis replacement in Rust with 5M+ GET/s (github.com)

Show HN: Datadef.io – Canvas for data lineage and metadata management (datadef.io)

Show HN: Vicinae – A native, Raycast-compatible launcher for Linux (github.com)

Show HN: I Collected Every Emoticon I Could Find – All Mood and Generator (emoticonhub.com)

Show HN: I built an app store for open-source financial plans (on spreadsheets) (finfam.app)

Show HN: Ruminate – AI reading tool for understanding hard things (tryruminate.com)

Show HN: I made a generative online drum machine with ClojureScript (dopeloop.ai)

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

Show HN: Clean Clode – Clean Messy Terminal Pastes from Claude Code and Codex (cleanclode.com)

Show HN: Blocks – Dream work apps and AI agents in minutes (blocks.diy)

Show HN: Alyx, a caffeine tracker with no accountability (alyxcaffeinetracker.com)

Show HN: Universal single-letter project commands to speed up your CLI workflow (github.com)

Show HN: Ultraplot – A succint wrapper for matplotlib (github.com)

Show HN: Building a Deep Research Agent Using MCP-Agent (thealliance.ai)

Show HN: CLAVIER-36 – A programming environment for generative music (clavier36.com)

Show HN: HN Term – browse HN using the terminal (github.com)

Show HN: TailGuard – Bridge your WireGuard router into Tailscale via a container (github.com)

Show HN: Bottlefire – Build single-executable microVMs from Docker images (bottlefire.dev)

Show HN: Making a cross-platform game in Go using WebRTC Datachannels (pion.ly)

Show HN: C++ Compiler Support Page (cppstat.dev)

Show HN: Haystack – Review pull requests like you wrote them yourself (haystackeditor.com)

Show HN: Should v0.2.0 – debugging Go tests made easier (github.com)

Show HN: InfiniteTalk AI – AI Lip-Sync Video Generator for Long Videos (infinitetalk.net)

Show HN: Open Line Protocol – a minimal wire for AI agents (MIT) (github.com)

Show HN: An MCP Gateway to block the lethal trifecta (github.com)

Show HN: A tool to make a bootable USB installer out of macOS, or download it (macdaddy.io)

Show HN: PaperSync, making ArXiv papers collaborative (hackcmu25.vercel.app)

Show HN: HN Watercooler – listen to HN threads as an audio conversation

Comments (56)