Show HN: Experimental HN Discussion to Animated Video Pipeline

1 ozten 0 7/31/2025, 11:18:34 PM youtube.com ↗

Friday's HN link "Steam, Itch.io are pulling ‘porn’ games. Critics say it's a slippery slope" had a large and divisive discussion.

I spent 2 days playing with automating a "live action talking animals" 90 second short 100% automated from that discussion.

The result is the YouTube link for this post.

---

Here is a more detailed breakdown:

Sunday night at 2am in the morning, I had a question and a wild thought.

Could I point Gemini Deep Research at https://news.ycombinator.com/item?id=44685011 and ask it to analyze and reduce it into 5 - 7 personas and extract key quotes, themes, etc.

If it could, then I had the idea of generating a short social video illustrating the discussion in a short 30 second video.

If I avoided curation, cherry-picking, and manual intervention... would it be any good? All effort went into prompts and context, not review or massaging.

I then spent Monday and Tuesday on the idea in a time-boxed spike.

I ended up with the following process:

- Gemini Deep Research, HN discussion produced the report

- Using the report, a Gemini Pro prompt output characters and the themes

- For each character, a Gemini Pro prompt chose an animal and described their character in detail

- Using the character descriptions and the themes, a Gemini Pro output a time coded script with actions, sound effect and dialog

- For each character, Gemini Pro produced an image prompt feed into Imagen 4 which output a character sheet with that character from multiple angles with all of their accessories and quirks

- With the script, Gemini Pro split it into scene files

- For each scene and character description, Gemini Pro with a JSON schema produced the video generation prompt

- For each scene and character sheet, Image 4 produced the first frame image for that scene

- Given the first frame image and the video generation prompt, Veo 3 produced an 8 second clip which includes voice, sounds, etc.

- Some dialog was longer than 8 seconds and required manual steps, such as using ffmpeg to get the last video frame and save to .png and use as the first frame in a second extension video

For LLM input / output, I used the first result.

For Image 4 I generated between 2 and 4 outputs and manually chose the best. In 100% of these samples either the first or the second was acceptable.

For Veo 3 I generated between 2 and 8 outputs and manually chose the best. Voice continuity was the biggest challange.

I originally had complicated plans for a 16:9 mobile ratio video where the screen would be broken up into 3 panels. Tuesday at 2pm, I abandon this and went with a simple linear approach. I slapped things into iMovie and got it done.

I was impressed with Gemini Pro 2.5's ability to understand the 3-slot and use the verticality in it's directions. It had characters looking up and down like in Hollywood Squares.

This experiment was as minimally "Cherry Picked" as possible. I'm impressed with the quality of the LLM, image, and video generation output.

Lastly, I had forgot to have the LLM art direct the opening and last frame, so I made some stuff up and finished the project.

I learned a lot and this was a fun experiment. It was a mixture of automation and manual steps. When I did this, you could not automate Imagen 4 with a character sheet, nor Veo 3 with frame to video. Each of those I had to manually use Whisk and Flow (their respective UIs).

I wrote 5 scripts. I ended up with about 142 artifacts (input and output files).

How Boba Shops Make Money [video] (youtube.com)

Google loses appeal over app store reforms in Epic Games case (cnbc.com)

Updated My about Page (mtende.blog)

Zero – Local Encrypted Vault App (No Cloud, No Signup, Free)

Covid's origins conspiracy theories hamper ability to prevent the next pandemic (cnn.com)

What Does ROC Law Say About Taiwan? (usali.org)

Show HN: Gmap: Explore Git Repos Visually from the CLI (github.com)

Mozart Dice Game: algorithmic music generation designed by Mozart (playonlinedicegames.com)

BoxBuddy: A Graphical Interface for Distrobox (github.com)

AI Researchers Are Negotiating $250M Pay Packages. Just Like NBA Stars (nytimes.com)

How much cash is the US raising from tariffs? (bbc.com)

What's a Potato? A Nine-Million-Year-Old Tomato (nytimes.com)

Oregon mandated phone-free schools – some parents are being asked to pay (oregonlive.com)

Samsung TV service down: Users say apps not loading amid widespread outage (hindustantimes.com)

Decoding the Chinese Computer (sixthtone.com)

Facebook Once Bought a VPN App and Used It to Spy on 33M+ Phones for Years (twitter.com)

Blame the governor! Oklahoma’s “board meeting porn” scandal goes gonzo (arstechnica.com)

Show HN: MitiFresh – A marketplace for homegrown backyard produce

Matlab Copilot: AI assistant optimized for Matlab (mathworks.com)

Elon Musk gives millions to Republican super PACs ahead of the midterms (nbcnews.com)

Construction on Trump's $200M White House ballroom to begin in September (cnn.com)

Peacock feathers can emit laser beams (arstechnica.com)

Further Performance Evolution in Python 3.14: Tail Call Interpreter (manjusaka.blog)

Two Driverless Waymo Cars Collide at Phoenix Sky Harbor Airport (teslarati.com)

Show HN: Kali Linux-like environment written in Windows Batch (github.com)

Show HN: Dingo 1.9.0 released: With enhanced hallucination detection (github.com)

Are Cyber Defenders Winning? – Lawfare (lawfaremedia.org)

The Impossible Quiz (en.wikipedia.org)

LLM leaderboard – Comparing models from OpenAI, Google, DeepSeek and others (artificialanalysis.ai)

IceBear: A Fine-Grained Incremental Scheduler for C/C++ Static Analyzers (doi.org)

Has any YC founder ever gone to jail for startup-related crimes?

Improvised Star Trek: Podcast and Stage Show (2019) (theimprovisedstartrek.com)

Cloud Optimized GeoTIFF: Imagery format for cloud-native geospatial processing (cogeo.org)

Waymo car crashed into another Waymo car (twitter.com)

China's summons Nvidia to explain H20 chip's 'back-door' risks (scmp.com)

Show HN: SonicInfra, Cloud at On-Prem Prices (sonicinfra.com)

Show HN: KubeForge – A GUI for Kubernetes YAMLs (github.com)

Show HN: I vibe coded a word game as a game designer (no ads, just fun) (wordsdescrambler.com)

Anyone else have Claude accounts banned for ToS breach with no warning? (reddit.com)

Shape memory alloys for cryogenic actuators (nature.com)

Rove Founders Arhan Chhabra and Max Morganroth (thefinancefrontier.substack.com)

Why Gen Zers Are Ditching Instagram for BeReal and Snapchat (thefinancefrontier.substack.com)

TraceRoot: Find the Root Cause in Your Code's Trace (github.com)

Show HN: Visual AI x RAG – Search, tag and edit photos with AI (coreviz.io)

A rtt log view tool (makerinchina.cn)

Wide Research (manus.im)

TikTok made songs shrink, but artists are pushing back (bbc.co.uk)

Can you tell if that song AI-generated? (apnews.com)

Figma's stock soared in its highly anticipated IPO,market cap instantly hit $45B (techcrunch.com)

Google loses on appeal In re: Google Play Store Antitrust LItigation [pdf] (cdn.ca9.uscourts.gov)

Show HN: Experimental HN Discussion to Animated Video Pipeline

Comments (0)