How to modify Starlink Mini to run without the built-in WiFi router (olegkutkov.me)

To the author: what happens to my voice after I upload it? What is your plan moving forward? I am too far left field to understand how to build a business and monetize an open source product like this, even though I found it fun to play around with.

unmute-sh · 32d ago

Thanks! There is a model that turns the voice into an embedding that is used to determine the voice. Unlike the STT and TTS, we won't be releasing the weights of this voice cloning model, but we will provide it over an API so that we can do verification and prevent abuse.

edit: Ah yes, and we do not store the voice sample on our server. The voice embedding is cached for 24 hours.

ton4eg · 32d ago

Way more entertaining than I would expect! What TTS and ASR models do you use? What sort of latency do you get?

unmute-sh · 32d ago

Thank you! The TTS and ASR are our own unreleased models, but we'll open-source them soon :)

The latency is about 500ms once we detect that it's the bot's turn to speak (roughly 200ms for the LLM's time-to-first token and 300ms for the TTS audio to start), plus a variable time for the semantic pause detection (VAD).

If it's clear that you're done talking, like when you ask a question, the model will reply very fast. If you stop mid-sentence as if you have more to say, it will wait for longer to avoid interrupting you.

karim79 · 32d ago

Incredible work. Short, sweet and simple. I hadn't expected to enjoy this as much as I did. I can't wait to see where it goes.

marnesh · 31d ago

Wow this is actually pretty amazing, It is so natural

marnesh · 31d ago

This is really amazing, it is so natural

xingwu · 32d ago

Simple, functional, perfect.

android521 · 32d ago

can't wait for the open source release.

Datalog in Rust (github.com)

How to modify Starlink Mini to run without the built-in WiFi router (olegkutkov.me)

Show HN: Meow – An Image File Format I made because PNGs and JPEGs suck for AI (github.com)

Canyon.mid (canyonmid.com)

1k year old 3 sisters crop farm found in Northern Michigan (smithsonianmag.com)

Ruby on Rails Audit Complete (ostif.org)

The Art of Lisp and Writing (dreamsongs.com)

An origin trial for a new HTML <permission> element (developer.chrome.com)

Q-learning is not yet scalable (seohong.me)

Tiny-diffusion: A minimal implementation of probabilistic diffusion models (github.com)

Infinite Grid of Resistors (mathpages.com)

CI/CD Observability with OpenTelemetry Step by Step Guide (signoz.io)

I have reimplemented Stable Diffusion 3.5 from scratch in pure PyTorch (github.com)

Text-to-LoRA: Hypernetwork that generates task-specific LLM adapters (LoRAs) (github.com)

Notes on the History of the Map Tile (placing.technology)

Meta-analysis of three different notions of software complexity (typesanitizer.com)

Waymo rides cost more than Uber or Lyft and people are paying anyway (techcrunch.com)

Breaking My Security Assignments (akpain.net)

The Talented Ms. Highsmith (yalereview.org)

AMD's AI Future Is Rack Scale 'Helios' (morethanmoore.substack.com)

The Algebra of an Infinite Grid of Resistors (mathpages.com)

Solar Orbiter gets world-first views of the Sun's poles (esa.int)

Chicken Eyeglasses (en.wikipedia.org)

Inside the Apollo “8-Ball” FDAI (Flight Director / Attitude Indicator) (righto.com)

Bioprospectors mine microbial genomes for antibiotic gold (cen.acs.org)

Large language models often know when they are being evaluated (arxiv.org)

How multiplication is defined in Peano arithmetic (devlinsangle.blogspot.com)

Last fifty years of integer linear programming: Recent practical advances (inria.hal.science)

Debunking HDR [video] (yedlin.net)

Cray versus Raspberry Pi (aardvark.co.nz)

Have a damaged painting? Restore it in just hours with an AI-generated “mask” (news.mit.edu)

Endometriosis is an interesting disease (owlposting.com)

Dance Captcha (dance-captcha.vercel.app)

SIMD-friendly algorithms for substring searching (2016) (0x80.pl)

The Many Sides of Erik Satie (thereader.mitpress.mit.edu)

Fixing the mechanics of my bullet chess (jacobbrazeal.wordpress.com)

TimeGuessr (timeguessr.com)

Unsupervised Elicitation of Language Models (arxiv.org)

"Make in India" Relies on "Made in China" (hinrichfoundation.com)

How to Build Conscious Machines (osf.io)

Seven replies to the viral Apple reasoning paper and why they fall short (garymarcus.substack.com)

Solidroad (YC W25) Is Hiring (solidroad.com)

Slowing the flow of core-dump-related CVEs (lwn.net)

Student discovers fungus predicted by Albert Hoffman (wvutoday.wvu.edu)

We investigated Amsterdam's attempt to build a 'fair' fraud detection model (lighthousereports.com)

Self-Adapting Language Models (arxiv.org)

Clinical knowledge in LLMs does not translate to human interactions (arxiv.org)

Root Cause of the June 12, 2025 Google Cloud Outage (twitter.com)

Implementing Logic Programming (btmc.substack.com)

If the moon were only 1 pixel: A tediously accurate solar system model (2014) (joshworth.com)

Show HN: Make your own voice AI in two clicks

Comments (9)