Public toilet locator app just gone viral in Russia (neartoilets.com)

Very impressive numbers.. wonder how this would scale on 4 relatively modern desktop PCs, like say something akin to a i5 8th Gen Lenovo ThinkCentre, these can be had for very cheap. But like @geerlingguy indicates - we need model compatibility to go up up up! As an example it would amazing to see something like fastsdcpu run distributed to democratize accessibility-to/practicality-of image gen models for people with limited budgets but large PC fleets ;)

rthnbgrredf · 11m ago

I think it is all well and good, but the most affordable option is probably still to buy a used MacBook with 16/32 or 64 GB (depending on the budget) unified memory and install Asahi Linux for tinkering.

Graphics cards with decent amount of memory are still massively overpriced (even used), big, noisy and draw a lot of energy.

mehdibl · 22m ago

1. This is Q4

2. This remain slow

3. The context window used here is likely 8k or similar which makes it unusable for bigger input/output.

Models already work fine on phones just try https://github.com/google-ai-edge/gallery and you will see local AI running on phones fine.

geerlingguy · 55m ago

distributed-llama is great, I just wish it would work with more models. I've been happy with ease of setup and its ongoing maintenance compared to Exo, and performance vs llama.cpp RPC mode.

alchemist1e9 · 22m ago

Any pointers to what is SOTA for cluster of hosts with CUDA GPUs but not enough vram for full weights, yet 10Gbit low latency interconnects?

If that problem gets solved, even if for only a batch approach that enables parallel batch inference resulting in high total token/s but low per session, and for bigger models, then it would he a serious game changer for large scale low cost AI automation without billions capex. My intuition says it should be possible, so perhaps someone has done it or started on it already.

echelon · 37m ago

This is really impressive.

If we can get this down to a single Raspberry Pi, then we have crazy embedded toys and tools. Locally, at the edge, with no internet connection.

Kids will be growing up with toys that talk to them and remember their stories.

We're living in the sci-fi future. This was unthinkable ten years ago.

taminka · 20m ago

i feel sorry for your kids if you think this shit is inspiring lol

chagpt is literally leading ppl with higher education to have full on psychosis by feeding into their insane delusions and confirmation bias, im sure a less smart version of this is a perfect toy for a kid w/o a fullt developed brain yet

literally go touch grass bro...

Public toilet locator app just gone viral in Russia (neartoilets.com)

Vibe Coding Through the Berghain Challenge (nibzard.com)

BBC News Robots.txt (news.bbc.co.uk)

Formality on Demand (inkandswitch.com)

DuckDuckGo founder: AI surveillance should be banned (gabrielweinberg.com)

Ask HN: Useful AI applications in regular businesses?

Optical Generative Models (nature.com)

A hidden simplicity behind how people move: geography's role in relocation (phys.org)

Mr. Tompkins in Wonderland" – Space, Time and Relativity [video] (archive.org)

Show HN: NeKernel v0.0.5 (github.com)

How to Think about Surveillance (ft.com)

Go tool that sorts methods by call graph analysis (pkg.go.dev)

Upcoming HHS report will link autism to common pain reliever, folate deficiency (cnn.com)

Wayland. (Budgie Desktop, 2023) (buddiesofbudgie.org)

NY Times: Why Are More Millionaires Renting? (nytimes.com)

How Binary JSON Works in YDB (2022) (laplab.me)

The Agentic Systems Series (gerred.github.io)

Ask HN: How do you stop teams from overestimating sprint capacity?

Victoria Woodhull – The first woman to run for president (en.wikipedia.org)

Gates-Backed Study: Flu Shots Linked to 27% Higher Heart Injury Risk (modernity.news)

996 (lucumr.pocoo.org)

Scale AI former CTO launches AI agent that could solve big data biggest problem (techcrunch.com)

Ask HN: Why do LLMs struggle with word count?

A16Z scouting ambitious Swiss founders for $1M accelerator

Do Hangover Supplements Work? (economist.com)

The Asshole Filter (mrsteinberg.com)

Lidar can destroy phone camera sensors [video] (youtube.com)

'So Here's This Lunatic with a Crew Cut' (vulture.com)

US jobs market weakens further in August (bbc.com)

In 1954, an Extraterrestrial Bruiser Shocked This Alabama Woman (smithsonianmag.com)

A Thornton high school teacher helped Colorado choose official state mushroom (coloradosun.com)

Show HN: Pompelmi – Client-side file upload pre-quarantine (no cloud) (github.com)

Managing NPM Releases (pilcrowonpaper.com)

PHP MCP SDK: official client/server framework for MCP (github.com)

Engineering Excellence Starts on Edge (world.hey.com)

Patterns vs. "Patterns" (perl.plover.com)

We Hacked Burger King: How Auth Bypass Led to Drive-Thru Audio Surveillance (bobdahacker.com)

GOP may succeed in unrelenting quest to kill two NASA climate satellites (arstechnica.com)

My 14 year old brother discovered this interesting and repeatable GPT failure (xcancel.com)

AI and the Future of Governance, Risk, and Compliance (GRC) (trainingcamp.com)

Mouse-Free Setups (lobste.rs)

Write Your Own Retro Compiler (t3x.org)

Ask HN: How do you turn engineering metrics into help and not blame?

OpenAI reorg at risk as Attorneys General push AI safety (theregister.com)

Show HN: QuickDeploy – Deploy your web app to a VPS with one command (quickdeploy.dev)

Hirschmilch: Since 2008 free electronic music 24/7 (hirschmilch.de)

Princeton Consumer Research lab tested eight sunscreens that failed SPF claims (abc.net.au)

IntuitiBits (intuitibits.com)

Qwen3 30B A3B Hits 13 token/s on 4xRaspberry Pi 5

Comments (7)