We should have the ability to run any code we want on hardware we own (hugotunius.se)

2071 points by K0nserv 15d ago 1203 comments

Cognitive load is what matters (github.com)

1582 points by nromiun 16d ago 526 comments

NPM debug and chalk packages compromised (aikido.dev)

1366 points by universesquid 7d ago 754 comments

I didn't bring my son to a museum to look at screens (sethpurcell.com)

1165 points by arch_deluxe 5d ago 387 comments

Show HN: A store that generates products from anything you type in search (anycrap.shop)

1132 points by kafked 2d ago 324 comments

I ditched Docker for Podman (codesmash.dev)

1118 points by codesmash 10d ago 654 comments

Germany is not supporting ChatControl – blocking minority secured (digitalcourage.social)

1111 points by xyzal 4d ago 355 comments

30 minutes with a stranger (pudding.cool)

1100 points by MaxLeiter 11d ago 375 comments

Charlie Kirk killed at event in Utah (nbcnews.com)

1074 points by david927 5d ago 3269 comments

Show HN: Term.everything – Run any GUI app in the terminal (github.com)

1074 points by mmulet 6d ago 144 comments

996 (lucumr.pocoo.org)

1053 points by genericlemon24 9d ago 532 comments

Next.js is infuriating (blog.meca.sh)

1033 points by Bogdanp 13d ago 579 comments

Show HN: I recreated Windows XP as my portfolio (mitchivin.com)

1028 points by mitchivin 9d ago 322 comments

EU court rules nuclear energy is clean energy (weplanet.org)

1026 points by mpweiher 3d ago 1203 comments

The MacBook has a sensor that knows the exact angle of the screen hinge (twitter.com)

1021 points by leephillips 8d ago 487 comments

Anthropic agrees to pay $1.5B to settle lawsuit with book authors (nytimes.com)

988 points by acomjean 10d ago 738 comments

Signal Secure Backups (signal.org)

987 points by keyboardJones 7d ago 442 comments

Using Claude Code to modernize a 25-year-old kernel driver (dmitrybrant.com)

920 points by dmitrybrant 8d ago 319 comments

iPhone Air (apple.com)

902 points by excerionsforte 6d ago 1949 comments

Pontevedra, Spain declares its entire urban area a "reduced traffic zone" (greeneuropeanjournal.eu)

870 points by robtherobber 5d ago 942 comments

I replaced Animal Crossing's dialogue with a live LLM by hacking GameCube memory (joshfonseca.com)

865 points by vuciv 6d ago 185 comments

Google can keep its Chrome browser but will be barred from exclusive contracts (cnbc.com)

864 points by colesantiago 13d ago 632 comments

UTF-8 is a brilliant design (iamvishnu.com)

836 points by vishnuharidas 3d ago 338 comments

We all dodged a bullet (xeiaso.net)

827 points by WhyNotHugo 6d ago 483 comments

Stripe Launches L1 Blockchain: Tempo (tempo.xyz)

807 points by _nvs 11d ago 1070 comments

Mistral raises 1.7B€, partners with ASML (mistral.ai)

802 points by TechTechTech 6d ago 422 comments

New Mexico is first state in US to offer universal child care (governor.state.nm.us)

787 points by toomuchtodo 6d ago 661 comments

Chat Control Must Be Stopped (privacyguides.org)

786 points by Improvement 7d ago 258 comments

The treasury is expanding the Patriot Act to attack Bitcoin self custody (tftc.io)

777 points by bilsbie 3d ago 555 comments

“This telegram must be closely paraphrased before being communicated to anyone” (history.stackexchange.com)

775 points by azeemba 15d ago 135 comments

Almost anything you give sustained attention to will begin to loop on itself (henrikkarlsson.xyz)

771 points by jger15 11d ago 223 comments

Where's the shovelware? Why AI coding claims don't add up (mikelovesrobots.substack.com)

769 points by dbalatero 12d ago 482 comments

Hosting a website on a disposable vape (bogdanthegeek.github.io)

763 points by BogdanTheGeek 10h ago 386 comments

Models of European metro stations (stations.albertguillaumes.cat)

726 points by tcumulus 1d ago 154 comments

Google AI Overview made up an elaborate story about me (bsky.app)

698 points by jsheard 14d ago 278 comments

iPhone dumbphone (stopa.io)

687 points by joshmanders 7d ago 394 comments

Claude Code: Now in Beta in Zed (zed.dev)

682 points by meetpateltech 12d ago 406 comments

Why our website looks like an operating system (posthog.com)

680 points by bnc319 4d ago 486 comments

Eternal Struggle (yoavg.github.io)

680 points by yurivish 15d ago 136 comments

Corporations are trying to hide job openings from US citizens (thehill.com)

675 points by b_mc2 3d ago 520 comments

KDE launches its own distribution (lwn.net)

675 points by Bogdanp 5d ago 525 comments

Many hard LeetCode problems are easy constraint problems (buttondown.com)

672 points by mpweiher 3d ago 525 comments

ICE is using fake cell towers to spy on people's phones (forbes.com)

667 points by coloneltcb 6d ago 255 comments

Court rejects Verizon claim that selling location data without consent is legal (arstechnica.com)

658 points by nobody9999 5d ago 86 comments

Claude now has access to a server-side container environment (anthropic.com)

658 points by meetpateltech 6d ago 343 comments

Hosting a website on a disposable vape (bogdanthegeek.github.io)

651 points by dmazin 14h ago 15 comments

I'm absolutely right (absolutelyright.lol)

651 points by yoavfr 10d ago 266 comments

LLM Visualization (bbycroft.net)

640 points by gmays 11d ago 46 comments

Notes on Managing ADHD (borretti.me)

633 points by amrrs 15d ago 330 comments

Serverless Horrors (serverlesshorrors.com)

622 points by operator-name 8d ago 484 comments

Strengths and limitations of diffusion language models

72 rbanffy 7 5/22/2025, 10:10:09 AM seangoedecke.com ↗

Comments (7)

cubefox · 116d ago

That's a nice explanation. I wonder whether autoregressive and diffusion language models could be combined such that the model only denoises the (most recent) end of a sequence of text, like a paragraph, while the rest is unchangeable and allows for key-value caching.

gfysfm · 116d ago

Hi, I wrote the post. Thank you!

That’s how it does work, but unfortunately denoising the last paragraph requires computing attention scores for every token in that paragraph, which requires checking those tokens against every token in the sequence. So it’s still much less cacheable than the equivalent autoregressive model.

billconan · 116d ago

I'm curious, in image generation, flow matching is said to be better than diffusion, then why do these language models still start from diffusion, instead of jumping to flow matching directly?

gessha · 116d ago

This is just a guess but I think it’s due to diffusion training being more popular so we’ve figured more of the kinks with those models. Flow matching models might follow after you figure out some of their hyperparameters.

mountainriver · 116d ago

A big discussion on this happened here as well https://news.ycombinator.com/item?id=44057820

There is quite a bit of evidence diffusion models work better at reasoning because they don't suffer from early token bias.

https://github.com/HKUNLP/diffusion-vs-ar https://arxiv.org/html/2410.14157v3

accrual · 116d ago

Great overview. I wonder if we'll start to see more text diffusion models from other players, or maybe even a mixture of diffusion and transformer models alternating roles behind a single UI, depending on the context and request.

shrubhub · 116d ago

The diffusion models are (or can be) transformer models! They're just not autoregressive.