Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

Also interesting in this context is the PyTorch Developer Podcast [1] by the same author. Very comforting to learn about PyTorch internals while doing the dishes.

[1] https://pytorch-dev-podcast.simplecast.com/

swyx · 104d ago

i think the problem w the podcast format (ironic for me to say) is that it assumes a lot higher familiarity with the apis than is afforded by any visual medium including blogs

smokel · 104d ago

Agreed, but I'm still very happy that some people try. I'm really not that much interested in the weather or listening to idle chit-chat, and for some reason most podcasts seem to focus on that.

alexrigler · 105d ago

This is a fun blast from the near past. I helped organize the PyTorch NYC meetup where Ed presented this and still think it's one of the best technical presentations I've seen. Hand drawn slides for the W. Wish I recorded :\

zcbenz · 104d ago

For learning internals of ML frameworks I recommend reading the source code of MLX: https://github.com/ml-explore/mlx .

It is a modern and clean codebase without legacies, and I could understand most things without seeking external articles.

ForceBru · 104d ago

Why is MLX Apple silicon only? Is there something fundamental that prevents it from working on x86? Are some core features only possible on Apple silicon? Or do the devs specifically refuse to port to x86? (Which is understandable, I guess)

I'm asking because it seems to have nice autodiff functionality. It even supports differentiating array mutation (https://ml-explore.github.io/mlx/build/html/usage/indexing.h...), which is something JAX and Zygote.jl can't do. Instead, both have ugly tricks like `array.at[index].set` and the `Buffer` struct.

So it would be cool to have this functionality on a "regular" CPU.

zcbenz · 104d ago

Most features are already supported on x86 CPUs, you can pip install mlx on Linux , and you can even use it on Windows (no official binary release yet but it is building and tests are passing).

saagarjha · 104d ago

I think it relies heavily on unified memory.

chuckledog · 105d ago

Great article, thanks for posting. Here’s a nice summary of automatic differentiation, mentioned in the article and core to how NN’s are implemented: https://medium.com/@rhome/automatic-differentiation-26d5a993...

hargun2010 · 105d ago

I guess its longer version of slides but not new I saw comment from as far back as 2023, nonetheless good content (resharable).

https://web.mit.edu/~ezyang/Public/pytorch-internals.pdf

aduffy · 104d ago

Edward taught a Programming Languages class I took nearly a decade ago, and clicking through here I immediately recognized the illustrated slides, brought a smile to my face

lyeager · 103d ago

Me too, he was great. Tried his darndest to help me understand Haskell monads.

aostiles · 103d ago

He was really nice in Stanford's CS 240h. He helped me better understand Safe Haskell and GHC internals.

vimgrinder · 105d ago

For someone it might help: If you are having trouble reading long articles, try text-to-audio with line highlight. It helps a lot. It has cured my lack of attention.

PeterStuer · 104d ago

No trouble reading the article. Those slides though. Make my eyes hurt :(

vimgrinder · 104d ago

they were constantly referred too in the text :/ impossible to skip

quotemstr · 104d ago

Huh. I'd have written TORCH_CHECK like this:

    TORCH_CHECK(self.dim() == 1) 
      << "Expected dim to be a 1-D tensor "
      << "but was " << self.dim() << "-D tensor";

Turns out it's possible to write TORCH_CHECK() so that it evaluates the streaming operators only if the check fails. (Check out how glog works.)

bilal2vec · 105d ago

See also dev forum roadmaps [1] and design docs (e.g. [2], [3],[4])

[1]: https://dev-discuss.pytorch.org/t/meta-pytorch-team-2025-h1-...

[2]: https://dev-discuss.pytorch.org/t/pytorch-symmetricmemory-ha...

[3]: https://dev-discuss.pytorch.org/t/where-do-the-2000-pytorch-...

[4]: https://dev-discuss.pytorch.org/t/rethinking-pytorch-fully-s...

nitrogen99 · 105d ago

2019. How much of this is still relevant?

mlazos · 105d ago

I used this to onboard to the PyTorch team a few years ago. It’s useful for understanding the key concepts of the framework. Torch.compile isn’t covered but the rest of it is still pretty relevant.

kadushka · 105d ago

I’m guessing about 80%

sidkshatriya · 105d ago

To understand a complex system, sometimes it better to understand a (simpler) model system. Sometimes an older version of the same system is that good model system. This is not true always but a good rule of thumb.

pizza · 104d ago

Btw, would anyone have any good resources on using pytorch as a general-purpose graph library? Like stuff beyond the assumption of nets = forward-only (acyclic) digraph

brutus1979 · 105d ago

Is there a video version of this? It seems it is from a talk?

The Fed says this is a cube of $1M. They're off by half a million (calvin.sh)

More on Apple's Trust-Eroding 'F1 the Movie' Wallet Ad (daringfireball.net)

The new skill in AI is not prompting, it's context engineering (philschmid.de)

My open source project was relicensed by a YC company [license updated] (twitter.com)

Introducing tmux-rs (richardscollin.github.io)

Nvidia won, we all lost (blog.sebin-nyshkim.net)

Writing Code Was Never the Bottleneck (ordep.dev)

I made my VM think it has a CPU fan (wbenny.github.io)

Xfinity using WiFi signals in your house to detect motion (xfinity.com)

Proton joins suit against Apple for practices that harm developers and consumers (proton.me)

The Rise of Whatever (eev.ee)

Websites hosting major US climate reports taken down (apnews.com)

Exploiting the IKKO Activebuds “AI powered” earbuds (2024) (blog.mgdproductions.com)

Cloudflare to introduce pay-per-crawl for AI bots (blog.cloudflare.com)

Being too ambitious is a clever form of self-sabotage (maalvika.substack.com)

Figma files for proposed IPO (figma.com)

Don’t use “click here” as link text (2001) (w3.org)

Private sector lost 33k jobs, badly missing expectations of 100k increase (cnbc.com)

ICEBlock, an app for anonymously reporting ICE sightings, goes viral (techcrunch.com)

There are no new ideas in AI, only new datasets (blog.jxmo.io)

YouTube No Translation (addons.mozilla.org)

Major reversal in ocean circulation detected in the Southern Ocean (icm.csic.es)

Ask HN: What Are You Working On? (June 2025)

Gridfinity: The modular, open-source grid storage system (gridfinity.xyz)

Cloudflare Introduces Default Blocking of A.I. Data Scrapers (nytimes.com)

Show HN: Spegel, a Terminal Browser That Uses LLMs to Rewrite Webpages (simedw.com)

Show HN: CSS generator for a high-def glass effect (glass3d.dev)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021) (krebsonsecurity.com)

I'm dialing back my LLM usage (zed.dev)

I write type-safe generic data structures in C (danielchasehooper.com)

Mini NASes marry NVMe to Intel's efficient chip (jeffgeerling.com)

Fakespot shuts down today after 9 years of detecting fake product reviews (blog.truestar.pro)

Melbourne man discovers extensive model train network underneath house (sbs.com.au)

Claude Code now supports hooks (docs.anthropic.com)

ICEBlock climbs to the top of the App Store charts after officials slam it (engadget.com)

OpenFLOW – Quickly make beautiful infrastructure diagrams local to your machine (github.com)

OBBB signed: Reinstates immediate expensing for U.S.-based R&D (kbkg.com)

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok (github.com)

Gene therapy restored hearing in deaf patients (news.ki.se)

Why I left my tech job to work on chronic pain (sailhealth.substack.com)

Sam Altman Slams Meta’s AI Talent Poaching: 'Missionaries Will Beat Mercenaries' (wired.com)

Larry (cat) (en.wikipedia.org)

Opening up ‘Zero-Knowledge Proof’ technology (blog.google)

Huawei releases an open weight model trained on Huawei Ascend GPUs (arxiv.org)

The $25k car is going extinct? (media.hubspot.com)

Flounder Mode – Kevin Kelly on a different way to do great work (joincolossus.com)

That XOR Trick (2020) (florian.github.io)

AI note takers are flooding Zoom calls as workers opt to skip meetings (washingtonpost.com)

Loss of key US satellite data could send hurricane forecasting back 'decades' (theguardian.com)

Astronomers discover 3I/ATLAS – Third interstellar object to visit Solar System (abc.net.au)

PyTorch Internals: Ezyang's Blog

Comments (24)