TPU (Tensor Processing Unit) Deep Dive (henryhmko.github.io)

Can you suggest a good reference for understanding which algorithms map well onto the regular grid systolic arrays used by TPUs? The fine article says dese matmul and convolution are good, but is there anything else? Eigendecomposition? SVD? matrix exponential? Solving Ax = b or AX = B? Cholesky?

musebox35 · 36m ago

I think https://jax-ml.github.io/scaling-book/ is one of the best references to go through. It details how single device and distributed computations map to TPU hardware features. The emphasis is on mapping the transformer computations, both forwards and backwards, so requires some familiarity with how transformer networks are structured.

WithinReason · 58m ago

Anything that you can express as 128x128 (but ideally much larger) dense matrix multiplication and nothing else

serf · 53m ago

does that cooling channel have a NEMA stepper on it as a pump or metering valve?[0]

If so, wild. That seems like overkill.

[0]: https://henryhmko.github.io/posts/tpu/images/tpu_tray.png

almostgotcaught · 2h ago

> In essence, caches allow hardware to be flexible and adapt to a wide range of applications. This is a large reason why GPUs are very flexible hardware (note: compared to TPUs).

this is correct but mis-stated - it's not the caches themselves that cost energy but MMUs that automatically load/fetch/store to cache on "page faults". TPUs don't have MMUs and furthermore are a push architecture (as opposed to pull).

jan_Sate · 3h ago

I thought that it would be about 3D printer filament.

TPU (Tensor Processing Unit) Deep Dive (henryhmko.github.io)

Sound As Pure Form: Music Language Inspired by Supercollider, APL, and Forth (github.com)

P-Hacking in Startups (briefer.cloud)

LaborBerlin: State-of-the-Art 16mm Projector (filmlabs.org)

Remote MCP Support in Claude Code (anthropic.com)

The bad boy of bar charts: William Playfair (2023) (blog.engora.com)

Denmark's Archaeology Experiment Is Paying Off in Gold and Knowledge (scientificamerican.com)

Type Inference Zoo (zoo.cuichen.cc)

U.S. bombs Iranian nuclear sites (bbc.co.uk)

Airpass – Easily overcome WiFi time limits (airpass.tiagoalves.me)

P2piano: A P2P collaboration space for the musically inclined (p2piano.com)

Finally, a Makefile formatter (50 years overdue) (github.com)

Samsung embeds IronSource spyware app on phones across WANA (smex.org)

Show HN: Luna Rail – treating night trains as a spatial optimization problem (luna-rail.com)

AllTracker: Efficient Dense Point Tracking at High Resolution (alltracker.github.io)

Phoenix.new – Remote AI Runtime for Phoenix (fly.io)

Linux on the Behringer X32 [video] (youtube.com)

Scaling our observability platform by embracing wide events and replacing OTel (clickhouse.com)

Compact Representations for Arrays in Lua [pdf] (sol.sbc.org.br)

Using Microsoft's New CLI Text Editor on Ubuntu (omgubuntu.co.uk)

Tell HN: Beware confidentiality agreements that act as lifetime non competes

Compiler for the B Programming Language (github.com)

uBlock Origin Lite Beta for Safari iOS (testflight.apple.com)

Axolotls May Hold the Key to Regrowing Limbs (smithsonianmag.com)

Unexpected security footguns in Go's parsers (blog.trailofbits.com)

Delta Chat is a decentralized and secure messenger app (delta.chat)

Debunking NIST's calculation of the Kyber-512 security level (2023) (blog.cr.yp.to)

'Gwada negative': French scientists find new blood type in woman (lemonde.fr)

AI is ushering in a 'tiny team' era (bloomberg.com)

Show HN: MMOndrian (mmondrian.com)

Weave (YC W25) is hiring a founding AI engineer (ycombinator.com)

Balatro for the Nintendo E-Reader (mattgreer.dev)

The Nyanja new PC-Engine/TurboGrafx 16-bit console game in development (sarupro.itch.io)

Show HN: To-Userscript: Chrome Extension to Userscript Converter (github.com)

Application First – Media over QUIC (quic.video)

Horse Browser (gethorse.com)

Life as Slime (asimov.press)

Tiny Undervalued Hardware Companions (2024) (vermaden.wordpress.com)

Augmented Vertex Block Descent (AVBD) (graphics.cs.utah.edu)

YouTube's new anti-adblock measures (iter.ca)

Andrej Karpathy: Software in the era of AI [video] (youtube.com)

ARIA, the UK's Bet to Build Scientific Revolutions (asimov.press)

Harper – an open-source alternative to Grammarly (writewithharper.com)

Learn you Galois fields for great good (2023) (xorvoid.com)

Captain Cook's missing ship found after sinking 250 years ago (independent.co.uk)

Sega mistakenly reveals sales numbers of popular games (gematsu.com)

Show HN: A color name API that maps hex to the closest human-readable name (meodai.github.io)

Don't Read This If You Have a Security Clearance (2023) (theatlantic.com)

AbsenceBench: Language models can't tell what's missing (arxiv.org)

Cosmoe: BeOS Class Library on Top of Wayland (cosmoe.org)

TPU (Tensor Processing Unit) Deep Dive

Comments (6)