This is awesome! Would be cool if these LLM visualizations were turned into teaching tools, like showing how attention moves during generation or how prompts shift the model’s output. Feels like that kind of interactive view could really help people get what’s going on under the hood.
th0ma5 · 1h ago
I always liked this visualization from a while ago https://alphacode.deepmind.com/
(Press play, zoom all the way out and scroll down if on mobile)
LLM Visualization - https://news.ycombinator.com/item?id=38505211 - Dec 2023 (131 comments)
The Illustrated Transformer: https://jalammar.github.io/illustrated-transformer/
Sebastian Raschka, PhD has a post on the architectures: https://magazine.sebastianraschka.com/p/from-gpt-2-to-gpt-os...
This HN comment has numerous resources: https://news.ycombinator.com/item?id=35712334