Understanding Transformers Using a Minimal Example

133 rttti 6 9/3/2025, 3:30:06 PM rti.github.io ↗

Comments (6)

CGMthrowaway · 4h ago
Honest feedback - I was really excited when I read the opening. However, I did not come away from this without a greater understanding than I already had.

For reference, my initial understanding was somewhat low: basically I know a) what embedding is basically b) transformers work by matrix multiplication, and c) it's something like a multi-threaded Markov chain generator with the benefit of prior-trained embeddings

onename · 4h ago
Have you checked out this video from 3Blue1Brown that talks bit about transformers?

https://youtu.be/wjZofJX0v4M

CGMthrowaway · 34m ago
I've seen it but I don't believe I've watched it all the way through. I will now
busymom0 · 5h ago
I'd also recommend another article on this topic of LLMs discussed a few days ago. I read it to the finish line and understood everything fully:

> How can AI ID a cat?

https://news.ycombinator.com/item?id=44964800

xwowsersx · 3h ago
So glad you shared this. Super accessible without diluting. Thank you!
aabdel0181 · 5h ago
very cool!