Understanding Transformers Using a Minimal Example

118 rttti 5 9/3/2025, 3:30:06 PM rti.github.io ↗

Comments (5)

CGMthrowaway · 3h ago
Honest feedback - I was really excited when I read the opening. However, I did not come away from this without a greater understanding than I already had.

For reference, my initial understanding was somewhat low: basically I know a) what embedding is basically b) transformers work by matrix multiplication, and c) it's something like a multi-threaded Markov chain generator with the benefit of prior-trained embeddings

onename · 3h ago
Have you checked out this video from 3Blue1Brown that talks bit about transformers?

https://youtu.be/wjZofJX0v4M

busymom0 · 3h ago
I'd also recommend another article on this topic of LLMs discussed a few days ago. I read it to the finish line and understood everything fully:

> How can AI ID a cat?

https://news.ycombinator.com/item?id=44964800

xwowsersx · 2h ago
So glad you shared this. Super accessible without diluting. Thank you!
aabdel0181 · 3h ago
very cool!