GPUs go brrr with Mojo: Algorithms

2 shubhamg2208 1 7/20/2025, 3:16:04 PM shubhamg.in ↗

Comments (1)

shubhamg2208 · 7h ago
Part 2 of my Mojo GPU-puzzles series dives into the workhorse kernels of DL: sliding-window pooling, halo-aware 1-D/2-D convolutions, warp-level prefix sums, and more. Lots of diagrams + runnable kernels; builds directly on Part 1(https://shubhamg.in/posts/2025-07-06-gpu-puzzles-p1.html). Feedback & perf tips welcome!