Gluon: a GPU programming language based on the same compiler stack as Triton

28 matt_d 7 9/17/2025, 7:50:11 PM github.com ↗

Comments (7)

lukax · 51m ago
Is this Triton's reply to NVIDIA's tilus[1]. Tilus is suposed to be lower level (e.g. you have control over registers). NVIDIA really does not want the CUDA ecosystem to move to Triton as Triton also supports AMD and other accelerators. So with Gluon you get access to lower level features and you can stay within Triton ecosystem.

[1] https://github.com/NVIDIA/tilus

reasonableklout · 11m ago
It sounds like they share that goal. Gluon is a thing because the Triton team realized over the last few months that Blackwell is a significant departure from the Hopper, and achieving >80% SoL kernels is becoming intractable as the triton middle-end simply can't keep up.

Some more info in this issue: https://github.com/triton-lang/triton/issues/7392

mdaniel · 31m ago
Also it REALLY jams me up that this is a thing, complicating discussions: https://github.com/triton-inference-server/server
ronsor · 1h ago
The fact that the "language" is still Python code which has to be traced in some way is a bit off-putting. It feels a bit hacky. I'd rather a separate compiler, honestly.
JonChesterfield · 25m ago
Mojo for python syntax without the ast walking decorator, cuda for c++ syntax over controlling the machine, ah hoc code generators writing mlir for data driven parametric approaches. The design space is filling out over time.
derbOac · 47m ago
Yeah that struck me as odd. It's more like a Python library or something.
huevosabio · 20m ago
Not to be confused with gluon the embbedable language in Rust: https://github.com/gluon-lang/gluon