Fp8 runs ~100 tflops faster when the kernel name has "cutlass" in it

4 mmastrac 0 7/10/2025, 2:40:53 PM github.com ↗

Comments (0)

No comments yet