Nvidia Blackwell Ultra (GB300): -97% INT8/FP64,+50% FP4 Dense,+55% VRAM,+114% At

[+114% Attention acceleration] Any idea how they got +50% FP4 from the same silicon? "Firmware" improvements? Or did they found a way to disable the INT8 and FP64 units and re-use them e.g. as overspill registers? Any other ideas why INT8/FP64 is down -97% on the same chip? QA/certification issues?

In case you you want to compare the complete specs, I would post them here, but since hn supports less formatting than early 2000s bb-forums, check it here: https://www.forum-3dcenter.org/vbulletin/showpost.php?p=1380...

Nvidia Blackwell Ultra (GB300): -97% INT8/FP64,+50% FP4 Dense,+55% VRAM,+114% At

Comments (2)