Nvidia Just Released Llama Nemotron Ultra
13 devaniranjan 1 4/8/2025, 11:02:40 PM
NVIDIA just released Llama 3.1 Nemotron Ultra (253B parameter model) that’s showing great performance on GPQA-Diamond, AIME, and LiveCodeBench.
Their blog goes into detail but it shows up to 4x throughput over DeepSeek-R1 with better benchmarks.
Blog: https://developer.nvidia.com/blog/build-enterprise-ai-agents-with-advanced-open-nvidia-llama-nemotron-reasoning-models/
The model is available on HF and as a NIM. Has anyone tried it?
HF: https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-v1
NIM: https://build.nvidia.com/nvidia/llama-3_1-nemotron-ultra-253b-v1
HF: https://huggingface.co/nvidia/Llama-3_1-Nemotron-Ultra-253B-...
NIM: https://build.nvidia.com/nvidia/llama-3_1-nemotron-ultra-253...