TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Nemotron 3 Super

By NVIDIA
Nemotron 3 Super is released as a 12B-active, 120B-total parameter hybrid MoE model that introduces LatentMoE for accuracy, multi-token prediction layers for faster inference, and NVFP4 pretraining, with support for up to 1M context length and published checkpoints plus training datasets.
Text Gen 7
Released: March 11, 2026

Overview

Nemotron 3 Super is NVIDIAโ€™s open Mixture-of-Experts hybrid Mamba-Transformer model designed for high-throughput agentic workloads and long-context reasoning.

About NVIDIA

Industry: Computer Hardware Manufacturing
Company Size: 42000
Location: Santa Clara, California, US
Website: nvidia.com
View Company Profile

Benchmark scores

How Nemotron 3 Super ranks on tracked AI benchmarks. Click any benchmark to see its full leaderboard.

HumanEval
Coding
79.4%
MBPP
Coding
78.4%
53.3%
GSM8K
Math
90.7%
MATH
Math
84.8%
ARC Challenge
Knowledge
96.1%
GPQA Diamond
Knowledge
60.0%
MMLU
Knowledge
86.0%
MMLU-Pro
Knowledge
75.7%
MGSM
Language
87.5%
RULER 128k
Language
88.3%
Compare across all benchmarks โ†’

Tools using Nemotron 3 Super

Last updated: April 6, 2026
0 AIs selected
Clear selection
#
Name
Task