Nemotron models
Browse all models from this model family.
-
By NVIDIANewTextReleased 2mo ago
-
NewTextReleased 2mo ago
-
By NVIDIANemotron-4 is NVIDIA’s open-weight LLM family (from compact to 340B) built for high-quality reasoning, coding, and synthetic data generation. It supports tool/function calling, JSON outputs, and long-context variants, and is production-ready via NVIDIA NIM/TensorRT-LLM for fast, scalable deployment.TextReleased 5mo ago
-
By NVIDIANemotron-Nano 2 is NVIDIA’s ultra-compact LLM tuned for on-device and edge deployment. It delivers fast instruction following, coding help, and reasoning with low memory use, supports tool/function calling and structured (JSON) outputs, and runs efficiently on Jetson, RTX laptops/desktops, and server GPUs.TextReleased 6mo ago
-
By NVIDIALlama 3.1 Nemotron Ultra is an NVIDIA-optimized deployment of Meta’s Llama 3.1, packaged for high-throughput production. It delivers strong reasoning and coding, long-context support (≈128K), tool/function calling, and JSON mode—served as a fast, scalable NIM for apps and agents.TextReleased 10mo ago
-
By NVIDIACosmos Nemotron VLM is NVIDIA’s multimodal model that fuses Cosmos world-model perception with Nemotron language reasoning. It understands images and video alongside text, performs step-by-step visual reasoning, and supports tool/function calling and JSON outputs—optimized for fast, scalable deployment via TensorRT-LLM and NIM.TextReleased 1y ago
No models found
Try adjusting your search or filters.
