TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Nemotron models

Browse all models from this model family.

to
  • NewText
    Released 2mo ago
  • NewText
    Released 2mo ago
  • By NVIDIA
    Nemotron-4 is NVIDIA’s open-weight LLM family (from compact to 340B) built for high-quality reasoning, coding, and synthetic data generation. It supports tool/function calling, JSON outputs, and long-context variants, and is production-ready via NVIDIA NIM/TensorRT-LLM for fast, scalable deployment.
    Text
    Released 5mo ago
  • By NVIDIA
    Nemotron-Nano 2 is NVIDIA’s ultra-compact LLM tuned for on-device and edge deployment. It delivers fast instruction following, coding help, and reasoning with low memory use, supports tool/function calling and structured (JSON) outputs, and runs efficiently on Jetson, RTX laptops/desktops, and server GPUs.
    Text
    Released 6mo ago
  • By NVIDIA
    Llama 3.1 Nemotron Ultra is an NVIDIA-optimized deployment of Meta’s Llama 3.1, packaged for high-throughput production. It delivers strong reasoning and coding, long-context support (≈128K), tool/function calling, and JSON mode—served as a fast, scalable NIM for apps and agents.
    Text
    Released 10mo ago
  • By NVIDIA
    Cosmos Nemotron VLM is NVIDIA’s multimodal model that fuses Cosmos world-model perception with Nemotron language reasoning. It understands images and video alongside text, performs step-by-step visual reasoning, and supports tool/function calling and JSON outputs—optimized for fast, scalable deployment via TensorRT-LLM and NIM.
    Text
    Released 1y ago

No models found

Try adjusting your search or filters.

0 AIs selected
Clear selection
#
Name
Task