TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Mistral NeMo

Model family: Mistral
Mistral NeMo combines Mistralโ€™s instruction-tuned LLMs with NVIDIAโ€™s NeMo tooling so you can serve them in production with low latency and predictable cost. Models are containerized as NIM microservices, exposing simple APIs while TensorRT-LLM kernels, paged attention, and KV-cache optimizations keep throughput high. You can enable long-context prompting for multi-document tasks, return schema-consistent JSON for workflows, and call external tools directly from the model for agent pipelines. Quantization and multi-GPU parallelism control memory and cost without sacrificing response quality, and Triton inference plus autoscaling make it straightforward to move from dev to large-scale deployments. In practice, teams use Mistral NeMo for enterprise copilots, RAG over private data, analytics assistants that write SQL or Python, and code helpersโ€”getting Mistralโ€™s balanced reasoning with the reliability and observability expected of a production stack.
Text Gen 7
Released: July 18, 2024

Overview

Mistral NeMo is the NVIDIA-optimized deployment of Mistral models, packaged as NeMo/NIM microservices for fast, scalable inference. It brings long-context prompting, tool/function calling, and reliable JSON output with TensorRT-LLM acceleration, quantization, and easy autoscaling on NVIDIA GPUs.

Pricing

Compare Mistral NeMo with other models listed in the same vendor pricing tiers and context lengths.

Standard

Model Input Cached input Output Unit
Codestral 25.01 Mistral AI
$0.3 - $0.9 per 1M tokens
Devstral 2 Mistral AI
$0.4 - $2 per 1M tokens
Magistral Medium Mistral AI
$2 - $5 per 1M tokens
Magistral Small Mistral AI
$0.5 - $1.5 per 1M tokens
Ministral 3 14B Mistral AI
$0.2 - $0.2 per 1M tokens
Ministral 3 3B Mistral AI
$0.1 - $0.1 per 1M tokens
Ministral 3 8B Mistral AI
$0.15 - $0.15 per 1M tokens
Mistral Large 3 Mistral AI
$0.5 - $1.5 per 1M tokens
Mistral Medium 3.5 Mistral AI
$1.5 - $7.5 per 1M tokens
Mistral NeMo This model Mistral AI
$0.15 - $0.15 per 1M tokens
Mistral Small 4 Mistral AI
$0.1 - $0.3 per 1M tokens
Mixtral 8x22B Mistral AI
$2 - $6 per 1M tokens
$0.1 - $0.4 per 1M tokens

About Mistral AI

Mistral AI is a company that specializes in artificial intelligence and machine learning solutions.

Industry: Artificial Intelligence
Company Size: 316
Location: Paris, FR
Website: mistral.ai
View Company Profile

Benchmark scores

How Mistral NeMo ranks on tracked AI benchmarks. Click any benchmark to see its full leaderboard.

MMLU
Knowledge
68.0%
TruthfulQA
Knowledge
50.3%
Compare across all benchmarks โ†’

Tools using Mistral NeMo

Last updated: April 6, 2026
0 AIs selected
Clear selection
#
Name
Task