Grok models
Browse all models from this model family.
-
By xAIImage-to-video generation model that produces short clips from a starting image and text prompt. Generates 6-second, 720p videos in about 25 seconds with synchronized audio including sound effects, ambience, and dialogue. Available via xAI API as grok-imagine-video-1.5 and on grok.com/imagine.NewVideoReleased 10d ago
-
By xAIGrok 4.3 is xAI’s flagship large language model for high-end reasoning, coding, and agentic workflows. xAI positions it as its most truth-seeking model, with a 1,000,000-token context window and support across the broader xAI API stack for tools, reasoning, and advanced integrations.NewTextReleased 1mo ago
-
MultimodalReleased 4mo ago
-
By xAIGrok Imagine is xAI’s video-audio generative model, exposed through the Imagine API, that turns text or images into cinematic videos and supports text-to-image, text-to-video, image-to-video and rich video editing with strong instruction following, latency and cost performance.VideoReleased 4mo ago
-
By xAIGrok 4.1 is xAI’s reasoning model tuned for stronger analysis and coding with faster, more direct answers. It supports long context, tool and function calling, streaming, and clean JSON, making it a solid default for assistants, RAG, and agent workflows.MultimodalReleased 7mo ago
-
By xAIGrok 4.1 is xAI’s reasoning model tuned for stronger analysis and coding with faster, more direct answers. It supports long context, tool and function calling, streaming, and clean JSON, making it a solid default for assistants, RAG, and agent workflows.MultimodalReleased 7mo ago
-
By xAILing-flash-2.0 is a high-speed multilingual instruction model built for very low latency and high throughput. It supports long context, tool and function calling, and clean JSON outputs, which makes it ideal for live chat, voice assistants, and real-time automation.MultimodalReleased 8mo ago
-
By xAIGrok 4 Fast is xAI’s optimized variant of the Grok 4 family. It delivers the same reasoning backbone as Grok 4 Heavy but tuned for speed and efficiency, with lower latency and cost. It’s designed for high-throughput applications, real-time assistants, and workloads where responsiveness matters more than maximum depth.MultimodalReleased 9mo ago
-
By xAIGrok Code Fast 1 is xAI’s low-cost, high-speed coding model for “agentic” workflows. It offers a 256K-token context, function calling, structured outputs, and pricing at $0.20/M input, $1.50/M output ($0.02/M cached). Available via the xAI APITextReleased 11mo ago
-
By xAIGrok 4 Heavy is xAI’s most powerful Grok 4 variant—frontier-level reasoning with native tool use, real-time search, a 256K-token context window, and higher rate limits. It’s available via the SuperGrok Heavy tier and the xAI API.TextReleased 11mo ago
-
MultimodalReleased 11mo ago
-
By xAIGrok Image 2 is xAI’s fast vision-language model. It reads images with text, handles OCR and layout, explains charts and screenshots, and returns grounded answers or JSON with long context, tool calling, and streaming for real-time multimodal assistants.ImageReleased 1y ago
-
By xAIGrok 3 Think is xAI’s reasoning-tuned mode of Grok 3 that “thinks before responding” and can expose a step-by-step reasoning trace. It targets harder math, coding, and analysis, trading speed for depth. Built on the Grok-3 model (131,072-token context, function calling, structured outputs), it’s available via the xAI API and to X Premium+/SuperGrok usersTextReleased 1y ago
-
By xAIGrok 3 Mini is xAI’s lightweight reasoning model that “thinks before responding,” exposes raw reasoning traces, supports function calling and structured outputs, and offers a 131,072-token context window. It also has a grok-3-mini-fast serving option and costs $0.30/M input tokens and $0.50/M output tokens.TextReleased 2y ago
