GLM models
Browse all models from this model family.
-
By Z.aiMultimodal coding foundation model from Z.AI that natively processes images, video, and text. Built for vision-based coding tasks including frontend recreation, GUI autonomous exploration, and code debugging. Supports long-horizon planning, function calling, and agent workflows with a 200K context window and optional thinking mode.MultimodalReleased 3mo ago
-
By Z.aiGLM 5 is Z.ai's 744B parameter open Mixture-of-Experts flagship model for agentic engineering, delivering frontier level coding, reasoning and long horizon agent performance while remaining Apache 2.0 licensed and optimized for Chinese and global hardware.TextReleased 4mo ago
-
By Z.aiGLM-4.7-FlashX is a lightweight, high-speed text model from Z.AI's GLM-4.7 series, optimized for high-throughput API use. It supports a 200K context window and up to 128K output tokens, with thinking mode, function calling, context caching, and structured JSON output.TextReleased 5mo ago
-
By Z.aiGLM-4.7-Flash is ZhipuAIโs 30B-class MoE reasoning and coding model with about 3B active parameters, a 200K context window, and strong tool-calling, tuned as a high-speed, lightweight variant of GLM-4.7 for local deployment and agentic coding.TextReleased 5mo ago
-
By Z.aiGLM-Image is Z.ai's open-source auto-regressive plus diffusion image model for dense-knowledge, high-fidelity text-to-image and image-to-image generation, with strong text rendering, layout control and identity-preserving edits.ImageReleased 5mo ago
-
By Z.aiGLM-4.7 is a large language model from Z.AI with enhanced coding, reasoning, and tool use capabilities. It features a 200K token context window, interleaved thinking mode, and strong performance on benchmarks including SWE-bench Verified (73.8%) and LiveCodeBench V6 (84.9%). Model weights are publicly available; the model is also accessible via API.TextReleased 6mo ago
-
By Z.aiOpen-source multimodal model series with native function calling. Available in 106B (cloud) and 9B Flash (local) variants. Features a 128K context window, supports image and video understanding, and can perform agentic tasks combining visual perception with direct tool execution, including web search and document-to-article generation.MultimodalReleased 6mo ago
-
By Z.aiMultimodal LLM with a 128K token context window accepting image, video, text, and file inputs with text output. The paid tier of the GLM-4.6V-Flash series offering higher capacity and stability. First visual model to natively integrate Function Call capability, enabling direct multimodal tool use without intermediate text conversion.MultimodalReleased 6mo ago
-
By Z.aiGLM 4.6 is Zhipu AIโs newest general-purpose model, improving on 4.5 with steadier reasoning, stronger coding, and smoother bilingual ChineseโEnglish performance. It supports long context, tool/function calling, structured JSON, streaming, and vision-language variantsโready for RAG, agents, and enterprise copilots.TextReleased 9mo ago
-
By Z.aiGLM-4.5V is a 106B parameter (12B active) MoE-based visual reasoning model supporting image, video, document, and GUI task understanding. It achieves open-source SOTA performance across 42 visual multimodal benchmarks and features a switchable Thinking Mode for balancing response speed against deep reasoning depth.MultimodalReleased 10mo ago
-
By Z.aiThe higher-accuracy GLM 4.5 tier for harder reasoning and coding. Very long context, precise tool calling, and strict JSON guarantees for production use.TextReleased 11mo ago
-
By Z.aiA balanced GLM 4.5 profile focused on efficiency. Strong instruction following, long context, reliable tool calls, and structured outputs for everyday copilots.TextReleased 11mo ago
-
By Z.aiA speed-tuned GLM 4.5 for low-latency assistants. Long context, tool and function calling, streaming, and clean JSON outputs with solid Chinese-English performance.TextReleased 11mo ago
-
By Z.aiGLM 4.5 is Zhipu AIโs flagship general-purpose model, tuned for strong reasoning, coding, and bilingual Chinese-English work. It supports long context, tool/function calling, reliable JSON output, streaming, and VL variants for image understandingโready for RAG, agents, and enterprise copilots.TextReleased 11mo ago
-
By Z.aiA 32B-parameter GLM-4 checkpoint with 128k context. Strong general reasoning and coding, tool and function calling, and clean JSON for long-form tasks.TextReleased 1y ago
-
Improved version with enhanced bilingual capabilities.TextReleased 2y ago
-
TextReleased 3y ago
