TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

GLM 4.6V FlashX

By Z.ai
Model family: GLM
GLM-4.6V-FlashX is a multimodal large language model with a 128K token context window. It is the first visual model with native Function Call capability built directly into its architecture, allowing images and screenshots to be passed as tool parameters without text conversion, and enabling the model to visually interpret tool outputs. Supports toggleable reasoning modes. Key use cases include intelligent image-text content creation, visual web search and report generation, frontend replication, long-context document understanding, and multi-image agent workflows. It achieves SOTA visual understanding performance at its parameter scale across benchmarks including MMBench, MathVista, and OCRBench. This is the paid tier of the flash series, offering higher throughput and stability versus the free GLM-4.6V-Flash variant.
Multimodal Gen 3
Released: December 8, 2025

Overview

Multimodal LLM with a 128K token context window accepting image, video, text, and file inputs with text output. The paid tier of the GLM-4.6V-Flash series offering higher capacity and stability. First visual model to natively integrate Function Call capability, enabling direct multimodal tool use without intermediate text conversion.

About Z.ai

Z.ai (formerly Zhipu AI) is a Chinese AI company developing large language models (GLM series), combining reasoning, coding, and agent capabilities, and offering open models and APIs.

Industry: Artificial Intelligence
Company Size: 800
Location: Beijing, Beijing, CN
Website: chat.z.ai
View Company Profile
Last updated: June 22, 2026
0 AIs selected
Clear selection
#
Name
Task