Zamba2 VL 7B

Zamba2 VL 7B

Zamba2-VL-7B is Zyphra’s vision-language model built on the Zamba2 LLM architecture. It supports single-image and multi-image understanding, visual grounding, document and chart reasoning, OCR-style visual text understanding, and general image-text assistant workflows. Zyphra says it is based on Zamba2-7B, uses a Qwen2.5-VL vision encoder, was trained on 100B tokens of vision-text and pure text data, and is designed to keep a small compute and memory footprint for efficient deployment. The model is released under Apache 2.0.

Overview

Zamba2-VL-7B is Zyphra’s open 7B-class vision-language model for single-image and multi-image understanding, visual grounding, OCR, charts, documents, and on-device multimodal applications.

🔍Image interpretation 📄Document analysis 📜OCR

About Zyphra AI

Superintelligence that Empowers.
Zyphra is a full stack open source superintelligence company based in San Francisco, California.
Our mission is to build human-aligned AI that helps individuals and organizations reach their fullest potential.

Industry: Artificial Intelligence

Company Size: 67

Location: Palo Alto, CA, US

Website: zyphra.com

View Company Profile

Last updated: June 11, 2026

Go to section

Search

Overview

About Zyphra AI

Related Models

Help

People also viewed

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: