TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Penguin VL 2B

By Tencent
Penguin-VL pairs Qwen3 backbones with an LLM-based vision encoder approach (instead of contrastive-pretrained encoders) to preserve fine-grained visual cues for dense captioning and complex VLM reasoning, released as a 2B-scale variant with inference code and demos.
New Multimodal Gen 3
Released: March 12, 2026

Overview

Penguin-VL-2B is a compact vision-language model that uses an LLM-based vision encoder to push efficiency limits in multimodal reasoning.

About Tencent

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.

Industry: Technology, Information and Media
Company Size: 110,000
Location: Shenzhen, CN
Website: tencent.com
View Company Profile

Tools using Penguin VL 2B

No tools found for this model yet.

Last updated: March 12, 2026
0 AIs selected
Clear selection
#
Name
Task