Seed 1.5 VL

Seed 1.5 VL

Model family: Seed

Seed1.5-VL is ByteDance’s flagship vision-language model, pairing a SeedViT 532M vision encoder with a 20B-active MoE LLM. It handles images and videos of arbitrary aspect ratio, does fine-grained grounding, OCR and visual puzzles, and powers GUI agents for control and gameplay, while matching top VLMs with much lower compute.

Overview

Compact vision-language foundation model from ByteDance Seed, combining a 532M vision encoder with a 20B-active MoE LLM to deliver strong image, video and GUI understanding and multimodal reasoning, with many SOTA results at low inference cost.

🖼️Image generation 📷Images 🎥Videos

About ByteDance

ByteDance is a multinational technology company known for its content platforms, including TikTok and Douyin.

Industry: Internet

Company Size: 10001+

Location: Beijing, CN

Website: bytedance.com

View Company Profile

Tools using Seed 1.5 VL

Seed by ByteDance v1.8

Advancing intelligence to serve humanity.

Task automation

Open

Share

🇨🇳 China
Released 8d ago
No pricing

327
7

Last updated: February 25, 2026

Search

Overview

About ByteDance

Other models from this family

Tools using Seed 1.5 VL

Related Models

Help

People also viewed

AI Options

Create AI Tools

Mini Tool

Vibe code an AI Tool

Choose listing type: