Seed 1.5 VL
Overview
Compact vision-language foundation model from ByteDance Seed, combining a 532M vision encoder with a 20B-active MoE LLM to deliver strong image, video and GUI understanding and multimodal reasoning, with many SOTA results at low inference cost.
About ByteDance
ByteDance is a multinational technology company known for its content platforms, including TikTok and Douyin.
View Company Profile