Vidi2
Overview
Vidi2 is ByteDance’s second generation large multimodal video model for understanding and creation, adding fine grained spatio temporal grounding, long video retrieval, and video question answering so it can find both the right time ranges and object boxes from natural language queries.
Description
About ByteDance
ByteDance is a multinational technology company known for its content platforms, including TikTok and Douyin.
View Company Profile