Marlin 2B
Overview
Marlin-2B is NemoStation’s open-source 2B video-language model for dense video captioning and natural-language temporal grounding.
About NemoStation
NemoStation is a video AI research and product lab focused on building small, grounded video understanding models that convert video data into structured, machine-readable information. Flagship product: Marlin-2B, a 2B-parameter video VLM fine-tuned on Qwen3.5-2B that produces dense scene/event captions with timestamps and resolves natural-language temporal queries. State-of-the-art in its weight class on CaReBench, DREAM-1K, and TimeLens-Bench. Also produces CaReBench, a video captioning benchm
Tools using Marlin 2B
No tools found for this model yet.
