OmniVinci
Overview
OmniVinci is NVIDIA’s 9B omni-modal LLM that jointly understands images, video, audio, and text, achieving strong cross-modal reasoning with only about 0.2T training tokens.
About NVIDIA
Tools using OmniVinci
No tools found for this model yet.
