TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Voyage multimodal 3

voyage-multimodal-3 is a multimodal embedding model designed for RAG and semantic search across knowledge bases containing both text and visual data. Unlike CLIP-based models that process text and images through separate encoders, it runs all inputs through a single transformer backbone, enabling vectorization of interleaved text and images in a unified representation. This eliminates the modality gap that causes CLIP-based models to favor same-modality results over semantically relevant cross-modal ones.

The model supports screenshots of PDFs, slide decks, tables, charts, and figures, removing the need for heuristic-based parsing or OCR pipelines. In evaluations across 20 multimodal retrieval datasets, it outperforms OpenAI CLIP large and Cohere multimodal v3 by over 40% on table/figure retrieval and over 25% on document screenshot retrieval, while matching voyage-3 on pure text retrieval. Accessible via API with query and document input types for retrieval-optimized embeddings.
Multimodal Gen 3
Released: November 12, 2024

Overview

Multimodal embedding model that vectorizes interleaved text and images through a unified transformer encoder. Supports screenshots of PDFs, slides, tables, and figures without complex document parsing. Unlike CLIP-based models, eliminates the modality gap, enabling accurate mixed-modality retrieval across text and visual content.

About Voyage AI

Voyage AI provides best-in-class embedding models and rerankers for search and retrieval over unstructured data, used to power retrieval-augmented generation (RAG) and AI applications. It offers general-purpose, domain-specific (finance, legal, code) and company-specific fine-tuned models. Founded in 2023 and based in Palo Alto, the company was acquired by MongoDB, Inc. in February 2025 and now operates as a MongoDB subsidiary.

Industry: Artificial Intelligence
Location: Palo Alto, California, US
View Company Profile
Last updated: June 23, 2026
0 AIs selected
Clear selection
#
Name
Task