Overview
Rerank 3.5 is a high-accuracy cross-encoder reranking model for search and RAG. It re-scores candidate passages or documents given a query, boosting precision at the top of the results and improving answer quality downstream.
Description
Rerank 3.5 sits after your first-stage retriever and makes finer-grained relevance judgments by jointly encoding the query with each candidate chunk. That cross-attention lets it catch subtle semantics—synonyms, paraphrase, discourse cues—and demote near-duplicates or off-topic snippets, so your top-k actually contains what the user needs. It’s optimized for fast batch inference and stable scoring, works well with cosine or raw logits, and handles long queries and multi-sentence passages without falling back to keyword bias. Teams typically pair it with an embedding retriever for recall, then let Rerank 3.5 sort the shortlist before generation. The result is higher click-through for search, better grounding and fewer hallucinations in RAG, and cleaner inputs for agents that depend on precise evidence.
About Cohere
Visually guide customers over phone or live chat with instant, no-download cobrowsing.
View Company ProfileRelated Models
Last updated: September 22, 2025