AI benchmarks
Browse benchmarks tracking how AI models perform on reasoning, coding, multimodal and knowledge tasks. Each benchmark has its own leaderboard with the latest results from frontier models.
Cross-benchmark model leaderboard
Compare how every tracked model ranks across the headline benchmarks in one matrix.
ChartQA
Question answering over charts and plots, mixing extraction and visual-reasoning questions.
DocVQA
Document VQA over scanned business documents. Tests OCR-grounded reading.
MathVista
Mathematical reasoning over visual contexts: figures, charts, diagrams, geometric drawings.
MMMU
11.5k college-level questions across 30 subjects requiring image + text reasoning (charts, diagrams, medical scans, music notation, …).
MMMU-Pro
Harder MMMU variant: filters out text-only-solvable items and adds a vision-only setting where the question itself is rendered into the image.
