DocVQA
Document VQA over scanned business documents. Tests OCR-grounded reading.
Best results
Frontier over time
All results
| # | Model | Score | Conditions | Eval date | Source | Flags |
|---|---|---|---|---|---|---|
| 1 | Claude Sonnet 3.5 | 95.2% | 0-shot · standard | 20 Jun 2024 | Self-reported | |
| 2 | Llama 4 Maverick | 94.4% | — | 05 Apr 2025 | Self-reported | Primary |
| 3 | Llama 4 Scout | 94.4% | — | 05 Apr 2025 | Self-reported | Primary |
| 4 | Pixtral Large | 93.3% | — | 18 Nov 2024 | Self-reported | Primary |
| 5 | GPT-4o | 92.8% | — | 16 Apr 2025 | Self-reported | Primary |
| 6 | Gemini Ultra | 90.9% | 0-shot · standard | 06 Dec 2023 | Self-reported | |
| 7 | Claude Haiku 3 | 88.8% | 0-shot · standard | 04 Mar 2024 | Self-reported | |
| 8 | Pixtral 12B | 78.6% | — | 10 Oct 2024 | Self-reported | Primary |
