TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

AIME 2025

American Invitational Mathematics Examination 2025

30 problems from the 2025 AIME I and II contests. High-school competition math with integer answers 0-999; valuable post-cutoff signal for 2024-trained models.

Math Text Accuracy Max 100.0% Released Feb 2025
37
Results
35
Models scored
100.0%
Top: Grok 4 Heavy
88.0%
Median

Best results

Top primary scores; one row per model.

Frontier over time

Each dot is one model result; the line traces the running best score.
Best score over time0.0025.050.075.0100.0Jan 2025Sep 2025May 2026

All results

Showing one canonical row per model. Show all configurations
# Model Score Conditions Eval date Source Flags
1 Grok 4 Heavy 100.0% CoT 09 Jul 2025 Self-reported Primary
2 GPT 5.2 Thinking 100.0% CoT 11 Dec 2025 Self-reported Primary
3 DeepSeek 3.2 Speciale 96.0% 01 Dec 2025 Paper Primary
4 Gemini 3 Flash (Thinking) 95.2% 17 Dec 2025 Self-reported Primary
5 Gemini 3 Pro 95.0% CoT 18 Nov 2025 Self-reported Primary
6 GPT 5.1 94.6% 0-shot · CoT 13 Nov 2025 Self-reported Primary
7 GPT 5.1 Thinking 94.6% CoT 12 Nov 2025 Self-reported Primary
8 GPT 5 (Thinking) 94.6% 07 Aug 2025 Self-reported Primary
9 GLM 4.6 93.9% CoT 30 Sep 2025 Self-reported Primary
10 Grok 3 Think 93.3% CoT · cons@64 19 Feb 2025 Self-reported Primary
11 Grok 3 Think 93.3% 18 Feb 2025 Self-reported Primary
12 Deepseek 3.2 93.1% 01 Dec 2025 Paper Primary
13 o4 mini 92.7% 16 Apr 2025 Self-reported Primary
14 Grok 4 91.7% CoT 09 Jul 2025 Self-reported Primary
15 DeepSeek V3.2 Exp 89.3% CoT 29 Sep 2025 Self-reported Primary
16 Nemotron 3 Nano 89.1% 15 Dec 2025 Self-reported Primary
17 o3 88.9% 16 Apr 2025 Self-reported Primary
18 DeepSeek V3.1 Terminus 88.4% 22 Sep 2025 Self-reported Primary
19 Gemini 2.5 Pro (Thinking) 88.0% 17 Dec 2025 Self-reported Primary
20 Claude Sonnet 4.5 87.0% 29 Sep 2025 Self-reported Primary
21 Gemini 2.5 Pro 86.7% CoT 17 May 2025 Self-reported Primary
22 Qwen3-235B-A22B 81.5% CoT 28 Apr 2025 Self-reported Primary
23 Qwen3 235B A22B 81.5% 28 Apr 2025 Self-reported Primary
24 GPT 5.5 Instant 81.2% 0-shot 05 May 2026 Self-reported Primary
25 Claude Haiku 4.5 80.7% 15 Oct 2025 Self-reported Primary
26 Claude Haiku 4.5 80.7% 15 Oct 2025 Self-reported Primary
27 o1 79.2% 16 Apr 2025 Self-reported Primary
28 Phi 4 reasoning plus 78.0% CoT 08 Jul 2025 Self-reported Primary
29 Gemini 2.5 Flash (Thinking) 72.0% 17 Dec 2025 Self-reported Primary
30 Qwen3 30B A3B 70.9% 28 Apr 2025 Self-reported Primary
31 Claude Sonnet 4 70.5% 22 May 2025 Self-reported Primary
32 R1 1776 70.0% 18 Feb 2025 Self-reported Primary
33 DeepSeek-R1 70.0% CoT 21 Jan 2025 Self-reported Primary
34 Magistral Medium 64.9% CoT 10 Jun 2025 Self-reported Primary
35 GPT 5 61.9% 07 Aug 2025 Self-reported Primary
36 Gemini 2.5 Flash-Lite 49.8% 26 Sep 2025 Self-reported Primary
37 Kimi K2 Instruct 49.5% 02 Jul 2025 Paper Primary
0 AIs selected
Clear selection
#
Name
Task