TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

RULER 128k

RULER (128k context)

Synthetic long-context evaluation suite measuring needle-in-a-haystack, multi-key retrieval and tracing across 128k token contexts.

Language Text Accuracy Max 100.0% Released Apr 2024
1
Results
1
Models scored
88.3%
Top: Nemotron 3 Super
88.3%
Median

Best results

Top primary scores; one row per model.

Frontier over time

Each dot is one model result; the line traces the running best score.
Not enough data to plot a trend yet.

All results

Showing all configurations including non-primary alternates.  · Show only primary
# Model Score Conditions Eval date Source Flags
1 Nemotron 3 Super 88.3% 0-shot 03 Apr 2026 Self-reported Primary
0 AIs selected
Clear selection
#
Name
Task