Skip to content
LIVE
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
OPUS 4.8$5 / $25per Mtok
GPT-5.5$5 / $30per Mtok
GEMINI 2.5 PRO$1.25 / $10per Mtok
SONNET 4.6$3 / $15per Mtok
SWE-BENCHleader GPT-5.568.7%
MMLU-PROleader GPT-5.594.2
GPQAleader GPT-5.578.3
AFTAv1.0 whitepaper live at /whitepaper
All systems operational0 AI providers monitored, polled every 2 minutes
Live status

Public Leaderboards

Pointers to every live, public AI model leaderboard. LMSYS Chatbot Arena, Artificial Analysis, HF Open LLM Leaderboard, SWE-bench Verified, Aider Polyglot, LiveCodeBench, BigCodeBench, Terminal-Bench, ARC Prize, MMLU-Pro, HLE, MMMU, Video Arena, Image Arena, TTS Arena, Open ASR, RULER, GAIA, WebArena, OSWorld. Different from /benchmark-registry (the eval suites themselves); this is where to find the live rankings.

Machine-readable JSON/api/public-leaderboards

For agents: same data at /api/public-leaderboards. Filter with ?domain=general|code|math|reasoning|multimodal|agent|voice|image|video|long-context|open-models. Free, cached 10 min.