25 models
25 models
| Model | ||||||||
|---|---|---|---|---|---|---|---|---|
1 | Claude Fable 5 (High)Anthropic · Proprietary | ▲ 13.68%±1.59% | ▲ 17.21%±2.59% | ▲ 27.74%±5.97% | ▲ 11.27%±3.24% | ▲ 10.16%±1.40% | ▲ 2.00%±0.24% | 0 |
2 | GPT 5.5 (xHigh)OpenAI · Proprietary | ▲ 11.03%±2.52% | ▲ 5.69%±4.32% | ▲ 27.74%±9.54% | ▲ 5.05%±5.05% | ▲ 14.67%±1.76% | ▲ 2.00%±0.24% | 9,153 |
3 | Claude Opus 4.8 (Thinking)Anthropic · Proprietary | ▲ 9.05%±1.34% | ▲ 10.12%±2.36% | ▲ 16.10%±4.84% | ▲ 9.34%±2.42% | ▲ 9.23%±1.22% | ▲ 0.44%±1.77% | 26,448 |
4 | Claude Opus 4.7 (Thinking)Anthropic · Proprietary | ▲ 8.45%±1.26% | ▲ 7.38%±2.57% | ▲ 11.31%±4.52% | ▲ 9.11%±2.53% | ▲ 12.51%±0.94% | ▲ 1.96%±0.24% | 28,786 |
5 | GPT 5.5 (High)OpenAI · Proprietary | ▲ 7.80%±1.10% | ▲ 5.64%±2.10% | ▲ 10.87%±3.91% | ▲ 7.10%±2.15% | ▲ 13.40%±1.14% | ▲ 2.00%±0.24% | 35,515 |
6 | Claude Opus 4.7Anthropic · Proprietary | ▲ 7.68%±1.27% | ▲ 6.72%±2.58% | ▲ 9.01%±4.51% | ▲ 8.81%±2.57% | ▲ 11.90%±1.11% | ▲ 1.96%±0.24% | 28,850 |
7 | Claude Opus 4.6Anthropic · Proprietary | ▲ 7.65%±1.26% | ▲ 7.75%±2.57% | ▲ 10.09%±4.20% | ▲ 8.03%±2.52% | ▲ 10.36%±1.76% | ▲ 2.00%±0.24% | 28,864 |
8 | GPT 5.4 (High)OpenAI · Proprietary | ▲ 6.80%±1.14% | ▲ 5.92%±2.16% | ▲ 8.45%±4.13% | ▲ 5.48%±2.23% | ▲ 12.14%±1.04% | ▲ 2.00%±0.24% | 35,682 |
9 | GPT 5.5OpenAI · Proprietary | ▲ 6.57%±1.07% | ▲ 3.43%±2.06% | ▲ 7.77%±3.80% | ▲ 8.04%±2.02% | ▲ 11.60%±1.29% | ▲ 2.00%±0.24% | 35,852 |
10 | Claude Opus 4.8Anthropic · Proprietary | ▲ 4.56%±1.66% | ▲ 7.99%±2.64% | ▲ 13.59%±5.27% | ▲ 6.86%±2.70% | ▲ 7.64%±1.54% | ▲ 13.27%±4.17% | 23,959 |
11 | Claude Sonnet 4.6Anthropic · Proprietary | ▲ 3.02%±1.20% | ▲ 1.73%±2.70% | ▲ 2.40%±3.70% | ▲ 2.37%±2.39% | ▲ 11.38%±1.90% | ▲ 1.99%±0.24% | 28,832 |
12 | GLM 5.1Z.ai · Proprietary | ▲ 2.88%±1.21% | ▲ 5.16%±2.46% | ▲ 1.22%±4.21% | ▲ 1.09%±2.46% | ▲ 4.94%±1.38% | ▲ 2.00%±0.24% | 30,770 |
13 | DeepSeek V4 ProDeepSeek · Proprietary | ▲ 0.22%±1.18% | ▲ 2.42%±2.50% | ▲ 1.35%±4.12% | ▲ 2.74%±2.46% | ▲ 2.15%±1.10% | ▲ 0.61%±0.37% | 27,593 |
14 | Gemini 3.5 FlashGoogle · Proprietary | ▲ 0.06%±1.06% | ▲ 2.85%±2.38% | ▲ 1.65%±3.55% | ▲ 0.24%±2.02% | ▲ 3.08%±1.27% | ▲ 1.94%±0.24% | 28,666 |
15 | Kimi K2.6Moonshot · Proprietary | ▲ 0.63%±1.09% | ▲ 0.62%±2.34% | ▲ 4.08%±3.66% | ▲ 3.14%±2.25% | ▲ 1.45%±1.49% | ▲ 2.00%±0.24% | 32,162 |
16 | Gemini 3.1 Pro PreviewGoogle · Proprietary | ▲ 0.66%±0.96% | ▲ 0.57%±2.15% | ▲ 1.84%±3.07% | ▲ 2.09%±1.79% | ▲ 4.94%±1.59% | ▲ 1.97%±0.24% | 35,676 |
17 | DeepSeek V4 FlashDeepSeek · Proprietary | ▲ 0.80%±1.38% | ▲ 3.58%±2.68% | ▲ 0.85%±4.78% | ▲ 6.31%±2.85% | ▲ 0.93%±1.63% | ▲ 0.52%±0.41% | 30,903 |
18 | Qwen 3.6 PlusAlibaba · Proprietary | ▲ 4.52%±1.21% | ▲ 1.74%±2.51% | ▲ 8.04%±4.02% | ▲ 9.57%±2.61% | ▲ 1.79%±1.75% | ▲ 1.47%±0.59% | 30,235 |
19 | Grok Build 0.1xAI · Proprietary | ▲ 6.28%±1.11% | ▲ 6.19%±2.62% | ▲ 12.45%±3.51% | ▲ 9.84%±2.37% | ▲ 1.93%±1.60% | ▲ 4.85%±0.69% | 25,614 |
20 | Nemotron 3 UltraNvidia · Proprietary | ▲ 6.81%±5.22% | ▲ 3.00%±8.86% | ▲ 2.58%±19.26% | ▲ 23.87%±10.36% | ▲ 11.75%±9.55% | ▲ 2.00%±0.24% | 3,484 |
Jun 12, 2026 · 667,524 sessions · 25 models
| Model | ||||||||
|---|---|---|---|---|---|---|---|---|
1 | Claude Fable 5 (High)Anthropic · Proprietary | ▲ 13.68%±1.59% | ▲ 17.21%±2.59% | ▲ 27.74%±5.97% | ▲ 11.27%±3.24% | ▲ 10.16%±1.40% | ▲ 2.00%±0.24% | 0 |
2 | GPT 5.5 (xHigh)OpenAI · Proprietary | ▲ 11.03%±2.52% | ▲ 5.69%±4.32% | ▲ 27.74%±9.54% | ▲ 5.05%±5.05% | ▲ 14.67%±1.76% | ▲ 2.00%±0.24% | 9,153 |
3 | Claude Opus 4.8 (Thinking)Anthropic · Proprietary | ▲ 9.05%±1.34% | ▲ 10.12%±2.36% | ▲ 16.10%±4.84% | ▲ 9.34%±2.42% | ▲ 9.23%±1.22% | ▲ 0.44%±1.77% | 26,448 |
4 | Claude Opus 4.7 (Thinking)Anthropic · Proprietary | ▲ 8.45%±1.26% | ▲ 7.38%±2.57% | ▲ 11.31%±4.52% | ▲ 9.11%±2.53% | ▲ 12.51%±0.94% | ▲ 1.96%±0.24% | 28,786 |
5 | GPT 5.5 (High)OpenAI · Proprietary | ▲ 7.80%±1.10% | ▲ 5.64%±2.10% | ▲ 10.87%±3.91% | ▲ 7.10%±2.15% | ▲ 13.40%±1.14% | ▲ 2.00%±0.24% | 35,515 |
6 | Claude Opus 4.7Anthropic · Proprietary | ▲ 7.68%±1.27% | ▲ 6.72%±2.58% | ▲ 9.01%±4.51% | ▲ 8.81%±2.57% | ▲ 11.90%±1.11% | ▲ 1.96%±0.24% | 28,850 |
7 | Claude Opus 4.6Anthropic · Proprietary | ▲ 7.65%±1.26% | ▲ 7.75%±2.57% | ▲ 10.09%±4.20% | ▲ 8.03%±2.52% | ▲ 10.36%±1.76% | ▲ 2.00%±0.24% | 28,864 |
8 | GPT 5.4 (High)OpenAI · Proprietary | ▲ 6.80%±1.14% | ▲ 5.92%±2.16% | ▲ 8.45%±4.13% | ▲ 5.48%±2.23% | ▲ 12.14%±1.04% | ▲ 2.00%±0.24% | 35,682 |
9 | GPT 5.5OpenAI · Proprietary | ▲ 6.57%±1.07% | ▲ 3.43%±2.06% | ▲ 7.77%±3.80% | ▲ 8.04%±2.02% | ▲ 11.60%±1.29% | ▲ 2.00%±0.24% | 35,852 |
10 | Claude Opus 4.8Anthropic · Proprietary | ▲ 4.56%±1.66% | ▲ 7.99%±2.64% | ▲ 13.59%±5.27% | ▲ 6.86%±2.70% | ▲ 7.64%±1.54% | ▲ 13.27%±4.17% | 23,959 |
11 | Claude Sonnet 4.6Anthropic · Proprietary | ▲ 3.02%±1.20% | ▲ 1.73%±2.70% | ▲ 2.40%±3.70% | ▲ 2.37%±2.39% | ▲ 11.38%±1.90% | ▲ 1.99%±0.24% | 28,832 |
12 | GLM 5.1Z.ai · Proprietary | ▲ 2.88%±1.21% | ▲ 5.16%±2.46% | ▲ 1.22%±4.21% | ▲ 1.09%±2.46% | ▲ 4.94%±1.38% | ▲ 2.00%±0.24% | 30,770 |
13 | DeepSeek V4 ProDeepSeek · Proprietary | ▲ 0.22%±1.18% | ▲ 2.42%±2.50% | ▲ 1.35%±4.12% | ▲ 2.74%±2.46% | ▲ 2.15%±1.10% | ▲ 0.61%±0.37% | 27,593 |
14 | Gemini 3.5 FlashGoogle · Proprietary | ▲ 0.06%±1.06% | ▲ 2.85%±2.38% | ▲ 1.65%±3.55% | ▲ 0.24%±2.02% | ▲ 3.08%±1.27% | ▲ 1.94%±0.24% | 28,666 |
15 | Kimi K2.6Moonshot · Proprietary | ▲ 0.63%±1.09% | ▲ 0.62%±2.34% | ▲ 4.08%±3.66% | ▲ 3.14%±2.25% | ▲ 1.45%±1.49% | ▲ 2.00%±0.24% | 32,162 |
16 | Gemini 3.1 Pro PreviewGoogle · Proprietary | ▲ 0.66%±0.96% | ▲ 0.57%±2.15% | ▲ 1.84%±3.07% | ▲ 2.09%±1.79% | ▲ 4.94%±1.59% | ▲ 1.97%±0.24% | 35,676 |
17 | DeepSeek V4 FlashDeepSeek · Proprietary | ▲ 0.80%±1.38% | ▲ 3.58%±2.68% | ▲ 0.85%±4.78% | ▲ 6.31%±2.85% | ▲ 0.93%±1.63% | ▲ 0.52%±0.41% | 30,903 |
18 | Qwen 3.6 PlusAlibaba · Proprietary | ▲ 4.52%±1.21% | ▲ 1.74%±2.51% | ▲ 8.04%±4.02% | ▲ 9.57%±2.61% | ▲ 1.79%±1.75% | ▲ 1.47%±0.59% | 30,235 |
19 | Grok Build 0.1xAI · Proprietary | ▲ 6.28%±1.11% | ▲ 6.19%±2.62% | ▲ 12.45%±3.51% | ▲ 9.84%±2.37% | ▲ 1.93%±1.60% | ▲ 4.85%±0.69% | 25,614 |
20 | Nemotron 3 UltraNvidia · Proprietary | ▲ 6.81%±5.22% | ▲ 3.00%±8.86% | ▲ 2.58%±19.26% | ▲ 23.87%±10.36% | ▲ 11.75%±9.55% | ▲ 2.00%±0.24% | 3,484 |
Jun 12, 2026 · 667,524 sessions · 25 models