Gemini 2.5 Pro
Google · 56.5%
0255075100
If this date looks stale, you may be seeing an older build or cached deploy.
This view shows where public sources refuse to tell the same story. A wide score range is not noise to hide. It is the main fact.
Not a moral rating. A quick check on how inspectable each source is when you need to dispute the headline number.
| Arena verified | 11 | 793 | no | May 13, 2026 | 0 |
| LiveBench verified | 6 | 773 | yes | May 13, 2026 | 0 |
| Artificial Analysis verified | 7 | 638 | yes | May 13, 2026 | 1 |
| BridgeBench verified | 5 | 122 | no | May 13, 2026 | 0 |
| Scale Labs verified | 8 | 98 | no | May 13, 2026 | 0 |
| Terminal-Bench verified | 1 | 31 | no | May 13, 2026 | 0 |
| OpenCompass verified | 1 | 15 | no | May 13, 2026 | 0 |
| MTEB verified | 1 | 11 | no | May 13, 2026 | 0 |
| LLMBase relay | 0 | 0 | no | May 13, 2026 | 0 |