Grok 4
xAI
- Reasoning / math / science
- Long context
xAI
0 shared benchmarks are still too close to call, so the win stays conditional. This compare uses all public sources, with provider-official evidence labeled separately.
Unknown
| Text Arena AR · rating Text · Chat / text | 1,41072.9% exact aliasverified runtime Row details
| 1,068n/a exact aliasverified runtimeContext only Row details
| n/a |