UABUnbiased AI BenchAI model rankings with source links.
Every score links back to its source.
Home/Models/qwen3-235b-a22b-instruct-2507
qwen3-235b-a22b-instruct-2507
Live · updated continuously
Models · /models/qwen3-235b-a22b-instruct-2507

qwen3-235b-a22b-instruct-2507

Qwen · Open weights · mid · registry tag 2026 benchmark-derived
text2 aliases2 official source links
Open compare
Last verified · May 13, 2026
Visible coverage · 6.5%
Verified coverage · 6.5%
Benchmark fit · n/a
Benchmark spread · n/a
Data version

Read this before trusting a headline.

Data version May 13, 2026Model list checked9 providers · 800 tracked modelsPage refreshed May 18, 2026

Model pages expose the current registry snapshot and page stamp so stale deployments are visible without reading the code.

Source-linked scores by benchmark

Each row keeps the benchmark source, source type, raw metric, and percentile inside its fair comparison set.

Thin verified coverageThis model currently reads as thin verified coverage across the resolved evidence surface.
Intelligence Index
AA · Chat / text · Composite
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
58.7% percentile inside its fair comparison set
25Raw benchmark value
Time to first token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
44.3% percentile inside its fair comparison set
2.48sRaw benchmark value
Text Arena
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
81.7% percentile inside its fair comparison set
1,419Raw benchmark value

Source links and registry checks

official
Arena

May 13, 2026

source →
official
Artificial Analysis

May 13, 2026

source →