Model profile · Qwen

Qwen3 32B

Open weightsbudget · registry tag 2026 open dense

Thin verified coverage

Reads as thin verified coverage across the resolved source data.

Visible coverage: 12.3%
Verified coverage: 12.3%
Spread: 59.9%
Last verified: Jun 20, 2026

47%bench fit

textcodedocument6 aliases25 official source links

Open compare

Data version

Current snapshot.

Data version Jun 20, 2026Model list checked9 providers · 1081 tracked modelsPage refreshed Jul 5, 2026

The registry snapshot and page stamp are shown so a stale deploy is visible at a glance.

Source-linked scores by benchmark

Each row keeps the benchmark source, source type, raw metric, and percentile inside its fair comparison set.

Thin verified coverageThis model currently reads as thin verified coverage across the resolved source data.

Chat / text28 benchmarks52.8%

Intelligence Index

AA · Chat / text · Combined

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #246 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 9
Percentile: 38%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.

38% percentile inside its fair comparison set

9Raw benchmark value

AA-Omniscience accuracy

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #152 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 17.3%
Percentile: 49.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.

49.7% percentile inside its fair comparison set

17.3%Raw benchmark value

AA-Omniscience non-hallucination

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #127 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 18%
Percentile: 57.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.

57.7% percentile inside its fair comparison set

18%Raw benchmark value

IFBench

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #250 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 31.5%
Percentile: 21.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `ifbench`.

21.3% percentile inside its fair comparison set

31.5%Raw benchmark value

Blended price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #90 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.3 /1M tokens
Percentile: 67.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.

67.8% percentile inside its fair comparison set

$0.3 /1M tokensRaw benchmark value

Input price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #93 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.2 /1M input tokens
Percentile: 66.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.

66.7% percentile inside its fair comparison set

$0.2 /1M input tokensRaw benchmark value

Output price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #90 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.6 /1M output tokens
Percentile: 67.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.

67.8% percentile inside its fair comparison set

$0.6 /1M output tokensRaw benchmark value

Output Speed

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #107 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 93.6 tokens/s
Percentile: 49.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.

49.5% percentile inside its fair comparison set

93.6 tokens/sRaw benchmark value

Time to first token

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #128 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 2.61s
Percentile: 39.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.

39.5% percentile inside its fair comparison set

2.61sRaw benchmark value

Time to first answer token

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #137 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 23.07s
Percentile: 35.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.

35.2% percentile inside its fair comparison set

23.07sRaw benchmark value

Text Arena

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,347
Percentile: 54.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: overall. Source rank: #178. Votes: 3926. Organization: alibaba. License: Apache 2.0.

54.5% percentile inside its fair comparison set

1,347Raw benchmark valueCI 1,338 - 1,357

Text Arena · Creative Writing

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #154

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,305
Percentile: 52.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: creative_writing. Source rank: #185. Votes: 616. Organization: alibaba. License: Apache 2.0.

52.6% percentile inside its fair comparison set

1,305Raw benchmark valueCI 1,282 - 1,327

Text Arena · English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #150

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,366
Percentile: 54.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: english. Source rank: #178. Votes: 2230. Organization: alibaba. License: Apache 2.0.

54.2% percentile inside its fair comparison set

1,366Raw benchmark valueCI 1,353 - 1,378

Text Arena · Exclude Ties

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,307
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: exclude_ties. Source rank: #176. Votes: 2544. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,307Raw benchmark valueCI 1,293 - 1,322

Text Arena · Hard Prompts

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #143

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,368
Percentile: 56.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts. Source rank: #172. Votes: 1207. Organization: alibaba. License: Apache 2.0.

56.3% percentile inside its fair comparison set

1,368Raw benchmark valueCI 1,351 - 1,384

Text Arena · Hard Prompts English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #144

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,382
Percentile: 55.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts_english. Source rank: #172. Votes: 729. Organization: alibaba. License: Apache 2.0.

55.9% percentile inside its fair comparison set

1,382Raw benchmark valueCI 1,361 - 1,403

Text Arena · Instruction Following

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,332
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: instruction_following. Source rank: #178. Votes: 858. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,332Raw benchmark valueCI 1,313 - 1,351

Text Arena · Longer Query

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,355
Percentile: 53.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: longer_query. Source rank: #171. Votes: 504. Organization: alibaba. License: Apache 2.0.

53.6% percentile inside its fair comparison set

1,355Raw benchmark valueCI 1,330 - 1,380

Text Arena · Multi Turn

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #151

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,338
Percentile: 53.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: multi_turn. Source rank: #182. Votes: 590. Organization: alibaba. License: Apache 2.0.

53.6% percentile inside its fair comparison set

1,338Raw benchmark valueCI 1,315 - 1,361

Text Arena · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,340
Percentile: 56.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: overall. Source rank: #168. Votes: 3926. Organization: alibaba. License: Apache 2.0.

56.6% percentile inside its fair comparison set

1,340Raw benchmark valueCI 1,331 - 1,349

Text Arena · Creative Writing · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #147

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,298
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: creative_writing. Source rank: #175. Votes: 616. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,298Raw benchmark valueCI 1,276 - 1,320

Text Arena · English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #143

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,359
Percentile: 56.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: english. Source rank: #169. Votes: 2230. Organization: alibaba. License: Apache 2.0.

56.3% percentile inside its fair comparison set

1,359Raw benchmark valueCI 1,346 - 1,371

Text Arena · Exclude Ties · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,297
Percentile: 56.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: exclude_ties. Source rank: #168. Votes: 2544. Organization: alibaba. License: Apache 2.0.

56.6% percentile inside its fair comparison set

1,297Raw benchmark valueCI 1,282 - 1,311

Text Arena · Hard Prompts · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,335
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts. Source rank: #175. Votes: 1207. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,335Raw benchmark valueCI 1,319 - 1,351

Text Arena · Hard Prompts English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,350
Percentile: 54.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts_english. Source rank: #176. Votes: 729. Organization: alibaba. License: Apache 2.0.

54.3% percentile inside its fair comparison set

1,350Raw benchmark valueCI 1,329 - 1,370

Text Arena · Instruction Following · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #152

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,305
Percentile: 53.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: instruction_following. Source rank: #181. Votes: 858. Organization: alibaba. License: Apache 2.0.

53.5% percentile inside its fair comparison set

1,305Raw benchmark valueCI 1,287 - 1,324

Text Arena · Longer Query · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #144

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,327
Percentile: 53%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: longer_query. Source rank: #172. Votes: 504. Organization: alibaba. License: Apache 2.0.

53% percentile inside its fair comparison set

1,327Raw benchmark valueCI 1,303 - 1,351

Text Arena · Multi Turn · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,331
Percentile: 54.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: multi_turn. Source rank: #177. Votes: 590. Organization: alibaba. License: Apache 2.0.

54.2% percentile inside its fair comparison set

1,331Raw benchmark valueCI 1,308 - 1,354

Coding4 benchmarks47.1%

Terminal-Bench Hard

AA · Coding · Objective

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #224 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 3%
Percentile: 28.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `terminalbenchHard`.

28.5% percentile inside its fair comparison set

3%Raw benchmark value

SciCode

AA · Coding · Objective

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #198 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 28%
Percentile: 46.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `scicode`.

46.5% percentile inside its fair comparison set

28%Raw benchmark value

Text Arena · Coding

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #134

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,408
Percentile: 58.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: coding. Source rank: #161. Votes: 513. Organization: alibaba. License: Apache 2.0.

58.4% percentile inside its fair comparison set

1,408Raw benchmark valueCI 1,384 - 1,432

Text Arena · Coding · No Style Control

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #145

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,358
Percentile: 55%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: coding. Source rank: #173. Votes: 513. Organization: alibaba. License: Apache 2.0.

55% percentile inside its fair comparison set

1,358Raw benchmark valueCI 1,334 - 1,382

Reasoning / math / science5 benchmarks55.3%

Humanity's Last Exam

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #283 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 4.3%
Percentile: 23.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `hle`.

23.8% percentile inside its fair comparison set

4.3%Raw benchmark value

GPQA

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #230 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 53.5%
Percentile: 38.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `gpqa`.

38.8% percentile inside its fair comparison set

53.5%Raw benchmark value

CritPt

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #101 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 0.3%
Percentile: 75.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `critpt`.

75.2% percentile inside its fair comparison set

0.3%Raw benchmark value

Text Arena · Math

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #98

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,399
Percentile: 69.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: math. Source rank: #121. Votes: 316. Organization: alibaba. License: Apache 2.0.

69.1% percentile inside its fair comparison set

1,399Raw benchmark valueCI 1,369 - 1,429

Text Arena · Math · No Style Control

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #97

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,398
Percentile: 69.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: math. Source rank: #116. Votes: 316. Organization: alibaba. License: Apache 2.0.

69.4% percentile inside its fair comparison set

1,398Raw benchmark valueCI 1,368 - 1,428

Professional reasoning18 benchmarks56.1%

Text Arena · Expert

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #112

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,398
Percentile: 59.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: expert. Source rank: #137. Votes: 236. Organization: alibaba. License: Apache 2.0.

59.6% percentile inside its fair comparison set

1,398Raw benchmark valueCI 1,360 - 1,435

Text Arena · Industry Business And Management And Financial Operations

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #156

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,340
Percentile: 51.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_business_and_management_and_financial_operations. Source rank: #184. Votes: 437. Organization: alibaba. License: Apache 2.0.

51.3% percentile inside its fair comparison set

1,340Raw benchmark valueCI 1,314 - 1,366

Text Arena · Industry Entertainment And Sports And Media

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #173

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,286
Percentile: 46.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_entertainment_and_sports_and_media. Source rank: #208. Votes: 704. Organization: alibaba. License: Apache 2.0.

46.7% percentile inside its fair comparison set

1,286Raw benchmark valueCI 1,266 - 1,307

Text Arena · Industry Legal And Government

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #188

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,326
Percentile: 37.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_legal_and_government. Source rank: #222. Votes: 285. Organization: alibaba. License: Apache 2.0.

37.2% percentile inside its fair comparison set

1,326Raw benchmark valueCI 1,294 - 1,359

Text Arena · Industry Life And Physical And Social Science

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #133

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,378
Percentile: 59.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_life_and_physical_and_social_science. Source rank: #160. Votes: 798. Organization: alibaba. License: Apache 2.0.

59.1% percentile inside its fair comparison set

1,378Raw benchmark valueCI 1,358 - 1,399

Text Arena · Industry Mathematical

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #71

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,426
Percentile: 77.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_mathematical. Source rank: #86. Votes: 282. Organization: alibaba. License: Apache 2.0.

77.3% percentile inside its fair comparison set

1,426Raw benchmark valueCI 1,394 - 1,458

Text Arena · Industry Medicine And Healthcare

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #135

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,378
Percentile: 54.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_medicine_and_healthcare. Source rank: #162. Votes: 212. Organization: alibaba. License: Apache 2.0.

54.6% percentile inside its fair comparison set

1,378Raw benchmark valueCI 1,339 - 1,418

Text Arena · Industry Software And It Services

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #135

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,399
Percentile: 58.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_software_and_it_services. Source rank: #161. Votes: 843. Organization: alibaba. License: Apache 2.0.

58.8% percentile inside its fair comparison set

1,399Raw benchmark valueCI 1,379 - 1,418

Text Arena · Industry Writing And Literature And Language

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #170

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,308
Percentile: 47.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_writing_and_literature_and_language. Source rank: #203. Votes: 1121. Organization: alibaba. License: Apache 2.0.

47.8% percentile inside its fair comparison set

1,308Raw benchmark valueCI 1,291 - 1,325

Text Arena · Expert · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #115

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,362
Percentile: 58.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: expert. Source rank: #140. Votes: 236. Organization: alibaba. License: Apache 2.0.

58.5% percentile inside its fair comparison set

1,362Raw benchmark valueCI 1,325 - 1,399

Text Arena · Industry Business And Management And Financial Operations · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,318
Percentile: 53.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_business_and_management_and_financial_operations. Source rank: #175. Votes: 437. Organization: alibaba. License: Apache 2.0.

53.5% percentile inside its fair comparison set

1,318Raw benchmark valueCI 1,292 - 1,344

Text Arena · Industry Entertainment And Sports And Media · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #159

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,279
Percentile: 51.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_entertainment_and_sports_and_media. Source rank: #188. Votes: 704. Organization: alibaba. License: Apache 2.0.

51.1% percentile inside its fair comparison set

1,279Raw benchmark valueCI 1,259 - 1,300

Text Arena · Industry Legal And Government · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #160

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,325
Percentile: 46.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_legal_and_government. Source rank: #187. Votes: 285. Organization: alibaba. License: Apache 2.0.

46.6% percentile inside its fair comparison set

1,325Raw benchmark valueCI 1,293 - 1,357

Text Arena · Industry Life And Physical And Social Science · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #123

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,372
Percentile: 62.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_life_and_physical_and_social_science. Source rank: #147. Votes: 798. Organization: alibaba. License: Apache 2.0.

62.2% percentile inside its fair comparison set

1,372Raw benchmark valueCI 1,352 - 1,392

Text Arena · Industry Mathematical · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #76

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,419
Percentile: 75.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_mathematical. Source rank: #91. Votes: 282. Organization: alibaba. License: Apache 2.0.

75.6% percentile inside its fair comparison set

1,419Raw benchmark valueCI 1,388 - 1,451

Text Arena · Industry Medicine And Healthcare · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #121

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,368
Percentile: 59.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_medicine_and_healthcare. Source rank: #145. Votes: 212. Organization: alibaba. License: Apache 2.0.

59.3% percentile inside its fair comparison set

1,368Raw benchmark valueCI 1,329 - 1,406

Text Arena · Industry Software And It Services · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #136

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,367
Percentile: 58.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_software_and_it_services. Source rank: #162. Votes: 843. Organization: alibaba. License: Apache 2.0.

58.5% percentile inside its fair comparison set

1,367Raw benchmark valueCI 1,348 - 1,386

Text Arena · Industry Writing And Literature And Language · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #158

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,298
Percentile: 51.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_writing_and_literature_and_language. Source rank: #186. Votes: 1121. Organization: alibaba. License: Apache 2.0.

51.5% percentile inside its fair comparison set

1,298Raw benchmark valueCI 1,282 - 1,315

Search / tool use1 benchmark43.4%

Tau2-Bench Telecom

AA · Search / tool use · Objective

It matters when the model must browse, call tools, and recover useful answers from external systems.

Rank #176 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 29.8%
Percentile: 43.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `tau2`.

43.4% percentile inside its fair comparison set

29.8%Raw benchmark value

Long context1 benchmark15.2%

Long Context Reasoning

AA · Long context · Objective

It checks whether long-context claims survive contact with retrieval, memory, or long-document tasks.

Rank #307 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 0%
Percentile: 15.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `lcr`.

15.2% percentile inside its fair comparison set

0%Raw benchmark value

Multilingual6 benchmarks50.4%

Text Arena · Chinese

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #146

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,365
Percentile: 50.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: chinese. Source rank: #175. Votes: 187. Organization: alibaba. License: Apache 2.0.

50.8% percentile inside its fair comparison set

1,365Raw benchmark valueCI 1,326 - 1,404

Text Arena · German

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #124

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,333
Percentile: 48.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: german. Source rank: #150. Votes: 172. Organization: alibaba. License: Apache 2.0.

48.1% percentile inside its fair comparison set

1,333Raw benchmark valueCI 1,292 - 1,375

Text Arena · Russian

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #152

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,328
Percentile: 47.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: russian. Source rank: #182. Votes: 412. Organization: alibaba. License: Apache 2.0.

47.8% percentile inside its fair comparison set

1,328Raw benchmark valueCI 1,303 - 1,354

Text Arena · Chinese · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #132

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,358
Percentile: 55.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: chinese. Source rank: #158. Votes: 187. Organization: alibaba. License: Apache 2.0.

55.6% percentile inside its fair comparison set

1,358Raw benchmark valueCI 1,320 - 1,395

Text Arena · German · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #116

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,342
Percentile: 51.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: german. Source rank: #139. Votes: 172. Organization: alibaba. License: Apache 2.0.

51.5% percentile inside its fair comparison set

1,342Raw benchmark valueCI 1,301 - 1,382

Text Arena · Russian · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,311
Percentile: 48.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: russian. Source rank: #176. Votes: 412. Organization: alibaba. License: Apache 2.0.

48.8% percentile inside its fair comparison set

1,311Raw benchmark valueCI 1,286 - 1,336

Source links and registry checks

official

Qwen3 announcement

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Artificial Analysis

Jun 20, 2026

source →

Model profile · Qwen

Qwen3 32B

Open weightsbudget · registry tag 2026 open dense

Thin verified coverage

Reads as thin verified coverage across the resolved source data.

Visible coverage: 12.3%
Verified coverage: 12.3%
Spread: 59.9%
Last verified: Jun 20, 2026

47%bench fit

textcodedocument6 aliases25 official source links

Open compare

Data version

Current snapshot.

Data version Jun 20, 2026Model list checked9 providers · 1081 tracked modelsPage refreshed Jul 5, 2026

The registry snapshot and page stamp are shown so a stale deploy is visible at a glance.

Source-linked scores by benchmark

Each row keeps the benchmark source, source type, raw metric, and percentile inside its fair comparison set.

Thin verified coverageThis model currently reads as thin verified coverage across the resolved source data.

Chat / text28 benchmarks52.8%

Intelligence Index

AA · Chat / text · Combined

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #246 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 9
Percentile: 38%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.

38% percentile inside its fair comparison set

9Raw benchmark value

AA-Omniscience accuracy

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #152 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 17.3%
Percentile: 49.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.

49.7% percentile inside its fair comparison set

17.3%Raw benchmark value

AA-Omniscience non-hallucination

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #127 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 18%
Percentile: 57.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.

57.7% percentile inside its fair comparison set

18%Raw benchmark value

IFBench

AA · Chat / text · Objective

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #250 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 31.5%
Percentile: 21.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `ifbench`.

21.3% percentile inside its fair comparison set

31.5%Raw benchmark value

Blended price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #90 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.3 /1M tokens
Percentile: 67.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.

67.8% percentile inside its fair comparison set

$0.3 /1M tokensRaw benchmark value

Input price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #93 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.2 /1M input tokens
Percentile: 66.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.

66.7% percentile inside its fair comparison set

$0.2 /1M input tokensRaw benchmark value

Output price

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #90 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: $0.6 /1M output tokens
Percentile: 67.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.

67.8% percentile inside its fair comparison set

$0.6 /1M output tokensRaw benchmark value

Output Speed

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #107 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 93.6 tokens/s
Percentile: 49.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.

49.5% percentile inside its fair comparison set

93.6 tokens/sRaw benchmark value

Time to first token

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #128 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 2.61s
Percentile: 39.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.

39.5% percentile inside its fair comparison set

2.61sRaw benchmark value

Time to first answer token

AA · Chat / text · Speed / cost

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #137 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 23.07s
Percentile: 35.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.

35.2% percentile inside its fair comparison set

23.07sRaw benchmark value

Text Arena

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,347
Percentile: 54.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: overall. Source rank: #178. Votes: 3926. Organization: alibaba. License: Apache 2.0.

54.5% percentile inside its fair comparison set

1,347Raw benchmark valueCI 1,338 - 1,357

Text Arena · Creative Writing

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #154

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,305
Percentile: 52.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: creative_writing. Source rank: #185. Votes: 616. Organization: alibaba. License: Apache 2.0.

52.6% percentile inside its fair comparison set

1,305Raw benchmark valueCI 1,282 - 1,327

Text Arena · English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #150

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,366
Percentile: 54.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: english. Source rank: #178. Votes: 2230. Organization: alibaba. License: Apache 2.0.

54.2% percentile inside its fair comparison set

1,366Raw benchmark valueCI 1,353 - 1,378

Text Arena · Exclude Ties

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,307
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: exclude_ties. Source rank: #176. Votes: 2544. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,307Raw benchmark valueCI 1,293 - 1,322

Text Arena · Hard Prompts

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #143

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,368
Percentile: 56.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts. Source rank: #172. Votes: 1207. Organization: alibaba. License: Apache 2.0.

56.3% percentile inside its fair comparison set

1,368Raw benchmark valueCI 1,351 - 1,384

Text Arena · Hard Prompts English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #144

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,382
Percentile: 55.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts_english. Source rank: #172. Votes: 729. Organization: alibaba. License: Apache 2.0.

55.9% percentile inside its fair comparison set

1,382Raw benchmark valueCI 1,361 - 1,403

Text Arena · Instruction Following

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,332
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: instruction_following. Source rank: #178. Votes: 858. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,332Raw benchmark valueCI 1,313 - 1,351

Text Arena · Longer Query

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,355
Percentile: 53.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: longer_query. Source rank: #171. Votes: 504. Organization: alibaba. License: Apache 2.0.

53.6% percentile inside its fair comparison set

1,355Raw benchmark valueCI 1,330 - 1,380

Text Arena · Multi Turn

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #151

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,338
Percentile: 53.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: multi_turn. Source rank: #182. Votes: 590. Organization: alibaba. License: Apache 2.0.

53.6% percentile inside its fair comparison set

1,338Raw benchmark valueCI 1,315 - 1,361

Text Arena · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,340
Percentile: 56.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: overall. Source rank: #168. Votes: 3926. Organization: alibaba. License: Apache 2.0.

56.6% percentile inside its fair comparison set

1,340Raw benchmark valueCI 1,331 - 1,349

Text Arena · Creative Writing · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #147

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,298
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: creative_writing. Source rank: #175. Votes: 616. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,298Raw benchmark valueCI 1,276 - 1,320

Text Arena · English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #143

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,359
Percentile: 56.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: english. Source rank: #169. Votes: 2230. Organization: alibaba. License: Apache 2.0.

56.3% percentile inside its fair comparison set

1,359Raw benchmark valueCI 1,346 - 1,371

Text Arena · Exclude Ties · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #142

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,297
Percentile: 56.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: exclude_ties. Source rank: #168. Votes: 2544. Organization: alibaba. License: Apache 2.0.

56.6% percentile inside its fair comparison set

1,297Raw benchmark valueCI 1,282 - 1,311

Text Arena · Hard Prompts · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #148

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,335
Percentile: 54.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts. Source rank: #175. Votes: 1207. Organization: alibaba. License: Apache 2.0.

54.8% percentile inside its fair comparison set

1,335Raw benchmark valueCI 1,319 - 1,351

Text Arena · Hard Prompts English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,350
Percentile: 54.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: hard_prompts_english. Source rank: #176. Votes: 729. Organization: alibaba. License: Apache 2.0.

54.3% percentile inside its fair comparison set

1,350Raw benchmark valueCI 1,329 - 1,370

Text Arena · Instruction Following · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #152

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,305
Percentile: 53.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: instruction_following. Source rank: #181. Votes: 858. Organization: alibaba. License: Apache 2.0.

53.5% percentile inside its fair comparison set

1,305Raw benchmark valueCI 1,287 - 1,324

Text Arena · Longer Query · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #144

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,327
Percentile: 53%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: longer_query. Source rank: #172. Votes: 504. Organization: alibaba. License: Apache 2.0.

53% percentile inside its fair comparison set

1,327Raw benchmark valueCI 1,303 - 1,351

Text Arena · Multi Turn · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,331
Percentile: 54.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: multi_turn. Source rank: #177. Votes: 590. Organization: alibaba. License: Apache 2.0.

54.2% percentile inside its fair comparison set

1,331Raw benchmark valueCI 1,308 - 1,354

Coding4 benchmarks47.1%

Terminal-Bench Hard

AA · Coding · Objective

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #224 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 3%
Percentile: 28.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `terminalbenchHard`.

28.5% percentile inside its fair comparison set

3%Raw benchmark value

SciCode

AA · Coding · Objective

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #198 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 28%
Percentile: 46.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `scicode`.

46.5% percentile inside its fair comparison set

28%Raw benchmark value

Text Arena · Coding

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #134

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,408
Percentile: 58.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: coding. Source rank: #161. Votes: 513. Organization: alibaba. License: Apache 2.0.

58.4% percentile inside its fair comparison set

1,408Raw benchmark valueCI 1,384 - 1,432

Text Arena · Coding · No Style Control

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #145

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,358
Percentile: 55%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: coding. Source rank: #173. Votes: 513. Organization: alibaba. License: Apache 2.0.

55% percentile inside its fair comparison set

1,358Raw benchmark valueCI 1,334 - 1,382

Reasoning / math / science5 benchmarks55.3%

Humanity's Last Exam

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #283 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 4.3%
Percentile: 23.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `hle`.

23.8% percentile inside its fair comparison set

4.3%Raw benchmark value

GPQA

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #230 · Source label: Qwen3 32B (Non-reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 53.5%
Percentile: 38.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `gpqa`.

38.8% percentile inside its fair comparison set

53.5%Raw benchmark value

CritPt

AA · Reasoning / math / science · Objective

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #101 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 0.3%
Percentile: 75.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `critpt`.

75.2% percentile inside its fair comparison set

0.3%Raw benchmark value

Text Arena · Math

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #98

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,399
Percentile: 69.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: math. Source rank: #121. Votes: 316. Organization: alibaba. License: Apache 2.0.

69.1% percentile inside its fair comparison set

1,399Raw benchmark valueCI 1,369 - 1,429

Text Arena · Math · No Style Control

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #97

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,398
Percentile: 69.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: math. Source rank: #116. Votes: 316. Organization: alibaba. License: Apache 2.0.

69.4% percentile inside its fair comparison set

1,398Raw benchmark valueCI 1,368 - 1,428

Professional reasoning18 benchmarks56.1%

Text Arena · Expert

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #112

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,398
Percentile: 59.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: expert. Source rank: #137. Votes: 236. Organization: alibaba. License: Apache 2.0.

59.6% percentile inside its fair comparison set

1,398Raw benchmark valueCI 1,360 - 1,435

Text Arena · Industry Business And Management And Financial Operations

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #156

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,340
Percentile: 51.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_business_and_management_and_financial_operations. Source rank: #184. Votes: 437. Organization: alibaba. License: Apache 2.0.

51.3% percentile inside its fair comparison set

1,340Raw benchmark valueCI 1,314 - 1,366

Text Arena · Industry Entertainment And Sports And Media

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #173

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,286
Percentile: 46.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_entertainment_and_sports_and_media. Source rank: #208. Votes: 704. Organization: alibaba. License: Apache 2.0.

46.7% percentile inside its fair comparison set

1,286Raw benchmark valueCI 1,266 - 1,307

Text Arena · Industry Legal And Government

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #188

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,326
Percentile: 37.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_legal_and_government. Source rank: #222. Votes: 285. Organization: alibaba. License: Apache 2.0.

37.2% percentile inside its fair comparison set

1,326Raw benchmark valueCI 1,294 - 1,359

Text Arena · Industry Life And Physical And Social Science

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #133

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,378
Percentile: 59.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_life_and_physical_and_social_science. Source rank: #160. Votes: 798. Organization: alibaba. License: Apache 2.0.

59.1% percentile inside its fair comparison set

1,378Raw benchmark valueCI 1,358 - 1,399

Text Arena · Industry Mathematical

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #71

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,426
Percentile: 77.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_mathematical. Source rank: #86. Votes: 282. Organization: alibaba. License: Apache 2.0.

77.3% percentile inside its fair comparison set

1,426Raw benchmark valueCI 1,394 - 1,458

Text Arena · Industry Medicine And Healthcare

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #135

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,378
Percentile: 54.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_medicine_and_healthcare. Source rank: #162. Votes: 212. Organization: alibaba. License: Apache 2.0.

54.6% percentile inside its fair comparison set

1,378Raw benchmark valueCI 1,339 - 1,418

Text Arena · Industry Software And It Services

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #135

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,399
Percentile: 58.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_software_and_it_services. Source rank: #161. Votes: 843. Organization: alibaba. License: Apache 2.0.

58.8% percentile inside its fair comparison set

1,399Raw benchmark valueCI 1,379 - 1,418

Text Arena · Industry Writing And Literature And Language

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #170

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,308
Percentile: 47.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_writing_and_literature_and_language. Source rank: #203. Votes: 1121. Organization: alibaba. License: Apache 2.0.

47.8% percentile inside its fair comparison set

1,308Raw benchmark valueCI 1,291 - 1,325

Text Arena · Expert · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #115

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,362
Percentile: 58.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: expert. Source rank: #140. Votes: 236. Organization: alibaba. License: Apache 2.0.

58.5% percentile inside its fair comparison set

1,362Raw benchmark valueCI 1,325 - 1,399

Text Arena · Industry Business And Management And Financial Operations · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,318
Percentile: 53.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_business_and_management_and_financial_operations. Source rank: #175. Votes: 437. Organization: alibaba. License: Apache 2.0.

53.5% percentile inside its fair comparison set

1,318Raw benchmark valueCI 1,292 - 1,344

Text Arena · Industry Entertainment And Sports And Media · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #159

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,279
Percentile: 51.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_entertainment_and_sports_and_media. Source rank: #188. Votes: 704. Organization: alibaba. License: Apache 2.0.

51.1% percentile inside its fair comparison set

1,279Raw benchmark valueCI 1,259 - 1,300

Text Arena · Industry Legal And Government · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #160

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,325
Percentile: 46.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_legal_and_government. Source rank: #187. Votes: 285. Organization: alibaba. License: Apache 2.0.

46.6% percentile inside its fair comparison set

1,325Raw benchmark valueCI 1,293 - 1,357

Text Arena · Industry Life And Physical And Social Science · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #123

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,372
Percentile: 62.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_life_and_physical_and_social_science. Source rank: #147. Votes: 798. Organization: alibaba. License: Apache 2.0.

62.2% percentile inside its fair comparison set

1,372Raw benchmark valueCI 1,352 - 1,392

Text Arena · Industry Mathematical · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #76

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,419
Percentile: 75.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_mathematical. Source rank: #91. Votes: 282. Organization: alibaba. License: Apache 2.0.

75.6% percentile inside its fair comparison set

1,419Raw benchmark valueCI 1,388 - 1,451

Text Arena · Industry Medicine And Healthcare · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #121

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,368
Percentile: 59.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_medicine_and_healthcare. Source rank: #145. Votes: 212. Organization: alibaba. License: Apache 2.0.

59.3% percentile inside its fair comparison set

1,368Raw benchmark valueCI 1,329 - 1,406

Text Arena · Industry Software And It Services · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #136

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,367
Percentile: 58.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_software_and_it_services. Source rank: #162. Votes: 843. Organization: alibaba. License: Apache 2.0.

58.5% percentile inside its fair comparison set

1,367Raw benchmark valueCI 1,348 - 1,386

Text Arena · Industry Writing And Literature And Language · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #158

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,298
Percentile: 51.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: industry_writing_and_literature_and_language. Source rank: #186. Votes: 1121. Organization: alibaba. License: Apache 2.0.

51.5% percentile inside its fair comparison set

1,298Raw benchmark valueCI 1,282 - 1,315

Search / tool use1 benchmark43.4%

Tau2-Bench Telecom

AA · Search / tool use · Objective

It matters when the model must browse, call tools, and recover useful answers from external systems.

Rank #176 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 29.8%
Percentile: 43.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `tau2`.

43.4% percentile inside its fair comparison set

29.8%Raw benchmark value

Long context1 benchmark15.2%

Long Context Reasoning

AA · Long context · Objective

It checks whether long-context claims survive contact with retrieval, memory, or long-document tasks.

Rank #307 · Source label: Qwen3 32B (Reasoning)

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Artificial Analysis
Raw value: 0%
Percentile: 15.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Artificial Analysis public leaderboard field `lcr`.

15.2% percentile inside its fair comparison set

0%Raw benchmark value

Multilingual6 benchmarks50.4%

Text Arena · Chinese

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #146

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,365
Percentile: 50.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: chinese. Source rank: #175. Votes: 187. Organization: alibaba. License: Apache 2.0.

50.8% percentile inside its fair comparison set

1,365Raw benchmark valueCI 1,326 - 1,404

Text Arena · German

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #124

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,333
Percentile: 48.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: german. Source rank: #150. Votes: 172. Organization: alibaba. License: Apache 2.0.

48.1% percentile inside its fair comparison set

1,333Raw benchmark valueCI 1,292 - 1,375

Text Arena · Russian

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #152

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,328
Percentile: 47.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: russian. Source rank: #182. Votes: 412. Organization: alibaba. License: Apache 2.0.

47.8% percentile inside its fair comparison set

1,328Raw benchmark valueCI 1,303 - 1,354

Text Arena · Chinese · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #132

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,358
Percentile: 55.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: chinese. Source rank: #158. Votes: 187. Organization: alibaba. License: Apache 2.0.

55.6% percentile inside its fair comparison set

1,358Raw benchmark valueCI 1,320 - 1,395

Text Arena · German · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #116

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,342
Percentile: 51.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: german. Source rank: #139. Votes: 172. Organization: alibaba. License: Apache 2.0.

51.5% percentile inside its fair comparison set

1,342Raw benchmark valueCI 1,301 - 1,382

Text Arena · Russian · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #149

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,311
Percentile: 48.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `qwen3-32b`. Category: russian. Source rank: #176. Votes: 412. Organization: alibaba. License: Apache 2.0.

48.8% percentile inside its fair comparison set

1,311Raw benchmark valueCI 1,286 - 1,336

Source links and registry checks

official

Qwen3 announcement

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Arena

Jun 20, 2026

source →

official

Artificial Analysis

Jun 20, 2026

source →

Qwen3 32B

Current snapshot.

Source-linked scores by benchmark

Source links and registry checks

Loading model evidence.

Qwen3 32B

Current snapshot.

Source-linked scores by benchmark

Source links and registry checks