Intelligence Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #64 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 28
- Percentile
- 84.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.
84.1% percentile inside its fair comparison set28Raw benchmark value
AA-Omniscience accuracy
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #128 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 18.6%
- Percentile
- 57.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.
57.4% percentile inside its fair comparison set18.6%Raw benchmark value
AA-Omniscience non-hallucination
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #199 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 10.9%
- Percentile
- 33.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.
33.6% percentile inside its fair comparison set10.9%Raw benchmark value
IFBench
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #90 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 50.8%
- Percentile
- 71.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `ifbench`.
71.7% percentile inside its fair comparison set50.8%Raw benchmark value
Blended price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #176 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $1.1 /1M tokens
- Percentile
- 36.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.
36.6% percentile inside its fair comparison set$1.1 /1M tokensRaw benchmark value
Input price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #148 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.4 /1M input tokens
- Percentile
- 47.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.
47.5% percentile inside its fair comparison set$0.4 /1M input tokensRaw benchmark value
Output price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #191 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $3.2 /1M output tokens
- Percentile
- 31.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.
31.2% percentile inside its fair comparison set$3.2 /1M output tokensRaw benchmark value
Output Speed
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #60 · Source label: Qwen3.5 122B A10B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 140.4 tokens/s
- Percentile
- 71.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.
71.9% percentile inside its fair comparison set140.4 tokens/sRaw benchmark value
Time to first token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #127 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 2.59s
- Percentile
- 40%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.
40% percentile inside its fair comparison set2.59sRaw benchmark value
Time to first answer token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #118 · Source label: Qwen3.5 122B A10B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 16.63s
- Percentile
- 44.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.
44.3% percentile inside its fair comparison set16.63sRaw benchmark value
Openness Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #113 · Source label: Qwen3.5 122B A10B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 39
- Percentile
- 54.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `opennessBreakdown.opennessIndex`.
54.8% percentile inside its fair comparison set39Raw benchmark value
Text Arena
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #76 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,417
- Percentile
- 76.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: overall. Source rank: #94. Votes: 28575. Organization: alibaba. License: Apache 2.0.
76.9% percentile inside its fair comparison set1,417Raw benchmark valueCI 1,413 - 1,422
Text Arena · Creative Writing
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #96 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,367
- Percentile
- 70.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: creative_writing. Source rank: #119. Votes: 4361. Organization: alibaba. License: Apache 2.0.
70.6% percentile inside its fair comparison set1,367Raw benchmark valueCI 1,357 - 1,376
Text Arena · English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #73 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,431
- Percentile
- 77.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: english. Source rank: #91. Votes: 13479. Organization: alibaba. License: Apache 2.0.
77.8% percentile inside its fair comparison set1,431Raw benchmark valueCI 1,425 - 1,437
Text Arena · Exclude Ties
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #75 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,404
- Percentile
- 77.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: exclude_ties. Source rank: #94. Votes: 20913. Organization: alibaba. License: Apache 2.0.
77.2% percentile inside its fair comparison set1,404Raw benchmark valueCI 1,398 - 1,410
Text Arena · Hard Prompts
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #80 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,433
- Percentile
- 75.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: hard_prompts. Source rank: #100. Votes: 17878. Organization: alibaba. License: Apache 2.0.
75.7% percentile inside its fair comparison set1,433Raw benchmark valueCI 1,427 - 1,438
Text Arena · Hard Prompts English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #76 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,444
- Percentile
- 76.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: hard_prompts_english. Source rank: #94. Votes: 8868. Organization: alibaba. License: Apache 2.0.
76.9% percentile inside its fair comparison set1,444Raw benchmark valueCI 1,437 - 1,451
Text Arena · Instruction Following
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #75 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,407
- Percentile
- 77.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: instruction_following. Source rank: #94. Votes: 9045. Organization: alibaba. License: Apache 2.0.
77.2% percentile inside its fair comparison set1,407Raw benchmark valueCI 1,400 - 1,413
Text Arena · Longer Query
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #85 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,418
- Percentile
- 72.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: longer_query. Source rank: #106. Votes: 10944. Organization: alibaba. License: Apache 2.0.
72.4% percentile inside its fair comparison set1,418Raw benchmark valueCI 1,411 - 1,424
Text Arena · Multi Turn
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #78 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,417
- Percentile
- 76.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: multi_turn. Source rank: #97. Votes: 4953. Organization: alibaba. License: Apache 2.0.
76.2% percentile inside its fair comparison set1,417Raw benchmark valueCI 1,408 - 1,426
Text Arena · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #73 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,418
- Percentile
- 77.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: overall. Source rank: #86. Votes: 28575. Organization: alibaba. License: Apache 2.0.
77.8% percentile inside its fair comparison set1,418Raw benchmark valueCI 1,413 - 1,422
Text Arena · Creative Writing · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #83 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,369
- Percentile
- 74.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: creative_writing. Source rank: #103. Votes: 4361. Organization: alibaba. License: Apache 2.0.
74.6% percentile inside its fair comparison set1,369Raw benchmark valueCI 1,359 - 1,378
Text Arena · English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #71 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,429
- Percentile
- 78.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: english. Source rank: #83. Votes: 13479. Organization: alibaba. License: Apache 2.0.
78.5% percentile inside its fair comparison set1,429Raw benchmark valueCI 1,423 - 1,435
Text Arena · Exclude Ties · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #75 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,404
- Percentile
- 77.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: exclude_ties. Source rank: #89. Votes: 20913. Organization: alibaba. License: Apache 2.0.
77.2% percentile inside its fair comparison set1,404Raw benchmark valueCI 1,398 - 1,410
Text Arena · Hard Prompts · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #74 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,422
- Percentile
- 77.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: hard_prompts. Source rank: #91. Votes: 17878. Organization: alibaba. License: Apache 2.0.
77.5% percentile inside its fair comparison set1,422Raw benchmark valueCI 1,416 - 1,427
Text Arena · Hard Prompts English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #71 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,431
- Percentile
- 78.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: hard_prompts_english. Source rank: #86. Votes: 8868. Organization: alibaba. License: Apache 2.0.
78.4% percentile inside its fair comparison set1,431Raw benchmark valueCI 1,424 - 1,438
Text Arena · Instruction Following · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #73 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,399
- Percentile
- 77.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: instruction_following. Source rank: #88. Votes: 9045. Organization: alibaba. License: Apache 2.0.
77.8% percentile inside its fair comparison set1,399Raw benchmark valueCI 1,393 - 1,406
Text Arena · Longer Query · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #74 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,410
- Percentile
- 76%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: longer_query. Source rank: #90. Votes: 10944. Organization: alibaba. License: Apache 2.0.
76% percentile inside its fair comparison set1,410Raw benchmark valueCI 1,403 - 1,416
Text Arena · Multi Turn · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #76 · Source label: qwen3.5-122b-a10b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,414
- Percentile
- 76.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-122b-a10b`. Category: multi_turn. Source rank: #92. Votes: 4953. Organization: alibaba. License: Apache 2.0.
76.8% percentile inside its fair comparison set1,414Raw benchmark valueCI 1,405 - 1,423