Intelligence Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #95 · Source label: Qwen3.5 35B A3B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 23
- Percentile
- 76.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.
76.2% percentile inside its fair comparison set23Raw benchmark value
AA-Omniscience accuracy
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #179 · Source label: Qwen3.5 35B A3B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 15.7%
- Percentile
- 40.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.
40.3% percentile inside its fair comparison set15.7%Raw benchmark value
AA-Omniscience non-hallucination
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #241 · Source label: Qwen3.5 35B A3B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 7.9%
- Percentile
- 19.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.
19.5% percentile inside its fair comparison set7.9%Raw benchmark value
IFBench
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #125 · Source label: Qwen3.5 35B A3B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 44.5%
- Percentile
- 60.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `ifbench`.
60.6% percentile inside its fair comparison set44.5%Raw benchmark value
Blended price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #138 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.7 /1M tokens
- Percentile
- 50.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.
50.4% percentile inside its fair comparison set$0.7 /1M tokensRaw benchmark value
Input price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #115 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.3 /1M input tokens
- Percentile
- 59.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.
59.8% percentile inside its fair comparison set$0.3 /1M input tokensRaw benchmark value
Output price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #155 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $2 /1M output tokens
- Percentile
- 45.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.
45.3% percentile inside its fair comparison set$2 /1M output tokensRaw benchmark value
Output Speed
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #46 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 161.6 tokens/s
- Percentile
- 78.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.
78.6% percentile inside its fair comparison set161.6 tokens/sRaw benchmark value
Time to first token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #100 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 2.24s
- Percentile
- 52.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.
52.9% percentile inside its fair comparison set2.24sRaw benchmark value
Time to first answer token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #112 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 14.62s
- Percentile
- 47.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.
47.1% percentile inside its fair comparison set14.62sRaw benchmark value
Openness Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #114 · Source label: Qwen3.5 35B A3B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 39
- Percentile
- 54.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `opennessBreakdown.opennessIndex`.
54.8% percentile inside its fair comparison set39Raw benchmark value
Text Arena
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #102 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,396
- Percentile
- 68.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: overall. Source rank: #124. Votes: 29248. Organization: alibaba. License: Apache 2.0.
68.9% percentile inside its fair comparison set1,396Raw benchmark valueCI 1,391 - 1,400
Text Arena · Creative Writing
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #119 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,344
- Percentile
- 63.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: creative_writing. Source rank: #144. Votes: 4499. Organization: alibaba. License: Apache 2.0.
63.5% percentile inside its fair comparison set1,344Raw benchmark valueCI 1,334 - 1,353
Text Arena · English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #100 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,412
- Percentile
- 69.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: english. Source rank: #121. Votes: 13761. Organization: alibaba. License: Apache 2.0.
69.5% percentile inside its fair comparison set1,412Raw benchmark valueCI 1,406 - 1,417
Text Arena · Exclude Ties
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #102 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,376
- Percentile
- 68.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: exclude_ties. Source rank: #124. Votes: 21373. Organization: alibaba. License: Apache 2.0.
68.9% percentile inside its fair comparison set1,376Raw benchmark valueCI 1,370 - 1,382
Text Arena · Hard Prompts
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #109 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,413
- Percentile
- 66.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: hard_prompts. Source rank: #131. Votes: 18409. Organization: alibaba. License: Apache 2.0.
66.8% percentile inside its fair comparison set1,413Raw benchmark valueCI 1,408 - 1,419
Text Arena · Hard Prompts English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #105 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,423
- Percentile
- 67.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: hard_prompts_english. Source rank: #128. Votes: 9004. Organization: alibaba. License: Apache 2.0.
67.9% percentile inside its fair comparison set1,423Raw benchmark valueCI 1,417 - 1,430
Text Arena · Instruction Following
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #97 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,388
- Percentile
- 70.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: instruction_following. Source rank: #120. Votes: 9340. Organization: alibaba. License: Apache 2.0.
70.5% percentile inside its fair comparison set1,388Raw benchmark valueCI 1,382 - 1,395
Text Arena · Longer Query
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #102 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,402
- Percentile
- 66.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: longer_query. Source rank: #127. Votes: 11132. Organization: alibaba. License: Apache 2.0.
66.8% percentile inside its fair comparison set1,402Raw benchmark valueCI 1,396 - 1,409
Text Arena · Multi Turn
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #103 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,395
- Percentile
- 68.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: multi_turn. Source rank: #126. Votes: 5203. Organization: alibaba. License: Apache 2.0.
68.4% percentile inside its fair comparison set1,395Raw benchmark valueCI 1,386 - 1,404
Text Arena · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #98 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,396
- Percentile
- 70.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: overall. Source rank: #118. Votes: 29248. Organization: alibaba. License: Apache 2.0.
70.2% percentile inside its fair comparison set1,396Raw benchmark valueCI 1,391 - 1,400
Text Arena · Creative Writing · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #103 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,348
- Percentile
- 68.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: creative_writing. Source rank: #127. Votes: 4499. Organization: alibaba. License: Apache 2.0.
68.4% percentile inside its fair comparison set1,348Raw benchmark valueCI 1,339 - 1,357
Text Arena · English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #97 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,408
- Percentile
- 70.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: english. Source rank: #116. Votes: 13761. Organization: alibaba. License: Apache 2.0.
70.5% percentile inside its fair comparison set1,408Raw benchmark valueCI 1,402 - 1,414
Text Arena · Exclude Ties · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #98 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,374
- Percentile
- 70.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: exclude_ties. Source rank: #118. Votes: 21373. Organization: alibaba. License: Apache 2.0.
70.2% percentile inside its fair comparison set1,374Raw benchmark valueCI 1,369 - 1,380
Text Arena · Hard Prompts · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #99 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,401
- Percentile
- 69.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: hard_prompts. Source rank: #120. Votes: 18409. Organization: alibaba. License: Apache 2.0.
69.8% percentile inside its fair comparison set1,401Raw benchmark valueCI 1,395 - 1,406
Text Arena · Hard Prompts English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #95 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,409
- Percentile
- 71%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: hard_prompts_english. Source rank: #115. Votes: 9004. Organization: alibaba. License: Apache 2.0.
71% percentile inside its fair comparison set1,409Raw benchmark valueCI 1,402 - 1,416
Text Arena · Instruction Following · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #93 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,380
- Percentile
- 71.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: instruction_following. Source rank: #113. Votes: 9340. Organization: alibaba. License: Apache 2.0.
71.7% percentile inside its fair comparison set1,380Raw benchmark valueCI 1,373 - 1,387
Text Arena · Longer Query · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #95 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,392
- Percentile
- 69.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: longer_query. Source rank: #117. Votes: 11132. Organization: alibaba. License: Apache 2.0.
69.1% percentile inside its fair comparison set1,392Raw benchmark valueCI 1,386 - 1,399
Text Arena · Multi Turn · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #100 · Source label: qwen3.5-35b-a3b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,390
- Percentile
- 69.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-35b-a3b`. Category: multi_turn. Source rank: #121. Votes: 5203. Organization: alibaba. License: Apache 2.0.
69.3% percentile inside its fair comparison set1,390Raw benchmark valueCI 1,382 - 1,399