Intelligence Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #57 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 29
- Percentile
- 85.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.
85.8% percentile inside its fair comparison set29Raw benchmark value
AA-Omniscience accuracy
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #176 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 15.7%
- Percentile
- 41.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.
41.3% percentile inside its fair comparison set15.7%Raw benchmark value
AA-Omniscience non-hallucination
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #110 · Source label: Qwen3.5 27B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 20.3%
- Percentile
- 63.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.
63.4% percentile inside its fair comparison set20.3%Raw benchmark value
IFBench
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #108 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 46.9%
- Percentile
- 66%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `ifbench`.
66% percentile inside its fair comparison set46.9%Raw benchmark value
Blended price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #166 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.9 /1M tokens
- Percentile
- 40.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.
40.2% percentile inside its fair comparison set$0.9 /1M tokensRaw benchmark value
Input price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #141 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.3 /1M input tokens
- Percentile
- 56.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.
56.5% percentile inside its fair comparison set$0.3 /1M input tokensRaw benchmark value
Output price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #180 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $2.6 /1M output tokens
- Percentile
- 35.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.
35.1% percentile inside its fair comparison set$2.6 /1M output tokensRaw benchmark value
Output Speed
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #123 · Source label: Qwen3.5 27B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 79.8 tokens/s
- Percentile
- 41.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.
41.9% percentile inside its fair comparison set79.8 tokens/sRaw benchmark value
Time to first token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #173 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 5.85s
- Percentile
- 18.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.
18.1% percentile inside its fair comparison set5.85sRaw benchmark value
Time to first answer token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #159 · Source label: Qwen3.5 27B (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 30.87s
- Percentile
- 24.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.
24.8% percentile inside its fair comparison set30.87sRaw benchmark value
Openness Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #121 · Source label: Qwen3.5 27B (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 39
- Percentile
- 54.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `opennessBreakdown.opennessIndex`.
54.8% percentile inside its fair comparison set39Raw benchmark value
Text Arena
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #91 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,409
- Percentile
- 72.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: overall. Source rank: #111. Votes: 27421. Organization: alibaba. License: Apache 2.0.
72.3% percentile inside its fair comparison set1,409Raw benchmark valueCI 1,404 - 1,413
Text Arena · Creative Writing
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #103 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,360
- Percentile
- 68.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: creative_writing. Source rank: #128. Votes: 4099. Organization: alibaba. License: Apache 2.0.
68.4% percentile inside its fair comparison set1,360Raw benchmark valueCI 1,350 - 1,370
Text Arena · English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #80 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,426
- Percentile
- 75.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: english. Source rank: #99. Votes: 12928. Organization: alibaba. License: Apache 2.0.
75.7% percentile inside its fair comparison set1,426Raw benchmark valueCI 1,421 - 1,432
Text Arena · Exclude Ties
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #91 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,392
- Percentile
- 72.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: exclude_ties. Source rank: #111. Votes: 20054. Organization: alibaba. License: Apache 2.0.
72.3% percentile inside its fair comparison set1,392Raw benchmark valueCI 1,386 - 1,398
Text Arena · Hard Prompts
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #88 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,427
- Percentile
- 73.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: hard_prompts. Source rank: #108. Votes: 17115. Organization: alibaba. License: Apache 2.0.
73.2% percentile inside its fair comparison set1,427Raw benchmark valueCI 1,422 - 1,433
Text Arena · Hard Prompts English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #77 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,441
- Percentile
- 76.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: hard_prompts_english. Source rank: #96. Votes: 8482. Organization: alibaba. License: Apache 2.0.
76.5% percentile inside its fair comparison set1,441Raw benchmark valueCI 1,434 - 1,448
Text Arena · Instruction Following
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #86 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,401
- Percentile
- 73.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: instruction_following. Source rank: #106. Votes: 8631. Organization: alibaba. License: Apache 2.0.
73.8% percentile inside its fair comparison set1,401Raw benchmark valueCI 1,394 - 1,408
Text Arena · Longer Query
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #78 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,423
- Percentile
- 74.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: longer_query. Source rank: #98. Votes: 10267. Organization: alibaba. License: Apache 2.0.
74.7% percentile inside its fair comparison set1,423Raw benchmark valueCI 1,416 - 1,429
Text Arena · Multi Turn
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #79 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,416
- Percentile
- 75.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: multi_turn. Source rank: #98. Votes: 4595. Organization: alibaba. License: Apache 2.0.
75.9% percentile inside its fair comparison set1,416Raw benchmark valueCI 1,406 - 1,425
Text Arena · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #86 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,409
- Percentile
- 73.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: overall. Source rank: #103. Votes: 27421. Organization: alibaba. License: Apache 2.0.
73.8% percentile inside its fair comparison set1,409Raw benchmark valueCI 1,404 - 1,413
Text Arena · Creative Writing · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #90 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,364
- Percentile
- 72.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: creative_writing. Source rank: #111. Votes: 4099. Organization: alibaba. License: Apache 2.0.
72.4% percentile inside its fair comparison set1,364Raw benchmark valueCI 1,354 - 1,374
Text Arena · English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #77 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,423
- Percentile
- 76.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: english. Source rank: #91. Votes: 12928. Organization: alibaba. License: Apache 2.0.
76.6% percentile inside its fair comparison set1,423Raw benchmark valueCI 1,417 - 1,429
Text Arena · Exclude Ties · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #87 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,391
- Percentile
- 73.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: exclude_ties. Source rank: #104. Votes: 20054. Organization: alibaba. License: Apache 2.0.
73.5% percentile inside its fair comparison set1,391Raw benchmark valueCI 1,385 - 1,397
Text Arena · Hard Prompts · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #86 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,414
- Percentile
- 73.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: hard_prompts. Source rank: #103. Votes: 17115. Organization: alibaba. License: Apache 2.0.
73.8% percentile inside its fair comparison set1,414Raw benchmark valueCI 1,409 - 1,419
Text Arena · Hard Prompts English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #75 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,425
- Percentile
- 77.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: hard_prompts_english. Source rank: #91. Votes: 8482. Organization: alibaba. License: Apache 2.0.
77.2% percentile inside its fair comparison set1,425Raw benchmark valueCI 1,418 - 1,432
Text Arena · Instruction Following · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #81 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,392
- Percentile
- 75.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: instruction_following. Source rank: #99. Votes: 8631. Organization: alibaba. License: Apache 2.0.
75.4% percentile inside its fair comparison set1,392Raw benchmark valueCI 1,385 - 1,399
Text Arena · Longer Query · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #68 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,413
- Percentile
- 78%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: longer_query. Source rank: #81. Votes: 10267. Organization: alibaba. License: Apache 2.0.
78% percentile inside its fair comparison set1,413Raw benchmark valueCI 1,407 - 1,420
Text Arena · Multi Turn · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #79 · Source label: qwen3.5-27b
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,411
- Percentile
- 75.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `qwen3.5-27b`. Category: multi_turn. Source rank: #96. Votes: 4595. Organization: alibaba. License: Apache 2.0.
75.9% percentile inside its fair comparison set1,411Raw benchmark valueCI 1,402 - 1,421