Intelligence Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #280 · Source label: Gemini 2.5 Flash-Lite (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 7
- Percentile
- 29.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `intelligenceIndex`.
29.4% percentile inside its fair comparison set7Raw benchmark value
AA-Omniscience accuracy
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #203 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 14.3%
- Percentile
- 32.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceAccuracy`.
32.2% percentile inside its fair comparison set14.3%Raw benchmark value
AA-Omniscience non-hallucination
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #168 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 13.2%
- Percentile
- 44%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `omniscienceNonHallucination`.
44% percentile inside its fair comparison set13.2%Raw benchmark value
IFBench
AA · Chat / text · Objective
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #249 · Source label: Gemini 2.5 Flash-Lite (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 31.5%
- Percentile
- 21.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `ifbench`.
21.3% percentile inside its fair comparison set31.5%Raw benchmark value
Blended price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #64 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.2 /1M tokens
- Percentile
- 77.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mBlended0To3To1`.
77.2% percentile inside its fair comparison set$0.2 /1M tokensRaw benchmark value
Input price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #50 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.1 /1M input tokens
- Percentile
- 83%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mInputTokens`.
83% percentile inside its fair comparison set$0.1 /1M input tokensRaw benchmark value
Output price
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #70 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- $0.4 /1M output tokens
- Percentile
- 75.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `price1mOutputTokens`.
75.4% percentile inside its fair comparison set$0.4 /1M output tokensRaw benchmark value
Output Speed
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #18 · Source label: Gemini 2.5 Flash-Lite (Non-reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 236 tokens/s
- Percentile
- 91.9%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianOutputTokensPerSecond`.
91.9% percentile inside its fair comparison set236 tokens/sRaw benchmark value
Time to first token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #201 · Source label: Gemini 2.5 Flash-Lite (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 38.85s
- Percentile
- 4.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstTokenSeconds`.
4.8% percentile inside its fair comparison set38.85sRaw benchmark value
Time to first answer token
AA · Chat / text · Speed / cost
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #172 · Source label: Gemini 2.5 Flash-Lite (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 38.85s
- Percentile
- 18.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `medianTimeToFirstAnswerTokenSeconds`.
18.6% percentile inside its fair comparison set38.85sRaw benchmark value
Openness Index
AA · Chat / text · Combined
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #174 · Source label: Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Artificial Analysis
- Raw value
- 6
- Percentile
- 7.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Artificial Analysis public leaderboard field `opennessBreakdown.opennessIndex`.
7.5% percentile inside its fair comparison set6Raw benchmark value
Text Arena
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #119 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,380
- Percentile
- 63.7%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: overall. Source rank: #144. Votes: 47228. Organization: google. License: Proprietary.
63.7% percentile inside its fair comparison set1,380Raw benchmark valueCI 1,376 - 1,383
Text Arena · Creative Writing
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #92 · Source label: gemini-2.5-flash-lite-preview-06-17-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,371
- Percentile
- 71.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-06-17-thinking`. Category: creative_writing. Source rank: #115. Votes: 4187. Organization: google. License: Proprietary.
71.8% percentile inside its fair comparison set1,371Raw benchmark valueCI 1,362 - 1,380
Text Arena · English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #126 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,385
- Percentile
- 61.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: english. Source rank: #151. Votes: 22616. Organization: google. License: Proprietary.
61.5% percentile inside its fair comparison set1,385Raw benchmark valueCI 1,380 - 1,389
Text Arena · Exclude Ties
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #120 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,351
- Percentile
- 63.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: exclude_ties. Source rank: #145. Votes: 33061. Organization: google. License: Proprietary.
63.4% percentile inside its fair comparison set1,351Raw benchmark valueCI 1,346 - 1,356
Text Arena · Hard Prompts
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #125 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,390
- Percentile
- 61.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: hard_prompts. Source rank: #151. Votes: 25066. Organization: google. License: Proprietary.
61.8% percentile inside its fair comparison set1,390Raw benchmark valueCI 1,386 - 1,395
Text Arena · Hard Prompts English
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #132 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,396
- Percentile
- 59.6%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: hard_prompts_english. Source rank: #158. Votes: 12545. Organization: google. License: Proprietary.
59.6% percentile inside its fair comparison set1,396Raw benchmark valueCI 1,391 - 1,402
Text Arena · Instruction Following
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #117 · Source label: gemini-2.5-flash-lite-preview-06-17-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,369
- Percentile
- 64.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-06-17-thinking`. Category: instruction_following. Source rank: #143. Votes: 8103. Organization: google. License: Proprietary.
64.3% percentile inside its fair comparison set1,369Raw benchmark valueCI 1,362 - 1,376
Text Arena · Longer Query
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #115 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,386
- Percentile
- 62.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: longer_query. Source rank: #140. Votes: 11474. Organization: google. License: Proprietary.
62.5% percentile inside its fair comparison set1,386Raw benchmark valueCI 1,380 - 1,392
Text Arena · Multi Turn
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #122 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,373
- Percentile
- 62.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: multi_turn. Source rank: #147. Votes: 8193. Organization: google. License: Proprietary.
62.5% percentile inside its fair comparison set1,373Raw benchmark valueCI 1,366 - 1,381
Text Arena · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #107 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,379
- Percentile
- 67.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: overall. Source rank: #129. Votes: 47228. Organization: google. License: Proprietary.
67.4% percentile inside its fair comparison set1,379Raw benchmark valueCI 1,376 - 1,382
Text Arena · Creative Writing · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #88 · Source label: gemini-2.5-flash-lite-preview-06-17-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,366
- Percentile
- 73.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-06-17-thinking`. Category: creative_writing. Source rank: #108. Votes: 4187. Organization: google. License: Proprietary.
73.1% percentile inside its fair comparison set1,366Raw benchmark valueCI 1,356 - 1,375
Text Arena · English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #121 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,382
- Percentile
- 63.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: english. Source rank: #143. Votes: 22616. Organization: google. License: Proprietary.
63.1% percentile inside its fair comparison set1,382Raw benchmark valueCI 1,377 - 1,386
Text Arena · Exclude Ties · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #107 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,349
- Percentile
- 67.4%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: exclude_ties. Source rank: #129. Votes: 33061. Organization: google. License: Proprietary.
67.4% percentile inside its fair comparison set1,349Raw benchmark valueCI 1,344 - 1,354
Text Arena · Hard Prompts · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #110 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,377
- Percentile
- 66.5%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: hard_prompts. Source rank: #133. Votes: 25066. Organization: google. License: Proprietary.
66.5% percentile inside its fair comparison set1,377Raw benchmark valueCI 1,372 - 1,381
Text Arena · Hard Prompts English · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #117 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,381
- Percentile
- 64.2%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: hard_prompts_english. Source rank: #141. Votes: 12545. Organization: google. License: Proprietary.
64.2% percentile inside its fair comparison set1,381Raw benchmark valueCI 1,375 - 1,387
Text Arena · Instruction Following · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #104 · Source label: gemini-2.5-flash-lite-preview-06-17-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,367
- Percentile
- 68.3%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-06-17-thinking`. Category: instruction_following. Source rank: #127. Votes: 8103. Organization: google. License: Proprietary.
68.3% percentile inside its fair comparison set1,367Raw benchmark valueCI 1,360 - 1,374
Text Arena · Longer Query · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #107 · Source label: gemini-2.5-flash-lite-preview-06-17-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,373
- Percentile
- 65.1%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-06-17-thinking`. Category: longer_query. Source rank: #131. Votes: 6298. Organization: google. License: Proprietary.
65.1% percentile inside its fair comparison set1,373Raw benchmark valueCI 1,366 - 1,381
Text Arena · Multi Turn · No Style Control
AR · Chat / text · Human
It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.
Rank #118 · Source label: gemini-2.5-flash-lite-preview-09-2025-no-thinking
verified runtimeexact alias
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,366
- Percentile
- 63.8%
- Last updated
- recent
- Eligibility
- headline eligible
Parsed from Arena leaderboard dataset row `gemini-2.5-flash-lite-preview-09-2025-no-thinking`. Category: multi_turn. Source rank: #141. Votes: 8193. Organization: google. License: Proprietary.
63.8% percentile inside its fair comparison set1,366Raw benchmark valueCI 1,359 - 1,373