Vision Arena
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #61
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,159
- Percentile
- 45%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: overall. Source rank: #78. Votes: 17406. Organization: google. License: Gemma.
45% percentile inside its fair comparison set1,159Raw benchmark valueCI 1,151 - 1,168
Vision Arena · Captioning
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #25
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,110
- Percentile
- 7.7%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: captioning. Source rank: #27. Votes: 273. Organization: google. License: Gemma.
7.7% percentile inside its fair comparison set1,110Raw benchmark valueCI 1,073 - 1,147
Vision Arena · Creative Writing Vision
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #48
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,176
- Percentile
- 14.5%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: creative_writing_vision. Source rank: #60. Votes: 630. Organization: google. License: Gemma.
14.5% percentile inside its fair comparison set1,176Raw benchmark valueCI 1,151 - 1,201
Vision Arena · Diagram
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #61
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,165
- Percentile
- 14.3%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: diagram. Source rank: #77. Votes: 1758. Organization: google. License: Gemma.
14.3% percentile inside its fair comparison set1,165Raw benchmark valueCI 1,150 - 1,180
Vision Arena · English
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #62
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,170
- Percentile
- 44%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: english. Source rank: #77. Votes: 8371. Organization: google. License: Gemma.
44% percentile inside its fair comparison set1,170Raw benchmark valueCI 1,160 - 1,181
Vision Arena · Entity Recognition
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #27
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,174
- Percentile
- 18.8%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: entity_recognition. Source rank: #30. Votes: 326. Organization: google. License: Gemma.
18.8% percentile inside its fair comparison set1,174Raw benchmark valueCI 1,142 - 1,207
Vision Arena · Homework
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #57
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,169
- Percentile
- 17.6%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: homework. Source rank: #73. Votes: 1067. Organization: google. License: Gemma.
17.6% percentile inside its fair comparison set1,169Raw benchmark valueCI 1,151 - 1,188
Vision Arena · Humor
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #42
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,149
- Percentile
- 16.3%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: humor. Source rank: #54. Votes: 780. Organization: google. License: Gemma.
16.3% percentile inside its fair comparison set1,149Raw benchmark valueCI 1,126 - 1,173
Vision Arena · Ocr
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #59
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,162
- Percentile
- 17.1%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: ocr. Source rank: #75. Votes: 6337. Organization: google. License: Gemma.
17.1% percentile inside its fair comparison set1,162Raw benchmark valueCI 1,153 - 1,172
Vision Arena · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #62
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,166
- Percentile
- 44%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: overall. Source rank: #77. Votes: 17406. Organization: google. License: Gemma.
44% percentile inside its fair comparison set1,166Raw benchmark valueCI 1,158 - 1,174
Vision Arena · Captioning · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #24
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,144
- Percentile
- 11.5%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: captioning. Source rank: #26. Votes: 273. Organization: google. License: Gemma.
11.5% percentile inside its fair comparison set1,144Raw benchmark valueCI 1,107 - 1,181
Vision Arena · Creative Writing Vision · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #47
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,186
- Percentile
- 16.4%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: creative_writing_vision. Source rank: #60. Votes: 630. Organization: google. License: Gemma.
16.4% percentile inside its fair comparison set1,186Raw benchmark valueCI 1,162 - 1,211
Vision Arena · Diagram · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #58
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,158
- Percentile
- 18.6%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: diagram. Source rank: #73. Votes: 1758. Organization: google. License: Gemma.
18.6% percentile inside its fair comparison set1,158Raw benchmark valueCI 1,143 - 1,172
Vision Arena · English · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #57
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,186
- Percentile
- 48.6%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: english. Source rank: #70. Votes: 8371. Organization: google. License: Gemma.
48.6% percentile inside its fair comparison set1,186Raw benchmark valueCI 1,175 - 1,196
Vision Arena · Entity Recognition · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #26
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,199
- Percentile
- 21.9%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: entity_recognition. Source rank: #28. Votes: 326. Organization: google. License: Gemma.
21.9% percentile inside its fair comparison set1,199Raw benchmark valueCI 1,167 - 1,232
Vision Arena · Homework · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #58
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,164
- Percentile
- 16.2%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: homework. Source rank: #73. Votes: 1067. Organization: google. License: Gemma.
16.2% percentile inside its fair comparison set1,164Raw benchmark valueCI 1,146 - 1,183
Vision Arena · Humor · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #43
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,164
- Percentile
- 14.3%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: humor. Source rank: #56. Votes: 780. Organization: google. License: Gemma.
14.3% percentile inside its fair comparison set1,164Raw benchmark valueCI 1,141 - 1,187
Vision Arena · Ocr · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #60
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,159
- Percentile
- 15.7%
- Last updated
- recent
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: ocr. Source rank: #75. Votes: 6337. Organization: google. License: Gemma.
15.7% percentile inside its fair comparison set1,159Raw benchmark valueCI 1,149 - 1,169
Vision Arena · Creative Writing
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #24
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,175
- Percentile
- 28.1%
- Last updated
- archived
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: creative_writing. Source rank: #25. Votes: 732. Organization: google. License: Gemma.
28.1% percentile inside its fair comparison set1,175Raw benchmark valueCI 1,153 - 1,197
Vision Arena · Creative Writing · No Style Control
AR · Vision understanding · Human
It is useful when the model must read charts, UI, screenshots, or visual scenes rather than text alone.
Rank #24
verified runtimeexact aliasBackground only
Raw row drilldownsource row, percentile, last updated, eligibility
- Source
- Arena
- Raw value
- 1,182
- Percentile
- 28.1%
- Last updated
- archived
- Eligibility
- benchmark_derived_model
Parsed from Arena leaderboard dataset row `gemma-3-27b-it`. Category: creative_writing. Source rank: #25. Votes: 732. Organization: google. License: Gemma.
28.1% percentile inside its fair comparison set1,182Raw benchmark valueCI 1,161 - 1,204