Benchmarks · /benchmarks/livebench-coding

Coding

Name: Coding
Creator: LiveBench

LiveBench code generation and completion score from the current website release.

Source · LiveBench
Version · livebench snapshot 2026-06-24
Scores · 109

Test details

Verified but agingThis is an objective signal, so it is mainly about measurable task performance rather than public taste.

source

LiveBench

metric

Score (%)

judge

Objective

direction

higher better

group id

livebench_coding_2026_01_08

domain

Coding

What it measures vs what it misses

✓ Measures

Objective coding accuracy on recent generation and completion tasks.

✗ Misses

Editing workflow ergonomics, latency, and subjective code style preference.

Why this countsIt tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.Same-test ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesIt does not fully capture repo-scale iteration, IDE ergonomics, or long debugging loops.

Leaderboard · this benchmark version

#1 · GPT-5.2

LB · Jan 8, 2026

Source label: gpt-5.2-codex

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 100%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

83.6%

#2 · claude-opus-4-7-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 99.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

83.2%

#3 · claude-opus-4-7-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 98.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

82.1%

#4 · claude-4-sonnet-20250514-base

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 97.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.7%

#5 · gpt-5.1-codex-max-high

LB · Jan 8, 2026

Source label: gpt-5.1-codex-max-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 96.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.7%

#6 · claude-sonnet-4-5-20250929-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 95.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.4%

#7 · gpt-5.1-2025-11-13-low

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 94.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.3%

#8 · claude-sonnet-4-6-thinking-auto-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 93.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80%

#9 · claude-opus-4-7-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 93.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80%

#10 · claude-opus-4-5-20251101-thinking-64k-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 91.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.7%

#11 · GLM-5.2 (max)

LB · Jan 8, 2026

Source label: glm-5.2

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 91.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.7%

#12 · claude-sonnet-4-6-thinking-auto-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 89.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.3%

#13 · claude-opus-4-8-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 89.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.3%

#14 · claude-opus-4-8-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 88%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.9%

#15 · GPT-5.5

LB · Jan 8, 2026

Source label: gpt-5.5-medium

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 87%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#16 · gpt-5.3-instant

LB · Jan 8, 2026

Source label: gpt-5.3-instant

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 87%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#17 · gemini-3-flash-preview-minimal

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#18 · kimi-k2.6-thinking

LB · Jan 8, 2026

Source label: kimi-k2.6-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#19 · claude-fable-5-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#20 · claude-opus-4-5-20251101-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 82.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.5%

#21 · claude-opus-4-5-20251101-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#22 · claude-opus-4-6-thinking-auto-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#23 · Gemini 3.5 Flash

LB · Jan 8, 2026

Source label: gemini-3.5-flash-high

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#24 · gpt-5.3-codex-high

LB · Jan 8, 2026

Source label: gpt-5.3-codex-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#25 · qwen3.6-plus

LB · Jan 8, 2026

Source label: qwen3.6-plus

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#26 · kimi-k2.5-thinking

LB · Jan 8, 2026

Source label: kimi-k2.5-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 76.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.9%

#27 · claude-opus-4-5-20251101-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 75.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.8%

#28 · GPT-5.4

LB · Jan 8, 2026

Source label: gpt-5.4-xhigh

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 75%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#29 · claude-4-sonnet-20250514-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#30 · claude-opus-4-5-20251101-thinking-64k-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#31 · claude-opus-4-5-20251101-thinking-64k-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#32 · gpt-5.1-2025-11-13-nothinking

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-nothinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#33 · GPT-5.3 Codex

LB · Jan 8, 2026

Source label: gpt-5.3-codex-xhigh

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: specialized_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#34 · claude-opus-4-7-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#35 · claude-opus-4-8-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#36 · gemini-3.1-pro-preview-high

LB · Jan 8, 2026

Source label: gemini-3.1-pro-preview-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#37 · gpt-5.2-2025-12-11-nothinking

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-nothinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#38 · claude-opus-4-8-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 65.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.4%

#39 · claude-fable-5-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 65.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.4%

#40 · claude-4-1-opus-20250805-base

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#41 · Claude Sonnet 4.5

LB · Jan 8, 2026

Source label: claude-sonnet-4-5-20250929

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#42 · gpt-5.2-2025-12-11-high

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#43 · DeepSeek Chat

LB · Jan 8, 2026

Source label: deepseek-v3.2

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#44 · gemini-2.5-pro-06-05-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-pro-06-05-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#45 · gpt-5.1-2025-11-13-medium

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-medium

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#46 · glm-5.1

LB · Jan 8, 2026

Source label: glm-5.1

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 58.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.4%

#47 · claude-4-1-opus-20250805-thinking-32k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 57.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.7%

#48 · gemini-3-pro-preview-11-2025-high

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 56.5%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.6%

#49 · claude-sonnet-4-6-thinking-auto-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 55.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.3%

#50 · kimi-k2-instruct

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 55.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.3%

#51 · Qwen3.7 Max

LB · Jan 8, 2026

Source label: qwen3.7-max

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 53.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.2%

#52 · Kimi K2.7 Code

LB · Jan 8, 2026

Source label: kimi-k2.7-code

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 52.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74%

#53 · gemini-3-flash-preview-high

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#54 · glm-5v-turbo

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#55 · gpt-5.2-2025-12-11-low

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#56 · glm-5

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 49.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.6%

#57 · DeepSeek V3.2 Exp

LB · Jan 8, 2026

Source label: deepseek-v3.2-exp

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 48.1%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.2%

#58 · glm-4.7

LB · Jan 8, 2026

Source label: glm-4.7

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 47.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.1%

#59 · Grok 4

LB · Jan 8, 2026

Source label: grok-4-0709

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 47.2%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.1%

#60 · gpt-5.4-mini-low

LB · Jan 8, 2026

Source label: gpt-5.4-mini-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 45.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.9%

#61 · claude-haiku-4-5-20251001-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 44.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.8%

#62 · gpt-5.1-2025-11-13-high

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 43.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.5%

#63 · Claude Haiku 4.5

LB · Jan 8, 2026

Source label: claude-haiku-4-5-20251001

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 42.6%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.2%

#64 · GPT-5

LB · Jan 8, 2026

Source label: gpt-5-pro-2025-10-06

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 41.7%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.1%

#65 · gpt-5.2-2025-12-11-medium

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-medium

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 41.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.1%

#66 · Qwen3.6 27B (Reasoning)

LB · Jan 8, 2026

Source label: qwen3.6-27b

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 39.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71.8%

#67 · nemotron-3-ultra-550b-a55b

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 38.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71.3%

#68 · glm-4.6

LB · Jan 8, 2026

Source label: glm-4.6

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 38%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71%

#69 · minimax-m2.5

LB · Jan 8, 2026

Source label: minimax-m2.5

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 37%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.7%

#70 · gemini-3-pro-preview-11-2025-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 36.1%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.6%

#71 · deepseek-v3.2-exp-thinking

LB · Jan 8, 2026

Source label: deepseek-v3.2-exp-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 35.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.1%

#72 · deepseek-v4-pro

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 34.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70%

#73 · GPT-5.1

LB · Jan 8, 2026

Source label: gpt-5.1-codex-mini

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 33.3%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.9%

#74 · Grok 4.3

LB · Jan 8, 2026

Source label: grok-4.3

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 33.3%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.9%

#75 · qwen3-235b-a22b-instruct-2507

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 31.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.6%

#76 · gpt-5-mini-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 30.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.5%

#77 · gpt-5.4-nano-low

LB · Jan 8, 2026

Source label: gpt-5.4-nano-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 29.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.4%

#78 · deepseek-v4-flash

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 28.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.2%

#79 · qwen3-235b-a22b-thinking-2507

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 27.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69%

#80 · mimo-v2-pro

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 26.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.8%

#81 · gemini-3.1-flash-lite-preview-high

LB · Jan 8, 2026

Source label: gemini-3.1-flash-lite-preview-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25.9%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.5%

#82 · GPT-5.4 mini

LB · Jan 8, 2026

Source label: gpt-5-mini-high

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#83 · qwen3-next-80b-a3b-instruct

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#84 · minimax-m3

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#85 · gemini-2.5-flash-preview-09-2025-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-preview-09-2025-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 22.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

67.5%

#86 · Kimi K2 Thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 21.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

67.4%

#87 · devstral-2512

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 20.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.8%

#88 · gemini-2.5-flash-lite-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-lite-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 19.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.4%

#89 · Grok 4.20

LB · Jan 8, 2026

Source label: grok-4.20-beta-0309-reasoning

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 18.5%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.1%

#90 · gemini-2.5-flash-06-05-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-06-05-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 17.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66%

#91 · qwen3-32b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 17.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66%

#92 · arcee-trinity-large-preview

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 15.7%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.6%

#93 · gemini-2.5-flash-lite-preview-09-2025-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-lite-preview-09-2025-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 14.8%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.4%

#94 · grok-build-0.1

LB · Jan 8, 2026

Source label: grok-build-0.1

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 14.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.4%

#95 · qwen3.6-flash

LB · Jan 8, 2026

Source label: qwen3.6-flash

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 13%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.9%

#96 · deepseek-v3.2-thinking

LB · Jan 8, 2026

Source label: deepseek-v3.2-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 12%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.6%

#97 · Grok Code Fast

LB · Jan 8, 2026

Source label: grok-code-fast-1-0825

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 11.1%
Last updated: archived
Eligibility: specialized_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.4%

#98 · glm-4.6v

LB · Jan 8, 2026

Source label: glm-4.6v

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 10.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.2%

#99 · GPT-5.4 nano

LB · Jan 8, 2026

Source label: gpt-5.4-nano

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 9.3%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

61.9%

#100 · qwen3-next-80b-a3b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 8.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.7%

#101 · gemma-4-31b-it

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 7.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.3%

#102 · GPT-OSS 120B

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 6.5%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.2%

#103 · grok-4.20-beta-0309-non-reasoning

LB · Jan 8, 2026

Source label: grok-4.20-beta-0309-non-reasoning

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 5.6%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

58.5%

#104 · Elephant Alpha

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 4.6%
Last updated: archived
Eligibility: Alpha model tracked from BridgeBench but excluded from default rankings.
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

56.7%

#105 · minimax-m2.7

LB · Jan 8, 2026

Source label: minimax-m2.7

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 3.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.9%

#106 · Grok 4.1 Fast

LB · Jan 8, 2026

Source label: grok-4-1-fast-non-reasoning

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 2.8%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.3%

#107 · nemotron-3-super-120b-a12b

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 1.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.1%

#108 · gpt-5-nano-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 0.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

52.7%

#109 · qwen3-30b-a3b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 0%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

48.9%

Benchmarks · /benchmarks/livebench-coding

Coding

LiveBench code generation and completion score from the current website release.

Source · LiveBench
Version · livebench snapshot 2026-06-24
Scores · 109

Test details

Verified but agingThis is an objective signal, so it is mainly about measurable task performance rather than public taste.

source

LiveBench

metric

Score (%)

judge

Objective

direction

higher better

group id

livebench_coding_2026_01_08

domain

Coding

What it measures vs what it misses

✓ Measures

Objective coding accuracy on recent generation and completion tasks.

✗ Misses

Editing workflow ergonomics, latency, and subjective code style preference.

Leaderboard · this benchmark version

#1 · GPT-5.2

LB · Jan 8, 2026

Source label: gpt-5.2-codex

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 100%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

83.6%

#2 · claude-opus-4-7-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 99.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

83.2%

#3 · claude-opus-4-7-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 98.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

82.1%

#4 · claude-4-sonnet-20250514-base

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 97.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.7%

#5 · gpt-5.1-codex-max-high

LB · Jan 8, 2026

Source label: gpt-5.1-codex-max-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 96.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.7%

#6 · claude-sonnet-4-5-20250929-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 95.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.4%

#7 · gpt-5.1-2025-11-13-low

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 94.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80.3%

#8 · claude-sonnet-4-6-thinking-auto-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 93.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80%

#9 · claude-opus-4-7-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 93.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

80%

#10 · claude-opus-4-5-20251101-thinking-64k-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 91.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.7%

#11 · GLM-5.2 (max)

LB · Jan 8, 2026

Source label: glm-5.2

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 91.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.7%

#12 · claude-sonnet-4-6-thinking-auto-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 89.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.3%

#13 · claude-opus-4-8-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 89.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

79.3%

#14 · claude-opus-4-8-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 88%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.9%

#15 · GPT-5.5

LB · Jan 8, 2026

Source label: gpt-5.5-medium

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 87%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#16 · gpt-5.3-instant

LB · Jan 8, 2026

Source label: gpt-5.3-instant

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 87%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#17 · gemini-3-flash-preview-minimal

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#18 · kimi-k2.6-thinking

LB · Jan 8, 2026

Source label: kimi-k2.6-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#19 · claude-fable-5-xhigh-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 85.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.6%

#20 · claude-opus-4-5-20251101-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 82.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.5%

#21 · claude-opus-4-5-20251101-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#22 · claude-opus-4-6-thinking-auto-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#23 · Gemini 3.5 Flash

LB · Jan 8, 2026

Source label: gemini-3.5-flash-high

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#24 · gpt-5.3-codex-high

LB · Jan 8, 2026

Source label: gpt-5.3-codex-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#25 · qwen3.6-plus

LB · Jan 8, 2026

Source label: qwen3.6-plus

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 81.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

78.2%

#26 · kimi-k2.5-thinking

LB · Jan 8, 2026

Source label: kimi-k2.5-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 76.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.9%

#27 · claude-opus-4-5-20251101-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 75.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.8%

#28 · GPT-5.4

LB · Jan 8, 2026

Source label: gpt-5.4-xhigh

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 75%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#29 · claude-4-sonnet-20250514-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#30 · claude-opus-4-5-20251101-thinking-64k-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#31 · claude-opus-4-5-20251101-thinking-64k-medium-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#32 · gpt-5.1-2025-11-13-nothinking

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-nothinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#33 · GPT-5.3 Codex

LB · Jan 8, 2026

Source label: gpt-5.3-codex-xhigh

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 74.1%
Last updated: archived
Eligibility: specialized_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

77.5%

#34 · claude-opus-4-7-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#35 · claude-opus-4-8-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#36 · gemini-3.1-pro-preview-high

LB · Jan 8, 2026

Source label: gemini-3.1-pro-preview-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#37 · gpt-5.2-2025-12-11-nothinking

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-nothinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 69.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.5%

#38 · claude-opus-4-8-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 65.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.4%

#39 · claude-fable-5-high-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 65.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.4%

#40 · claude-4-1-opus-20250805-base

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#41 · Claude Sonnet 4.5

LB · Jan 8, 2026

Source label: claude-sonnet-4-5-20250929

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#42 · gpt-5.2-2025-12-11-high

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 63.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

76.1%

#43 · DeepSeek Chat

LB · Jan 8, 2026

Source label: deepseek-v3.2

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#44 · gemini-2.5-pro-06-05-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-pro-06-05-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#45 · gpt-5.1-2025-11-13-medium

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-medium

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 61.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.7%

#46 · glm-5.1

LB · Jan 8, 2026

Source label: glm-5.1

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 58.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

75.4%

#47 · claude-4-1-opus-20250805-thinking-32k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 57.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.7%

#48 · gemini-3-pro-preview-11-2025-high

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 56.5%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.6%

#49 · claude-sonnet-4-6-thinking-auto-low-effort

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 55.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.3%

#50 · kimi-k2-instruct

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 55.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.3%

#51 · Qwen3.7 Max

LB · Jan 8, 2026

Source label: qwen3.7-max

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 53.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74.2%

#52 · Kimi K2.7 Code

LB · Jan 8, 2026

Source label: kimi-k2.7-code

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 52.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

74%

#53 · gemini-3-flash-preview-high

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#54 · glm-5v-turbo

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#55 · gpt-5.2-2025-12-11-low

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 51.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.9%

#56 · glm-5

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 49.1%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.6%

#57 · DeepSeek V3.2 Exp

LB · Jan 8, 2026

Source label: deepseek-v3.2-exp

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 48.1%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.2%

#58 · glm-4.7

LB · Jan 8, 2026

Source label: glm-4.7

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 47.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.1%

#59 · Grok 4

LB · Jan 8, 2026

Source label: grok-4-0709

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 47.2%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

73.1%

#60 · gpt-5.4-mini-low

LB · Jan 8, 2026

Source label: gpt-5.4-mini-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 45.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.9%

#61 · claude-haiku-4-5-20251001-thinking-64k

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 44.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.8%

#62 · gpt-5.1-2025-11-13-high

LB · Jan 8, 2026

Source label: gpt-5.1-2025-11-13-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 43.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.5%

#63 · Claude Haiku 4.5

LB · Jan 8, 2026

Source label: claude-haiku-4-5-20251001

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 42.6%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.2%

#64 · GPT-5

LB · Jan 8, 2026

Source label: gpt-5-pro-2025-10-06

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 41.7%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.1%

#65 · gpt-5.2-2025-12-11-medium

LB · Jan 8, 2026

Source label: gpt-5.2-2025-12-11-medium

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 41.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

72.1%

#66 · Qwen3.6 27B (Reasoning)

LB · Jan 8, 2026

Source label: qwen3.6-27b

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 39.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71.8%

#67 · nemotron-3-ultra-550b-a55b

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 38.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71.3%

#68 · glm-4.6

LB · Jan 8, 2026

Source label: glm-4.6

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 38%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

71%

#69 · minimax-m2.5

LB · Jan 8, 2026

Source label: minimax-m2.5

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 37%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.7%

#70 · gemini-3-pro-preview-11-2025-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 36.1%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.6%

#71 · deepseek-v3.2-exp-thinking

LB · Jan 8, 2026

Source label: deepseek-v3.2-exp-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 35.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70.1%

#72 · deepseek-v4-pro

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 34.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

70%

#73 · GPT-5.1

LB · Jan 8, 2026

Source label: gpt-5.1-codex-mini

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 33.3%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.9%

#74 · Grok 4.3

LB · Jan 8, 2026

Source label: grok-4.3

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 33.3%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.9%

#75 · qwen3-235b-a22b-instruct-2507

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 31.5%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.6%

#76 · gpt-5-mini-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 30.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.5%

#77 · gpt-5.4-nano-low

LB · Jan 8, 2026

Source label: gpt-5.4-nano-low

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 29.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.4%

#78 · deepseek-v4-flash

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 28.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69.2%

#79 · qwen3-235b-a22b-thinking-2507

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 27.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

69%

#80 · mimo-v2-pro

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 26.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.8%

#81 · gemini-3.1-flash-lite-preview-high

LB · Jan 8, 2026

Source label: gemini-3.1-flash-lite-preview-high

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25.9%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.5%

#82 · GPT-5.4 mini

LB · Jan 8, 2026

Source label: gpt-5-mini-high

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#83 · qwen3-next-80b-a3b-instruct

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#84 · minimax-m3

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 25%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

68.2%

#85 · gemini-2.5-flash-preview-09-2025-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-preview-09-2025-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 22.2%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

67.5%

#86 · Kimi K2 Thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 21.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

67.4%

#87 · devstral-2512

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 20.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.8%

#88 · gemini-2.5-flash-lite-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-lite-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 19.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.4%

#89 · Grok 4.20

LB · Jan 8, 2026

Source label: grok-4.20-beta-0309-reasoning

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 18.5%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66.1%

#90 · gemini-2.5-flash-06-05-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-06-05-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 17.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66%

#91 · qwen3-32b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 17.6%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

66%

#92 · arcee-trinity-large-preview

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 15.7%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.6%

#93 · gemini-2.5-flash-lite-preview-09-2025-highthinking

LB · Jan 8, 2026

Source label: gemini-2.5-flash-lite-preview-09-2025-highthinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 14.8%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.4%

#94 · grok-build-0.1

LB · Jan 8, 2026

Source label: grok-build-0.1

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 14.8%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

65.4%

#95 · qwen3.6-flash

LB · Jan 8, 2026

Source label: qwen3.6-flash

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 13%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.9%

#96 · deepseek-v3.2-thinking

LB · Jan 8, 2026

Source label: deepseek-v3.2-thinking

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 12%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.6%

#97 · Grok Code Fast

LB · Jan 8, 2026

Source label: grok-code-fast-1-0825

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 11.1%
Last updated: archived
Eligibility: specialized_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.4%

#98 · glm-4.6v

LB · Jan 8, 2026

Source label: glm-4.6v

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 10.2%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

64.2%

#99 · GPT-5.4 nano

LB · Jan 8, 2026

Source label: gpt-5.4-nano

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 9.3%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

61.9%

#100 · qwen3-next-80b-a3b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 8.3%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.7%

#101 · gemma-4-31b-it

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 7.4%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.3%

#102 · GPT-OSS 120B

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 6.5%
Last updated: archived
Eligibility: historical_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

60.2%

#103 · grok-4.20-beta-0309-non-reasoning

LB · Jan 8, 2026

Source label: grok-4.20-beta-0309-non-reasoning

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 5.6%
Last updated: archived
Eligibility: preview_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

58.5%

#104 · Elephant Alpha

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 4.6%
Last updated: archived
Eligibility: Alpha model tracked from BridgeBench but excluded from default rankings.
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

56.7%

#105 · minimax-m2.7

LB · Jan 8, 2026

Source label: minimax-m2.7

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 3.7%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.9%

#106 · Grok 4.1 Fast

LB · Jan 8, 2026

Source label: grok-4-1-fast-non-reasoning

verified runtimeexact alias

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 2.8%
Last updated: archived
Eligibility: headline eligible
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.3%

#107 · nemotron-3-super-120b-a12b

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 1.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

54.1%

#108 · gpt-5-nano-low

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 0.9%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

52.7%

#109 · qwen3-30b-a3b-thinking

LB · Jan 8, 2026

verified runtimeexact aliasBackground only

Raw row drilldownsource, percentile, eligibility

Source URL: https://livebench.ai/table_2026_01_08.csv
Percentile: 0%
Last updated: archived
Eligibility: benchmark_derived_model
Identity: provider alias (0.92)

Derived from the official LiveBench website leaderboard table. Category: Coding. Tasks scored: 2.

48.9%

Coding

Test details

What it measures vs what it misses

✓ Measures

✗ Misses

Leaderboard · this benchmark version

Loading benchmark evidence.

Coding

Test details

What it measures vs what it misses

✓ Measures

✗ Misses

Leaderboard · this benchmark version