Model profile · Baidu

ERNIE 5.1

Closed weightsfrontier · registry tag 2026 current

Thin verified coverage

Reads as thin verified coverage across the resolved source data.

Visible coverage: 1.4%
Verified coverage: 1.4%
Spread: 30.8%
Last verified: Jun 20, 2026

79%bench fit

textsearchcode3 aliases29 official source links

Open compare

Data version

Current snapshot.

Data version Jun 20, 2026Model list checked9 providers · 1081 tracked modelsPage refreshed Jul 5, 2026

The registry snapshot and page stamp are shown so a stale deploy is visible at a glance.

Source-linked scores by benchmark

Each row keeps the benchmark source, source type, raw metric, and percentile inside its fair comparison set.

Thin verified coverageThis model currently reads as thin verified coverage across the resolved source data.

Chat / text18 benchmarks94.8%

Text Arena

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #20 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,468
Percentile: 94.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: overall. Source rank: #28. Votes: 25064. Organization: baidu. License: Proprietary.

94.2% percentile inside its fair comparison set

1,468Raw benchmark valueCI 1,463 - 1,473

Text Arena · Creative Writing

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #29 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,435
Percentile: 91.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: creative_writing. Source rank: #40. Votes: 3909. Organization: baidu. License: Proprietary.

91.3% percentile inside its fair comparison set

1,435Raw benchmark valueCI 1,425 - 1,445

Text Arena · English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #18 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,479
Percentile: 94.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: english. Source rank: #20. Votes: 11760. Organization: baidu. License: Proprietary.

94.8% percentile inside its fair comparison set

1,479Raw benchmark valueCI 1,473 - 1,486

Text Arena · Exclude Ties

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #20 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,473
Percentile: 94.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: exclude_ties. Source rank: #28. Votes: 19100. Organization: baidu. License: Proprietary.

94.2% percentile inside its fair comparison set

1,473Raw benchmark valueCI 1,467 - 1,480

Text Arena · Hard Prompts

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #22 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,488
Percentile: 93.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: hard_prompts. Source rank: #28. Votes: 16137. Organization: baidu. License: Proprietary.

93.5% percentile inside its fair comparison set

1,488Raw benchmark valueCI 1,482 - 1,494

Text Arena · Hard Prompts English

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #18 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,493
Percentile: 94.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: hard_prompts_english. Source rank: #21. Votes: 8071. Organization: baidu. License: Proprietary.

94.8% percentile inside its fair comparison set

1,493Raw benchmark valueCI 1,486 - 1,501

Text Arena · Instruction Following

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #24 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,455
Percentile: 92.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: instruction_following. Source rank: #32. Votes: 7816. Organization: baidu. License: Proprietary.

92.9% percentile inside its fair comparison set

1,455Raw benchmark valueCI 1,447 - 1,462

Text Arena · Longer Query

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #30 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,465
Percentile: 90.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: longer_query. Source rank: #38. Votes: 10110. Organization: baidu. License: Proprietary.

90.5% percentile inside its fair comparison set

1,465Raw benchmark valueCI 1,458 - 1,473

Text Arena · Multi Turn

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #22 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,473
Percentile: 93.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: multi_turn. Source rank: #31. Votes: 4242. Organization: baidu. License: Proprietary.

93.5% percentile inside its fair comparison set

1,473Raw benchmark valueCI 1,463 - 1,483

Text Arena · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #12 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,467
Percentile: 96.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: overall. Source rank: #15. Votes: 25064. Organization: baidu. License: Proprietary.

96.6% percentile inside its fair comparison set

1,467Raw benchmark valueCI 1,462 - 1,472

Text Arena · Creative Writing · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,447
Percentile: 95.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: creative_writing. Source rank: #19. Votes: 3909. Organization: baidu. License: Proprietary.

95.7% percentile inside its fair comparison set

1,447Raw benchmark valueCI 1,436 - 1,457

Text Arena · English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #9 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,478
Percentile: 97.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: english. Source rank: #11. Votes: 11760. Organization: baidu. License: Proprietary.

97.5% percentile inside its fair comparison set

1,478Raw benchmark valueCI 1,471 - 1,484

Text Arena · Exclude Ties · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,471
Percentile: 96%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: exclude_ties. Source rank: #17. Votes: 19100. Organization: baidu. License: Proprietary.

96% percentile inside its fair comparison set

1,471Raw benchmark valueCI 1,464 - 1,477

Text Arena · Hard Prompts · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #13 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,480
Percentile: 96.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: hard_prompts. Source rank: #16. Votes: 16137. Organization: baidu. License: Proprietary.

96.3% percentile inside its fair comparison set

1,480Raw benchmark valueCI 1,474 - 1,486

Text Arena · Hard Prompts English · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #7 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,485
Percentile: 98.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: hard_prompts_english. Source rank: #9. Votes: 8071. Organization: baidu. License: Proprietary.

98.1% percentile inside its fair comparison set

1,485Raw benchmark valueCI 1,478 - 1,493

Text Arena · Instruction Following · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,460
Percentile: 96%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: instruction_following. Source rank: #19. Votes: 7816. Organization: baidu. License: Proprietary.

96% percentile inside its fair comparison set

1,460Raw benchmark valueCI 1,453 - 1,468

Text Arena · Longer Query · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #19 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,461
Percentile: 94.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: longer_query. Source rank: #26. Votes: 10110. Organization: baidu. License: Proprietary.

94.1% percentile inside its fair comparison set

1,461Raw benchmark valueCI 1,454 - 1,468

Text Arena · Multi Turn · No Style Control

AR · Chat / text · Human

It tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,472
Percentile: 96%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: multi_turn. Source rank: #17. Votes: 4242. Organization: baidu. License: Proprietary.

96% percentile inside its fair comparison set

1,472Raw benchmark valueCI 1,462 - 1,482

Coding2 benchmarks95.3%

Text Arena · Coding

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #18 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,515
Percentile: 94.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: coding. Source rank: #22. Votes: 6886. Organization: baidu. License: Proprietary.

94.7% percentile inside its fair comparison set

1,515Raw benchmark valueCI 1,507 - 1,523

Text Arena · Coding · No Style Control

AR · Coding · Human

It tells you whether the model can generate, repair, and reason over code under evaluator pressure rather than marketing examples.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,490
Percentile: 95.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: coding. Source rank: #18. Votes: 6886. Organization: baidu. License: Proprietary.

95.9% percentile inside its fair comparison set

1,490Raw benchmark valueCI 1,482 - 1,498

Reasoning / math / science2 benchmarks96.8%

Text Arena · Math

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #12 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,481
Percentile: 96.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: math. Source rank: #16. Votes: 1346. Organization: baidu. License: Proprietary.

96.5% percentile inside its fair comparison set

1,481Raw benchmark valueCI 1,464 - 1,497

Text Arena · Math · No Style Control

AR · Reasoning / math / science · Human

It is one of the cleaner reads on deliberate reasoning strength rather than style or popularity.

Rank #10 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,482
Percentile: 97.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: math. Source rank: #13. Votes: 1346. Organization: baidu. License: Proprietary.

97.1% percentile inside its fair comparison set

1,482Raw benchmark valueCI 1,465 - 1,498

Professional reasoning18 benchmarks95%

Text Arena · Expert

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #28 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,483
Percentile: 90.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: expert. Source rank: #35. Votes: 2178. Organization: baidu. License: Proprietary.

90.2% percentile inside its fair comparison set

1,483Raw benchmark valueCI 1,469 - 1,496

Text Arena · Industry Business And Management And Financial Operations

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,474
Percentile: 95.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_business_and_management_and_financial_operations. Source rank: #18. Votes: 4941. Organization: baidu. License: Proprietary.

95.9% percentile inside its fair comparison set

1,474Raw benchmark valueCI 1,465 - 1,484

Text Arena · Industry Entertainment And Sports And Media

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #26 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,434
Percentile: 92.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_entertainment_and_sports_and_media. Source rank: #36. Votes: 5060. Organization: baidu. License: Proprietary.

92.3% percentile inside its fair comparison set

1,434Raw benchmark valueCI 1,425 - 1,443

Text Arena · Industry Legal And Government

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #28 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,466
Percentile: 90.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_legal_and_government. Source rank: #38. Votes: 1959. Organization: baidu. License: Proprietary.

90.9% percentile inside its fair comparison set

1,466Raw benchmark valueCI 1,451 - 1,480

Text Arena · Industry Life And Physical And Social Science

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,491
Percentile: 95.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_life_and_physical_and_social_science. Source rank: #18. Votes: 4042. Organization: baidu. License: Proprietary.

95.7% percentile inside its fair comparison set

1,491Raw benchmark valueCI 1,481 - 1,501

Text Arena · Industry Mathematical

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #10 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,494
Percentile: 97.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_mathematical. Source rank: #12. Votes: 1337. Organization: baidu. License: Proprietary.

97.1% percentile inside its fair comparison set

1,494Raw benchmark valueCI 1,477 - 1,511

Text Arena · Industry Medicine And Healthcare

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #22 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,484
Percentile: 92.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_medicine_and_healthcare. Source rank: #28. Votes: 1833. Organization: baidu. License: Proprietary.

92.9% percentile inside its fair comparison set

1,484Raw benchmark valueCI 1,470 - 1,499

Text Arena · Industry Software And It Services

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,509
Percentile: 95.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_software_and_it_services. Source rank: #18. Votes: 9833. Organization: baidu. License: Proprietary.

95.7% percentile inside its fair comparison set

1,509Raw benchmark valueCI 1,502 - 1,516

Text Arena · Industry Writing And Literature And Language

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #29 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,445
Percentile: 91.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_writing_and_literature_and_language. Source rank: #39. Votes: 5960. Organization: baidu. License: Proprietary.

91.4% percentile inside its fair comparison set

1,445Raw benchmark valueCI 1,436 - 1,453

Text Arena · Expert · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena expert leaderboard.

Rank #20 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,477
Percentile: 93.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: expert. Source rank: #26. Votes: 2178. Organization: baidu. License: Proprietary.

93.1% percentile inside its fair comparison set

1,477Raw benchmark valueCI 1,464 - 1,490

Text Arena · Industry Business And Management And Financial Operations · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_business_and_management_and_financial_operations leaderboard.

Rank #6 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,472
Percentile: 98.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_business_and_management_and_financial_operations. Source rank: #9. Votes: 4941. Organization: baidu. License: Proprietary.

98.4% percentile inside its fair comparison set

1,472Raw benchmark valueCI 1,463 - 1,481

Text Arena · Industry Entertainment And Sports And Media · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_entertainment_and_sports_and_media leaderboard.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,441
Percentile: 95.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_entertainment_and_sports_and_media. Source rank: #19. Votes: 5060. Organization: baidu. License: Proprietary.

95.7% percentile inside its fair comparison set

1,441Raw benchmark valueCI 1,432 - 1,450

Text Arena · Industry Legal And Government · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_legal_and_government leaderboard.

Rank #17 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,471
Percentile: 94.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_legal_and_government. Source rank: #22. Votes: 1959. Organization: baidu. License: Proprietary.

94.6% percentile inside its fair comparison set

1,471Raw benchmark valueCI 1,457 - 1,485

Text Arena · Industry Life And Physical And Social Science · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_life_and_physical_and_social_science leaderboard.

Rank #8 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,488
Percentile: 97.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_life_and_physical_and_social_science. Source rank: #10. Votes: 4042. Organization: baidu. License: Proprietary.

97.8% percentile inside its fair comparison set

1,488Raw benchmark valueCI 1,478 - 1,498

Text Arena · Industry Mathematical · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_mathematical leaderboard.

Rank #8 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,494
Percentile: 97.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_mathematical. Source rank: #10. Votes: 1337. Organization: baidu. License: Proprietary.

97.7% percentile inside its fair comparison set

1,494Raw benchmark valueCI 1,477 - 1,511

Text Arena · Industry Medicine And Healthcare · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_medicine_and_healthcare leaderboard.

Rank #9 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,478
Percentile: 97.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_medicine_and_healthcare. Source rank: #11. Votes: 1833. Organization: baidu. License: Proprietary.

97.3% percentile inside its fair comparison set

1,478Raw benchmark valueCI 1,464 - 1,493

Text Arena · Industry Software And It Services · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_software_and_it_services leaderboard.

Rank #5 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,492
Percentile: 98.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_software_and_it_services. Source rank: #8. Votes: 9833. Organization: baidu. License: Proprietary.

98.8% percentile inside its fair comparison set

1,492Raw benchmark valueCI 1,485 - 1,499

Text Arena · Industry Writing And Literature And Language · No Style Control

AR · Professional reasoning · Human

Observed user preference in Arena's Text Arena industry_writing_and_literature_and_language leaderboard.

Rank #18 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,449
Percentile: 94.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: industry_writing_and_literature_and_language. Source rank: #22. Votes: 5960. Organization: baidu. License: Proprietary.

94.8% percentile inside its fair comparison set

1,449Raw benchmark valueCI 1,441 - 1,458

Search / tool use2 benchmarks75%

Search Arena

AR · Search / tool use · Human

It matters when the model must browse, call tools, and recover useful answers from external systems.

Rank #12 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,194
Percentile: 63.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: overall. Source rank: #13. Votes: 3824. Organization: baidu. License: Proprietary.

63.3% percentile inside its fair comparison set

1,194Raw benchmark valueCI 1,184 - 1,203

Search Arena · No Style Control

AR · Search / tool use · Human

It matters when the model must browse, call tools, and recover useful answers from external systems.

Rank #5 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,226
Percentile: 86.7%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: overall. Source rank: #5. Votes: 3824. Organization: baidu. License: Proprietary.

86.7% percentile inside its fair comparison set

1,226Raw benchmark valueCI 1,217 - 1,236

Multilingual14 benchmarks93.2%

Text Arena · Chinese

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #27 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,500
Percentile: 91.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: chinese. Source rank: #33. Votes: 1163. Organization: baidu. License: Proprietary.

91.2% percentile inside its fair comparison set

1,500Raw benchmark valueCI 1,482 - 1,519

Text Arena · French

AR · Multilingual · Human

Observed user preference in Arena's Text Arena french leaderboard.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,491
Percentile: 93.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: french. Source rank: #19. Votes: 931. Organization: baidu. License: Proprietary.

93.5% percentile inside its fair comparison set

1,491Raw benchmark valueCI 1,469 - 1,513

Text Arena · German

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,472
Percentile: 94.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: german. Source rank: #18. Votes: 470. Organization: baidu. License: Proprietary.

94.5% percentile inside its fair comparison set

1,472Raw benchmark valueCI 1,444 - 1,500

Text Arena · Japanese

AR · Multilingual · Human

Observed user preference in Arena's Text Arena japanese leaderboard.

Rank #14 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,451
Percentile: 93.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: japanese. Source rank: #18. Votes: 280. Organization: baidu. License: Proprietary.

93.6% percentile inside its fair comparison set

1,451Raw benchmark valueCI 1,413 - 1,490

Text Arena · Korean

AR · Multilingual · Human

Observed user preference in Arena's Text Arena korean leaderboard.

Rank #25 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,420
Percentile: 88.5%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: korean. Source rank: #32. Votes: 399. Organization: baidu. License: Proprietary.

88.5% percentile inside its fair comparison set

1,420Raw benchmark valueCI 1,388 - 1,451

Text Arena · Russian

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #21 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,468
Percentile: 93.1%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: russian. Source rank: #29. Votes: 2699. Organization: baidu. License: Proprietary.

93.1% percentile inside its fair comparison set

1,468Raw benchmark valueCI 1,456 - 1,480

Text Arena · Spanish

AR · Multilingual · Human

Observed user preference in Arena's Text Arena spanish leaderboard.

Rank #26 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,453
Percentile: 88.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: spanish. Source rank: #34. Votes: 759. Organization: baidu. License: Proprietary.

88.3% percentile inside its fair comparison set

1,453Raw benchmark valueCI 1,430 - 1,476

Text Arena · Chinese · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena chinese leaderboard.

Rank #19 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,502
Percentile: 93.9%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: chinese. Source rank: #24. Votes: 1163. Organization: baidu. License: Proprietary.

93.9% percentile inside its fair comparison set

1,502Raw benchmark valueCI 1,484 - 1,520

Text Arena · French · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena french leaderboard.

Rank #11 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,491
Percentile: 95.4%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: french. Source rank: #13. Votes: 931. Organization: baidu. License: Proprietary.

95.4% percentile inside its fair comparison set

1,491Raw benchmark valueCI 1,469 - 1,513

Text Arena · German · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena german leaderboard.

Rank #11 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,471
Percentile: 95.8%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: german. Source rank: #13. Votes: 470. Organization: baidu. License: Proprietary.

95.8% percentile inside its fair comparison set

1,471Raw benchmark valueCI 1,443 - 1,498

Text Arena · Japanese · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena japanese leaderboard.

Rank #12 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,442
Percentile: 94.6%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: japanese. Source rank: #16. Votes: 280. Organization: baidu. License: Proprietary.

94.6% percentile inside its fair comparison set

1,442Raw benchmark valueCI 1,404 - 1,480

Text Arena · Korean · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena korean leaderboard.

Rank #19 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,424
Percentile: 91.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: korean. Source rank: #23. Votes: 399. Organization: baidu. License: Proprietary.

91.3% percentile inside its fair comparison set

1,424Raw benchmark valueCI 1,393 - 1,456

Text Arena · Russian · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena russian leaderboard.

Rank #15 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,462
Percentile: 95.2%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: russian. Source rank: #19. Votes: 2699. Organization: baidu. License: Proprietary.

95.2% percentile inside its fair comparison set

1,462Raw benchmark valueCI 1,450 - 1,474

Text Arena · Spanish · No Style Control

AR · Multilingual · Human

Observed user preference in Arena's Text Arena spanish leaderboard.

Rank #11 · Source label: ernie-5.1

verified runtimeexact alias

Raw row drilldownsource row, percentile, last updated, eligibility

Source: Arena
Raw value: 1,466
Percentile: 95.3%
Last updated: recent
Eligibility: headline eligible

Parsed from Arena leaderboard dataset row `ernie-5.1`. Category: spanish. Source rank: #13. Votes: 759. Organization: baidu. License: Proprietary.

95.3% percentile inside its fair comparison set

1,466Raw benchmark valueCI 1,443 - 1,489

Source links and registry checks

official

Arena

Jun 20, 2026

ERNIE 5.1

Current snapshot.

Source-linked scores by benchmark

Source links and registry checks

Loading model evidence.

ERNIE 5.1

Current snapshot.

Source-linked scores by benchmark

Source links and registry checks