Visible tradeoffsThis is a combined signal, so it bundles multiple inputs and should not be treated as one clean test.
source
Artificial Analysis
metric
Index (index)
judge
Combined
direction
higher better
group id
aa_openness_index_current
domain
Chat / text
What it measures vs what it misses
✓ Measures
How open a model is across weights, methodology, and data transparency signals.
✗ Misses
Capability, price, latency, and product quality.
Why this countsIt tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.Same-test ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesIt does not prove deeper reasoning, tool use, or enterprise workflow reliability.
Fallback benchmark identity is visible for context but excluded from default ranking.
Identity
benchmark proxy (0.58)
Parsed from Artificial Analysis public leaderboard field `opennessBreakdown.opennessIndex`. Backfilled from GPT-5 via approved benchmark identity mapping map-gpt-5-4-to-gpt-5.