Visible tradeoffsThis is an efficiency signal, so it belongs beside quality rather than being mistaken for quality.
source
Artificial Analysis
metric
Blended price (/1M tokens)
judge
Speed / cost
direction
lower better
group id
aa_price_blended_3_1_current
domain
Chat / text
What it measures vs what it misses
✓ Measures
Approximate API cost under a fixed 3:1 input-output token mix.
✗ Misses
Application-specific cache hit rates, discounts, and subscription-plan economics.
Why this countsIt tests whether the model is actually useful in normal conversational turns, not just on narrow correctness tasks.Same-test ruleThis percentile only compares models inside the exact benchmark/version group shown here. It is not a universal score.What it missesIt does not prove deeper reasoning, tool use, or enterprise workflow reliability.