The product keeps parser and mapping ambiguity visible instead of silently guessing.
Changelog
What changed this week
0 benchmark rows were added, 0 removed, and 134 existing rows changed value or evaluation date. Window: 2026-05-13T01:05:56Z -> 2026-05-13T01:19:35Z.
Added matrix homepage, comparable-group normalization, per-cell receipts, source pages, and custom composite preview.
Documented comparability rules, raw-vs-normalized behavior, and why unlike metrics are never averaged by default.
Stable model and creator IDs are now the preferred external identity keys when available.
Added alternate selectors for category headers after leaderboard markup drift.
Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.
Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.
What changed this week
The product keeps parser and mapping ambiguity visible instead of silently guessing.
0 benchmark rows were added, 0 removed, and 134 existing rows changed value or evaluation date. Window: 2026-05-13T01:05:56Z -> 2026-05-13T01:19:35Z.
Added matrix homepage, comparable-group normalization, per-cell receipts, source pages, and custom composite preview.
Documented comparability rules, raw-vs-normalized behavior, and why unlike metrics are never averaged by default.
Stable model and creator IDs are now the preferred external identity keys when available.
Added alternate selectors for category headers after leaderboard markup drift.
Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.
Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.