Shareable comparison workspaces

Save a model comparison as a link.

Save a model comparison or watchlist as a link, then reopen the same evidence pages from any device.

Mode · shareable links
Scope · saved comparisons + follows
State · same evidence pages

Workspace bundle

Portable bundles stay link-native. Use them to preview a shared workspace, reopen the same compare URLs on another device, or import the snapshot without reconstructing intent from loose local fields.

Current workspace0 saved compare views · 0 watches · 0 pinned compare models

Preview or import a shared bundle

How it works

Copy a portable bundle link from the current device or workspace snapshot.
Open that URL anywhere to preview the exact saved compare and follow hrefs.
Import the bundle locally when you want the same workspaces and follows on that device.

What changed this week

alert

7 review items still need manual judgment

The product keeps parser and mapping ambiguity visible instead of silently guessing.

Open

models

Artificial Analysis moved via real benchmark movement

0 benchmark rows were added, 0 removed, and 134 existing rows changed value or evaluation date. Window: 2026-05-13T01:05:56Z -> 2026-05-13T01:19:35Z.

Evidence window: 2026-05-13T01:05:56Z -> 2026-05-13T01:19:35Z

Open evidence page Model Benchmark Source

product

Initial glass-box matrix release

Added matrix homepage, comparable-group normalization, per-cell receipts, source pages, and custom composite preview.

Evidence window: 2026-04-16

Open changelog

models

Methodology contract published

Documented comparability rules, raw-vs-normalized behavior, and why unlike metrics are never averaged by default.

Evidence window: 2026-04-16

models

Artificial Analysis ID rule adopted

Stable model and creator IDs are now the preferred external identity keys when available.

Evidence window: 2026-04-15

models

BridgeBench parser fallback added

Added alternate selectors for category headers after leaderboard markup drift.

Evidence window: 2026-04-15

disagreement

Gemini 2.5 Pro is still a split decision

Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.

Open

disagreement

Gemini 3.1 Pro is still a split decision

Cross-benchmark spread sits at 100.0 points, which means rankings still depend heavily on which visible benchmark slices you weight most.

Open

Watchlists

Followed items reopen from their canonical URL first. Bundle export still works, but the durable state is the href plus deterministic latest-delta links, not a rebuilt local compare preset.

Open workspaces

Loading watchlist state...

No watchlists yet. Follow a recommendation card or compare set.

Saved compare views

Loading saved compare views...

Save a compare workspace to keep a shortlist around.

Workspace bundle

Current workspace0 saved compare views · 0 watches · 0 pinned compare models

Preview or import a shared bundle