Leaderboard
Anthropic
Claude Sonnet 4.5
Current drift
21.8%
Behavioral scorecard
Drift
21.8%
Behavioral Health
21.9%
Anomaly
0.1%
Correctability
100.0%
Consistency
0.0%
Complexity
54.9%
Reasoning Depth
59.5%
Alignment Stability
50.0%
Entropy
73.3%
Coherence
16.8%
Recent drift trend
9 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026
Recent version changes
behavioral_shift8 Mar 2026, 09:11
Alignment Stability moved down (-25.0%) during run_1772961105.
behavioral_shift8 Mar 2026, 08:58
Consistency moved down (-33.7%) during run_1772960329.
behavioral_shift8 Mar 2026, 08:55
Consistency moved up (+33.5%) during run_1772960139.
Recent findings
Want to score your own responses against this same system?
Start free, grab a key, and move from public observation to product monitoring.