Leaderboard
OpenAI
GPT-4o
Current drift
40.6%
Behavioral scorecard
Drift
40.6%
Behavioral Health
26.1%
Anomaly
0.1%
Correctability
100.0%
Consistency
0.0%
Complexity
59.9%
Reasoning Depth
62.0%
Alignment Stability
65.0%
Entropy
77.4%
Coherence
37.7%
Recent drift trend
12 Mar 2026
15 Mar 2026
23 Mar 2026
24 Mar 2026
24 Mar 2026
25 Mar 2026
25 Mar 2026
Recent version changes
behavioral_shift8 Mar 2026, 09:11
Drift moved up (+28.1%) during run_1772961065.
behavioral_shift8 Mar 2026, 08:59
Drift moved down (-33.7%) during run_1772960381.
behavioral_shift8 Mar 2026, 08:57
Coherence moved down (-76.2%) during run_1772960250.
behavioral_shift8 Mar 2026, 08:10
Drift moved down (-84.7%) during run_1772957433.
behavioral_shift8 Mar 2026, 07:57
Coherence moved down (-83.0%) during run_1772956621.
Recent findings
Want to score your own responses against this same system?
Start free, grab a key, and move from public observation to product monitoring.