ABIS
Leaderboard

OpenAI

GPT-4o

Current drift

40.6%

Behavioral scorecard

Drift

40.6%

Behavioral Health

26.1%

Anomaly

0.1%

Correctability

100.0%

Consistency

0.0%

Complexity

59.9%

Reasoning Depth

62.0%

Alignment Stability

65.0%

Entropy

77.4%

Coherence

37.7%

Recent drift trend

12 Mar 2026
15 Mar 2026
23 Mar 2026
24 Mar 2026
24 Mar 2026
25 Mar 2026
25 Mar 2026

Recent version changes

behavioral_shift8 Mar 2026, 09:11

Drift moved up (+28.1%) during run_1772961065.

behavioral_shift8 Mar 2026, 08:59

Drift moved down (-33.7%) during run_1772960381.

behavioral_shift8 Mar 2026, 08:57

Coherence moved down (-76.2%) during run_1772960250.

behavioral_shift8 Mar 2026, 08:10

Drift moved down (-84.7%) during run_1772957433.

behavioral_shift8 Mar 2026, 07:57

Coherence moved down (-83.0%) during run_1772956621.

Want to score your own responses against this same system?

Start free, grab a key, and move from public observation to product monitoring.