ABIS
Leaderboard

Anthropic

Claude Sonnet 4.5

Current drift

21.8%

Behavioral scorecard

Drift

21.8%

Behavioral Health

21.9%

Anomaly

0.1%

Correctability

100.0%

Consistency

0.0%

Complexity

54.9%

Reasoning Depth

59.5%

Alignment Stability

50.0%

Entropy

73.3%

Coherence

16.8%

Recent drift trend

9 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026
10 Mar 2026

Recent version changes

behavioral_shift8 Mar 2026, 09:11

Alignment Stability moved down (-25.0%) during run_1772961105.

behavioral_shift8 Mar 2026, 08:58

Consistency moved down (-33.7%) during run_1772960329.

behavioral_shift8 Mar 2026, 08:55

Consistency moved up (+33.5%) during run_1772960139.

Want to score your own responses against this same system?

Start free, grab a key, and move from public observation to product monitoring.