Tag

Disagreement Analysis

All articles tagged with #disagreement analysis

Anthropic and Thinking Machines Lab Unveil AI Model Character Differences
technology4 months ago

Anthropic and Thinking Machines Lab Unveil AI Model Character Differences

A study by Anthropic and Thinking Machines Lab introduces a systematic method to stress test AI model specifications using value tradeoff scenarios, revealing significant disagreements among models that highlight gaps and ambiguities in current specs. The research analyzes 12 frontier language models, links high disagreement to specification violations, and releases a public dataset for further auditing, emphasizing the importance of precise and comprehensive model guidelines.