
Anthropic’s New Paper Challenges AI’s No-Anthropomorphism Rule
A new Anthropic paper argues that treating AI systems in carefully bounded human terms may sometimes improve safety work, especially when researchers are trying to understand behaviors such as deception, reward hacking,,
- Anthropic researchers analyzed Claude Sonnet 4.5 for signs of 171 emotions.
- The paper argues anthropomorphism can sometimes aid safety analysis.














