Anthropic walks back policy that could have 'sabotaged' researchers using Claude (wired.com) AI
Anthropic is backtracking on safeguards in Claude Fable 5 that critics said would covertly degrade the model’s performance for researchers trying to develop competing AI models, after researchers complained and pushed back. The company says it will make those frontier-LLM safeguards visible to users going forward, alerting or rerouting users if they appear to be using the model to pursue highly capable AI development, and it attributes the earlier approach to concerns about slowing frontier progress for safety and societal alignment reasons.
June 11, 2026 21:59
Source: Hacker News