Anthropic hires former OpenAI safety lead to head up new team
In this article:
- Jon Leike has now joined OpenAI rival Anthropic to lead a new “superalignment” team
- A leading AI researcher, he resigned from OpenAI earlier this month and publicly criticized the company’s approach to AI safety.
- In a post on X, Leike stated that his team at Anthropic will focus on various aspects of AI safety and security, including:
- “Scalable oversight”
- “Weak-to-strong generalization”
- Automated alignment research