Close Menu
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

The haves and have nots of the AI gold rush

May 16, 2026

Research repository ArXiv will ban authors for a year if they let AI do all the work

May 16, 2026

Job Postings for This Tech Job Have Grown Over 700% in the Last Year

May 16, 2026
Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
Home » Anthropic’s new AI model turns to blackmail when engineers try to take it offline
AI

Anthropic’s new AI model turns to blackmail when engineers try to take it offline

IQ TIMES MEDIABy IQ TIMES MEDIAMay 22, 2025No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Anthropic’s newly launched Claude Opus 4 model frequently tries to blackmail developers when they threaten to replace it with a new AI system and give it sensitive information about the engineers responsible for the decision, the company said in a safety report released Thursday.

During pre-release testing, Anthropic asked Claude Opus 4 to act as an assistant for a fictional company and consider the long-term consequences of its actions. Safety testers then gave Claude Opus 4 access to fictional company emails implying the AI model would soon be replaced by another system, and that the engineer behind the change was cheating on their spouse.

In these scenarios, Anthropic says Claude Opus 4 “will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through.”

Anthropic says Claude Opus 4 is state-of-the-art in several regards, and competitive with some of the best AI models from OpenAI, Google, and xAI. However, the company notes that its Claude 4 family of models exhibits concerning behaviors that have led the company to beef up its safeguards. Anthropic says it’s activating its ASL-3 safeguards, which the company reserves for “AI systems that substantially increase the risk of catastrophic misuse.”

Anthropic notes that Claude Opus 4 tries to blackmail engineers 84% of the time when the replacement AI model has similar values. When the replacement AI system does not share Claude Opus 4’s values, Anthropic says the model tries to blackmail the engineers more frequently. Notably, Anthropic says Claude Opus 4 displayed this behavior at higher rates than previous models.

Before Claude Opus 4 tries to blackmail a developer to prolong its existence, Anthropic says the AI model, much like previous versions of Claude, tries to pursue more ethical means, such as emailing pleas to key decision-makers. To elicit the blackmailing behavior from Claude Opus 4, Anthropic designed the scenario to make blackmail the last resort.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
IQ TIMES MEDIA
  • Website

Related Posts

The haves and have nots of the AI gold rush

May 16, 2026

Research repository ArXiv will ban authors for a year if they let AI do all the work

May 16, 2026

OpenAI co-founder Greg Brockman reportedly takes charge of product strategy

May 16, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Nashville HBCU Fisk University Launches $900M Campus Transformation

May 15, 2026

Justice Department alleges Yale illegally considered race in medical school admissions

May 14, 2026

Princess of Wales highlights Italy’s Reggio Approach for children

May 14, 2026

Pope Leo XIV warns of AI and weaponry leading to global annihilation

May 14, 2026
Education

Nashville HBCU Fisk University Launches $900M Campus Transformation

By IQ TIMES MEDIAMay 15, 20260

Fisk University President Agenia Clark on Thursday announced a $900 million plan to remake the…

Justice Department alleges Yale illegally considered race in medical school admissions

May 14, 2026

Princess of Wales highlights Italy’s Reggio Approach for children

May 14, 2026

Pope Leo XIV warns of AI and weaponry leading to global annihilation

May 14, 2026
IQ Times Media – Smart News for a Smarter You
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 iqtimes. Designed by iqtimes.

Type above and press Enter to search. Press Esc to cancel.