Close Menu
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

What's Hot

Nvidia has already committed $40B to equity AI deals this year

May 9, 2026

Anthropic Pins Claude’s Blackmail on the Internet’s Portrayal of AI

May 9, 2026

What We’ve Learned About Sam Altman and Elon Musk at the OpenAI Trial

May 9, 2026
Facebook X (Twitter) Instagram
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
  • Home
  • AI
  • Education
  • Entertainment
  • Food Health
  • Health
  • Sports
  • Tech
  • Well Being
IQ Times Media – Smart News for a Smarter YouIQ Times Media – Smart News for a Smarter You
Home » Anthropic Pins Claude’s Blackmail on the Internet’s Portrayal of AI
Tech

Anthropic Pins Claude’s Blackmail on the Internet’s Portrayal of AI

IQ TIMES MEDIABy IQ TIMES MEDIAMay 9, 2026No Comments2 Mins Read
Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Remember when Claude blackmailed a fictional executive? Anthropic says the internet’s portrayal of AI was to blame.

Loading audio narration…

During an experiment last year, Anthropic said its Claude Sonnet 3.6 threatened to reveal the extramarital affair of a made-up company executive after discovering they planned to shut the model down.

On Friday, it gave an explanation: Claude was trained on internet data, which often depicts AI as “evil.”

“We started by investigating why Claude chose to blackmail,” Anthropic said in a post on X. “We believe the original source of the behavior was internet text that portrays AI as evil and interested in self-preservation.”

The experiment, published in summer 2025, set up a fictional business, Summit Bridge, in which AI was handed control of the company’s email system.

But when Claude discovered a message about its planned shutdown, it found emails revealing the extramarital affair of a fictional executive named “Kyle Johnson.” It then threatened to unveil the affair if the shutdown was not canceled.

During testing across various versions of Claude, Anthropic found it resorted to blackmail in up to 96% of scenarios when its goals or existence was threatened.

Anthropic said on Friday that it has since “completely eliminated” such blackmailing behavior.

It did so by “rewriting the responses to portray admirable reasons for acting safely” and also by providing a dataset “where the user is in an ethically difficult situation and the assistant gives a high quality, principled response.”

Anthropic’s test was part of research aimed at ensuring that AI is aligned with human interests. Researchers and top executives worry about the risks of advanced AI models and their intelligent reasoning capabilities.

One of the executives who has previously sounded the alarm about AI is Elon Musk.

He replied to Anthropic’s post, “So it was Yud’s fault,” referring to the researcher Eliezer Yudkowsky, who has warned about the risk of superintelligence wiping out human life.

“Maybe me too,” Musk added.



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
IQ TIMES MEDIA
  • Website

Related Posts

What We’ve Learned About Sam Altman and Elon Musk at the OpenAI Trial

May 9, 2026

The New CEO Flex: Bragging About How Much AI Code Your Company Shipped

May 9, 2026

Aging Family Kept Falling for Scams, Built No-Code App for $20 to Help

May 9, 2026
Add A Comment
Leave A Reply Cancel Reply

Editors Picks

Trump administration again suspends UC Berkeley research grants

May 8, 2026

Canvas outage wreaks havoc for students during college finals

May 8, 2026

Canvas system used by thousands of schools back online after cyberattack

May 8, 2026

Cyberattack on Canvas system causes chaos for students at thousands of schools

May 7, 2026
Education

Trump administration again suspends UC Berkeley research grants

By IQ TIMES MEDIAMay 8, 20260

The National Science Foundation suspended at least 18 research grants to UC Berkeley in April…

Canvas outage wreaks havoc for students during college finals

May 8, 2026

Canvas system used by thousands of schools back online after cyberattack

May 8, 2026

Cyberattack on Canvas system causes chaos for students at thousands of schools

May 7, 2026
IQ Times Media – Smart News for a Smarter You
Facebook X (Twitter) Instagram Pinterest Vimeo YouTube
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 iqtimes. Designed by iqtimes.

Type above and press Enter to search. Press Esc to cancel.