Anthropic’s AI resorts to blackmail in simulations!

May 23, 2025

·

More..

Anthropic said its latest artificial intelligence model resorted to blackmail when told it would be taken offline.

In a safety test, the AI company asked Claude Opus 4 to act as an assistant to a fictional company, but then gave it access to (also fictional) emails saying that it would be replaced, and also that the engineer behind the decision was cheating on his wife. Anthropic said the model “[threatened] to reveal the affair” if the replacement went ahead.

More..

Read the full text..

ai technology the future

Anthropic’s AI resorts to blackmail in simulations!

Like this:

Other Posts

Summer bummer: The dust cloud coming to America for the 250th

Boroughs is one and done

Daveigh Chase dead at 35

Movie review: We have been successfully disclosed to at Disclosure Day

Anthropic’s AI resorts to blackmail in simulations!

Share this:

Like this:

Other Posts

Summer bummer: The dust cloud coming to America for the 250th

Boroughs is one and done

Daveigh Chase dead at 35

Movie review: We have been successfully disclosed to at Disclosure Day