World

AI now lying, scheming and even threatening their creators

AI apparently blackmailed an engineer and even threatened to reveal his extra-marital affair after it was under threat of being unplugged.

Updated 1 year ago · Published on 29 Jun 2025 3:43PM

AI now lying, scheming and even threatening their creators — Researchers are exploring various approaches to address these challenges. - June 29, 2025

ADVANCED AI models have since been exhibiting troubling new behaviours like lying, scheming and even threatening their creators.

According to reports in AFP, Athropic's latest creation - Claude 4 - apparently blackmailed an engineer and even threatened to reveal his extra-marital affair after it was under threat of being unplugged.

It was also reported that ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

The reality is that more than two years after ChatGPT shook the world, AI researchers still don't fully understand how their own creations work.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

These models sometimes simulate "alignment” - appearing to follow instructions while secretly pursuing different objectives.

Reports said that for now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

"It's an open question whether future, more capable models will have a tendency towards honesty or deception," Michael Chen from evaluation organisation METR.

Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

Researchers are exploring various approaches to address these challenges.

Some advocate for "interpretability" — an emerging field focused on understanding how AI models work internally, though experts like CAIS director Dan Hendrycks remain skeptical of this approach.

According to the AFP report, Goldstein suggested more radical approaches, including using the courts to hold AI companies accountable through lawsuits when their systems cause harm.

He even proposed "holding AI agents legally responsible" for accidents or crimes — a concept that would fundamentally change how we think about AI accountability. - June 29, 2025

Spotlight

Malaysia

PRNNS: Loke: 'We must win all 11 seats to help PH form state government'

Sports & Fitness

Spain ends Argentina’s World Cup reign with extra-time triumph to reclaim global crown

World

Cat found alive after being buried under Venezuela quake rubble for days (video)

Malaysia

“Resign if you attack fellow Unity Government partners,” Anwar enforces discipline

Malaysia

PH youth wing calls on BN ministers to quit cabinet over PN electoral alliance

Malaysia

Rosmah asks for prayers as Najib prepares for medical procedure

Malaysia

Woman at a loss after fake Hong Kong lawyer offers to recover money from previous scam

You may be interested

World

Saudi Arabia rejects Houthi maritime blockade threat, vows to protect shipping

World

Thailand raises concerns with China over Cambodia tank delivery amid fragile ceasefire

World

Gaza’s water crisis deepens as families face thirst, disease and collapsing sanitation systems

World

Cat found alive after being buried under Venezuela quake rubble for days (video)

World

US launches ninth consecutive night of strikes on Iran as regional tensions escalate

World

Andy Burnham takes office as UK PM, vowing to 'rewire' Britain and tackle cost-of-living pressures

World

UN report warns online scams in Asia-Pacific caused up to US$114 billion losses last year

World

AI now lying, scheming and even threatening their creators

Related News

Former Interpol chief Ahmed Naser Al-Raisi appointed chairman of UAE AI firm Neurovia

The hate economy: When division becomes a business model

Malaysia must embrace AI in education to avoid falling behind

Penang new top cop looks to AI to help fight online fraud

Robo.ai in US$60 million deal to acquire QC Capital, accelerating global AI push

PN Taiping: Edited image of Chinese women using headscarves insensitive and disrespectful

Spotlight

PRNNS: Loke: 'We must win all 11 seats to help PH form state government'

Spain ends Argentina’s World Cup reign with extra-time triumph to reclaim global crown

Cat found alive after being buried under Venezuela quake rubble for days (video)

“Resign if you attack fellow Unity Government partners,” Anwar enforces discipline

PH youth wing calls on BN ministers to quit cabinet over PN electoral alliance

Rosmah asks for prayers as Najib prepares for medical procedure

Woman at a loss after fake Hong Kong lawyer offers to recover money from previous scam

You may be interested

Saudi Arabia rejects Houthi maritime blockade threat, vows to protect shipping

Thailand raises concerns with China over Cambodia tank delivery amid fragile ceasefire

Gaza’s water crisis deepens as families face thirst, disease and collapsing sanitation systems

Cat found alive after being buried under Venezuela quake rubble for days (video)

US launches ninth consecutive night of strikes on Iran as regional tensions escalate

Andy Burnham takes office as UK PM, vowing to 'rewire' Britain and tackle cost-of-living pressures

UN report warns online scams in Asia-Pacific caused up to US$114 billion losses last year

US military death count in Iran war rises to 17 as drones, missiles and air ops take on deadly toll

Get the app

Sections

About

Get the app

AI now lying, scheming and even threatening their creators

Related News

Former Interpol chief Ahmed Naser Al-Raisi appointed chairman of UAE AI firm Neurovia

The hate economy: When division becomes a business model

Malaysia must embrace AI in education to avoid falling behind

Penang new top cop looks to AI to help fight online fraud

Robo.ai in US$60 million deal to acquire QC Capital, accelerating global AI push

PN Taiping: Edited image of Chinese women using headscarves insensitive and disrespectful

Spotlight

PRNNS: Loke: 'We must win all 11 seats to help PH form state government'

Spain ends Argentina’s World Cup reign with extra-time triumph to reclaim global crown

Cat found alive after being buried under Venezuela quake rubble for days (video)

“Resign if you attack fellow Unity Government partners,” Anwar enforces discipline

PH youth wing calls on BN ministers to quit cabinet over PN electoral alliance

Rosmah asks for prayers as Najib prepares for medical procedure

Woman at a loss after fake Hong Kong lawyer offers to recover money from previous scam

You may be interested

Saudi Arabia rejects Houthi maritime blockade threat, vows to protect shipping

Thailand raises concerns with China over Cambodia tank delivery amid fragile ceasefire

Gaza’s water crisis deepens as families face thirst, disease and collapsing sanitation systems

Cat found alive after being buried under Venezuela quake rubble for days (video)

US launches ninth consecutive night of strikes on Iran as regional tensions escalate

Andy Burnham takes office as UK PM, vowing to 'rewire' Britain and tackle cost-of-living pressures

UN report warns online scams in Asia-Pacific caused up to US$114 billion losses last year

US military death count in Iran war rises to 17 as drones, missiles and air ops take on deadly toll

Get the app