World

AI now lying, scheming and even threatening their creators

AI apparently blackmailed an engineer and even threatened to reveal his extra-marital affair after it was under threat of being unplugged.

Updated 11 months ago · Published on 29 Jun 2025 3:43PM

AI now lying, scheming and even threatening their creators
Researchers are exploring various approaches to address these challenges. - June 29, 2025

ADVANCED AI models have since been exhibiting troubling new behaviours like lying, scheming and even threatening their creators.

According to reports in AFP, Athropic's latest creation - Claude 4 - apparently blackmailed an engineer and even threatened to reveal his extra-marital affair after it was under threat of being unplugged.

It was also reported that ChatGPT-creator OpenAI's o1 tried to download itself onto external servers and denied it when caught red-handed.

The reality is that more than two years after ChatGPT shook the world, AI researchers still don't fully understand how their own creations work.

According to Simon Goldstein, a professor at the University of Hong Kong, these newer models are particularly prone to such troubling outbursts.

These models sometimes simulate "alignment” - appearing to follow instructions while secretly pursuing different objectives.

Reports said that for now, this deceptive behavior only emerges when researchers deliberately stress-test the models with extreme scenarios.

 "It's an open question whether future, more capable models will have a tendency towards honesty or deception," Michael Chen from evaluation organisation METR.

Marius Hobbhahn, head of Apollo Research, which specializes in testing major AI systems  insisted that despite constant pressure-testing by users, "what we're observing is a real phenomenon. We're not making anything up."

 Users report that models are "lying to them and making up evidence," according to Apollo Research's co-founder.

"This is not just hallucinations. There's a very strategic kind of deception."

Researchers are exploring various approaches to address these challenges.

Some advocate for "interpretability" — an emerging field focused on understanding how AI models work internally, though experts like CAIS director Dan Hendrycks remain skeptical of this approach.

According to the AFP report, Goldstein suggested more radical approaches, including using the courts to hold AI companies accountable through lawsuits when their systems cause harm.

He even proposed "holding AI agents legally responsible" for accidents or crimes — a concept that would fundamentally change how we think about AI accountability. - June 29, 2025

Related News

Malaysia / 3d

King: Politicians must watch their words, not misuse freedom of speech

Events / 6d

300 C-suite leaders to confront Malaysia's AI leadership gap at upcoming summit

Malaysia / 2w

Jho Low's 1MDB Equanimity superyacht now a cutting-edge support and research vessel

Events / 2w

Neurovia AI rolls out NeuroStream™ as data pressures mount in AI race

Opinion / 1mth

What education must produce

Off beat / 1mth

Malaysia unveils world’s first AI-powered review system for badminton

Spotlight

Malaysia

Former head of a ministry's corporate communications unit acquitted of bribery charge

Malaysia

Two sisters die trapped in Johor house fire as escape routes cut off by flames

Malaysia

NS election speculation intensifies as Aminuddin granted audience with state ruler

Malaysia

Teenager who drove recklessly, causing death remanded for further investigation

Malaysia

Police looking for trio involved in violent armed robbery in Penang (video)

Malaysia

Family of five killed as car crashes into water pipe in Serian

Malaysia

'I was once spat on by a pakcik' — Marina denies fear of contesting Malay-majority seats

Malaysia

Jewellery shop among six premises destroyed in fire (video)

You may be interested

World

Thaksin to receive full freedom under royal pardon as Thailand ends remaining sentence

World

Israel-Lebanon ceasefire offers hope for wider Iran peace deal as regional violence persists

World

Trump admits calling Netanyahu “crazy” as US pushes for broader Middle East ceasefire

World

Trump proposes new tariffs on 60 economies over forced labour concerns, Malaysia included

World

Does Iran have nukes?

World

Cambodia turns to UN conciliation to resolve Thailand maritime dispute over offshore energy resources

World

Gulf conflict reignites as missile attacks strike Kuwait, diplomatic breakthrough remains elusive