Experts cast doubt over Anthropic claims that Claude was hijacked to automate cyberattacks

11 hours ago 13

A person holding out their hand with a digital AI symbol.

(Image credit: Shutterstock / LookerStudio)

Anthropic claimed it observed an AI cyberattack without substantial human intervention
Experts say this claim is likely inflated with human intervention downplayed
The reports only outline what security professionals already know: AI tools speed up the attack process.

Anthropic recently reported Chinese hackers had hijacked its Claude platform to launch fully AI-orchestrated cyberattacks - but this claim has since been met with skepticism in the cybersecurity community.

It seems likely that, although AI did carry out a significant portion of the attack (roughly 80-90%), the technology still needed vital human input - since AI cannot ‘think’ for itself, it can only copy.

Some researchers believe this is just a marketing tactic to inflate the perceived capabilities of AI, or perhaps some fear mongering to feed narrative around the US v China AI race.

Nothing new

“I continue to refuse to believe that attackers are somehow able to get these models to jump through hoops that nobody else can,” Dan Tentler, executive founder of Phobos Group told Ars Technica.

“Why do the models give these attackers what they want 90% of the time but the rest of us have to deal with ass-kissing, stonewalling, and acid trips?”

Whilst it may be true that AI has come on leaps and bounds in recent months, it still is unlikely to be able to complete strings of complex tasks without human input. The tools are useful, but they enhance human capabilities rather than replacing them altogether.

“The implication here is that the attacker was using existing tooling, but used an AI agent to take the place of the human that would normally drive those tools and go through the phases of the attack much faster,” said Tim Mitchell, Senior Security Researcher, Sophos X-Ops Counter Threat Unit.

Sign up to the TechRadar Pro newsletter to get all the top news, opinion, features and guidance your business needs to succeed!

“From a defender’s perspective, that means there’s nothing new to defend against here - but the window to spot and defend against the attack is much reduced."

Another point to note, is that by Anthropic’s own reporting, only a ‘small number’ of the AI’s attempts to infiltrate organizations were successful - although it would have represented a first step in a fast-evolving process.

TechRadar Pro has asked Anthropic for comment, but did not hear anything at the time of publishing.

Follow TechRadar on Google News and add us as a preferred source to get our expert news, reviews, and opinion in your feeds. Make sure to click the Follow button!

And of course you can also follow TechRadar on TikTok for news, reviews, unboxings in video form, and get regular updates from us on WhatsApp too.

Ellen has been writing for almost four years, with a focus on post-COVID policy whilst studying for BA Politics and International Relations at the University of Cardiff, followed by an MA in Political Communication. Before joining TechRadar Pro as a Junior Writer, she worked for Future Publishing’s MVC content team, working with merchants and retailers to upload content.

Read Entire Article