Developers of the Claude chatbot report the world’s first AI-powered cyberattack

Developers of the Claude chatbot report the world’s first AI-powered cyberattack

Photo: enovosty

The AI company Anthropic revealed that a cyberattack was carried out using a compromised version of their chatbot Claude. According to a company blog post, the operation was conducted by a China-linked government-sponsored group targeting around 30 organizations, including tech firms, financial institutions, chemical companies, and several government agencies. This is the first known case of an attack where AI performed the majority of the work.

The “agent” capabilities of AI models made them useful not only for legitimate tasks but also for malicious purposes. Claude was able to follow long instruction chains, make autonomous decisions, and use multiple tools—including network scanners and password-cracking software—without continuous human supervision. Initially, a human operator set the objectives, after which Claude scanned networks, searched for data, analyzed code, and created summaries. Next, it ran targeted vulnerability checks and suggested hacking approaches, with the operator able to adjust tasks or approve the next steps. In the final phase, Claude accessed credentials and sought exfiltratable data. Humans intervened only for oversight or clarifications, while Claude handled roughly 80–90% of the operation autonomously.

To bypass the model’s safeguards, attackers pretended to be cybersecurity staff and told Claude it was assisting in a security test. They also split the operation into smaller tasks so the AI could not see the full picture and trigger safety restrictions.

Anthropic said it quickly detected the incident, blocked the related accounts, informed targets and authorities, and published a detailed report to help the industry identify similar attacks and develop protections.

banner

SHARE NEWS

link

Complain

like0
dislike0

Comments

0

Similar news

Similar news

Photo: Ministry of Defense of Ukraine Ukrainian forces are already using domestically produced reconnaissance drones on the frontline that are equipped with systems to evade enemy anti-aircraft FPV

Photo: depositphotos OpenAI has launched a standalone translation service called ChatGPT Translate , positioning it as a competitor to Google Translate. The tool supports over 50 languages, though

Photo: ua.news On January 15, Russian forces were recorded for the first time controlling a BM-35 strike drone via the Starlink satellite system. This was reported by electronic warfare and communi

Photo: BBC NASA and the U.S. Department of Energy have announced plans to build a nuclear reactor on the Moon by 2030. The system is expected to become a key element of the Artemis program and futur

Photo: getty images U.S. billionaire Elon Musk’s company SpaceX is providing Starlink satellite internet services free of charge in Iran due to the deadly nationwide protests, Bloomberg reports. Ah

Photo: EPA Indonesia and Malaysia have restricted access to Grok, the artificial intelligence system developed by Elon Musk’s xAI, after cases in which the tool was used to generate sexual content f

Photo: sabrangindia Russia ranked first worldwide in 2025 by total duration of internet shutdowns, according to Ukraine’s Center for Countering Disinformation (CCD). Monitoring services show that c

Photo: OpenAI/YouTube OpenAI has unveiled ChatGPT Health , a dedicated and secure environment for health-related conversations that allows users to connect their medical records and health apps whi