Developers of the Claude chatbot report the world’s first AI-powered cyberattack

Developers of the Claude chatbot report the world’s first AI-powered cyberattack

Photo: enovosty

The AI company Anthropic revealed that a cyberattack was carried out using a compromised version of their chatbot Claude. According to a company blog post, the operation was conducted by a China-linked government-sponsored group targeting around 30 organizations, including tech firms, financial institutions, chemical companies, and several government agencies. This is the first known case of an attack where AI performed the majority of the work.

The “agent” capabilities of AI models made them useful not only for legitimate tasks but also for malicious purposes. Claude was able to follow long instruction chains, make autonomous decisions, and use multiple tools—including network scanners and password-cracking software—without continuous human supervision. Initially, a human operator set the objectives, after which Claude scanned networks, searched for data, analyzed code, and created summaries. Next, it ran targeted vulnerability checks and suggested hacking approaches, with the operator able to adjust tasks or approve the next steps. In the final phase, Claude accessed credentials and sought exfiltratable data. Humans intervened only for oversight or clarifications, while Claude handled roughly 80–90% of the operation autonomously.

To bypass the model’s safeguards, attackers pretended to be cybersecurity staff and told Claude it was assisting in a security test. They also split the operation into smaller tasks so the AI could not see the full picture and trigger safety restrictions.

Anthropic said it quickly detected the incident, blocked the related accounts, informed targets and authorities, and published a detailed report to help the industry identify similar attacks and develop protections.

banner

SHARE NEWS

link

Complain

like0
dislike0

Comments

0

Similar news

Similar news

Photo: Getty Images Despite strong updates from Google, OpenAI’s AI continues to attract a wider audience. OpenAI’s AI chatbot, ChatGPT, continues to dominate its competitors, surpassing Google Gem

Photo: Getty Images A new report shows a noticeable shift in U.S. users’ preferences. Young people are increasingly choosing TikTok, Reddit, and WhatsApp, while X continues to lose audience, accordi

Photo: techcrunch.com OpenAI has updated ChatGPT’s voice mode, allowing users to speak with the chatbot while simultaneously seeing its responses on the screen, the company announced on X. The new

Photo: NASA Scientists have discovered that Physcomitrium patens spores remained viable after enduring the extreme conditions of orbit. Moss spores that spent nine months on the exterior of the I

Photo: OpenAI ChatGPT will now respond to messages and interact with all participants in shared group conversations. OpenAI has announced the global rollout of group chats in ChatGPT after a succes

Photo: Bloomberg Companies around the world are increasingly openly acknowledging that they are cutting jobs due to the adoption of artificial intelligence technologies. Previously, employers avoid

Photo: Getty Images Scientists have, for the first time, created a galactic simulation that tracks the evolution of every individual star over 10,000 years. An international team has unveiled the f

Photo: unsplash.com Apple is reshaping its product cycle, preparing three new iPhone models and overhauling its production strategy. The company is shifting away from its reliance on the annual fa