Categories: Cyber Security News

Critical RCE Vulnerabilities in AI Inference Engines Exposes Meta, Nvidia and Microsoft Frameworks

As artificial intelligence infrastructure rapidly expands, critical security flaws threaten the backbone of enterprise AI deployments.

Security researchers at Oligo Security have uncovered a series of dangerous Remote Code Execution (RCE) vulnerabilities affecting major AI frameworks from Meta, NVIDIA, Microsoft, and PyTorch projects, including vLLM and SGLang.

The vulnerabilities, collectively termed “ShadowMQ,” stem from the unsafe implementation of ZeroMQ (ZMQ) communications combined with Python’s pickle deserialization.

What makes this threat particularly alarming is how it spread across the AI ecosystem through code reuse and copy-paste development practices.

How the Vulnerability Spread Across Frameworks

The investigation began in 2024 when researchers analyzed Meta’s Llama Stack and discovered the dangerous use of ZMQ’s recv_pyobj() method, which deserializes data using Python’s pickle module.

ShadowMQ Vulnerability CVE Data Table

CVE ID Product Severity CVSS Score Vulnerability Type
CVE-2024-50050 Meta Llama Stack Critical 9.8 Remote Code Execution
CVE-2025-30165 vLLM Critical 9.8 Remote Code Execution
CVE-2025-23254 NVIDIA TensorRT-LLM Critical 9.3 Remote Code Execution
CVE-2025-60455 Modular Max Server Critical 9.8 Remote Code Execution
N/A (Unpatched) Microsoft Sarathi-Serve Critical 9.8 Remote Code Execution
N/A (Incomplete Fix) SGLang Critical 9.8 Remote Code Execution

This configuration created unauthenticated network sockets that could execute arbitrary code during deserialization, enabling remote attackers to compromise systems.

After Meta patched the vulnerability (CVE-2024-50050), Oligo researchers found identical security flaws across multiple frameworks.

NVIDIA’s TensorRT-LLM, PyTorch projects vLLM and SGLang, and Modular’s Max Server all contained nearly identical vulnerable patterns.

Oligo Code analysis revealed that entire files were copied between projects, spreading the security flaw like a virus. These AI inference servers power critical enterprise infrastructure, processing sensitive data across GPU clusters.

Organizations trusting SGLang include xAI, AMD, NVIDIA, Intel, LinkedIn, Oracle Cloud, Google Cloud, Microsoft Azure, AWS, MIT, Stanford, UC Berkeley, and numerous other major technology companies.

Successful exploitation could allow attackers to execute arbitrary code, escalate privileges, exfiltrate model data, or install cryptocurrency miners.

Oligo researchers identified thousands of exposed ZMQ sockets communicating unencrypted over the public internet. However, Microsoft’s Sarathi-Serve and SGLang remain vulnerable with incomplete fixes.

Organizations should immediately update to patched versions, avoid using pickle with untrusted data, implement authentication for ZMQ communications, and restrict network access to ZMQ endpoints.

Follow us on Google News, LinkedIn, and X for daily cybersecurity updates. Contact us to feature your stories.

The post Critical RCE Vulnerabilities in AI Inference Engines Exposes Meta, Nvidia and Microsoft Frameworks appeared first on Cyber Security News.

rssfeeds-admin

Recent Posts

The Best Deals Today: Apple AirTag 4-Pack, 3 for $33 Movie Deal, Sony WH-1000XM5 Headphones, and More

A new weekend has arrived, and today, you can save big on Apple AirTags, 4K…

1 hour ago

Avengers: Endgame Re-Release Footage is ‘Critical’ to the Plot of Doomsday

Director Joe Russo has confirmed the upcoming Avengers: Endgame re-release will include new footage that…

2 hours ago

Windrose Celebrates 1 Million Copies Sold as Player Counts Climb

Cooperative pirate survival game Windrose has reached 1 million copies sold less than a week…

3 hours ago

Franklin is looking to grow its downtown. Liberty Utilities’s gas capacity is posing a problem.

The space in the heart of Franklin’s downtown, a former department store, excited Patrick McDevitt…

5 hours ago

‘Not cosmetic’: NH lawmaker wants state to cover GLP-1 drugs for weight loss

Two years ago, Sue Prentiss got a sobering reality check at her doctor’s office. The…

5 hours ago

Franklin is looking to grow its downtown. Liberty Utilities’s gas capacity is posing a problem.

The space in the heart of Franklin’s downtown, a former department store, excited Patrick McDevitt…

5 hours ago

This website uses cookies.