Less than 13 hours after becoming public, a high-gravity vulnerability in LMDeploy - the open source toolkit to compress, deploy and serve language and vision models - was already being exploited in real environments. The failure, recorded as CVE-2026-33626 with CVSS 7.5 score, is a variant of Server-Side Request Forgery (SSRF) in the vision-language module: a function that downloads images from URLs not valid if these addresses belong to internal networks or cloud metadata services, opening the door to unauthorized access to sensitive resources.
The vector is simple and dangerous: the image loader accepts arbitrary URLs. An input-control attacker may force the server to request internal resources such as the AWS instance metadata service (IMDS), local databases, administrative interfaces in loopback or internal endpoints that are normally not accessible from the Internet. In the reported case, the attackers combined internal port scanning and exfiltration by DNS OOB to confirm and exploit the SSRF capacity as "HTTP primitive."

The fact that the holding took place in a matter of hours is no coincidence. In practice, technical divulgations that include the vulnerable code fragment, the parameter name and functional examples act as precise instructions for malicious actors - and today also as prompts for commercial models - accelerating the translation of a detected problem to an operational explosion. This shows a worrying dynamic in the IA infrastructure ecosystem: extremely short response time between publication and attack.
The potential consequences are material and multifaceted. Cloud credentials theft by IMDS, recognition and lateral movement to internal services, commitment of databases or queues, and the possibility of persistence if the server concerned has extended permissions are plausible scenarios. For industrial environments or that run exposed controllers and PLCs, the combination of massive tests and selective scans is a recipe for interruptions and manipulation.
From the operational point of view, the first immediate priority should be to contain the exposure. If your organization uses LMDeploy with vision-language support, apply the official correction if it is already available or temporarily disable the remote image loading features. As an additional containment measure, block or restrict outgoing traffic from inference servers, implement egress rules that allow only explicitly necessary domains and ranges and ensure that there is no direct access to 169.254.169.254 (IMDS) or to loopback interfaces from processes that accept customer URLs.
There are specific mitigation in the cloud that reduce the impact of this type of SSRF. For AWS instances, enable IMDSv2 and set the hop limit or disable access to metadata when it is not necessary to limit an attacker's ability to steal credentials. It is also recommended to apply strict network segmentation, apply white lists of domains for outgoing fets and use inverse or validator proxies that check all / blacklist of IPs and RFC1918 ranges before allowing an external application. In general terms, validating and normalizing URLs entries is mandatory; avoiding this validation was the root of the incident.
In detection and response, look for clear indicators: server requests about 169.254.169.254, loopback connections 127.0.0.1 from processes that do not normally perform them, consultation chains containing URLs provided by third parties, and atypical DNS consultations to OOB domains (out-of-band). Check the models service for request patterns that change between different models or show attempts to scan internal ports. If you suspect commitment, isolate the instance, collect evidence and rotate exposed credentials.

This incident should serve as a reminder that security in model infrastructure is not just traditional hardening: it involves considering how functions designed for flexibility (such as loading a remote image) can become vectors of access to the internal network. Product managers and safety equipment should incorporate network exposure-oriented design reviews, SSRF tests and default output controls in any component that processes external URLs.
To deepen SSRF prevention and mitigation patterns, please refer to the OWASP guide on the subject and AWS documentation for the metadata service, which describe specific production controls: OWASP SSRF Prevention Cheat Sheet and AWS Instance Metadata Service (IMDS) documentation. In addition, the very publication of the finding and the correction can be found in the public advice of the project: GitHub Advisory GHSA-6w67-hwm5-92mq.
In short, organizations that deploy inference models and gateways must assume that the exploits will arrive very quickly after disclosure and prepare both patches and compensatory measures and detection capabilities. Update, segment, monitor and validate inputs are the pillars for reducing immediate risk; sustained improvement requires integrating these practices into the IA infrastructure development and deployment cycle.
Related
More news on the same subject.

18-year-old Ukrainian youth leads a network of infostealers that violated 28,000 accounts and left $250,000 in losses
The Ukrainian authorities, in coordination with US agents. They have focused on an operation of infostealer which, according to the Ukrainian Cyber Police, was allegedly adminis...

The digital signature is in check: Microsoft dismands a service that turned malware into apparently legitimate software
Microsoft announced the disarticulation of a "malware-signing-as-a-service" operation that exploited its device signature system to convert malicious code into seemingly legitim...

A single GitHub workflow token opened the door to the software supply chain
A single GitHub workflow token failed in the rotation and opened the door. This is the central conclusion of the incident in Grafana Labs following the recent wave of malicious ...

WebWorm 2025: the malware that is hidden in Discord and Microsoft Graphh to evade detection
The latest observations by cyber security researchers point to a change in worrying tactics of an actor linked to China known as WebWorm: in 2025 it has incorporated back doors ...

Identity is no longer enough: continuous verification of the device for real-time security
Identity remains the backbone of many security architectures, but today that column is cracking under new pressures: advanced phishing, real-time proxyan authentication kits and...

The dark matter of identity is changing the rules of corporate security
The Identity Gap: Snapshot 2026 report published by Orchid Security puts numbers to a dangerous trend: the "dark matter" of identity - accounts and credentials that are neither ...

PinTheft the public explosion that could give you root on Arch Linux
A new public explosion has brought to the surface again the fragility of the Linux privilege model: the V12 Security team named the failure as PinTheft and published a concept t...