Critical Vulnerability in SGLang Enables Remote Code Execution via Malicious Model Files

A critical security vulnerability in the open-source SGLang library allows attackers to execute arbitrary code on systems processing malicious large language model files, researchers disclosed Monday.

The flaw, designated CVE-2026-5760, stems from the library's handling of GGUF model files. The vulnerability was identified by security researcher Stuart Beck and reported to the CERT Coordination Center (CERT/CC). CERT/CC confirmed the issue on April 20, 2026, noting that the defect enables remote code execution (RCE) when an application loads a compromised model file.

SGLang is a widely used library for serving large language models, particularly in high-throughput environments. The vulnerability arises because the library utilizes the jinja2.Environment() class to process templates within model files without implementing sandboxing protections. Instead of using the safer ImmutableSandboxedEnvironment, the code allows unrestricted access to Python functions and system resources during template rendering. An attacker who can supply a crafted GGUF file to a vulnerable system can inject malicious code that executes with the privileges of the application process.

The CERT Coordination Center classified the vulnerability as critical due to the potential for complete system compromise. Successful exploitation could allow attackers to steal sensitive data, deploy ransomware, or pivot to other systems within a network. The flaw affects any deployment of SGLang that processes untrusted GGUF model files, a common practice in AI development and testing workflows.

Security experts recommend immediate mitigation steps for organizations using the affected library. Developers should upgrade to the latest patched version of SGLang as soon as it becomes available. Until a patch is deployed, administrators are advised to restrict the library to processing only trusted model files from verified sources. Implementing network segmentation and monitoring for unusual outbound connections from AI inference servers may also help detect exploitation attempts.

The discovery highlights ongoing risks in the rapidly expanding ecosystem of artificial intelligence tools. As organizations increasingly integrate large language models into production environments, the supply chain for model files becomes a potential attack vector. The use of template engines like Jinja2 within AI frameworks introduces complexity that can lead to oversights in security configurations.

Industry observers note that this vulnerability underscores the need for rigorous security auditing of open-source AI components. The incident follows a trend of supply chain attacks targeting software dependencies, where attackers compromise legitimate tools to gain access to downstream users. The specific mechanics of how the malicious GGUF files are constructed and distributed remain under investigation.

Questions remain regarding the scope of existing infections and whether the vulnerability has been exploited in the wild prior to disclosure. CERT/CC has not reported any known active exploitation campaigns, but the severity of the flaw suggests attackers may have been probing for the weakness. The timeline for a widespread patch deployment across the global developer community is expected to take several days, leaving a window of exposure for systems that have not yet updated.