Unauthorized Users Breach Anthropic's Claude Mythos Security Model

SAN FRANCISCO — Unauthorized users gained access to Anthropic's Claude Mythos security model for a period of two weeks, the company confirmed on Friday. The breach, discovered on April 24, 2026, exposed unreleased versions of the artificial intelligence system to external actors before the vulnerability was patched.

Anthropic stated that the intrusion occurred through an unidentified entry point and lasted approximately 14 days. During this window, the attackers accessed the Claude Mythos framework, which is designed to test and secure the company's large language models against adversarial prompts and safety failures. The security model remains unreleased to the public, making the exposure a significant risk for the company's future product integrity.

The company did not disclose the location of the attackers or the specific methods used to compromise the system. Officials noted that no customer data or production models were accessed during the incident. The breach was detected through internal monitoring systems that flagged anomalous access patterns within the Mythos environment.

Anthropic immediately initiated a security review and isolated the affected systems to prevent further unauthorized access. The company is working with cybersecurity experts to determine the full extent of the intrusion and to strengthen its defensive protocols. No evidence suggests that the attackers successfully extracted proprietary code or training data from the unreleased models.

The incident raises questions about the security practices surrounding unreleased AI systems. Industry analysts have noted that the theft of security models could allow bad actors to develop countermeasures against future safety filters. However, Anthropic has not confirmed whether the attackers achieved their objective of hunting unreleased models or if they were merely exploring the system's capabilities.

The company has not released details on the identity of the unauthorized users or their motivations. Investigators are examining whether the breach was the work of a state-sponsored group, a criminal organization, or independent researchers. The lack of information regarding the attackers' location and identity complicates efforts to assess the broader implications of the incident.

Anthropic has not announced any changes to its release schedule for the Claude Mythos model. The company is expected to address the breach in its next quarterly security report. Until then, the full scope of the damage remains unclear, and the question of whether the attackers successfully compromised the unreleased models remains unanswered.