
Tech • IA • Crypto
Anthropic’s Mythos 1 AI model is demonstrating unprecedented cyber offense and defense capabilities, raising urgency around safety, patching capacity, and transparency as signs point to an accelerated rollout.
Mythos 1 has identified over 10,000 high or critical vulnerabilities across roughly 50 major technology organizations, including Cloudflare, Mozilla, and OpenBSD. In one case, Cloudflare reported 2,000 vulnerabilities in core systems, with 400 classified as severe, while maintaining a lower false positive rate than top human testers.
The model demonstrated capabilities described as nation-state level cyber offense, including autonomously constructing full exploit chains. It also became the first AI to defeat the UK AI Safety Institute’s dual network challenge end-to-end, signaling a leap in real-world exploit capability.
In a banking deployment, Mythos prevented a $1.5 million wire fraud attempt by detecting anomalies across compromised email accounts and AI-generated voice calls. The system intervened moments before funds were transferred, showcasing defensive applications alongside offensive potential.
Across 1,000+ core open-source projects, Mythos identified 2,319 vulnerabilities, with 1,094 confirmed as high severity after review. A notable case involved WolfSSL, where the model generated attack code enabling certificate forgery, potentially affecting billions of devices if exploited.
The speed of AI discovery has created a bottleneck: developers cannot fix issues fast enough. Of 1,129 reported vulnerabilities, only 75 critical flaws have been patched so far, with human teams averaging two weeks per fix, prompting maintainers to request slower disclosure.
Anthropic launched Claude Security, which not only detects vulnerabilities but generates fixes. Enterprise users have reportedly resolved 2,100+ issues in three weeks, signaling a shift toward AI-driven remediation and competition with platforms like Snyk and Veracode.
Although Anthropic publicly stated Mythos would remain restricted, references to “Mythos 1 Preview” briefly appeared in developer tools, suggesting a faster rollout. This raises questions about whether safety safeguards have been sufficiently addressed.
Reports of a projected $559 million operating profit and revenue growth from $4.8 billion to $10.9 billion in one quarter have been challenged. Critics point to discounted compute deals and possible front-loaded revenue accounting, casting doubt on sustainability.
While developer events emphasized productivity and “magic,” leadership warnings painted a starkly different picture. Anthropic executives cautioned that AI could reach recursive self-improvement by 2028 or sooner, with potentially catastrophic consequences if safeguards lag.
The recruitment of Andrej Karpathy, alongside former Tesla and xAI engineers, underscores intensified competition in frontier AI development and preparation for large-scale deployment.
Mythos 1 represents a major inflection point in AI-driven cybersecurity, combining unprecedented capability with systemic risks that existing infrastructure and governance may not be ready to handle.
All right, so Anthropic is getting ready to unleash something absolutely wild on the world. We're talking about Mythos 1, which is basically their most powerful AI model yet, and it looks like it's about to become way more accessible than anyone expected. Let me walk you through what's been happening because there's a lot to unpack here. So the story starts with project glasswing which is this initiative Anthropic launched where they've been using claude mythos to find vulnerabilities in software. And when I say they've been finding vulnerabilities I mean they've been absolutely destroying the entire cyber security industry's understanding of what's possible. In just 30 days mythos discovered over 10,000 high severity or critical software vulnerabilities across roughly 50 major tech companies and infrastructure developers. We're talking about companies like Cloudflare, Mosilla, OpenBSD, and a bunch of others that basically run the internet. Here's where it gets crazy, though. Cloudflare reported that Mythos found 2,000 vulnerabilities in their core system pathways, and 400 of those were classified as high or critical severity. But get this, the false positive rate from the AI was actually lower than what you'd get from top human security testers. Mosilla's Firefox 150 browser got patched for 271 critical vulnerabilities in one go, which is more than 10 times what they found in Firefox 148 using the older Opus 4.6 model. And OpenBSD Mythos uncovered a 27-year-old hidden bug in their codebase and then just casually constructed a complete exploit chain without any human help whatsoever. The UK AI Safety Institute even came out and officially confirmed that Mythos Preview is the first AI model in the world capable of fully defeating their dual network challenge end to end. This thing is legitimately operating at a level that security researchers are describing as nationstate level cyber offensive capabilities. One researcher who participated in the beta testing literally said on X that it felt like watching an F-22 fighter jet fly overhead while holding a spear. Now, Anthropic also used Mythos in a real business scenario at a partner bank, and it actually stopped a $ 1.5 million wire fraud attempt in real time. Hackers had compromised customer email accounts, used AI voice cloning to make fraudulent calls, and were literally moments away from completing the transfer when Mythos detected the scam by analyzing anomalous behavior patterns and blocked the transaction. And you'd think with something this powerful, Anthropic would keep it locked down tight, right? Well, that's where things get interesting. Just last Friday, Anthropic came out and said that Mythos would remain restricted and that they were unlikely to release it to the general public anytime soon. They specifically mentioned needing to develop far stronger safeguards before making Mythos class models available through a general release. But literally the next day, users started spotting something called Mythos 1 and Claude Mythos one preview showing up in Claude Code and Claude Security. It was only visible for a brief period, but people grabbed screenshots and the evidence is pretty clear. New strings appeared in the source code that explicitly referenced access to the claude mythos model in cloud code and cla security. So either anthropic is preparing a roll out way faster than they let on or something changed dramatically in their safety assessment basically overnight. What's also happening behind the scenes is that Anthropic is building out a whole new claude security dashboard for enterprise customers. This thing is designed to surface discovered vulnerabilities with 7-day and 30-day historical charts and deeper triage results. It's basically positioning Cloud Security as a direct competitor to dedicated vulnerability management platforms like SNICK and Veraricode, which is a pretty big deal for the enterprise security market. And just to make things even more complicated, there are rumors floating around that Claude Opus 4.8 is in the works and that select anthropic partners are already doing internal evaluations. If that launches in the coming weeks, it would fit the cadence they set with Opus 4.7 back in April, and it would line up perfectly with all these mythos and security product moves they're making. But let's talk about what this actually means for the broader ecosystem, because things are getting messy. Anthropics scanned over 1,000 core open-source projects that basically hold up the internet, and they identified 23,19 vulnerabilities total. Of those, 6,22 were assessed by Mythos as high or critical vulnerabilities. They partnered with six independent security research firms to manually verify everything, and the AI's true positive rate came out to 90.6%. After final verification, 1,094 of these were confirmed as high severity or critical vulnerabilities with conclusive evidence. One case that really drives home how dangerous this is involves Wolf SSL, which is this widely used open-source cryptography library that's running on billions of devices worldwide. We're talking IoT devices, routers, smart cars, all kinds of stuff. Mythos didn't just find a vulnerability in Wolf SSL. It wrote its own attack code that would allow hackers to forge digital certificates and create perfectly realistic fake bank websites or email login pages. If that vulnerability hadn't been discovered and fixed before malicious actors got to it, we'd be looking at a potential catastrophe affecting billions of devices. Now, here's where the situation gets really problematic. The bottleneck in cyber security used to be finding vulnerabilities, but Mythos has essentially reduced that cost and time to nearly zero. The new bottleneck is that humans can't patch vulnerabilities anywhere near as fast as the AI can discover them. Several open- source maintainers have literally sent pleading emails to Anthropic asking them to slow down because they're overwhelmed. On average, human programmers are taking about two weeks to fix a single high severity vulnerability, even with detailed reports. Out of 1,129 vulnerabilities that Anthropic submitted to open-source authors, only 75 critical vulnerabilities have actually been patched so far. To address this, Anthropic launched something called Claude Security, which is an automation tool for Claude enterprise customers that doesn't just identify vulnerabilities, but also generates the fix patches. In just 3 weeks since launch, enterprise clients have used it to rapidly fix over 2,100 vulnerabilities. They've also open sourced a bugf finding pipeline with customized instructions, an automation framework that lets Claude navigate large code bases and clone sub aents for parallel scanning, and a threat model builder that automatically identifies the most vulnerable points in your system. Cisco even jumped in and announced they're open- sourcing something called the Foundry Security Spec System to build a security evaluation framework similar to Mythos. The vision here is that AI will detect vulnerabilities and generate patches with humans only responsible for the final review. That's supposedly the ultimate form of future cyber security. But Anthropic stance on releasing Mythos publicly has been very cautious and for good reason. They've said they won't fully release it until they implement stronger, higher level security safeguards. The XBO test report showed that Mythos preview achieved a generational leap ahead of all existing models on the web exploit benchmark, demonstrating unprecedented precision, even at the level of individual token generation. If the Mythos API were made public today, global hacker groups and extremist organizations could effortlessly produce thousands of zeroday exploitation tools at minimal cost basically overnight. we'd be looking at computers, hospital systems, and power grid control centers facing a catastrophe. Meanwhile, there's this whole other story playing out about Anthropic's finances that's honestly pretty wild. The Wall Street Journal ran a piece saying Anthropic is about to have its first profitable quarter with an operating profit of $559 million. They're projecting revenue to more than double from 4.8 billion in Q1 to 10.9 billion in Q2. That's explosive growth that would help them turn an operating profit for the first time. But Ed Zitron, who's been covering Anthropic's finances pretty closely, absolutely tore this narrative apart. He pointed out that the journal added a note at the bottom saying it's unclear what accounting methods Anthropic used since they're not required to follow public company financial reporting requirements yet. So, we're talking about non-GAAP Ibida profitability for potentially just a single quarter. The real issue is how Anthropic achieved this. Remember that deal they signed with SpaceX to take over Colossus 1 and some or all of Colossus 2? Well, according to SpaceX's own filing, Anthropic is paying them $1.25 billion a month starting in May and June, but with a reduced fee as it ramps up. That's $15 billion a year in compute costs normally, but discounted for the exact months that Anthropic is using to tell investors they have an operating profit. So basically, they're suppressing costs during Q2 specifically. And then the journal conveniently mentions that the company might not remain profitable for the full year as spending increases. The revenue numbers also don't really add up when you look at previous reporting. Back in February, Anthropic claimed they hit 14 billion in annual recurring revenue, which implies monthly revenue of about 1.17 billion. By March 3rd, they claimed 19 billion in ARR or 1.58 billion per month. But then on March 9th, their CFO Krishna Ralph declared under oath that Anthropic had brought in revenues exceeding $5 billion to date. That's a huge discrepancy that's tough to reconcile, especially when the information had reported 4.5 billion in revenue for all of 2025. If we believe the leaked charts showing 4.8 billion in Q1 2026, that would mean Anthropic made over 90% of its lifetime revenues in just the first quarter of this year and virtually no revenue in previous years. That level of growth is possible, but definitely stretches credibility. The only real defense is that their CFO lowballed the government and a judge to such a dramatic extent that he hid over 4 billion in revenue, which seems unlikely. What's probably happening is that Anthropic is taking prepayment of tokens from large enterprises like $50 million intended to be spread over 12 months that they're booking as revenue immediately. They're also offering discounted tokens with discounts ranging from 10 to 30%. And they may be front-loading annual commitments of subscriptions and enterprise agreements. All of this would inflate revenue numbers and depress costs because they wouldn't have actually provided the compute necessary to earn that revenue yet. Adding to all this drama, there was this really interesting contrast between two different anthropic events this week. On Wednesday, they held their first developer focused event in Europe called Code with Claude. The whole vibe was about productivity and magic and this renaissance in computer programming. Boris Churnney, who created Claude Code, talked about reconnecting with the feeling of magic that got him into programming. Developers were eating free lunch, getting complimentary mini computers, and the mood was basically unbridled enthusiasm. When someone asked the crowd how many had shipped code written by Claude without even reading it, a startling number of people raised their hands. But then on Thursday, anthropic co-founder Jack Clark gave a lecture at Oxford University, and it was a completely different tone. He said AI posed a nonzero chance of killing everybody on the planet and warned that the next few years would contain more disruption than any in living memory. He predicted that by 2028, or maybe sooner, AI would reach recursive self-improvement and achieve the capability to improve itself without human intervention. He said, "Most of the world is in denial about current AI capabilities, let alone what's coming in 6 months." Clark even admitted that Anthropic itself underestimated the scale and speed of AI advancement, saying, "When Mythos finished training, they were like, "Oh, it's here faster than we thought, and we've done insufficient preparation." So, you've got this situation where Anthropic is telling developers one story about productivity and magic while telling policymakers and academics that we might all be in serious trouble very soon. It's not necessarily nefarious. Companies tailor messages to different audiences all the time, but experiencing those two narratives so close together creates serious whiplash. And just to round out all the news, Anthropic hired Andre Carpathy this week, which is a pretty big deal. Cararpathy co-founded OpenAI, then got recruited to Tesla by Elon Musk to lead their computer vision team for autopilot, and now he's joining Anthropic's pre-training team. His work at OpenAI and Tesla came up repeatedly during the Musk versus Altman trial that just concluded where the jury ruled in Sam Alman's favor. Carpathy's joining follows Ross Nordine, a founding member of XAI and Ex-Tesla employee who announced earlier this month he was also joining Anthropic. So yeah, Anthropic is clearly gearing up for something major with Mythos 1, whether they're ready to admit it publicly or not. The production infrastructure is already in place. The enterprise security tooling is being built out, and they're hiring top tier AI talent left and right. The big question is whether they've actually met the safety conditions they said were necessary before releasing a Mythos class model, or if they're quietly abandoning those standards under competitive pressure. Either way, things are about to get very interesting in the AI world, and Mythos 1 is going to be at the center of it all. All right, let me know your thoughts in the comments. Subscribe for more AI updates. Hit the like button if you enjoyed the video. Thanks for watching, and I'll catch you in the next one.