The Claude Apocalypse has been averted—Dario Amodei is a nice guy, just like Donald Trump

Oh my, that was a close call! Thankfully, Anthropic launched Project Glasswing:

Today we’re announcing Project Glasswing, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks in an effort to secure the world’s most critical software.

We formed Project Glasswing because of capabilities we’ve observed in a new frontier model trained by Anthropic that we believe could reshape cybersecurity. Claude Mythos Preview is a general-purpose, unreleased frontier model that reveals a stark fact: AI models have reached a level of coding capability where they can surpass all but the most skilled humans at finding and exploiting software vulnerabilities.

Mythos Preview has already found thousands of high-severity vulnerabilities, including some in every major operating system and web browser. Given the rate of AI progress, it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely. The fallout—for economies, public safety, and national security—could be severe. Project Glasswing is an urgent attempt to put these capabilities to work for defensive purposes.

As part of Project Glasswing, the launch partners listed above will use Mythos Preview as part of their defensive security work; Anthropic will share what we learn, so the whole industry can benefit. We have also extended access to a group of over 40 additional organizations that build or maintain critical software infrastructure so they can use the model to scan and secure both first-party and open-source systems. Anthropic is committing up to $100M in usage credits for Mythos Preview across these efforts, as well as $4M in direct donations to open-source security organizations.

Project Glasswing is a starting point. No one organization can solve these cybersecurity problems alone: frontier AI developers, other software companies, security researchers, open-source maintainers, and governments across the world all have essential roles to play. The work of defending the world’s cyber infrastructure might take years; frontier AI capabilities are likely to advance substantially over just the next few months. For cyber defenders to come out ahead, we need to act now.

…

We have already seen the serious consequences of cyberattacks for important corporate networks, healthcare systems, energy infrastructure, transport hubs, and the information security of government agencies across the world. On the global stage, state-sponsored attacks from actors like China, Iran, North Korea, and Russia have threatened to compromise the infrastructure that underpins both civilian life and military readiness. Even smaller-scale attacks, such as those where individual hospitals or schools are targeted, can still inflict substantial economic damage, expose sensitive data, and even put lives at risk. The current global financial costs of cybercrime are challenging to estimate, but might be around $500B every year.

Many flaws in software go unnoticed for years because finding and exploiting them has required expertise held by only a few skilled security experts. With the latest frontier AI models, the cost, effort, and level of expertise required to find and exploit software vulnerabilities have all dropped dramatically. Over the past year, AI models have become increasingly effective at reading and reasoning about code—in particular, they show a striking ability to spot vulnerabilities and work out ways to exploit them. Claude Mythos Preview demonstrates a leap in these cyber skills—the vulnerabilities it has spotted have in some cases survived decades of human review and millions of automated security tests, and the exploits it develops are increasingly sophisticated.

…

Although the risks from AI-augmented cyberattacks are serious, there is reason for optimism: the same capabilities that make AI models dangerous in the wrong hands make them invaluable for finding and fixing flaws in important software—and for producing new software with far fewer security bugs. Project Glasswing is an important step toward giving defenders a durable advantage in the coming AI-driven era of cybersecurity.

…

Over the past few weeks, we have used Claude Mythos Preview to identify thousands of zero-day vulnerabilities (that is, flaws that were previously unknown to the software’s developers), many of them critical, in every major operating system and every major web browser, along with a range of other important pieces of software.

In a post on our Frontier Red Team blog, we provide technical details for a subset of these vulnerabilities that have already been patched and, in some cases, the ways that Mythos Preview found to exploit them. It was able to identify nearly all of these vulnerabilities—and develop many related exploits—entirely autonomously, without any human steering. The following are three examples:

Mythos Preview found a 27-year-old vulnerability in OpenBSD—which has a reputation as one of the most security-hardened operating systems in the world and is used to run firewalls and other critical infrastructure. The vulnerability allowed an attacker to remotely crash any machine running the operating system just by connecting to it;

It also discovered a 16-year-old vulnerability in FFmpeg—which is used by innumerable pieces of software to encode and decode video—in a line of code that automated testing tools had hit five million times without ever catching the problem;

The model autonomously found and chained together several vulnerabilities in the Linux kernel—the software that runs most of the world’s servers—to allow an attacker to escalate from ordinary user access to complete control of the machine.

We have reported the above vulnerabilities to the maintainers of the relevant software, and they have all now been patched. For many other vulnerabilities, we are providing a cryptographic hash of the details today (see the Red Team blog), and we will reveal the specifics after a fix is in place.

…

In addition to our own work, many of our partners have already been using Claude Mythos Preview for several weeks.

The NYT is slightly hysterical: Anthropic Claims Its New A.I. Model, Mythos, Is a Cybersecurity ‘Reckoning’:

Anthropic, the artificial intelligence company that recently fought the Pentagon over the use of its technology, has built a new A.I. model that it claims is too powerful to be released to the public.

Instead, Anthropic said on Tuesday, it will make the new model — known as Claude Mythos Preview — available to a consortium of more than 40 technology companies, including Apple, Amazon and Microsoft, which will use the model to find and patch security vulnerabilities in critical software programs.

Anthropic said it had no plans to release its new technology more widely, but was announcing the new model’s capabilities in one area in particular — identifying security vulnerabilities in software — in an effort to sound the alarm over what the company believes will be a new, scarier era of A.I. threats.

…

The company’s decision to hold back Claude Mythos Preview, while giving access only to partners out of concern for how it might be misused, has some precedent. In 2019, OpenAI announced it had built a new model, GPT-2, but was not releasing the full version right away. The company claimed that its text-generation capabilities could be used to automate the mass-production of propaganda or misinformation. (It later released the model, after conducting additional safety testing on it.) Many of the leaders of the GPT-2 project later left OpenAI to start Anthropic.

This time, Anthropic is making a different, more urgent claim. The company’s executives say Claude Mythos Preview is already capable of carrying out autonomous security research, including scanning for and exploiting so-called zero-day vulnerabilities in critical software programs, flaws that are unknown even to the software’s developer. These efforts can often be triggered by amateurs with simple prompts. The company claims that the new model has already identified “thousands” of bugs and vulnerabilities in popular software programs, including every major operating system and browser.

One of the vulnerabilities Claude found, the company said, was a 27-year-old bug in OpenBSD, an open-source operating system that was designed to be difficult to hack. Many internet routers and secure firewalls incorporate OpenBSD’s technology. Another was a longstanding issue in a piece of popular video software that automated testing tools had scanned five million times, without finding any problems.

“This model is good at finding vulnerabilities that would be well understood and findable by security researchers,” Mr. Graham said. “At the same time, it has found vulnerabilities, and in some cases crafted exploits, sophisticated enough that they were both missed by literally decades of security researchers, as well as all the automated tools designed to find them.”

Anthropic announced on Monday that its projected annual revenue had more than tripled in 2026, to more than $30 billion from $9 billion. The growth has come largely because of the popularity of Anthropic’s Claude as a tool for programming.

Anthropic has focused on making Claude good at completing lengthy coding tasks, in hopes of making it more useful to professional programmers and amateur “vibecoders.” But an A.I. system designed to be good at coding is also good at spotting the flaws in code — running automated scans for bugs and vulnerabilities that can allow hackers to take control of users’ machines, expose sensitive user information or wreak other havoc.

The cybersecurity industry has been bracing for years for what more capable A.I. models could do to critical tech infrastructure. Until recently, only expert human researchers with access to specialized tools were capable of finding the most severe security vulnerabilities. Now, the fear is that a powerful A.I. model could discover them on its own.

“Imagine a horde of agents methodically cataloging every weakness in your technology infrastructure, constantly,” Nikesh Arora, the chief executive of Palo Alto Networks, wrote in a blog post last week.

Mr. Graham said one of the unanswered questions about Claude Mythos Preview, and other future models that will be capable of doing similar things, was whether most or all of the world’s critical software would need to be patched or rewritten as a result of these new models.

“There are a lot of really critical systems around the world, whether it’s physical infrastructure or things that protect your personal data, that are running on old versions of code,” Mr. Graham said. “If these previously were mostly secure because it took a lot of human effort to attack them, does that paradigm of security even work anymore?”

It is wise to take claims about unreleased model capabilities from A.I. companies with a grain of salt. In this case, though, cybersecurity researchers who have been given access to Claude Mythos Preview have characterized the model as a significant cybersecurity risk.

Indeed, it’s not difficult to make $30 billion once you literally steal subscribers’ money because your shitty Claude Code is so buggy that one can say, “Hi, please continue!” and get almost all their credits consumed!

So, in the end, the Apocalypse is going to be Made in the USA, not Made in China. We’re living on borrowed time.

Béranger on Everything is beta these days, especially KDE Plasma: “The Spectacle crash has officially been fixed in Plasma 6.6.4. Look for the commit regarding layer-shell-qt. It literally reads, “Fix…” Apr 8, 05:57

Béranger on Dafuq: Claude Code appears to have leaked! 😱: “Issue #42796: [MODEL] Claude Code is unusable for complex engineering tasks with the Feb updates ● Discussion on Hacker News…” Apr 7, 22:40

edel on Today, I visited China (online): “Oh… Spain. Unfortunately I am not traveling any longer, so perceptions get biased now. The real number is, of course,…” Apr 7, 07:32

Béranger on Today, I visited China (online): “On medical leave? In what country? Not so many in Germany or Romania. But very low spirited indeed.” Apr 7, 03:23

edel on Today, I visited China (online): “Last time I was in China was more than a decade ago and by then already 90% of bikes I…” Apr 6, 23:51

Béranger on Handwriting is fitness for the brain, they say (uni-ball agrees): “Ah, you’re a mechanical pencil person! B and 2B are very pleasant to use. Many low-quality HB leads are often…” Apr 5, 22:52

Béranger on SSDs are not magic—but YT shorts are somewhat dumb, too: “Regarding the 3-2-1 backup rule (3 copies, 2 media types, 1 offsite), Christopher Barnatt came up with an updated version…” Apr 5, 21:38

dan on Handwriting is fitness for the brain, they say (uni-ball agrees): “I enjoy writing with Penac CCH-3 0.7 mm and Rotring Tikky II 0.5 mm with Staedtler 2B (I don’t like…” Apr 5, 13:09

Béranger on No more quality for the peripherals Made in China: “Today, one of the 3 mice I’m using is an ultra-light Redragon K1NG LITE M916W-LIT-1K, and I love it! OTOH,…” Apr 4, 22:59

Béranger on What I didn’t know about TypeScript’s toolchain: “Well, in the 1990s, in Slackware, I used to build my kernels. I never built a kernel in the last…” Apr 4, 22:56

EJ W on No more quality for the peripherals Made in China: “I am a gamer, and I completely agree with everything you said about gamer products, the consumers are fools, they…” Apr 4, 17:45

Alex on What I didn’t know about TypeScript’s toolchain: “He is a Gentoo developer actually. I was a Gentoo user for too many years (I’m not “normal” 😀) and…” Apr 4, 12:10

Béranger on What I didn’t know about TypeScript’s toolchain: “A fabulous article indeed, especially if you follow the links! I pondered over a few topics, but eventually I decided…” Apr 4, 00:20

Béranger on AI despre aventura ANAF cu blockchain: “Ministerul Finanțelor insistă în retardul lor mental: „BF-CHAIN oferă cetățenilor un control direct și transparent asupra bonurilor fiscale primite.”” Apr 3, 21:53

Béranger on Why are people consenting to using AppArmor or SELinux?: “If it’s CVE-2023-52076, then the problem is in the design of such viewers: This vulnerability is capable of writing arbitrary…” Apr 3, 21:18

Béranger on Ce nu știam despre monarhia britanică: “Regnele regnă. Reginele reginează sau reginesc.” Apr 3, 19:29

zugu on Ce nu știam despre monarhia britanică: “Eu regnez, tu regnezi, el regnează.” Apr 3, 19:27

Dmknght on Why are people consenting to using AppArmor or SELinux?: “IMO, any application that works with untrusted data needs AppArmor profiles. For example, there was a path traversal in Atril,…” Apr 3, 13:38

Dmknght on Why are people consenting to using AppArmor or SELinux?: “Completely agree with this. On server side, malicious activities are usually executed by “www-data/nginx” user (at least for web servers).…” Apr 3, 13:25

Alex on What I didn’t know about TypeScript’s toolchain: “A great article on the topic: Greybeard’s tomb: the lost treasure of language design.” Apr 3, 10:12

sofleet on A rare gem in a world of decay: The Graystones: “The first time I heard the Graystones was mid-December a few months ago. A few videos had already been taken…” Apr 3, 03:08

Béranger on Apple is still the worst of all: “Minimalistic mobile OSes should be the rule in the age of web apps. However, for security reasons, authenticators (I’m using Microsoft…” Apr 3, 00:31

Béranger on 📻 Radio Streams: “Fixed/updated logos and added a few more streams.” Apr 3, 00:28

Béranger on Dafuq: Claude Code appears to have leaked! 😱: “A few more insights (UPDATE 3).” Apr 3, 00:14

JimboBob on Apple is still the worst of all: “I liked Firefox OS and still own that ZTE phone. It doesn’t work much now of course. Since then, I…” Apr 3, 00:11

Béranger on Dafuq: Claude Code appears to have leaked! 😱: “A few insights on Claude Code added at the end (UPDATE 2).” Apr 2, 20:30

Béranger on On public demand: quick RAM figures: “Added Debian XFCE and Xebian stable to the comparison.” Apr 1, 22:59

Béranger on On public demand: quick RAM figures: “Liam Proven, who loves XFCE for its keyboard shortcuts from the IBM/CUA era (and who’s using it with a left…” Apr 1, 21:10

Béranger on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “UPDATE: Liam Proven knows more about Wimpy’s lack of “passion” towards Ubuntu MATE. In Ubuntu 26.04 beta arrives packing GNOME…” Apr 1, 21:03

Béranger on On public demand: quick RAM figures: “Added MX KDE to the comparison. (Previously, only MX XFCE was tested, but the D/L link was given for the…” Mar 31, 22:01

Béranger on Claude Desktop for Linux: I didn’t even know it existed!: “Dafuq: Claude Code appears to have leaked! 😱” Mar 31, 21:01

greenjeans on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “No I just usually use reportbug and sent it to Debian that way. I agree that MATE needs some customizing,…” Mar 31, 20:07

Béranger on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “Did you report any bug to Ubuntu MATE that ever got fixed? Ubuntu MATE has an active forum, but that’s…” Mar 31, 17:25

greenjeans on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “Mate’s not quite ready for Monty Python’s “bring out yer dead” cart quite yet, lol. The last bug I reported…” Mar 31, 17:23

Béranger on I never thought I’ll prefer ONLYOFFICE: “LibreOffice vs ONLYOFFICE – Which One Is Right For You? I don’t care about the ODF format. I couldn’t find…” Mar 31, 14:00

Béranger on Dumbo goes on: 26.04 Beta is solid, but not Kubuntu: “Oh, I didn’t know that GNOME 50 dropped support for accessing Google Drive files. I couldn’t have known, as I…” Mar 31, 13:51

Béranger on The day GoComics went bad: “Strange thing, the Inbox Comics mail for today, March 31, managed to retrieve Garfield from GoComics, with the link pointing…” Mar 31, 12:41

HAL on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “OK, but on older or very old machines, we need something lighter, which is why we need XFCE. Not everyone…” Mar 30, 16:26

Béranger on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “Why, of course, Liam Proven would be devastated if something happened to XFCE! OTOH, “resources” is more than just RAM.…” Mar 30, 15:15

HAL on Why didn’t anyone tell me that Ubuntu MATE is already dead?: “About Ubuntu MATE, maybe some new people will answer the call and get the project back on track. We can…” Mar 30, 15:04

ng6333 on SPECIAL: It’s distro-hopping time! (Is this a happy ending?): “Believe it or not — I ran into the same BT error when running Debian 13 live XFCE ISO and…” Mar 29, 12:46

Béranger on Has Trump invaded yet?: “I am so bored by all those naive analyses that only talk about Trump’s 15 points communicated by Pakistan to…” Mar 29, 02:27

Béranger on Dumbo goes on: 26.04 Beta is solid, but not Kubuntu: “Apparently, if you have an expensive (probably slightly north of €2,000) ASUS Vivobook Pro 15 N6506MV with Intel® Core™ Ultra…” Mar 29, 02:07

Béranger on A rare gem in a world of decay: The Graystones: “The Turnarounds, again, with a video that premieres on March 30: Runaway Baby (Bruno Mars Cover) | The Turnarounds Official…” Mar 29, 01:13

HAL on Dumbo goes on: 26.04 Beta is solid, but not Kubuntu: “Ah, ouai, 2 Go… Ouch, ça pique.” Mar 28, 21:29

Béranger on Dumbo goes on: 26.04 Beta is solid, but not Kubuntu: “Une fois customisé à fond, RAM 2 Go, disque 18 Go.” Mar 28, 21:26

HAL on Dumbo goes on: 26.04 Beta is solid, but not Kubuntu: “Hum, GNOME 🤔 Quelle est l’empreinte sur le système, par défaut sans après l’installation. sans toucher, et aussi une fois…” Mar 28, 21:20

Béranger on Has Trump invaded yet?: “Andrew Sterling Ansley, on FB: Next time someone says that Iran is dangerous and they need to be stopped… here’s…” Mar 28, 20:31

ConEst on Apple is still the worst of all: “Liquid glass is so bad that it’s enough to fire their entire design team and their dark mode icons look…” Mar 27, 04:45

Kimi on Everything is beta these days, especially KDE Plasma: “That’s hardware instability. Check you CPU & RAM with stress testing software applications such as OCCT, RamTestPro & something similar.” Mar 27, 02:28

Béranger on Has Trump invaded yet?: “Is anyone betting on an attack earlier than the announced deadline? Also, I’m pretty sure that this announcement has been…” Mar 27, 02:13

Béranger on How I chose to become Dumbo: “Controversies are good. Emacs and the GPL are bad 🙂 To me, the GNU General Public Licenses are communist. It’s…” Mar 26, 21:16

Béranger on Has Trump invaded yet?: “From a 7,000-word analysis by Bret Devereaux, ancient and military historian: Miscellanea: The War in Iran: The problem is that…” Mar 26, 20:49

John on How I chose to become Dumbo: “You are entitled to your opinion. RMS does not need any defense but for the sake of accuracy, he never…” Mar 26, 20:23

Béranger on More AI hysteria (and some tips): “I defined quantization and distillation here, but quantization was a topic in this post, so I’ll comment below. The developer…” Mar 26, 20:04

Béranger on Me no know much, but running LLMs locally was disappointing: “RAG is a PITA: From zero to a RAG system: successes and failures. After debugging, I discovered it was processing…” Mar 26, 19:30

Béranger on How I chose to become Dumbo: “This is generally correct, except that, IMO, RMS is a communist retard (and sympathetic to pedophiles). GNU and the GPL…” Mar 26, 18:43

John on How I chose to become Dumbo: “Ubuntu “sins” from the past. They will never learn. 2010 – 2017 R.I.P.: “We have no plans to fork GNOME”…” Mar 26, 18:31

Béranger on How I chose to become Dumbo: “Canonical seems to have entered self-destruction mode, albeit starting in October. Phoronix: Ubuntu 26.10 Looks To Strip Its GRUB Bootloader…” Mar 26, 12:40

Béranger on Has Trump invaded yet?: “The Hill: The U.S. Army has raised the maximum enlistment age to 42 and also relaxed rules on recruiting individuals…” Mar 26, 11:02

The Claude Apocalypse has been averted—Dario Amodei is a nice guy, just like Donald Trump

Today we’re announcing Project Glasswing, a new initiative that brings together Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks in an effort to secure the world’s most critical software.

No Comments Yet

Leave a Reply Cancel reply