Claude Opus 4.6 breaks schemes by detecting more than 500 high-risk vulnerabilities in open source libraries

Anthropic has taken a striking step in the conversation on artificial intelligence and security: his new language model, Claude Opus 4.6, has helped to identify more than 500 high-gravity vulnerabilities in open source libraries, including projects such as Ghostscript, OpenSC and CGIF. The company explains that the model did not need specific instructions for these tasks and that the findings were validated by humans before notifying the maintainers.

Claude Opus 4.6 was presented as a version of the model able to improve in programming capabilities: code review, cleansing and reasoning on complex routines. Anthropic says he tested it in a virtualized environment where the model had access to traditional fault research tools, such as depurators and input generators (fuzzers), but without receiving guidance on how to use them. The idea was to measure their "standard" performance, without custom scaffolding that would guide the search.

Claude Opus 4.6 breaks schemes by detecting more than 500 high-risk vulnerabilities in open source libraries — Image generated with IA.

What makes the news interesting is not only the number of findings, but the type of errors discovered. Some of the examples cited include the identification, by analysis of the history of commitments, of a lack of verification of limits in Ghostscript that could cause a failure; the detection of calls to unsafe functions such as strrchr () and strcat () in OpenSC that led to buffer overflows; and an overflow in the heap in CGIF that required understanding how the LZW algorithm is related to the GIF format in order to trigger vulnerability. In the case of CGIF, the maintainers published a correction in version 0.5.1.

These stories explain why an IA can be useful where traditional methods stumble. Coverage fuzzers and other automatic techniques generate random inputs or guided by running metrics, and work very well for many types of failures. However, there are errors that only appear when a specific sequence of algorithmic decisions is produced or when a conceptual relationship between a format and a process has to be understood - scenarios where an approach of "reasoning" can make a difference. For those who want to deepen on fuzzing techniques and their limits, resources like OSS-Fuzz or projects such as American Fuzzy Lop (AFL) The state of art is well documented.

It is important to stress that Anthropic did not present these results as the last autonomous resource: the company claims to have validated each failure to avoid false positives or model hallucinations. In particular, they used Claude to prioritize memory corruption vulnerabilities and then verified that these problems were reproducible and real before informing the maintainers. This human verification stage is key to making language model-based detection useful and responsible.

At the same time, this announcement revives an ethical and security debate. The same ability to find faults can be exploited for offensive purposes. Anthropic recognizes it and, in previous communications, has pointed out that its models could also facilitate complex attacks if used maliciously. The company says it is adjusting safeguards and adding controls to avoid abuse; a reading of its reflections on risks and autonomous tools can be found on its technical blog ( report on the zero- days and other related entries).

For the security community and for those responsible for open source projects, the implications are double. On the one hand, having a tool that accelerates the location of serious vulnerabilities can relieve the burden on often overloaded maintainers and help to park before a problem is exploited freely. On the other hand, it opens the need to establish sound processes for responsible disclosure, prioritization and maintenance funding to ensure that corrections are quickly integrated. Agencies such as CISA They insist on the importance of rapidly patching the vulnerabilities known as the basic defence measure.

The current capacity of these models should also be relativized: identifying patterns, suggesting assumptions and prioritizing failures is not yet a substitute for security equipment. Human work remains essential to reproduce, exploit in a controlled manner and correct the failures, as well as to judge the real impact in each context. What these tools do change is the temporal dynamics: they can discover attack vectors that would pass unnoticed and therefore reduce the window in which an attacker could take advantage.

The collaboration between IA and security opens an opportunity to balance forces on the digital battlefield. If IA techniques are applied with controls, transparency and ethics, they can become "multipliers" for those who defend infrastructure. For that potential to materialize, it will be necessary to improve open ecosystem maintenance practices, finance audits and educate both developers and managers on how to interpret and validate automated reports.

In short, the case of Claude Opus 4.6 is a sample of what is already possible: a language model that supports the search for complex errors and helps prioritize corrections. But it's not a magic wand or a substitute for human work. The challenge now is to manage the adoption of these tools in a responsible way, combining the speed and scope of the IA with the verification, outreach and patching processes that protect users and systems. For those who want to follow Claude's development and Anthropic's analysis, their announcement of the version and subsequent reflections are available on their website ( Opus 4.6 launch) and on his technical blog.

If you are interested in knowing more about why some bugs escape to traditional fuzzers or how algorithms like LZW work in formats like GIF, an accessible explanation is on the page of LZW on Wikipedia which helps to understand why certain errors require a conceptual understanding to be activated.

The arrival of IA with code audit capabilities poses a powerful promise and a call to collective responsibility: using it to improve security without opening new attack windows.

Coverage

More news on the same subject.

Safety alert Drug critical vulnerability of SQL injection in PostgreSQL requires immediate update

May 21, 2026 4 min de lectura 12

Explore RadarBytes

Claude Opus 4.6 breaks schemes by detecting more than 500 high-risk vulnerabilities in open source libraries

Disable your ad blocker

Claude Opus 4.6 breaks schemes by detecting more than 500 high-risk vulnerabilities in open source libraries

Related

Safety alert Drug critical vulnerability of SQL injection in PostgreSQL requires immediate update

18-year-old Ukrainian youth leads a network of infostealers that violated 28,000 accounts and left $250,000 in losses

RAMPART and Clarity redefine the safety of IA agents with reproducible testing and governance from the start

The digital signature is in check: Microsoft dismands a service that turned malware into apparently legitimate software

A single GitHub workflow token opened the door to the software supply chain

WebWorm 2025: the malware that is hidden in Discord and Microsoft Graphh to evade detection

Identity is no longer enough: continuous verification of the device for real-time security

Manage your cookies