SUBSCRIBE
Tech Journal Now
  • Home
  • News
  • AI
  • Reviews
  • Guides
  • Best Buy
  • Software
  • Games
  • More Articles
Reading: Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark – GeekWire
Share
Tech Journal NowTech Journal Now
Font ResizerAa
  • News
  • Reviews
  • Guides
  • AI
  • Best Buy
  • Games
  • Software
Search
  • Home
  • News
  • AI
  • Reviews
  • Guides
  • Best Buy
  • Software
  • Games
  • More Articles
Have an existing account? Sign In
Follow US
© Foxiz News Network. Ruby Design Company. All Rights Reserved.
Tech Journal Now > News > Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark – GeekWire
News

Microsoft’s multi-agent AI system tops Anthropic’s Mythos on cybersecurity benchmark – GeekWire

News Room
Last updated: May 14, 2026 12:47 am
News Room
Share
3 Min Read
SHARE

by Todd Bishop on May 13, 2026 at 5:16 pmMay 13, 2026 at 5:23 pm

CyberGym benchmark scores over time, showing the rapid improvement in AI vulnerability discovery capabilities. Microsoft’s multi-model MDASH system (top right) tops the leaderboard at 88.4%. (CyberGym / UC Berkeley)

Mythos has been MDASH’d.

A new AI-powered system from Microsoft surpassed a headline-grabbing rival from Anthropic on a leading cybersecurity benchmark, using more than 100 specialized AI agents working together across multiple AI models to find real-world software vulnerabilities.

Microsoft’s system, codenamed MDASH, was introduced this week alongside the disclosure of 16 new vulnerabilities it found in different versions of Windows, including four “critical” remote code execution flaws fixed in this month’s Patch Tuesday release. 

The company, which has faced persistent criticism over security lapses, is betting that multiple models can discover vulnerabilities at a pace that individual models can’t match. 

MDASH, derived from the term “multi-model agentic scanning harness,” works by running specialized AI agents through a staged pipeline. Different agents scan code for potential vulnerabilities, then a separate set of agents debate whether each finding is real and exploitable, and a final stage constructs proof-of-concept attacks to confirm the bugs exist.

By comparison, Anthropic’s Mythos, which raised concerns over its ability to find and exploit software vulnerabilities when it was previewed earlier this year, is a single AI model running inside an agent framework. Anthropic restricted its release to a handful of companies through a consortium called Project Glasswing, which includes Microsoft.

OpenAI’s GPT-5.5 and others on the leaderboard are also single-model systems.

MDASH scored 88.45% on the CyberGym benchmark, a test developed by UC Berkeley researchers that measures how well AI systems can reproduce real-world vulnerabilities across 1,507 tasks drawn from 188 open-source software projects. 

Mythos Preview was second at 83.1%, followed by GPT-5.5 at 81.8%. 

The benchmark gives each system a description of a known vulnerability and an unpatched codebase, and measures whether it can produce a working attack that triggers the bug. 

The scores on the CyberGym leaderboard are self-reported by the companies, including Anthropic’s Mythos result. The benchmark code is public, but no independent party has verified any of the scores. Also, benchmark results don’t necessarily reflect real-world performance.

The results also highlight growing concerns about AI’s use as an offensive hacking tool. The same capabilities that allow AI to find vulnerabilities in friendly hands can be used to discover them for exploitation by attackers. Microsoft said MDASH is being used internally by its security engineering teams and will be entering a limited private preview with customers. 

Microsoft is telling customers to expect bigger Patch Tuesdays going forward as AI accelerates the discovery of vulnerabilities.

Read the full article here

You Might Also Like

Amazon secures multi-year rights to Oprah’s video podcasts

Make Democracy capitalist again – GeekWire

Microsoft tops Wall Street expectations, reports accelerating Azure growth and $37B AI run rate – GeekWire

Former Tableau product chief launches Golden Analytics, using AI to challenge the BI old guard – GeekWire

Washington state needs a ‘coherent’ story to compete in AI, leaders agree

Share This Article
Facebook Twitter Email Print
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

- Advertisement -
Ad image

Trending Stories

Games

Turn-based tactics game Warhammer 40,000: Mechanicus 2 will release next week to coincide with a Warhammer Skulls event stacked with reveals

May 14, 2026
Games

Bungie is turning Marathon’s loot dial to 11 ahead of its first wipe, opening up Cryo Archive 24/7

May 14, 2026
News

Cowboy Space raises $275M as it seeks 40-60 employees for new satellite and rocket hub in Seattle – GeekWire

May 13, 2026
Games

No, Eric Barone is not adding infidelity to Stardew Valley, although he did briefly consider letting you ruin marriages, to Grandpa’s deep disappointment

May 13, 2026
AI

Nearly every enterprise is investing in AI, but only 5% say their data is ready – Computerworld

May 13, 2026
Games

Doom is 10 years old today, and you can pick it up for just $2 on Steam

May 13, 2026

Always Stay Up to Date

Subscribe to our newsletter to get our newest articles instantly!

Follow US on Social Media

Facebook Youtube Steam Twitch Unity

2024 © Prices.com LLC. All Rights Reserved.

Tech Journal Now

Quick Links

  • Privacy Policy
  • Terms of use
  • For Advertisers
  • Contact
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?