know.2nth.ai › Agents › agents › microsoft-mai

agents · Models · Reference leaf

MAI, and
humanist superintelligence.

At Build 2026, Microsoft AI — led by Mustafa Suleyman — unveiled seven in-house MAI frontier models and reframed its strategy around "humanist superintelligence": efficiency, specialisation, data sovereignty, and hardware-software co-design. This leaf tracks what was actually announced, what it means for agent-OS architecture, and where the open-standard anchors sit. The MAI models are proprietary; the anchors are not.

Microsoft Build 2026 7 MAI models Proprietary · private preview Open anchors: MCP · Skills Validated 2026-06-04

01 · What was announced

Seven frontier models, built in-house from scratch.

San Francisco, 2 June 2026. The Microsoft AI (MAI) Superintelligence Team — formed November 2025, led by CEO Mustafa Suleyman — unveiled a family of seven frontier models spanning image, voice, transcription, reasoning and coding. A deliberate move away from Microsoft's posture as the enterprise delivery vehicle for OpenAI and Anthropic models inside Microsoft 365.

Source-validation note

This leaf was rebuilt from a supplied draft; five claims were corrected against primary sources, marked [corrected] inline and listed in the Resources section. The MAI models are proprietary (private preview / rolling release on Microsoft Foundry), not open source — the open anchors below are the standards the agent-OS thesis rests on, not the MAI models themselves.

Model	Type	Notable spec
MAI-Thinking-1	Reasoning	Trillion-parameter mixture-of-experts, 35B active, 128k context. Trained with no distillation from other companies' models. Private preview on Foundry. `[corrected]`
MAI-Code-1-Flash	Coding	5B-parameter model, rolling out in VS Code and GitHub Copilot. `[corrected]`
MAI-Image-2.5 + Flash	Image	Microsoft says it ranks #2 on a leading image-editing leaderboard, ahead of Google's Nano Banana Pro.
MAI-Voice-2	Voice	Natural-sounding output across ~15 languages.
MAI-Transcribe-1.5	Transcription	Speech-to-text.
Aion Instruct / Aion Plan	On-device SLMs	Small models for local reasoning/planning on Windows.

The unifying message from Suleyman: "This is all about long-term self-sufficiency for Microsoft and our partners. It's about models you can trust."

02 · Why it matters

The headline is cost-per-task and control, not size.

The numbers Microsoft put on stage are all vendor-reported — treat as directional until independently reproduced — but they point the conversation away from "how large is the model?".

10× cost efficiency

Using RL environments tuned on McKinsey's tasks, a MAI model beat GPT-5.5 with 10× greater cost efficiency. A separate Excel-agent example matched GPT-5.4 at 10× lower cost. [corrected — two distinct benchmarks]

30% better perf-per-dollar

The Maia 200 inference chip versus the leading GPU, on Microsoft's measurement.

1.4× perf-per-watt

Running MAI-Thinking-1 end-to-end on Microsoft's own silicon.

1 trillion-fold

Suleyman's framing of the increase in frontier-training compute over 15 years — the scaling backdrop.

The strategic shift for buyers: the question moves from "how large is the model?" to "how efficient is the task execution, and who owns the resulting intelligence?"

03 · How it works — four pillars

Four pillars, and the open anchors under them.

The strategy rests on four ideas. Each proprietary pillar has an open-standard analog — the part you can reproduce without renting Microsoft's stack.

Pillar 1 — Humanist superintelligence. State-of-the-art AI explicitly architected to augment, not replace, humans and organisations. Microsoft positions value as a function of what the system is anchored to, not raw capability alone.

Pillar 2 — Frontier Tuning + Reinforcement Learning Environments (RLEs). Rather than renting shared intelligence, organisations build custom, task-specific agents inside private "training gyms" adapted to their own workflows, evals, tools and data. The tuned model becomes a proprietary operational moat.

Open anchor — Gymnasium

RLEs are proprietary, but the working open standard for agent training environments is the Gymnasium API (Farama Foundation, the maintained successor to OpenAI Gym). To reproduce the pattern without renting Microsoft's gyms, that's where it lives: github.com/Farama-Foundation/Gymnasium.

Pillar 3 — Hardware-software co-design. MAI-Thinking-1 was co-designed and optimised on the Maia 200 inference chip (TSMC 3nm; ~10 PFLOPS FP4; 216GB HBM3e). Maia 200 was first revealed 26 January 2026 and confirmed in production at Build (Iowa + Arizona, expanding to Italy, Australia, South Korea). [corrected — not first announced at Build]

Open anchor — MRC

The reliability layer for extreme-scale workloads, MRC (Multipath Reliable Connection), was announced as an open network protocol co-developed with AMD, Broadcom, Intel, OpenAI and NVIDIA — a rare open artefact in an otherwise proprietary stack.

Pillar 4 — Clean data lineage. MAI-Thinking-1's selling point to enterprises is no distillation and legally clean training data — a direct play for regulated, production-grade deployment.

Agent-OS open anchors — MCP & Agent Skills

The two standards that make the "personal agent → sub-tree of skill agents → elastic OS" model portable across vendors:

Model Context Protocol (MCP) — open agent-to-tool standard: github.com/modelcontextprotocol. Agent Skills — open standard, Apache 2.0 + CC-BY-4.0: agentskills.io. Both are live leaves in this tree.

04 · The ecosystem

Build 2026 in context: proprietary layer, open analog.

Where each Microsoft piece sits, and the open standard that keeps the equivalent layer vendor-portable.

Layer	Microsoft (proprietary)	Open analog
Frontier reasoning	MAI-Thinking-1 (Foundry preview)	— (no open frontier-reasoning peer)
Coding agent	MAI-Code-1-Flash (VS Code, Copilot)	—
On-device	Aion Instruct / Aion Plan (Windows)	Local SLM runtimes (llama.cpp ecosystem)
Agent tooling	Foundry, Agent Mode, Microsoft Scout	MCP, Agent Skills
Custom training	Frontier Tuning / RLEs	Gymnasium API
Silicon	Maia 200, Cobalt 200	—
Networking	MRC	MRC is itself the open protocol
Vertical model	Mayo Clinic healthcare model	—

Other Build 2026 announcements relevant to agent-OS work: Azure HorizonDB (managed PostgreSQL for agentic apps, vector + semantic search), Project Solara (chip-to-cloud platform for agent-first devices), Microsoft Discovery (GA — agentic scientific-research platform), and MDASH (multi-agent security system).

05 · Use cases

As demonstrated on stage.

Enterprise reasoning at lower cost — MAI-Thinking-1 for medium-weight-class reasoning where GPT-class cost is hard to justify. Inference-efficient coding — MAI-Code-1-Flash inside existing GitHub Copilot / VS Code workflows. Company-specific agents — Frontier Tuning + RLEs to build a private "hill-climbing machine" around proprietary workflows (the McKinsey and Excel examples). Verticalised healthcare — the Mayo Clinic frontier model (below). On-device agents — Aion SLMs for local reasoning/planning without cloud round-trips. Agentic security — MDASH using teams of agents to find exploitable bugs.

06 · The healthcare frontier

Mayo Clinic — a data-sovereignty story, not a Microsoft-owns-it story.

Microsoft and Mayo Clinic announced a strategic collaboration to develop a frontier AI model purpose-built for healthcare, pairing Mayo's de-identified clinical data and longitudinal insights with Microsoft's AI, cloud and engineering. Two corrections to the common framing:

Mayo Clinic owns the model

Consistent with its stewardship of clinical data. Microsoft plans to offer access via Azure AI Foundry APIs. A data-sovereignty story, not a Microsoft-owns-it story. [corrected]

Not yet a clinical team member

Suleyman said it will take "many years" to train and refine the model to be trustworthy for high-stakes use. It is initially deployed inside Mayo's environment for testing and refinement. [corrected — draft overstated maturity]

07 · Decision guide

When to pay attention — and when to stay sceptical.

Pay attention when	Stay sceptical when
You're cost-bound on inference and your tasks are narrow / repeatable	You need the absolute capability frontier — MAI-Thinking-1 is medium weight class by design
Data lineage and ownership are compliance requirements ("own it vs rent it")	You need it now — much is private preview, rolling release, or months out
You already live in Foundry / 365 and want first-party models	You need open weights — none of the MAI models are open source
The RLE / Frontier-Tuning economics fit your workflow	Vendor benchmarks are the only evidence — the 10× / 30% figures are Microsoft's own

The strongest part of the announcement is the own-it-vs-rent-it argument for regulated workloads. If portability and openness are the requirement, build on the open anchors in section 03 — not on MAI.

08 · The 2nth.ai read

It maps almost directly onto the agent-OS thesis.

Humans operate a personal agent that fans out to a sub-tree of skill-specific agents, treating cloud and SaaS as an elastic operating system to be continuously reinforced and optimised. Build 2026 is the proprietary version of that picture.

Human / Personal Agent — "humanist superintelligence" is the same augment-not-replace stance the 2nth.ai grid is built on. Skills — RLEs are Microsoft's proprietary version of the skill-agent training loop; the open, portable equivalent is the Agent Skills standard (already live in the 2nth-skills registry) plus the Gymnasium API for the environments. Elastic OS — Maia 200 + Cobalt 200 + Foundry are Microsoft's elastic-OS substrate; MCP and MRC are the open protocols that keep the OS layer vendor-portable.

SA deployment note — data residency

Maia 200's confirmed rollout regions (Iowa, Arizona → Italy, Australia, South Korea) do not yet include an African region. For POPIA / residency-sensitive workloads, MAI inference and the Mayo model would currently route out of country. Where in-country inference is an architectural requirement, validate against Azure southafricanorth model availability (deployment-type specific — Regional vs Global SKU matters) and the Vertex AI africa-south1 / AWS af-south-1 alternatives before committing.

Educational node only. This section maps the announcement to the 2nth.ai model; it does not describe a 2nth.ai or Imbila service offering.

09 · How this leaf connects

The open anchors are live leaves in this tree.

Where the portable equivalents of Microsoft's proprietary pillars actually live.

agents/mcp

Model Context Protocol

The open agent-to-tool standard — the portable equivalent of Microsoft's Foundry / Agent Mode tooling layer.

agents/skills

Agent Skills

The open skill standard (Apache 2.0 + CC-BY-4.0) — the portable equivalent of RLEs as a skill-agent loop.

agents/claude

Claude (model sibling)

A frontier model family in the same Models band — the kind of provider Microsoft was the delivery vehicle for before going in-house.

agents/gpt

GPT · OpenAI

The models MAI benchmarks against (GPT-5.4 / 5.5) and the partnership Microsoft is diversifying away from.

tech/microsoft

Microsoft sub-hub

The implementation side — Azure OpenAI from Workers, AI Gateway, Copilot in SA — where Foundry-hosted models actually get wired in.

media

Media domain

MAI-Voice-2, MAI-Image-2.5 and MAI-Transcribe-1.5 are generative-media models — the domain that tracks that band.

10 · Resources & source validation

Primary sources and the corrections applied.

First-party and reputable secondary coverage, the open-standard anchors, and the five corrections made against the supplied draft.

Primary Microsoft Source — Mayo Clinic + Maia 200 news.microsoft.com/source Open anchor Gymnasium — RL environments github.com/Farama-Foundation/Gymnasium Open anchor Model Context Protocol github.com/modelcontextprotocol Open anchor Agent Skills standard agentskills.io

Corrections applied to the supplied draft

1. MAI-Thinking-1 is a trillion-param MoE with 35B active params — not a flat "35B model". 2. The coding model is MAI-Code-1-Flash (5B), hyphenated. 3. Maia 200 was revealed 26 Jan 2026, confirmed in production at Build — not first announced at Build. 4. The 10× cost-efficiency figure covers two distinct benchmarks (McKinsey vs GPT-5.5; Excel RLE vs GPT-5.4), not one. 5. The Mayo Clinic model is owned by Mayo, is early-stage ("many years" to maturity per Suleyman), and is not yet a real-time clinical team member.

Reputable secondary: GeekWire, CNBC, TechRadar, DataCenterDynamics, Fierce Healthcare / CNN / Euronews. Last validated 4 June 2026.

MAI, andhumanist superintelligence.