Microsoft introduces AI accelerator for US Azure customers

The company has developed Maia 200, an AI accelerator that promises to boost inference workloads

Cliff Saran, Managing Editor

Published: 26 Jan 2026 16:00

Microsoft has announced that Azure’s US central datacentre region is the first to receive a new artificial intelligence (AI) inference accelerator, Maia 200.

Microsoft describes Maia 200 as an inference powerhouse, built on TSMC 3nm process with native FP8/FP4 (floating point) tensor cores, a redesigned memory system that uses 21 6GB of the latest high-speed memory architecture (HBM3e). This is capable of transferring data at 7TB per second. Maia also provides 272MB of on-chip memory plus data movement engines, which Microsoft said is used to keep massive models fed, fast and highly utilised.

According to the company, these hardware features mean Maia 200 is capable of delivering three times the FP4 performance of the third-generation Amazon Trainium, and FP8 performance above Google’s seventh-generation tensor processing unit. Microsoft said Maia 200 represents its most efficient inference system yet, offering 30% better cost performance over existing systems, but at the time of writing, it was unable to give a date as to when the product would be available outside of the US.

Along with its US Central datacentre region, Microsoft also announced that its US West 3 datacentre region, near Phoenix, Arizona, will be the next to be updated with Maia 200.

In a blog post describing how Maia 200 is being deployed, Scott Guthrie, Microsoft executive vice-president for cloud and AI, said the setup comprises racks of trays configured with four Maia accelerators. Each tray is fully connected with direct, non‑switched links, to keep high‑bandwidth communication local for optimal inference efficiency.

He said the same communication protocol is used for intra-rack and inter-rack networking using the Maia AI transport protocol to provide a way to scale clusters of Maia 200 accelerators with minimal network hops.

“This unified fabric simplifies programming, improves workload flexibility and reduces stranded capacity while maintaining consistent performance and cost efficiency at cloud scale,” added Guthrie.

Microsoft introduces AI accelerator for US Azure customers

The company has developed Maia 200, an AI accelerator that promises to boost inference workloads

Read more AI accelerator stories

Read more on Artificial intelligence, automation and robotics

Is there no stopping the AI spending spree?

Microsoft Maia 200 AI chip could boost cloud GPU supply

CES 2026: Qualcomm expands IE‑IoT portfolio

AMD pushes for open ecosystem to challenge Cuda dominance