Microsoft Achieves Global First with Nvidia Vera Rubin AI Supercomputer Validation
In a significant development for artificial intelligence infrastructure, Microsoft CEO Satya Nadella has announced that his company has become the world's first cloud provider to install and bring up the Nvidia Vera Rubin NVL72 system for validation. This milestone represents a crucial step forward in the global race to build the next generation of AI computing capabilities.
Unveiling the Vera Rubin NVL72 System
Nvidia CEO Jensen Huang originally unveiled the Vera Rubin NVL72 AI supercomputer at CES, with the company promising groundbreaking performance improvements. The system offers up to five times greater inference performance and ten times lower cost per token compared to Nvidia's current top-tier Blackwell AI chip available in the market.
Nadella shared the news alongside a photograph of the installed system, stating: "We're the first cloud to bring up an NVIDIA Vera Rubin NVL72 system for validation, another big step in building the next generation of AI infrastructure with NVIDIA."
Deepening Microsoft-Nvidia Partnership
This achievement builds upon the strengthened partnership between Microsoft and Nvidia announced in October last year, when both companies revealed they were deepening their collaboration to power the next wave of AI industrial innovation. For years, their partnership has helped fuel the AI revolution by bringing advanced supercomputing to the cloud, enabling breakthrough frontier models, and making AI more accessible to organizations worldwide.
The companies announced they were building on that foundation with new advancements delivering greater performance, capability, and flexibility. Key developments include:
- Nvidia added support for Nvidia RTX PRO 6000 Blackwell Server Edition on Azure Local
- New NVIDIA Nemotron and NVIDIA Cosmos models in Azure AI Foundry
- NVIDIA Run:ai on Azure for enterprises to optimize GPU utilization
- The world's first deployment of NVIDIA GB300 NVL72
These innovations allow customers to deploy AI and visual computing workloads in distributed and edge environments with easy orchestration and management in the cloud, while giving businesses an enterprise-grade platform to build, deploy, and scale AI applications and agents.
Understanding the Vera Rubin Architecture
The Vera Rubin represents Nvidia's most ambitious AI data center architecture to date. Huang described it as the product of what Nvidia calls "extreme co-design," allowing six different types of chips to work together as one unified system.
The six components that form this revolutionary architecture are:
- The Vera CPU
- The Rubin GPU
- The NVLink 6 switch
- The ConnectX-9 SuperNIC
- The BlueField-4 data processing unit
- The Spectrum-6 Ethernet switch
Together, these components form the building blocks of the Vera Rubin NVL72 rack—a single unit of AI computing infrastructure more powerful than anything Nvidia has built previously. This unified system approach represents a significant leap forward in AI processing capability and efficiency.
The validation of this system by Microsoft marks a pivotal moment in cloud computing and artificial intelligence development, positioning both companies at the forefront of next-generation technological infrastructure.
