ENFR

Tech • IA • Crypto

Today Topics Videos Crypto Archives Favorites

NVIDIA Vera Rubin Platform Ramping into Full Production | Built for the Era of Agents

10/10

NVIDIANVIDIAJune 2, 2026 at 04:00 PM4:08

Audio player

0:00 / 0:00

TL;DR

NVIDIA has launched Vera Rubin, a multi-rack, pod-scale supercomputer designed to power agentic AI systems that can autonomously reason, plan, and act.

KEY POINTS

Shift to Agentic AI

AI development is moving beyond simple response generation toward “agentic” systems capable of observing, reasoning, planning, and using tools. These systems require managing vast context, combining working and long-term memory, and coordinating specialized sub-agents. This shift significantly increases computational and architectural demands.

Vera Rubin Architecture

Vera Rubin is described as the first multi-rack, pod-scale supercomputer built specifically for agentic AI workloads. It integrates multiple interconnected rack-scale systems into a unified platform designed to handle large-scale reasoning, orchestration, and memory-intensive processing.

Advanced Chip and Board Design

The system is built on seven new chips manufactured using TSMC’s 3-nanometer process, with advanced packaging and HBM4 memory from Micron, SK hynix, and Samsung. A single compute board contains 6 trillion transistors and over 18,000 components, highlighting the scale of integration required.

High-Density Compute and Networking

Each system includes 18 compute trays and NVLink switch trays, connected without cables for resilience. Liquid-cooled infrastructure delivers over 5,000 amps, equivalent to the power draw of 20 electric cars at full acceleration. Networking is powered by ConnectX-9, BlueField-4 DPUs, and Spectrum-X photonic Ethernet.

Specialized Processing Roles

The MVL72 system handles reasoning, planning, and high-throughput token generation, while Grace CPU racks with 256 CPUs orchestrate workloads and memory. Complementing this, Groq LPX systems deliver ultra-low-latency inference with 40 petabytes per second of SRAM bandwidth.

Massive Supply Chain Effort

Production spans 150 supply chain partners across Taiwan, involving millions of square feet of manufacturing space and hundreds of facilities. Companies including Foxconn, Quanta, Microsoft, Dell, and CoreWeave are already deploying engineering racks.

CONCLUSION

Vera Rubin represents a major leap in AI infrastructure, combining extreme-scale hardware and global manufacturing coordination to support the next generation of autonomous, agent-driven computing systems.

Full transcript

More from NVIDIA