Skip to main content

DCBBS Blueprints for NVIDIA Vera Rubin NVL72

Built to Scale from 5MW to 1GW

  • Complete 1,152-GPU NVIDIA Vera Rubin NVL72 Scalable Unit per 5MW power envelope, multiplied to scale from a single unit to gigawatt-class AI factories
  • 331 TB of HBM4 GPU memory* and 864 TB of LPDDR5X CPU memory per Scalable Unit, coherently accessible across the NVLink fabric
  • Industry-leading DLC-2 direct liquid cooling, from direct-to-chip cold plates through 1MW cooling towers, sized for 227 kW per rack
  • Dedicated Supermicro team across the full lifecycle: site survey, project design, integration, deployment, and ongoing support
  • Supporting NVIDIA’s latest reference architecture integrating NVIDIA Context Memory Storage Platform, NVIDIA Spectrum™-X Ethernet, and NVIDIA Quantum-X800 InfiniBand Platform
  • Management Software Suite: End-to-end SuperCloud software delivers unified infrastructure control, deployment automation, developer tools, and multi-tenant GPU cloud management

* Physical GPU memory

End-to-end solution spanning compute, networking, storage, power and cooling for streamlined deployment
Supermicro DCBBS DLC-2 liquid cooling stack with in-rack or in-row CDU, RDHx, and L2A sidecar options
NVIDIA Context Memory Storage Platform and High Performance Storage integrated for long-context and agentic AI workloads
Dedicated Supermicro team manages deployment from site survey through commissioning and ongoing support
Learn More About DCBBS
DCBBS & NVIDIA Vera Rubin NVL72 scalable unit – 1152 GPUs total

NVIDIA Vera Rubin NVL72 SuperCluster

Supermicro is engineering its NVIDIA Vera Rubin NVL72 with new DCBBS liquid-cooling components to fully support the power and thermal envelope at rack and cluster scale. This includes the manufacturing of optimized NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar to streamline production and deployment of the rack-scale AI supercomputer at scale. The Vera Rubin NVL72 operates as a single rack-scale accelerator, unifying six co-designed chips — Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField-4, and Spectrum-X — to deliver 3.6 Exaflops of inference, 75TB of fast memory, and 1.6 PB/s of HBM4 bandwidth, targeting up to 10x the throughput per watt and one-tenth the token cost compared to NVIDIA Blackwell.

NVIDIA Vera Rubin NVL72 Unifies 72 Rubin GPUs and 36 Vera CPUs in a rack through the latest NVIDIA NVLink-C2C and NVLink 6
Power-efficient Scale-out and Scale-across Connectivity using NVIDIA Quantum-X800 InfiniBand or Spectrum™-X Ethernet Ethernet
Extreme co-design of Rubin GPU, Vera CPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X
Supermicro DCBBS DLC-2 Liquid-cooling Optimized for Supermicro NVIDIA MGX racks, in-rack or in-row CDU, RDHx and L2A sidecar

2U NVIDIA HGX Rubin NVL8 System

The 2U HGX Rubin NVL8 system provides the densest and most flexible HGX platform — and the first HGX platform to offer greater flexibility in CPU selections including NVIDIA Vera CPUs alongside next-generation AMD and Intel x86 processors. Built on the NVIDIA MGX rack architecture with Supermicro’s blind mate busbar and manifold for tool-free rack integration, it gives customers the freedom to pair eight Rubin GPUs with the CPU platform that best fits their workload and software stack.

Up to 9 systems and 72 GPUs in a rack
Supports the new NVIDIA Vera CPUs, and next-Gen x86 CPUs
Extreme co-design of Rubin GPU, NVLink 6, ConnectX-9, BlueField®-4 and Spectrum-X
DLC-2 98+% Heat Capture with DCBBS L2A Sidecar Option
Resources
Ready to Build the Future of AI?

Contact Supermicro today to design your next-generation AI data center.

Contact Us

Certain products may not be available in your region