Build AI Factories with Supermicro and NVIDIA

What is a Supermicro AI Factory?

AI factories from Supermicro and NVIDIA are complete, turnkey solutions simplifying the deployment of enterprise AI at scale for faster time-to-online and time-to-revenue, with full-stack solutions including compute, software, networking, and storage. Supermicro delivers AI infrastructure optimized for performance and efficiency, with fully-integrated solutions based on NVIDIA Enterprise Reference Architectures and NVIDIA-Certified Systems™ for guaranteed full-stack performance and compatibility. Supermicro’s industry-leading rack-level testing, validation, and deployment services ensure quality and seamless plug-and-play deployment for complete AI confidence.

Supermicro

First-to-Market NVIDIA-Certified Systems

Rack-Scale Integration, Testing, and Validation before Shipping

Cluster-scale Deployment, Services, and Support

Storage and Networking Integration

NVIDIA

NVIDIA Accelerated Compute

NVIDIA Spectrum™-X Ethernet Networking Platform

NVIDIA Software Stack

NVIDIA AI Data Platform

Full-Stack Solution

Complete Rack-Level Integration

NVIDIA
AI Enterprise

NVIDIA
Omniverse

NVIDIA
Run:ai

Based on NVIDIA Enterprise Reference Architectures

Supermicro and NVIDIA Deliver Everything You Need to Reduce Complexity and Deploy AI Faster

Industry-leading Time-to-Online for the Latest AI Technologies.

Proven first-to-market track record for new NVIDIA acceleration technologies to market
Flexible building block approach enables faster adoption cycles
Production capacity in the USA of over 5,000 racks per month
Supermicro Data Center Building Block Solutions® (DCBBS) provides everything needed to facilitate the deployment of AI factories

Flexible, End-to-End AI Solutions Tailored to Your Enterprise.

Industry-leading broad portfolio of accelerated AI systems
Flexible, modular architectures fine-tuned to maximize performance and efficiency in enterprise environments
Cluster-level integration expertise including networking, testing, and validation
Storage solutions for all stages of the AI data pipeline

Proven Quality. Unmatched Performance. Complete AI Confidence.

Close cooperation between Supermicro and NVIDIA ensures performance-optimized AI hardware can be easily integrated into full-stack AI solutions
Full portfolio of NVIDIA-Certified Systems for guaranteed performance
Single-vendor solutions with complete quality, integrity, and compatibility control throughout the entire supply chain
Complete L11 testing and validation beyond industry standards for seamless plug-and-play deployment

Supermicro NVIDIA-Certified Systems™ – The Foundation of AI Factories

Supermicro’s flexible, modular architectures mean configurations and form factors have been fine-tuned to maximize performance and efficiency in enterprise environments, resulting in a simplified process of integrating AI-optimized hardware into existing enterprise environments where thermal, power, and space constraints may limit the use of one-size-fits-all or ready-made solutions. Supermicro’s industry-leading portfolio of NVIDIA-Certified Systems have been fully tested and validated for performance, reliability, and compatibility with NVIDIA Enterprise software and NVIDIA Spectrum-X networking, and form the building blocks for seamlessly scaling AI factories.

Learn More

NVIDIA RTX PRO™ Servers

Available in a range of form factors and densities to optimized for enterprise environments. From industry-standard 2U systems designed to replace CPU-based servers to thermally-optimized high density systems designed for maximum performance.

2U, 4U, and 5U form factors
Up to 8 NVIDIA RTX PRO 6000 Blackwell Server Edition GPUs per system
Multi-workload support including AI, HPC, and visual computing
Optimized for air cooled environments with support for ambient temperatures up to 35°C
AMD EPYC™ or Intel® Xeon® CPU options

Supermicro X14 5U 8-GPU system SYS-522GA-NRT — Thermally-optimized 5U system supporting up to 8 GPUs and Intel Xeon 6 CPUs
Learn More

Supermicro H14 5U 8-GPU system AS -5126GS-TNRT2 — Thermally-optimized 5U system supporting up to 8 GPUs and AMD EPYC 9005 series CPUs
Learn More

Supermicro H14 5U 4-GPU system AS -5126GS-TNRT — Thermally-optimized 5U system supporting up to 8 GPUs and AMD EPYC 9005 series CPUs
Learn More

Supermicro X14 4U 8-GPU system SYS-422GL-NR — 4U RTX PRO Server supporting up to 8 GPUs and Intel Xeon 6 GPUs
Learn More

Supermicro X13 5U 8-GPU system SYS-521GE-TNRT — 5U RTX PRO Server supporting up to 8 GPUs and 5th Gen Intel Xeon GPUs
Learn More

Supermicro portfolio of NVIDIA RTX PRO™ servers

NVIDIA HGX™ Systems

Specialized architectures designed for maximum AI performance. NVIDIA HGX systems offer unprecedented computational performance, density, and efficiency with next-generation air-cooled architectures as well as multiple CPU options.

8U and 10U form factors allow optimal system performance in air cooled environments
NVIDIA B300, B200 and H200 8-GPU with NVIDIA NVLink® for maximum GPU-GPU communication
AMD EPYC 9005 or Intel Xeon 6 CPU options

Supermicro X14 8U 8-GPU system SYS-822GS-NB3RT — Air cooled 8U system with NVIDIA HGX B300 8-GPU and Intel Xeon 6 CPUs
Learn More

Supermicro X14 10U 8-GPU system SYS-A22GA-NBRT — Air-cooled 10U system with NVIDIA HGX B200 8-GPU and Intel Xeon 6 CPUs
Learn More

Supermicro X13 8U 8-GPU system SYS-821GE-TNHR — Air-cooled 8U system with NVIDIA HGX H200 8-GPU and 5th Gen Intel Xeon CPUs
Learn More

Supermicro H13 8U 8-GPU system AS -8125GS-TNHR — Air-cooled 8U system with NVIDIA HGX H200 8-GPU and AMD EPYC 9004 CPUs
Learn More

Supermicro portfolio of NVIDIA HGX™ servers

Storage Platforms

Complete solutions require integration of storage hardware that is easy for enterprises to deploy. Together with leading data management ISVs, Supermicro is able to support the NVIDIA AI Data Platform and enable AI workflows within the data management environment.

Available in a range of form factors and storage densities
Storage drive options including EDSFF E3.S, U.2, and spinning media

Supermicro H13 1U Hyper system AS -1115HS-TNR — 1U 12-drive storage system
Learn More

AI at Scale with Supermicro AI Factory SuperClusters

Supermicro’s AI Factory SuperClusters are based on NVIDIA Enterprise Reference Architectures and provide enterprise customers with complete, rack-scale and cluster-scale solutions that ensure full-stack performance and compatibility, simplifying the deployment of complete AI factories. Supermicro’s testing and validation goes beyond industry standards, with complete testing of all nodes and cluster-level (L12) testing before shipment to ensure seamless plug-and-play deployment for customers of any size. Supermicro AI Factory solutions are endorsed by NVIDIA for Infrastructure Configuration, Spectrum-X networking, and Software Reference Stack and based on the NVIDIA Enterprise Reference Architecture for RTX PRO 6000 Blackwell Server Edition and HGX B200.

Complete Rack-Level Integration and Validation
Built by Supermicro’s expert teams, delivered ready to power on from day one
NVIDIA Spectrum-X Networking
High-speed AI compute fabric tuned and validated across the full stack of NVIDIA hardware and software, creating an unmatched Ethernet solution for AI factories.
NVIDIA Software Stack
Run NVIDIA AI Enterprise, NVIDIA Omniverse, and NVIDIA Run:AI with guaranteed compatibility
NVIDIA GPUs
Choose from NVIDIA RTX PRO 6000 Blackwell Server Edition GPU and HGX B200 to handle a wide range of enterprise AI workloads
Storage
Supermicro works with leading ISVs to create storage solutions that support the NVIDIA AI Data Platform and can be seamlessly connected to AI factories
Thermal Optimization
Built by Supermicro’s expert teams, delivered ready to power on from day one
Supermicro NVIDIA-Certified Systems
Tested and validated for guaranteed compatibility and performance

GPU	NVIDIA RTX PRO 6000 Blackwell Server Edition GPU	NVIDIA HGX B200	NVIDIA HGX B300
Maximum Cluster Size	Up to 32 nodes, 256 GPUs per scalable unit	Up to 32 nodes, 256 GPUs per scalable unit	Up to 32 nodes, 256 GPUs
Nodes per Rack (Typical)	4–8 per rack	4 per rack	4 per rack
GPU System Node SKU(s)	SYS-522GA-NRT SYS-422GL-NR AS -5126GS-TNRT2	SYS-A22GA-NBRT	SYS-822GS-NB3RT AS- 8126GS-NB3RT
GPU Configuration per Node	8x NVIDIA RTX PRO 6000 Blackwell Server Edition (96GB GDDR7 per GPU)	8x NVIDIA HGX B200 (192GB HBM3e per GPU)	8x NVIDIA HGX B300 (288GB HBM3e per GPU)
Rack Power (4 Nodes)	33.3–36.6kW	53.6kW	60kW
Networking	NVIDIA Spectrum-X	NVIDIA Spectrum-X	NVIDIA Spectrum-X
NVIDIA Software Stack	NVIDIA AI Enterprise/NVIDIA Omniverse/NVIDIA Run:ai	NVIDIA AI Enterprise/NVIDIA Omniverse/NVIDIA Run:ai	NVIDIA AI Enterprise/NVIDIA Omniverse/NVIDIA Run:ai
Target Deployment Use Case	AI inference / Retrieval Augmented Generation (RAG), HPC, and visual computing	Foundational AI model training, large-scale AI inference, and FP64 HPC workloads	Foundational AI model training, large-scale AI inference, and HPC workloads
Links	Learn More Datasheet	Learn More Datasheet	Learn More Datasheet

NVIDIA AI Software Platforms

Supermicro’s NVIDIA-Certified Systems™ have been fully tested and validated for performance, reliability, and compatibility with the NVIDIA AI software stack including NVIDIA AI Enterprise, NVIDIA Omniverse, and NVIDIA Run:ai, enabling the building and deployment of production-ready agentic AI and physical AI systems anywhere—across clouds, data centers, or at the edge.

NVIDIA AI Enterprise

NVIDIA AI Enterprise is a cloud-native suite of software tools, libraries, and frameworks, including NVIDIA NIM and NeMo microservices, that accelerate and simplify the development, deployment, and scaling of AI applications.

Learn More

NVIDIA Omniverse

NVIDIA Omniverse is a platform of APIs, SDKs, and services that enable developers to integrate OpenUSD, NVIDIA RTX™ rendering technologies, and generative physical AI into existing software tools and simulation workflows for industrial and robotic use cases.

Learn More

NVIDIA Run:ai

NVIDIA Run:ai accelerates AI operations with dynamic orchestration across the AI life cycle, maximizing GPU efficiency, scaling workloads, and integrating seamlessly into hybrid AI infrastructure with zero manual effort.

Learn More

A Complete AI Data Ecosystem, Built to Accelerate on Supermicro Systems

Supermicro storage servers provide the high-performance hardware foundation for the NVIDIA AI Data Platform, validated with industry-leading storage software partners. Whether you are running DDN, VAST Data, Weka, IBM Storage Scale, Cloudian, Everpure, or Nutanix, Supermicro delivers the compute, networking, and storage infrastructure to enable intelligent knowledge retrieval for RAG, semantic search, and AI agents across your enterprise data.

HyperPOD AI Data Platform

The HyperPOD AI Data Platform with DDN’s Infinia data intelligence platform and built on Supermicro servers is designed for enterprise AI data pipelines and applications.

Learn More

Supermicro VAST CNode-X

The Supermicro CNode-X and EBox systems create a unified AI data platform, bringing high-performance GPU compute directly to where your enterprise data lives.

Learn More

High-Speed Parallel File System for AI

Weka's cloud-native parallel file system on Supermicro NVMe storage systems delivers extreme-throughput data access for AI embedding and retrieval pipelines

Learn More

Accelerate AI at Any Scale

IBM Storage Scale on Supermicro systems provides a proven, enterprise-grade parallel file system with deep integration for AI workloads – enabling fast, secure data access across distributed environments.

Learn More

HyperScale® AI Data Platform

The HyperScale AI Data Platform combines Cloudian’s HyperStore data management with Supermicro systems to deliver a complete enterprise AI solution.

Learn More

All-Flash Performance for AI Data

The Everpure AI Data Platform integrates the Everpure Platform and FlashBlade with Supermicro PCIe GPU servers to accelerate enterprise AI data workloads.

Agentic AI Solution for AI Factories

The Nutanix AI Platform using Supermicro systems delivers pre-validated high performance infrastructure for enterprises.

Learn More

Unlock the Full Value of Your Enterprise Data

Supermicro storage infrastructure supports the NVIDIA AI Data Platform – transforming your existing data into an AI-ready knowledge base for RAG, semantic search, and intelligent AI agents. Your data stays secure and on-premises while becoming instantly accessible to AI applications in near real time.

Learn more about Supermicro Solutions for NVIDIA AI Data Platform

Built for the AI Factory

The AI Data Platform works alongside Supermicro AI Factory compute infrastructure, bridging intelligent data access with high-performance model training and inference.

Intelligent Knowledge Retrieval

Continuously ingest, embed, and index unstructured enterprise data – documents, images, video, and more – so AI applications can retrieve the right information instantly.

Security Without Compromise

Data never moves or gets duplicated. Vector embeddings are created in place, preserving existing access controls and data sovereignty requirements.

Always Current, Always Relevant

Continuous data processing keeps AI outputs grounded in your most up-to-date information, reducing hallucinations and improving decision quality.

Services and Onsite Deployment

Accelerate your data center deployment with comprehensive professional services from planning through ongoing support. Supermicro Global Services delivers end-to-end expertise including data center design, solution validation, and professional on-site deployment – whether you're building from bare land, retrofitting air-to-liquid cooling, or deploying in co-location facilities. Our integrated approach reduces time-to-online and ensures higher-quality installations, backed by continued on-site support and 4-hour response time options for mission-critical uptime.

Featured Resources

AI Infrastructure

Data Center Building Block Solutions® (DCBBS)

AI Factory

Edge AI

AI Storage

Industry AI Solutions

NVIDIA Solutions

AMD Solutions

Intel Solutions

Rackmount Servers

1U Dual Processor

2U Dual Processor

Single Processor

Multi-Processor

Product Families

GPU Servers

8U/10U GPU Lines

4U/5U GPU Lines

2U GPU Lines

1U GPU Lines

Twin Servers

FlexTwin™

BigTwin®

GrandTwin®

TwinPro®

FatTwin®

Blade Servers

SuperBlade®

MicroBlade®

MicroCloud

Storage Servers

All Storage Systems

All-Flash NVMe

Top-Loading Storage

JBOF

Petascale Grace Storage

Enterprise-Optimized Storage

JBOD Storage Enclosures

Motherboards

Server Boards

Workstation Boards

Embedded / IoT Boards

Desktop / Gaming Boards

Motherboard Matrix

Global SKUs

Chassis

1U Chassis

2U Chassis

3U Chassis

4U / Tower Chassis

Mid / Mini-Tower

Embedded / IoT Chassis

Mobile Racks / Drive Kits

JBOD Storage Enclosures

Global SKUs

SuperRack®

Rack Integration Service

Accessories

Cable Matrix

Riser Card Matrix

Storage AOC Matrix

Power Supply Matrix

Heatsink Matrix

System Fan Matrix

Mobile Racks / Drive Kits

Front Chassis Bezels

Storage, I/O, Security

Edge AI and IoT Systems

Compact Edge Systems

Compact Edge Servers

Rackmount Edge Servers

Embedded Components

Embedded Motherboards

Embedded Chassis

Switches

Adapters

SuperWorkstations

Liquid-Cooled AI Development Platform

Single-Processor

Dual-Processor