Skip to main content
What is a Supermicro AI Factory?

AI factories from Supermicro and NVIDIA are complete, turnkey solutions simplifying the deployment of enterprise AI at scale for faster time-to-online and time-to-revenue, with full-stack solutions including compute, software, networking, and storage. Supermicro delivers AI infrastructure optimized for performance and efficiency, with fully-integrated solutions based on NVIDIA Enterprise Reference Architectures and NVIDIA-Certified Systems™ for guaranteed full-stack performance and compatibility. Supermicro’s industry-leading rack-level testing, validation, and deployment services ensure quality and seamless plug-and-play deployment for complete AI confidence.

Supermicro

First-to-Market NVIDIA-Certified Systems

Rack-Scale Integration, Testing, and Validation before Shipping

Cluster-scale Deployment, Services, and Support

Storage and Networking Integration

NVIDIA

NVIDIA Accelerated Compute

NVIDIA Spectrum™-X Ethernet Networking Platform

NVIDIA Software Stack

NVIDIA AI Data Platform

Full-Stack Solution

Complete Rack-Level Integration
NVIDIA
AI Enterprise
NVIDIA
Omniverse
NVIDIA
Run:ai
Based on NVIDIA Enterprise Reference Architectures
DataPower
Supermicro NVIDIA GPU Server Racks
Intelligence
Supermicro and NVIDIA Deliver Everything You Need to Reduce Complexity and Deploy AI Faster

Industry-leading Time-to-Online for the Latest AI Technologies.

  • Proven first-to-market track record for new NVIDIA acceleration technologies to market
  • Flexible building block approach enables faster adoption cycles
  • Production capacity in the USA of over 5,000 racks per month
  • Supermicro Data Center Building Block Solutions® (DCBBS) provides everything needed to facilitate the deployment of AI factories

Flexible, End-to-End AI Solutions Tailored to Your Enterprise.

  • Industry-leading broad portfolio of accelerated AI systems
  • Flexible, modular architectures fine-tuned to maximize performance and efficiency in enterprise environments
  • Cluster-level integration expertise including networking, testing, and validation
  • Storage solutions for all stages of the AI data pipeline

Proven Quality. Unmatched Performance. Complete AI Confidence.

  • Close cooperation between Supermicro and NVIDIA ensures performance-optimized AI hardware can be easily integrated into full-stack AI solutions
  • Full portfolio of NVIDIA-Certified Systems for guaranteed performance
  • Single-vendor solutions with complete quality, integrity, and compatibility control throughout the entire supply chain
  • Complete L11 testing and validation beyond industry standards for seamless plug-and-play deployment
AI at Scale with Supermicro AI Factory SuperClusters

Supermicro’s AI Factory SuperClusters are based on NVIDIA Enterprise Reference Architectures and provide enterprise customers with complete, rack-scale and cluster-scale solutions that ensure full-stack performance and compatibility, simplifying the deployment of complete AI factories. Supermicro’s testing and validation goes beyond industry standards, with complete testing of all nodes and cluster-level (L12) testing before shipment to ensure seamless plug-and-play deployment for customers of any size. Supermicro AI Factory solutions are endorsed by NVIDIA for Infrastructure Configuration, Spectrum-X networking, and Software Reference Stack and based on the NVIDIA Enterprise Reference Architecture for RTX PRO 6000 Blackwell Server Edition and HGX B200.

Complete Rack-Level Integration and ValidationBuilt by Supermicro’s expert teams, delivered ready to power on from day oneNVIDIA Spectrum-X NetworkingHigh-speed AI compute fabric tuned and validated across the full stack of NVIDIA hardware and software, creating an unmatched Ethernet solution for AI factories.NVIDIA Software StackRun NVIDIA AI Enterprise, NVIDIA Omniverse, and NVIDIA Run:AI with guaranteed compatibilityNVIDIA GPUsChoose from NVIDIA RTX PRO 6000 Blackwell Server Edition GPU and HGX B200 to handle a wide range of enterprise AI workloadsStorageSupermicro works with leading ISVs to create storage solutions that support the NVIDIA AI Data Platform and can be seamlessly connected to AI factoriesThermal OptimizationArchitectures designed for maximum performance in air-cooled environmentsSupermicro NVIDIA-Certified SystemsTested and validated for guaranteed compatibility and performance
GPUNVIDIA RTX PRO 6000 Blackwell Server Edition GPUNVIDIA HGX B200
Maximum Cluster SizeUp to 32 nodes, 256 GPUsUp to 128 nodes, 1,024 GPUs
Nodes per Rack (Typical)4–8 per rack4 per rack
GPU System Node SKU(s)
  • SYS-522GA-NRT
  • SYS-422GL-NR
  • AS -5126GS-TNRT2
  • SYS-A22GA-NBRT
Rack Power (4 Nodes)33.3–36.6kW53.6kW
NetworkingNVIDIA Spectrum-XNVIDIA Spectrum-X
NVIDIA Software StackNVIDIA AI Enterprise/NVIDIA Omniverse/NVIDIA Run:aiNVIDIA AI Enterprise/NVIDIA Omniverse/NVIDIA Run:ai
Target Deployment Use CaseAI inference / Retrieval Augmented Generation (RAG), HPC, and visual computingFoundational AI model training, large-scale AI inference, and HPC workloads
Links

NVIDIA AI Software Platforms

Supermicro’s NVIDIA-Certified Systems™ have been fully tested and validated for performance, reliability, and compatibility with the NVIDIA AI software stack including NVIDIA AI Enterprise, NVIDIA Omniverse, and NVIDIA Run:ai, enabling the building and deployment of production-ready agentic AI and physical AI systems anywhere—across clouds, data centers, or at the edge.

Screenshot - NVIDIA AI EnterpriseScreenshot - NVIDIA Omniverse™Screenshot - NVIDIA Run:ai

NVIDIA AI Enterprise

NVIDIA AI Enterprise is a cloud-native suite of software tools, libraries, and frameworks, including NVIDIA NIM and NeMo microservices, that accelerate and simplify the development, deployment, and scaling of AI applications.

NVIDIA Omniverse

NVIDIA Omniverse is a platform of APIs, SDKs, and services that enable developers to integrate OpenUSD, NVIDIA RTX™ rendering technologies, and generative physical AI into existing software tools and simulation workflows for industrial and robotic use cases.

NVIDIA Run:ai

NVIDIA Run:ai accelerates AI operations with dynamic orchestration across the AI life cycle, maximizing GPU efficiency, scaling workloads, and integrating seamlessly into hybrid AI infrastructure with zero manual effort.

Services and Onsite Deployment

Accelerate your data center deployment with comprehensive professional services from planning through ongoing support. Supermicro Global Services delivers end-to-end expertise including data center design, solution validation, and professional on-site deployment – whether you're building from bare land, retrofitting air-to-liquid cooling, or deploying in co-location facilities. Our integrated approach reduces time-to-online and ensures higher-quality installations, backed by continued on-site support and 4-hour response time options for mission-critical uptime.

Featured Resources

Certain products may not be available in your region